Lead Platform Engineer

Job description

About the Team

The Forem team is about 40 individuals from all over the world. In total approximately 87% of folks come from underrepresented backgrounds. Specifically for the engineering team, 74% of the team is from an underrepresented group in tech. Learn more here:  https://www.forem.com/about

We are looking for a Lead Platform Engineer with a focus on scaling best practices, reducing engineer toil, maintaining our current containerized system and fleet, and ensuring reliability and stability of the Forem platform. This engineer will strongly influence platform engineering, establishing policies, practices, and a framework for how DevOps,SRE, and infrastructure, will work now and in the future. This lead engineer will also work with all engineers to understand pain points and find a balance in the “now” vs the “future”, while also proposing foundational decisions and creating effective structures in this area. This individual will need to take into consideration performance, reliability, stability, and developer happiness to ensure that we are providing a world-class community building experience for our internal team, external contributors, creators, and users. The starting salary for this role is $180,000 plus equity and is not location-based.


Primary Responsibilities

  • Work collaboratively with multiple engineering pods to determine pain points and future facing reliability work

  • Improve developer experience around flakey tests, build and deploy possibilities, and testing

  • Maintain hosting infrastructure for both company run and client sites including monitoring and incident identification

  • Execute necessary upgrades and patches across our fleet of servers

  • Provide clear onboarding and troubleshooting documentation for engineers and the open source community around deployment-related topics

  • Automate and document tasks to scale services and processes

  • Triage, debug, and fix bugs reported by users

  • Put best practices in place and upskill others on SRE best practices

  • Lead incident resolution including identifying and executing any follow-ups in conjunction with the pod

  • Proactively pick new work, determining solutions and codepaths, working to unblock yourself either by asking for help or finding other solutions in an asynchronous, distributed culture

  • Participate in code reviews, design and implementation conversations, and post-incident reviews

Skills or Experience We’d Like To See

  • 6+ years professional experience in DevOps, SRE, or infrastructure

  • Experience writing high-quality, readable code, with a focus on performance and maintainability

  • Experience with a range of testing tools, ideally including front-end and backend testing best practices

  • Experience with containerized systems like Kubernetes or Fedora CoreOS

  • Strong experience in Ruby, C, Python, or Bash scripting

  • Experience with deployment systems and tools including AWS, terraform, and Ansible

  • Experience working in an asynchronous, distributed team

  • High degree of empathy

  • Interest or experience in open source software and/or the open source community

  • Ability to focus on a big picture while shipping iteratively towards that goal

  • Thrives in a fast-paced start-up environment

  • Ability to work independently and as part of a collaborative team

  • Strong communication and problem-solving skills and ability to interact with all levels in the organization


Work/life balance and ensuring that folks have what they need for working and for life is incredibly important to us. Below are some of the benefits we offer:

  • Unlimited PTO with a minimum requirement

  • Generous employee equity

  • A remote-first, fully-distributed team that encourages schedule flexibility

  • A monthly wellness stipend to keep you at your peak

  • Free mental health services

  • Medical, Dental, and Vision coverage (U.S. only)

  • An annual professional development budget to keep you learning and growing

  • Office setup budget and monthly Coworking stipend to make sure you have everything you need to work effectively wherever you are

  • 401K (U.S. Only)

Interview process

We want our candidates to have the best possible interview experience because this is as much about you finding the right fit as it is us finding a great new addition to the team. We value candidates from all backgrounds and experiences and want our interview process to be representative of that. Here is what you can expect:

  • Application review. Our application involves more questions up front so that we can get to know you better right from the get go

  • Round 1: hiring manager interview with the Head of Engineering

  • Round 2: a short (90 minute) take home that is meant to mimic real-life work and provide fodder for a synchronous technical conversation.

  • Round 3: Team interview covering your technical experience and talking through your take home submission

  • Round 4: Team interview covering your core skills around communication, mentorship, and feedback

  • Round 5: wrap up. This is primarily a chance for you to ask any final questions you have about the role, team, or company.

We provide feedback for any candidate that participated in round 1 and beyond so that if Forem isn’t a good fit, you know why.