THE ROLE
We are seeking a highly skilled and experienced Staff Platform Engineer to lead a small team of talented engineers in building and maintaining our cutting-edge infrastructure. You will be at the forefront of ensuring our platform, serving over 80 million users monthly, is scalable, reliable, and observable. This role requires a blend of expertise in infrastructure design, automation, cloud technologies (specifically Google Cloud and Kubernetes), and software development.
RESPONSIBILITIES
Leadership: Guide and mentor a team of platform engineers, fostering collaboration and professional growth.
Infrastructure Design and Automation: Architect, implement, and manage our complex infrastructure with a focus on scalability, security, and efficiency. Leverage automation tools to streamline processes and ensure consistency.
Cloud Expertise: Deep understanding of Google Cloud Platform and Kubernetes to optimize our cloud infrastructure for performance and cost-effectiveness.
Software Development: Collaborate with development teams to implement necessary software changes for platform optimization and integration.
Observability Champion: Drive the adoption of an observability-first approach, ensuring comprehensive visibility into system health and performance for all teams.
Problem-Solving: Lead the team in troubleshooting complex technical issues and implementing solutions to prevent future occurrences.
ABOUT YOU
- Extensive experience in infrastructure design, implementation, and management.
- Proven leadership skills with a passion for mentoring and team development.
- Strong understanding of DevOps principles and practices.
- Proficiency in Python.
- Ability to work independently and as part of a collaborative team.
- Excellent communication and problem-solving skills.
- 8+ years working with cloud computing providers, specifically Google Cloud Platform.
- 8+ years working with deployment and orchestration of containerized applications, specifically Kubernetes.
- 8+ years experience using Infrastructure-as-code (IaC) and configuration management tooling (e.g. Ansible, Terraform).
- 8+ years working with monitoring, observability, and visualization platforms (e.g. Honeycomb, Dynatrace, Prometheus, Grafana, Solarwinds).
- 8+ years working with logging and alerting platforms (e.g. Elastic Search, PagerDuty, GoAlert).
- Ability to be on-call. The current schedule requires each North American engineer to be on week-long rotations every month, covering noon to midnight Eastern Time. The schedule is subject to change.
- Ability to occasionally travel to the US or Canada
YOU’RE A GREAT FIT IF YOU ALSO HAVE
- Excitement for online communities and alignment with the Fora mission to help people have fun finding products for their hobbies
$150,000 - $170,000 a year