Virta is the first company with a clinically-proven treatment to safely and sustainably reverse type 2 diabetes and other chronic metabolic diseases without the use of medications or surgery. Our innovations in nutritional biochemistry, data science and digital tools combined with our clinical expertise are shifting the diabetes treatment paradigm from management to reversal. Our mission - to reverse type 2 diabetes in 100 million people by 2025.
Virta is in a phase of rapid growth and we are investing heavily in our GCP-based Kubernetes infrastructure to ensure that we have a solid foundation on which to grow. This role provides a key opportunity to help develop and instill the site reliability practices that will help scale our business to the next level, as well as ensure our patients have continuous access to our life-changing treatment.
As a Site Reliability Engineer at Virta, you will be supporting Virta’s patients and clinical staff by ensuring Virta’s systems are always available and performant. Some of the responsibilities will include:
Build and maintain monitoring systems and processes to ensure product engineers get actionable data for the components they maintain.
Coordinate with the product teams to enhance the scalability and reliability of our systems through analysis and observability improvements.
Engage in capacity planning with load testing and auto-scaling strategies.
Own the incident response process, including, development of sustainable practices, learnings, and ensuring blameless postmortems.
Work across the engineering team to encourage excellence in incident response and build a culture of site reliability engineering.
Efficiently troubleshoot issues across our systems and software to determine root causes and impact.
Within your first 90 days at Virta, we expect you will do the following:
Learn Virta’s system and network architecture to take part in incident response and troubleshooting activities.
Improve monitoring and observability tooling to enhance visibility into our systems and software.
Help define and rollout Service Level Objectives and operational readiness within the Virta system.
3+ years of experience in site reliability or comparable roles working in a modern containerized cloud environment.
Proficiency in scripting in at least one language (Bash, Python, Go).
Experience implementing monitoring tools and alerting systems .
Excellent Kubernetes troubleshooting skills during incident response events.
Previous experience developing runbooks and driving process improvement.
Is this role not quite what you're looking for? Join our Talent Community and follow us on Linkedin to stay connected!
As part of your duties at Virta, you may come in contact with sensitive patient information that is governed by HIPAA. Throughout your career at Virta, you will be expected to follow Virta's security and privacy procedures to ensure our patients' information remains strictly confidential. Security and privacy training will be provided.
Virta has a location based compensation structure. Starting pay will be based on a number of factors and commensurate with qualifications & experience. For this role, the compensation range is $145,885 - $163,916. Information about Virta’s benefits is on our Careers page at: https://www.virtahealth.com/careers.
#LI-Remote
#LI-JN3