Join Okta as a Staff Site Reliability Engineer in a pivotal engineering position within our Business Technology team. This remote work-enabled role focuses on enhancing our cloud platform services, emphasizing back-end infrastructure and tooling for corporate teams. As a Staff Site Reliability Engineer, you will empower teams to build and automate infrastructure at scale, ensuring software reliability and predictability.
Key Responsibilities
Build and manage development tools, pipelines, and infrastructure with a security-first mindset.
Engage in Agile ceremonies, author stories, and support team members through demos, knowledge sharing, and architectural sessions.
Promote and implement best practices for creating secure, scalable, and reliable cloud infrastructure.
Develop and maintain essential technical documentation, including network diagrams, runbooks, and procedures.
Design, operate, and monitor Okta's IT infrastructure and cloud services, focusing on efficiency and security standards.
Lead initiatives to enhance our existing cloud platforms, aligning with the latest security standards and best practices.
Develop and manage policies, standards, processes, and procedures to improve infrastructure operations.
Collaborate with software engineers to ensure development aligns with established processes and performs as designed.
Oversee centralized technical processes, including container and image management.
Deliver exceptional customer service to internal users, championing SRE services and DevOps practices.
Required Qualifications
Over 10 years of experience in roles such as SRE, DevOps, Systems Engineer, or similar.
Proven expertise in developing complex applications for cloud infrastructure at scale.
Strong proficiency in managing AWS multi-account environments, including AWS Orgs, AWS IAM, AWS Identity Center, and Stacksets.
Skilled in infrastructure automation using Terraform.
Experienced in developing applications on AWS and other cloud infrastructures, covering compute, storage, networking, and virtualization.
Proficient with Git and constructing deployment pipelines, particularly with Github Actions.
Expertise in tooling and automation with Python.
Knowledgeable in AWS container-based workloads and concepts, notably EKS, ECS, and ECR.
Familiar with monitoring tools such as Splunk, Cloudwatch, and Grafana.
Preferred Qualifications
Experience with scalable network architectures and an understanding of network technologies related to IP internetworking is highly advantageous.
About Okta
Okta is The World’s Identity Company, enabling safe use of technology everywhere through secure access, authentication, and automation. Our products, including the Okta Platform and Auth0 Platform, place identity at the core of business security and growth. At Okta, we value diverse perspectives and seek individuals eager to bring their unique experiences to our team.
Benefits & Perks
Okta offers competitive salaries, excellent health benefits, and a network of supportive professionals. As part of our commitment to health and well-being, employees enjoy comprehensive wellness programs and flexible work arrangements.