Founded in 2016, LogRocket's goal is to make every experience on the web as perfect as possible. We're solving a huge challenge for product managers and developers - understanding the user experience. LogRocket is the first system that gives these teams complete visibility into their customer's experience using their web apps - through pixel-perfect replays of user sessions and clear insight into logs, errors, and network activity.
Key Responsibilities
Improve quality of pager alerts while reducing noise
Maintain awareness of engineering initiatives across the organization and monitor their impact on stability, cost, and performance
Keep infrastructure up-to-date to take advantage of security patches and new features
Improve operational security without sacrificing engineering independence
Required Qualifications
At least 5 years of experience as a Site Reliability Engineer, or related job
Ability to read and understand product code (writing product code is a nice-to-have!)
Familiarity with the state of the art in cloud technologies, including common providers, specific tools of the trade, and their strengths and weaknesses
Experience operating applications and databases with demanding scalability or availability requirements
Proven expertise in modern container orchestration practices (we use Kubernetes on GKE)
A strong understanding of the performance, architecture, tooling, and cost of cloud systems
A security focused mindset with a solid understanding of incident response and risk mitigation
A strong collaborator who is transparent about progress on tasks, seeks feedback early and often, works effectively with the team and customers
About LogRocket
Backed by top investors such as Matrix Partners, Battery Ventures, and Delta-V Capital, we've raised $55M in funding and we're eager to bring talented people onboard to support our growth. We're on a mission to improve society's experience with software and that's where you come in.
Benefits
Extensive health, dental, and vision benefits
Open vacation policy - we all work hard and take time for ourselves when we need it, no strings attached
Three months of fully-paid parental leave to any employee welcoming a child into their home
401k and commuter benefits
Generous stock options - we all get to own a piece of what we’re building
Regular team outings and activities
Flexible working hours and location
Monthly employee gifts
Example projects
Overhaul a fleet of nginx load-balancers handling 100s of thousands of requests per second without incurring downtime
Work with members of the engineering team to identify and resolve spikes in processing latency in our ingestion worker pool
Automate database scaling to improve operating cost while maintaining the ability to respond to traffic spikes
Help build tools to streamline the onboarding and release process for customers using LogRocket's On-Premise offering
Improve the performance and reliability of the system our Product Engineering teams use to both test and deploy software
Please let LogRocket know that you found this role at devopsprojectshq.com as a way to support us, so we can keep providing you with awesome DevOps jobs.