Never miss a job!

Join 1,800+ DevOps engineers getting weekly alerts for remote and US, EU roles that don't show up on the big boards. Junior to senior. Kubernetes, AWS, Terraform — filtered for your stack.

🇪🇺 Secure Your EU Traffic

Ensure digital sovereignty for your infrastructure. Get EU static IPs with full data residency for compliance and peace of mind.

🇪🇺 Get an EU IP with OutboundGateway → GDPR-compliant • Static IPs • EU Data Residency

2024-12-10

Software Engineer (Infrastructure/Platform Services or Site Reliability Engineering)

The Core Platform team is responsible for maintaining and optimizing the data, infrastructure, messaging, and services platform that powers Sift’s online systems. We ensure these systems are always available, reliable, and performing at their best to meet customer needs. In the event of an outage or failure, we follow well-practiced recovery plans to restore services swiftly. Managing such complex, large-scale systems requires continuous monitoring and proactive maintenance to uphold these standards.

What will you do:

Design and build immutable infrastructure and fault-tolerant, multi-AZ/multi-region systems that are resilient and self-healing.
Implement multi-region deployments, such as BigTable clusters spanning multiple regions, with strategies to ensure specific customers are routed to designated regions (e.g., sticky sessions at the regional level).
Optimize local development and testing workflows to be fast, efficient, and seamless.
Create dynamic environments that enable specific services to interact with other environments in real time.
Develop automated bot solutions for deployment and monitoring, integrating with Slack for streamlined updates.
Participate in on-call support and incident response activities, providing 12/7 coverage for one calendar week approximately once every 3-4 weeks.

Technical stack: GCP, AWS, Terraform, Kubernetes, Vault, Jenkins, Kafka, Snowflake, Spark, Java 11, Python 3, Ruby 2.7, Ruby on Rails.

What makes you a strong fit:

You have a deep understanding of large-scale computing and approach infrastructure as code. You're passionate about building immutable infrastructure and resilient, multi-AZ/multi-region systems that can withstand failures. While you recognize the importance of monitoring and alerting, your ultimate goal is to design self-healing systems. Collaboration is key to you, and you strive to act as a force multiplier by making thoughtful trade-offs to drive success.

Key qualifications:

5+ years of experience as a Software Engineer focused on infrastructure/platform services or in a Site Reliability Engineering (SRE) role.
Strong programming skills in languages such as Java, Scala, or Python.
Extensive experience building and managing cloud infrastructure on GCP or AWS.
Expertise in building infrastructure as code and automating provisioning processes using tools like CloudFormation or Terraform.
Proficiency in setting up and managing monitoring and alerting systems, both open-source and commercial.
Familiarity with Docker and container orchestration technologies like Kubernetes, GKE, or AWS ECS.
Experience troubleshooting and resolving production system issues, with a focus on building automated solutions to prevent future occurrences.
Proven expertise in automation and a solid understanding of configuration management tools.

Benefits and perks:

Competitive Compensation: Includes financial rewards, annual 5% bonus, and stock options;
Health Insurance Stipend: Support for your medical and health related needs;
Sports and Wellness Stipend: Encouraging a healthy and active lifestyle;
Work From Home Stipend: Support in creating a productive home office setup;
Education Reimbursement: books, education courses, and conferences to support your professional growth;
Mental Health Days: Additional paid day offs to prioritize your well-being;
Language and Public Speaking Development: English courses and social activities within the company to enhance your communication skills.

Our interview process:

Introduction interview: a 45-minute session with a recruiter to discuss your background and the role.
Technical Screening interview: a 60-minute interview with a member of the engineering team to explore your fit for the position.

Virtual onsite loop with the team: a comprehensive session comprising four interviews lasting approximately 3.5 hours, covering system design, coding abilities, deep dive, and values and behavior-based conversations.

During these sessions, you will have the opportunity to learn about company culture, meet engineers or peers from your team, and discuss distributed system problems. You will have time for interesting questions and gain transparency regarding your future responsibilities and the project.

A little about us:

Sift is the AI-powered fraud platform securing digital trust for leading global businesses. Our deep investments in machine learning and user identity, a data network scoring 1 trillion events per year, and a commitment to long-term customer success empower more than 700 customers to grow fearlessly. Brands including DoorDash, Yelp, and Poshmark rely on Sift to unlock growth and deliver seamless consumer experiences. Visit us at sift.com and follow us on LinkedIn.

Apply

Please let Sift know that you found this role at devopsprojectshq.com as a way to support us,
so we can keep providing you with awesome DevOps jobs.

💼 Upgrade to Premium

Get instant access to exclusive DevOps jobs with €120K+ salaries

Monthly

€16.50/month

Best value for job search

✓ Access to premium jobs
✓ Priority support
✓ Early access to new jobs

Get Started

Best Value

Yearly

€49.50/year

Only €4.13/month - Save 75%

✓ Everything in Monthly
✓ Maximum savings
✓ Best long-term value

Get Started

View All Plans & Features

Never miss a job!

🇪🇺 Secure Your EU Traffic

Software Engineer (Infrastructure/Platform Services or Site Reliability Engineering)

You must be logged in to apply for this job

Please let Sift know that you found this role at devopsprojectshq.com as a way to support us,
so we can keep providing you with awesome DevOps jobs.

💼 Upgrade to Premium

Monthly

Yearly

Similar Jobs

Site Reliability Engineer

Senior Software Engineer, API Platform

DevOps/SRE Engineer

On-Demand DevOps Engineer

Built and hosted in the EU 🇪🇺 we keep your data safe

Never miss a job!

🇪🇺 Secure Your EU Traffic

Software Engineer (Infrastructure/Platform Services or Site Reliability Engineering)

You must be logged in to apply for this job

Please let Sift know that you found this role at devopsprojectshq.com as a way to support us, so we can keep providing you with awesome DevOps jobs.

💼 Upgrade to Premium

Monthly

Yearly

Similar Jobs

Site Reliability Engineer

Senior Software Engineer, API Platform

DevOps/SRE Engineer

On-Demand DevOps Engineer

Built and hosted in the EU 🇪🇺 we keep your data safe

Someone Just Upgraded!

Please let Sift know that you found this role at devopsprojectshq.com as a way to support us,
so we can keep providing you with awesome DevOps jobs.