About Us
From two-person startups to public companies, Rho is the banking platform with everything businesses need to manage cash, control spending, and automate finance busywork. Rho offers corporate cards, banking, treasury, expense management, AP, accounting automation, and more in one integrated platform backed by award-winning support.
About this Role
We are seeking a talented Software Reliability Engineer (SRE) to implement software engineering principles to operations and infrastructure problems. This role focuses on the automation, reliability, and scalability of our systems.
Key responsibilities include:
Partner with engineering teams to establish and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs), fostering a reliability-focused culture through collaborative incident management processes
Work alongside development teams to optimize critical payment system infrastructure by managing error budgets, collaboratively achieving p99 latency targets, and maintaining appropriate availability levels aligned with SLOs
Drive continuous improvements based on incident learnings and DORA metrics
Support capacity planning and performance tuning for high-traffic, low-latency payment transactions.
Write and maintain basic code to improve logging, metrics collection, and system visibility, ensuring observability best practices are embedded in core payment services
Collaborate with development teams on incident response procedures, including pre-mortem and post-mortem analysis
Partner with engineering teams to maintain comprehensive runbooks and documentation for critical services
Success in this role will be measured by improvements in key metrics including Mean Time to Detect (MTTD), Mean Time to Resolve (MTTR), p99 latencies, high availability, and achievement of SLO targets.
Qualifications
Required:
4+ years of experience as a Site Reliability Engineer or in a similar role, with expertise in maintaining high-performance, scalable web systems with high throughput.
Strong troubleshooting abilities with demonstrated experience in incident management
Proficiency in programming languages (Python, Go, JavaScript, Ruby, or similar), SQL databases, Kubernetes, and CI/CD
Experience with monitoring and observability tools like Prometheus, Grafana, Datadog, PagerDuty, Opsgenie, or similar
Proficiency with cloud platforms and infrastructure like GCP or AWS
Excellent communication skills and proven ability to collaborate in team settings.
Nice to have:
Experience with SLO/SLI implementation
Excellent problem-solving and analytical abilities
Experience in fintech, with a focus on high-volume transaction processing
Experience with message queuing and streaming platforms like Kafka and PubSub
Familiarity with gRPC and microservices architecture
What we offer
Our people are our most valuable asset. Base salary may vary depending on relevant experience, skills, geographic location, and business needs.
Benefits:
Top-notch Private Healthcare Insurance for you and your family members
Generous PTO policy
Lunch at work
Covered costs for parking for onsite staff
Learning and development budget
Paternity leave
Hybrid work environment (with old town Belgrade office)
Diversity is a core value at Rho. We’re passionate about building and sustaining an inclusive and equitable environment for all those involved with our mission, including employees, contractors, candidates, customers and vendors. We believe every member of the Rho community enriches our ability to provide a broad range of ways to understand and engage with the market, identify problems, and drive solutions that align with our mission. We welcome all qualified applications and support each of our Rho’ers with ongoing professional growth opportunities.