We are seeking an experienced Site Reliability Engineer to join our team and help ensure the reliability, scalability, and performance of our client’s systems, both in LATAM and the USA. You will be working remotely from anywhere in Latin America, in an agile environment, with an awesome team on the implementation of world-class software products.
Activities include designing, developing, installing, and maintaining software solutions that provide efficiency in Cloud Operations.
Work with engineering teams to refine deployment and release processes.
Work closely with development teams to ensure that our systems are designed and implemented to be highly available and scalable.
Implement and maintain system security measures.
Monitor systems to collect metrics for tuning and capacity planning.
Automate the deployment, scaling, and monitoring of our systems.
Requirements
5 years of experience related to the role.
Experience with cloud infrastructure providers such as AWS, GCP, or Azure.
Strong experience with automation and configuration management tools such as Ansible.
Familiarity with containerization and orchestration technologies such as Docker and Kubernetes.
Experience with monitoring and logging tools such as Elasticsearch.
Experience in Linux systems and Windows administration.
Strong skills in languages such as Java, Python, C, Unix or Ruby.
Comfortable scripting and debugging.
Nice to have
Bachelor's or Graduate's Degree in computer engineering, computer science, engineering or information systems management, or equivalent experience.
Proficient in English.
Experience with Dynatrace.
Benefits
A stable, long-term contract. Continuous Training. Private Health insurance. Flexible schedule. Work with some of the most talented software engineers in Latin America and the US, doing challenging work and world-class software for clients in the US and worldwide.