Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Site Reliability Engineer (SRE) based in Brazil.
This role is at the core of ensuring reliability, scalability, and performance across mission-critical systems in a highly innovative technology environment. You will be responsible for shaping and evolving observability, incident response, and automation practices that directly impact platform stability and customer experience. Acting as a bridge between development, platform, and security teams, you will help define operational excellence standards and drive a “software as operations” mindset. The environment is fast-paced, collaborative, and strongly oriented toward engineering ownership and continuous improvement. You will work on distributed systems running in Kubernetes-based infrastructures, with strong emphasis on resilience and proactive problem-solving. A key part of your mission will be reducing manual operational work through automation and AI-driven approaches (AIOps). This is a high-impact role where your work will directly improve system reliability and engineering efficiency at scale.
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Site Reliability Engineer (SRE) based in Brazil.
This role is at the core of ensuring reliability, scalability, and performance across mission-critical systems in a highly innovative technology environment. You will be responsible for shaping and evolving observability, incident response, and automation practices that directly impact platform stability and customer experience. Acting as a bridge between development, platform, and security teams, you will help define operational excellence standards and drive a “software as operations” mindset. The environment is fast-paced, collaborative, and strongly oriented toward engineering ownership and continuous improvement. You will work on distributed systems running in Kubernetes-based infrastructures, with strong emphasis on resilience and proactive problem-solving. A key part of your mission will be reducing manual operational work through automation and AI-driven approaches (AIOps). This is a high-impact role where your work will directly improve system reliability and engineering efficiency at scale.
