Back to jobs
Jobgether

Lead Site Reliability Engineer

CanadaPosted Today
Full-timehybrid

Job Description

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Lead Site Reliability Engineer based in Canada.

This is a high-impact technical leadership role focused on improving reliability across large-scale distributed systems that directly impact millions of customers. You will sit at the core of incident response and production stability, working across engineering teams to identify systemic failure patterns and eliminate them at the root. The role blends hands-on engineering with cross-functional influence, requiring you to translate real production incidents into durable architectural and operational improvements. You will help define and elevate reliability standards across the organization, shaping how systems are built, deployed, and operated. Beyond incident response, you will drive long-term resilience through observability, automation, and safer deployment practices. This is a highly collaborative environment where influence matters as much as execution, and where your work compounds across teams and services. You will also help mature a growing SRE practice, moving it from reactive incident handling to proactive reliability engineering.

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Lead Site Reliability Engineer based in Canada.

This is a high-impact technical leadership role focused on improving reliability across large-scale distributed systems that directly impact millions of customers. You will sit at the core of incident response and production stability, working across engineering teams to identify systemic failure patterns and eliminate them at the root. The role blends hands-on engineering with cross-functional influence, requiring you to translate real production incidents into durable architectural and operational improvements. You will help define and elevate reliability standards across the organization, shaping how systems are built, deployed, and operated. Beyond incident response, you will drive long-term resilience through observability, automation, and safer deployment practices. This is a highly collaborative environment where influence matters as much as execution, and where your work compounds across teams and services. You will also help mature a growing SRE practice, moving it from reactive incident handling to proactive reliability engineering.

How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
 
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
 
 
#LI-CL1

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

Get Started Free
Lead Site Reliability Engineer at Jobgether | Renata