Senior Site Reliability Engineer (SRE) | LATAM

ColombiaPosted 1 weeks ago

hybrid

Job Description

Hi, this is Monica Hernandez, Founder and CEO of MAS Global. I started MAS with the idea that we could be more than a business, that’s why we like to say that MAS is More.

I was born and raised in Medellin, Colombia and thanks to a scholarship I became a Software Engineer and built a career in the US, where I now live. Starting MAS was my way to give back to the community, building a bridge between the US and Colombia and rest of Latin America, while creating opportunities in technology for other women, latinos and minorities across the globe.

Join our team for exciting opportunities with large scale projects, to experience a multicultural environment with more than 10 nationalities, all united by the same values.

We are All-In for our clients and our people, we live collaboration so we like to say Juntos Somos MAS, we care about community impact and helping each other (we have a Foundation!), we love providing a positive MAS experience to those we touch, and we stay curious so we are in constant learning and knowledge sharing mode.

Vamos por MAS?

Who We Are

At MAS Global Consulting, we are a premium digital engineering partner delivering technology solutions to some of the world’s most innovative companies — from high-growth startups to Fortune 500 enterprises.
With a people-first culture and a commitment to excellence, we combine nearshore talent, agile delivery, and technical depth to build scalable, high-impact software solutions.

Our teams comprise experienced technologists who are passionate about innovation, collaboration, and delivering measurable value to our clients.

Who You Are

You are an experienced Site Reliability Engineer who thrives on building resilient, scalable, and highly reliable systems. You have a strong background in cloud infrastructure, automation, and DevSecOps practices, focusing on improving system stability, performance, and operational efficiency.

You enjoy working closely with product and engineering teams, translating operational needs into reliable solutions, and continuously optimizing workflows through automation and modern reliability engineering principles.

What You’ll Do

As a Senior Site Reliability Engineer, you will play a key role in ensuring the reliability, scalability, and security of production environments. You will drive automation initiatives, improve monitoring strategies, and support the overall stability of critical systems while promoting best-in-class DevSecOps practices.

Key Responsibilities

Design, build, and deploy solutions that enhance system reliability and optimize operational efficiency.
Develop and optimize CI/CD pipelines to ensure secure and efficient delivery processes.
Provide technical guidance and mentorship on DevSecOps and SRE best practices.
Collaborate with product teams to understand system requirements and reliability needs.
Conduct root cause analysis and post-mortems to prevent incident recurrence through code-driven solutions.
Implement robust monitoring, alerting, and security scanning mechanisms.
Support incident resolution and assist operational teams in troubleshooting production issues.
Promote and implement modern technologies and workflows to improve system performance.
Automate processes to reduce manual operational effort and improve response times.
Provide after-hours emergency support when required.

What You Bring

Technical Skills

5+ years of experience in Site Reliability Engineering, DevOps, or reliability-focused roles, designing, building, and deploying solutions that improve system reliability and operational efficiency.
Strong experience improving reliability through root cause analysis, post-mortems, and code-based prevention of recurring incidents.
Proven ability to design and guide effective CI/CD pipelines while applying DevSecOps best practices.
Solid experience working with AWS in scalable and highly available environments.
Hands-on experience with Terraform and Ansible for infrastructure automation.
Experience implementing monitoring and security scanning solutions.
Strong background managing containerized environments using Docker and Kubernetes.
Ability to identify and implement automation to reduce manual support workload.
Experience with Git, GitLab, and Artifactory for version and artifact management.
Proficiency in Linux and/or Windows scripting to support operational processes.
Experience supporting incident resolution, collaborating with product teams, and providing after-hours support when required.
English proficiency from Intermediate (B2) or higher.

Apply for this position

Required*

See Your Match Score

About masglobalconsulting

More jobs at masglobalconsulting

Senior AI Engineer (Backend - Python)

Colombia

Fullstack Developer - Java + React or Angular | New York, NY

New York, NY

Salesforce Architect – Sales, Service & Experience Cloud

Colombia

Site Reliability Engineer

Colombia

DevOps Engineer

Colombia

Fullstack Developer - Java + React or Angular | Jersey City, NJ

Jersey City, NJ

Similar roles

Executive Director, Site Payment Services

Jobgether · US

Senior Software Engineer, Site Reliability Engineering

Jobgether · Canada

Senior Software Engineer, Site Reliability Engineering

Jobgether · US

Associate Site Merchandiser

Chalhoub Group · Dubai, United Arab Emirates

Site Monitor II - FSP

Jobgether · Brazil

UI/UX Designer (Website)

iwoca Deutschland · London

$50K - $80K