Site Reliability Engineer (SRE) at Rytbank

About the team

We're building the next generation of digital banking infrastructure that combines enterprise-grade reliability with startup agility.

Our SRE team is the backbone of our technology organization, ensuring our platform delivers an "always-on" banking experience while enabling rapid innovation and growth.

You’ll collaborate with some of the sharpest minds in the industry, operating in a supportive and dynamic environment that fosters creativity, exploration, and innovation.

Your next thrilling adventure starts here. Be part of shaping the future of digital banking today!

About the Role

As a SRE, you’ll drive the development and execution of strategies for DevSecOps practices and platform. Your work will ensure seamless collaboration between technology teams, enabling fast and reliable high-quality software delivery.

You’ll be working closely with our SRE team responsible for implementing and managing Infrastructure as Code (IaC), CI/CD pipelines, cloud native & micro-services, automation frameworks, and release management processes, ensuring they align with organizational objectives.

What You'll Do

Lead the design and implementation of highly available, secure, and scalable banking infrastructure using infrastructure as code (IaC) principles
Establish and maintain SLOs/SLIs that define our reliability standards and drive accountability across engineering teams
Serve as an incident commander during critical service disruptions, leading cross-functional response teams with calm expertise
Build and enhance our observability platform, enabling real-time monitoring of our golden signals (uptime, latency, saturation, error rate)
Develop automation solutions for incident response, disaster recovery, and business continuity
Drive our DevSecOps platform to enable safe, rapid deployments through CI/CD, GitOps, and self-service capabilities
Lead FinOps initiatives to optimize infrastructure costs while maintaining performance and reliability
Mentor junior engineers and contribute to a culture of operational excellence

What We're Seeking

Strong understanding of cloud technologies (AWS, Azure, GCP, Alibaba Cloud)
Experience implementing CI/CD pipelines and GitOps workflows
Deep expertise with infrastructure as code tools (Hasicorp Terraform, OpenTofu, CloudFormation, or similar)
Proven ability to design and implement observability solutions using modern monitoring stacks
Experience leading incident response and building post-mortem processes
Strong understanding of Java or any other object-oriented programming language (OOP).
Strong understanding of containerization & orchestration.
Experience with messaging systems such as Kafka is an added advantage.
Familiarity with relational and non-relational databases is a plus.
Ability to balance hands-on technical expertise with strategic decision-making.
Strong problem-solving skills and the ability to make sound decisions under pressure.
A passion for continuous learning, innovation, and professional development.
High ownership of responsibilities, with a focus on delivering results and meeting deadlines.
Financial services experience is a plus but not required

What We Value

Revolutionary in our thinking
Innovative in our products, services and the way we work
Genuine in our intentions
Honourable in our actions
Tenacious in overcoming challenges

JR00000510

Site Reliability Engineer (SRE)

Job Description

See Your Match Score

More jobs at Rytbank

More jobs at Rytbank

Site Reliability Engineer (SRE)​

Job Description

See Your Match Score

More jobs at Rytbank

More jobs at Rytbank

Site Reliability Engineer (SRE)