Back to jobs
R

Site Reliability Engineer (SRE)​

Kuala LumpurPosted 4 months ago
Full-timeremote

Job Description

About the team

We're building the next generation of digital banking infrastructure that combines enterprise-grade reliability with startup agility.

Our SRE team is the backbone of our technology organization, ensuring our platform delivers an "always-on" banking experience while enabling rapid innovation and growth.

You’ll collaborate with some of the sharpest minds in the industry, operating in a supportive and dynamic environment that fosters creativity, exploration, and innovation.

Your next thrilling adventure starts here. Be part of shaping the future of digital banking today!

About the Role

As a SRE, you’ll drive the development and execution of strategies for DevSecOps practices and platform. Your work will ensure seamless collaboration between technology teams, enabling fast and reliable high-quality software delivery.

You’ll be working closely with our SRE team responsible for implementing and managing Infrastructure as Code (IaC), CI/CD pipelines, cloud native & micro-services, automation frameworks, and release management processes, ensuring they align with organizational objectives.

What You'll Do

  • Lead the design and implementation of highly available, secure, and scalable banking infrastructure using infrastructure as code (IaC) principles
  • Establish and maintain SLOs/SLIs that define our reliability standards and drive accountability across engineering teams
  • Serve as an incident commander during critical service disruptions, leading cross-functional response teams with calm expertise
  • Build and enhance our observability platform, enabling real-time monitoring of our golden signals (uptime, latency, saturation, error rate)
  • Develop automation solutions for incident response, disaster recovery, and business continuity
  • Drive our DevSecOps platform to enable safe, rapid deployments through CI/CD, GitOps, and self-service capabilities
  • Lead FinOps initiatives to optimize infrastructure costs while maintaining performance and reliability
  • Mentor junior engineers and contribute to a culture of operational excellence

What We're Seeking

  • Strong understanding of cloud technologies (AWS, Azure, GCP, Alibaba Cloud)
  • Experience implementing CI/CD pipelines and GitOps workflows
  • Deep expertise with infrastructure as code tools (Hasicorp Terraform, OpenTofu, CloudFormation, or similar)
  • Proven ability to design and implement observability solutions using modern monitoring stacks
  • Experience leading incident response and building post-mortem processes
  • Strong understanding of Java or any other object-oriented programming language (OOP).
  • Strong understanding of containerization & orchestration.
  • Experience with messaging systems such as Kafka is an added advantage.
  • Familiarity with relational and non-relational databases is a plus.
  • Ability to balance hands-on technical expertise with strategic decision-making.
  • Strong problem-solving skills and the ability to make sound decisions under pressure.
  • A passion for continuous learning, innovation, and professional development.
  • High ownership of responsibilities, with a focus on delivering results and meeting deadlines.
  • Financial services experience is a plus but not required

What We Value

  • Revolutionary in our thinking
  • Innovative in our products, services and the way we work
  • Genuine in our intentions
  • Honourable in our actions
  • Tenacious in overcoming challenges
JR00000510

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

Get Started Free
Site Reliability Engineer (SRE)​ at Rytbank | Renata