Back to jobs
Rakuten Viki

Senior Site Reliability (DevOps) Engineer(INPD)

SingaporePosted 3 weeks ago
senior

Job Description

Job Description:

Situated in the heart of Singapore's Central Business District, Rakuten Asia Pte. Ltd. is Rakuten's Asia Regional headquarters. Established in August 2012 as part of Rakuten's global expansion strategy, Rakuten Asia comprises various businesses that provide essential value-added services to Rakuten's global ecosystem. Through advertisement product development, product strategy, and data management, among others, Rakuten Asia is strengthening Rakuten Group's core competencies to take the lead in an increasingly digitalized world.

Rakuten Group, Inc. is a global leader in internet services that empower individuals, communities, businesses, and society. Founded in Tokyo in 1997 as an online marketplace, Rakuten has expanded to offer services in e-commerce, fintech, digital content, and communications to approximately 1.7 billion members around the world. The Rakuten Group has nearly 32,000 employees and operations in 30 countries and regions. For more information visit https://global.rakuten.com/corp/

Our Incentive Platform Department (INPD) drives Rakuten's core loyalty and coupon product strategy, executes product development, and ensures successful implementation. We empower Rakuten's ecosystem by creating highly scalable and resilient platforms that prioritize our customers. The Loyalty Platform Section is specifically responsible for developing and operating Rakuten Point and Coupon Platform, the most popular loyalty program in Japan, handling massive real-time traffic daily. We contribute directly to Rakuten's growth and user experience by delivering seamless, impactful solutions.

Why We're Hiring:

We are looking to add a senior engineer who can collaborate with developers to deliver web services faster and more resilient to failure using automation and data driven monitoring. As an experienced senior site reliability engineer, you will be involved in constructing, configuring, upgrading, and monitoring the systems that keep our services up and running in all environments. You should be adept at troubleshooting application issues at both the infrastructure and application level. You should practice sustainable incident response and blameless postmortems. Additionally, you should possess a strong desire to use automated tools to perform all these tasks in a safe and repeatable way. The ideal candidate would propose and drive initiatives to continuously improve the technology and processes to increase productivity and reduce risks.

Responsibilities:

  • Design and build infrastructure and automation to rapidly deliver our web services, specifically focusing on the Rakuten Coupon Platform and related loyalty services.

  • Troubleshoot system and network failures as well as application crashes for high-traffic coupon and loyalty platforms.

  • Create CI/CD pipelines using tools such as Jenkins or similar for robust and continuous deployment of coupon platform features.

  • Proactively engage in service capacity planning and demand forecasting, performance analysis, and system tuning to identify potential issues before they happen, ensuring the reliability of our coupon services.

  • Continuously optimize operations and reduce risk through automation and process improvement for critical loyalty and coupon systems.

  • Act as a technical expert to introduce new technologies and drive the adoption of these technologies within the Coupon Platform Team.

Work Environment & Tech Stack

  • Team & Culture: Can work in globally distributed Agile development teams with a strong sense of ownership, customer service, and integrity demonstrated through clear communication. Powering Coupons across Rakuten's vast ecosystem, the Coupon Platform Team tackles massive real-time traffic daily. We build and maintain the robust platform essential for services like Rakuten Ichiba and Travel, directly contributing to Rakuten's growth and user experience. Be part of our mission to deliver seamless, impactful solutions.

  • Languages: Java, Kotlin

  • Databases/Caches: Cassandra, ETCD, Redis, Couchbase, MySQL

  • Orchestration/Containers: Kubernetes, Docker, Istio

  • Messaging: Kafka, RabbitMQ

  • CI/CD & Monitoring: Jenkins, ELK (Elasticsearch, Logstash, Kibana), Prometheus, Grafana

  • Infrastructure as Code: Ansible, Terraform

  • Cloud Platform: GCP (Google Cloud Platform), Private Cloud Platform

Required Qualifications:

  • 5+ years deploying and managing large scale internet facing web services, with specific experience in high-traffic, real-time transaction systems like loyalty or coupon platforms.

  • Bachelor’s degree (BS) in Computer Science, Engineering or related field, or equivalent work experience.

  • Experience with DevOps processes, culture, and tools (e.g., Chef and Terraform).

  • Demonstrated experience measuring and monitoring availability, latency and overall system health for critical production systems.

  • Experience with CI/CD tools, such as Jenkins, Rundeck for release and operation automation.

  • Strong sense of ownership, customer service, and integrity demonstrated through clear communication.

  • Can work in globally distributed Agile development teams.

  • Experience with container technologies such as Docker and Kubernetes.

  • Experience with network monitoring and observability tools.

Rakuten is an equal opportunities employer and welcomes applications regardless of sex, marital status, ethnic origin, sexual orientation, religious belief, or age.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

501-1000 employees
San Mateo, California, US
Website
Senior Site Reliability (DevOps) Engineer(INPD) at Rakuten Viki | Renata