Back to jobs

Senior Software Engineer, Site Reliability Engineering, Cloud IRT
Posted Yesterday
Job Description
- Engage in and improve the whole lifecycle of service from inception and design, through to deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Build systems and tooling to support Cloud IRT team; improve visibility into state of Cloud, detection of large scale issues, communications to customers, stakeholders and customer facing teams.
- Participate in oncall rotation supporting critical incident response for Google Cloud Platform (GCP).