Back to jobs

Staff Site Reliability Engineer, Cloud Reliability Intelligence
Posted Today
Job Description
- Own the technical roadmap and long-term architecture for the Evergreen platform, including a unified data model for promise delivery across GCP.
- Design and scale high-performance backend pipelines (Go, Java) and data-rich user interfaces (TypeScript, Angular) used by over 10,000+ Google engineers.
- Prototype and productionize LLM-based features to parse unstructured incident data, automatically file risk tickets, and suggest reliability fixes.
- Partner closely with Product Management, Data Science, and leadership to align multiple organizations on a unified approach to policy measurement and enforcement.