Back to jobs

Senior Software Engineer, Site Reliability Engineering, Vertex AI 3P SRE
Posted 2 weeks ago
Job Description
- Engage in and improve the whole lifecycle of servicesfrom inception and design through to deployment, operation, and refinement.
- Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
- Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
- Practice sustainable incident response and blameless post-mortems (Post-Mortem Examinations).