Back to jobs
Google

Software Engineering Manager, Site Reliability Engineering

Posted 3 days ago

Job Description

  • Manage the scale and availability of next-generation Workspace GenAI features. We partner with the Workspace AI SRE to support complex agentic flows in Editors, ensuring that model-based features are fast, reliable, and gracefully degrade under load.
  • Operate our newly partitioned Spanner storage topology. We have completed the physical isolation of Spanner allocations per Editor and shard, aligning storage failure domains with frontend Pod shards. Focusing on managing elastic resource capacity through auto scaling.
  • Orchestrate large-scale, multi-system restore operations for critical customer data. We contribute to tools and playbooks to coordinate data recovery across dependencies, validating data correctness and restoring integrity after complex platform incidents.
  • Direct the resource headroom and efficiency roadmap for the Editors portfolio.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

Get Started Free
Software Engineering Manager, Site Reliability Engineering at Google | Renata