Back to jobs

Senior Staff Research Scientist, Gemini Safety Post-Training, DeepMind
Posted 3 days ago
Job Description
- Rethink how safety is trained into models, especially for agentic, long-horizon behavior.
- Design and ship post-training recipes (Reinforcement Learning (RL), Supervised Fine-Tuning (SFT), and beyond) that install safety and alignment properties into Gemini models. You own the path from research to production.
- Build the metrics and evaluations that tell us whether training is actually making models safer in deployment, not just on benchmarks.
- Work directly with the post-training pipeline and infrastructure. Partner with the AGI Safety team to bring alignment research into practical training. Translate between research and production.
- Shape the road map for where safety post-training goes next. Build and grow the team to execute on it.