Back to jobs
Job Description
- Design and create novel data tooling to accelerate Gemini model evaluation, training, and hill climbing to improve agentic capabilities.
- Facilitate ingestion and creation of corpora representing complex worlds, and record human, agentic, and hybrid trajectories through the Reinforcement Learning (RL) environments.
- Build scalable data collection pipelines bridging capturing multi-turn, tool-using agent interactions and enabling rapid iteration on environment complexity and reward design.
- Create human-in-the-loop annotation and trajectory review tooling, analytics dashboards, and agentic orchestration frameworks to continuously generate, curate, and validate high-signal training corpora at scale.
