Research Engineer, Frontier Safety Loss of Control, DeepMind

San Francisco, CA, USAPosted 3 days ago

remote

Identify potential harms from misaligned agents and develop strategies for detection and prevention.
Implement technical controls to monitor agent thoughts, behaviour, and respond to mitigate potential harms.
Integrate various agent behaviour signals from across the organisation to inform response policies.
Conduct adversarial testing of controls.
Work with internal product teams to ensure that control systems are adopted over all high-risk AI surfaces.

Software Product Manager

Multi-Physics Simulation and Test Engineer

Product Marketing Manager, Retail Ads

Account Strategist, Engage, Google Customer Solutions

Senior Network Engineer, Global Network Edge

Platform Solutions Architect III, Retail, Google Cloud

Research Engineer, Frontier Safety Loss of Control, DeepMind at Google | Renata

Job Description