Back to jobs
Google

Research Engineer, Frontier Safety Loss of Control, DeepMind

San Francisco, CA, USAPosted 3 days ago
remote

Job Description

  • Identify potential harms from misaligned agents and develop strategies for detection and prevention.
  • Implement technical controls to monitor agent thoughts, behaviour, and respond to mitigate potential harms.
  • Integrate various agent behaviour signals from across the organisation to inform response policies.
  • Conduct adversarial testing of controls.
  • Work with internal product teams to ensure that control systems are adopted over all high-risk AI surfaces.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

Get Started Free
Research Engineer, Frontier Safety Loss of Control, DeepMind at Google | Renata