Back to jobs
Dahl Consulting

Research Scientist, Gemini Vision, DeepMind

Posted 2 weeks ago

Job Description

  • Conduct original research in multimodal AI (Gemini), including vision-language models (VLMs), image understanding, OCR and document intelligence, spatial reasoning and embodied perception, image-text alignment and retrieval, agentic multimodal systems, scaling laws, and data infra, pipeline, training data attribution, and mixture optimization.
  • Design, train, and evaluate large-scale transformer-based architectures for image and video understanding.
  • Develop novel methods for multimodal pretraining, instruction tuning, alignment, and reinforcement learning.
  • Collaborate with cross-functional teams to transition research ideas into production-grade Gemini capabilities.
  • Contribute to research direction, experimental design, and scientific strategy within the Gemini organization.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

Get Started Free
Research Scientist, Gemini Vision, DeepMind at Dahl Consulting | Renata