Back to jobs

This job is no longer available.

The original posting has expired, but this page is kept for context. Continue to current roles from this employer or search similar active jobs.

Dahl Consulting

Research Scientist, ML Efficiency, Google Research

Posted 1 weeks ago
No longer available

Job Description

  • Advance algorithms, sampling techniques and large-scale optimization to make serving and inference of generative AI models more efficient and flexible.This includes model compression, knowledge distillation and quantization strategies.
  • Innovate algorithms and large language model architectures that improve computation efficiency and generalization of training deep learning models.
  • Improve the end-to-end model deployment pipeline that includes entirely new formulations of pretraining, instruction tuning, reinforcement learning, thinking and reasoning.
  • Collaborate with hardware and software teams to optimize kernels and inference engines, across different hardware and model architectures.
  • Optimize latency, memory bandwidth, workloads.
Research Scientist, ML Efficiency, Google Research at Dahl Consulting | Renata