Job Description
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Machine Learning Engineer (Token Factory) based in France.
This role sits at the intersection of large-scale AI systems and high-performance infrastructure, focusing on optimizing how foundation models are trained and served at scale.
You will contribute to a cutting-edge inference and fine-tuning platform designed to push modern LLMs to their performance limits across massive GPU fleets.
The work directly impacts throughput, latency, and cost efficiency for next-generation AI workloads used in production environments.
You will collaborate with highly specialized engineers across ML, systems, and infrastructure domains in a fast-moving, research-driven environment.
The role combines deep ML expertise with systems-level engineering, requiring strong understanding of both model architecture and hardware behavior.
You will help design and improve critical components such as inference engines, training pipelines, and GPU optimization strategies.
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Senior Machine Learning Engineer (Token Factory) based in France.
This role sits at the intersection of large-scale AI systems and high-performance infrastructure, focusing on optimizing how foundation models are trained and served at scale.
You will contribute to a cutting-edge inference and fine-tuning platform designed to push modern LLMs to their performance limits across massive GPU fleets.
The work directly impacts throughput, latency, and cost efficiency for next-generation AI workloads used in production environments.
You will collaborate with highly specialized engineers across ML, systems, and infrastructure domains in a fast-moving, research-driven environment.
The role combines deep ML expertise with systems-level engineering, requiring strong understanding of both model architecture and hardware behavior.
You will help design and improve critical components such as inference engines, training pipelines, and GPU optimization strategies.
