Back to jobs
Jobgether

AI Research Engineer (Model Compression & Quantization)

IndiaPosted 5 days ago
Full-timeonsite

Job Description

This position is posted by Jobgether on behalf of a partner company. We are currently looking for an AI Research Engineer (Model Compression & Quantization) in India.

This role sits at the forefront of efficient AI systems research, focusing on making large-scale multimodal models practical for real-world deployment. You will work on advancing state-of-the-art techniques in model compression, enabling LLMs and vision-language models to run efficiently on resource-constrained devices such as mobile and edge hardware. The position combines deep research with hands-on engineering, requiring you to design and optimize pipelines that reduce memory usage, latency, and compute cost without sacrificing model performance. You will explore and implement techniques such as quantization, pruning, and knowledge distillation, contributing directly to scalable AI infrastructure. Operating in a highly research-driven and experimental environment, you will collaborate with AI engineers and researchers to push the boundaries of efficient multimodal intelligence. This is a high-impact role for someone passionate about both cutting-edge AI research and real-world deployment constraints.

This position is posted by Jobgether on behalf of a partner company. We are currently looking for an AI Research Engineer (Model Compression & Quantization) in India.

This role sits at the forefront of efficient AI systems research, focusing on making large-scale multimodal models practical for real-world deployment. You will work on advancing state-of-the-art techniques in model compression, enabling LLMs and vision-language models to run efficiently on resource-constrained devices such as mobile and edge hardware. The position combines deep research with hands-on engineering, requiring you to design and optimize pipelines that reduce memory usage, latency, and compute cost without sacrificing model performance. You will explore and implement techniques such as quantization, pruning, and knowledge distillation, contributing directly to scalable AI infrastructure. Operating in a highly research-driven and experimental environment, you will collaborate with AI engineers and researchers to push the boundaries of efficient multimodal intelligence. This is a high-impact role for someone passionate about both cutting-edge AI research and real-world deployment constraints.

How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
 
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
 
 
#LI-CL1

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

AI Research Engineer (Model Compression & Quantization) at Jobgether | Renata