Back to jobs
P

ML Research Engineer, AI Evaluation Platform

SeattlePosted 1 months ago
Full-timeremote

Job Description

AI systems are only as trustworthy as the methods used to evaluate them. At Apple, where AI powers experiences for billions of people, getting evaluation right is not a support function—it is a foundational science. Our team, part of Apple Services Engineering, is building that scientific foundation: rigorous, scalable evaluation methodology for LLMs, agentic systems, and human-AI interaction. What makes this team unusual is its interdisciplinary core. You will work alongside measurement scientists (psychometrics, validity theory), ML researchers, and platform engineers—bringing together ML research, statistical rigor, and production engineering. We are looking for an ML Research Engineer who can move fluidly across this landscape: someone who loves implementing the latest techniques in AI, has the engineering instincts to make them robust and scalable, and thrives at the intersection of research and production.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

ML Research Engineer, AI Evaluation Platform at Pineapple Hospitality Company | Renata