Back to jobs
Apple

ML Engineer - Automated Evaluation and Adversarial Design

CupertinoPosted 1 months ago
Full-timeremote

Job Description

The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a suite of productivity and creative applications; including Creator Studio, used by hundreds of millions of people. This team serves as the primary evaluation function, providing critical quality signals that directly influence model development decisions and product launches. This role focuses on building and scaling automated evaluation systems and designing adversarial and stress-testing methodologies across multiple AI features. The work requires a deep understanding of how AI systems fail and how to measure quality rigorously. As features evolve from single-turn interactions into multi-turn, agentic experiences, the evaluation challenge shifts from assessing individual outputs to stress-testing entire conversation flows and agent decision chains. This is an opportunity to shape the evaluation infrastructure that determines whether AI features meet the bar for hundreds of millions of users.

See Your Match Score

Sign up and Renata will show you how this job matches your skills and experience.

ML Engineer - Automated Evaluation and Adversarial Design at Apple | Renata