ML Engineer - Automated Evaluation and Adversarial Design

CupertinoPosted 1 months ago

Full-timeremote

Job Description

The Productivity and Machine Learning Evaluation team ensures the quality of AI-powered features across a suite of productivity and creative applications; including Creator Studio, used by hundreds of millions of people. This team serves as the primary evaluation function, providing critical quality signals that directly influence model development decisions and product launches. This role focuses on building and scaling automated evaluation systems and designing adversarial and stress-testing methodologies across multiple AI features. The work requires a deep understanding of how AI systems fail and how to measure quality rigorously. As features evolve from single-turn interactions into multi-turn, agentic experiences, the evaluation challenge shifts from assessing individual outputs to stress-testing entire conversation flows and agent decision chains. This is an opportunity to shape the evaluation infrastructure that determines whether AI features meet the bar for hundreds of millions of users.

See Your Match Score

About Apple

Website

More jobs at Apple

Senior Machine Learning Engineer, Wallet, Payment & Commerce

AUSTIN

Sr. Software Development Engineer (Applied ML)

Sunnyvale

Detection and Response Software Engineer |

Seattle

Hardware Engineering Program Manager, iPad and Input Devices

Cupertino

US-Manager

Apple Park Visitor Center

iPhone Product Design (PD) Budget Engineering Program Manager

Cupertino

Similar roles

Quality Engineer

Eaton Cummins Automated Transmission Technologies · Camarillo, CA, US

Quality Engineer

Eaton · Camarillo, CA, US

Senior Executive Engineer (M&E)

Atelier Ten · Singapore

Reservoir Engineer

Shell · Cyberjaya-Wisma Shell

Product Development Engineer

KLA · Hsinchu, Taiwan

Junior Engineer (m/w/d) – CNI & Cloud Networking

Logicalis Asia Pacific · Frankfurt am Main