Back to jobs
Job Description
- Build context engineering pipelines, agentic workflows with tool usage, and robust evaluation frameworks.
- Analyze model behavior, creating high-quality evaluation datasets to identify weaknesses and guide performance improvements.
- Enhance serving and evaluation infrastructure to power new user-facing features.
- Act as a technical bridge between engineering, product, UX, and research teams to translate user needs into valuable features.
- Analyze user metrics and model outputs to enhance personalization and overall system helpfulness.
