Back to jobs
Job Description
- Lead the design and architecture of highly scalable, fault-tolerant systems where multi-agent networks reason, plan, and execute complex workflows across vast, distributed codebases.
- Define best practices for the team and broader organization. Blend traditional distributed systems architecture with advanced LLM orchestration, complex Retrieval Augmented Generation (RAG) pipelines, and optimization.
- Establish the overarching technical strategy for AI quality and safety. Build automated evaluation frameworks that measure performance, enforce strict security standards, and reliably mitigate at scale.
- Manage the most intricate non-deterministic edge cases. Build advanced telemetry and introspection tooling that allows the entire organization to understand, debug, and optimize self-sustaining behavior.
- Drive technical alignment across local pods and global organizations. Mentor junior and mid-level engineers, translate extreme ambiguity into actionable technical roadmaps, and shape the future of AI-driven developer productivity.
