Back to jobs
Job Description
- Define the long-term goal for repair automation of AI/ML infrastructure, focusing on achieving goals through multiple parallel programs.
- Lead and participate in the design of agentic diagnostic systems that utilize Generative AI to automate diagnoses for next-gen networks.
- Work with platform teams to integrate new hardware platforms into the automation ecosystem, driving the qualification and repair workflows required for global fleet turn-up.
- Lead critical safety initiatives, such as automated anomaly detection, to protect fleet health and capacity.
- Mentor a team of junior and executive engineers and influence engineering practices across the broader infrastructure organization to drive consistency in automation and safety standards.
