Back to jobs

Tech Lead Manager, Kubernetes AI Infrastructure
Kirkland, WA, USAPosted 2 weeks ago
onsite
Job Description
- Design, guide and vet systems designs within the scope of the broader area, and write system development code to solve ambiguous problems.
- Design, develop, and maintain Kubernetes-based systems to manage large-scale TPU infrastructure for on-premises and hybrid environments.
- Oversee a team of software engineers specializing in distributed systems and AI infra, while fostering a high-performance and collaborative team environment.
- Collaborate with major frontier AI Labs to influence the AI infrastructure roadmap and promote the use of TPUs for advanced ML workloads.
- Work with cross-functional partners to deploy new infrastructure management tools that enhance Kubernetes' ability to handle large-scale GenAI tasks.