Back to jobs
Job Description
- Become a trusted advisor to customers, helping them understand and incorporate AI accelerators into their overall cloud and IT strategy by designing training and inferencing platforms.
- Demonstrate how Google Cloud is differentiated, highlighting the power of accelerators by working with customers on POCs, demonstrating features, optimizing model performance, profiling, and bench marking.
- Design and implement multi-host AI training and inferencing solutions on Google Cloud TPUs, focusing on scalability and performance tuning.
- Conduct performance profiling and optimization of customer models and data pipelines for the TPU architecture, identifying and resolving issues.
- Advise customers on best practices for integrating their MLOps workflows with the Google Cloud AI Platform ecosystem for TPU utilization.
