
Deep Learning Engineer
Job Description
About Us
At Hayden AI, we are on a mission to harness the power of computer vision to transform the way transit systems and other government agencies address real-world challenges.
From bus lane and bus stop enforcement to transportation optimization technologies and beyond, our innovative mobile perception system empowers our clients to accelerate transit, enhance street safety, and drive toward a sustainable future.
About the Deep Learning Team
The Deep learning team’s work is at the crux of Hayden AI’s solutions to its customers. The team is responsible for building and maintaining advanced models that are able to perceive the world and the knowledge produced by the models is used downstream to implement various customer use-cases. The team runs a number of models both in the cloud as well as on the edge device serving different scenarios. The team also works closely with the platform team to build out Hayden AI’s MLOps infrastructure to achieve better scale and reliability.
About the Role
As a Deep Learning Engineer at Hayden, you will make key contributions towards building foundational AI capabilities for Hayden that will enable solving complex problems for Hayden AI Customers. In this role, you will build, train, evaluate and deploy state of the art ML models in production at scale. You will report into the Director of AI within the Perception Org and work closely with other engineering as well as product teams to deliver value to Hayden AI customers. We are looking for someone who has familiarity with or growing expertise in at least one of the verticals below — depth in a specific area is valued but not required on day one.
3D vision models to predict depth and 3D structure,
Video/temporal behavior models to predict intent,
Deep understanding of Vision language models and hands-on experience fine tuning them,
Deep understanding of foundation models in perception.
Deep knowledge of Nvidia edge device stack for running ML models and cuda know-how in terms delivering highly optimized models
This position is based in San Francisco and follows a hybrid schedule with at least 3 days in-office per week.
Key Responsibilities
Below are your primary responsibilities. These represent the core areas where you’ll make an impact. As part of a rapidly evolving team, we look forward to your impact expanding over time.
Come up with solutions to complex problems in perception using deep learning.
Articulate ideas both verbally and in written form culminating in clear design and project plan docs for their work
Contribute to team roadmap and planning
Work with cross functional teams across the company and contribute towards delivery of end to end solutions for the customer
Required Qualifications
The qualifications below outline the experience and skills most relevant to success in this role. We recognize that skills and potential come in many forms, and we welcome diverse experiences that advance our mission.
Experience: 1-2 years experience building and deploying machine learning models for perception in production settings.
Core Skills: Solid hands-on experience designing, training, and evaluating machine learning models for perception. Demonstrated ability to independently build ML pipelines and deploy models to cloud environments (AWS, GCP, or Azure); familiarity with MLOps practices including experiment tracking, model versioning, and automated workflows. Hands-on experience in PyTorch, Python, and related skills.
Personal Attributes:Good communicator, self starter and ability to collaborate with others, quick learner who can adapt to a fast paced startup culture
Education: Bachelors or Masters in Computer Science or related field
Nice to Have: Experience working in perception problems in self driving car companies that aligns closely with the work of this team.