Job Description
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Reinforcement Learning Engineer in the United States.
This role focuses on designing, training, and deploying advanced reinforcement learning systems that solve complex sequential decision-making problems where traditional supervised learning approaches are insufficient. You will work on building intelligent agents that learn through interaction, with applications spanning simulation environments and real-world production systems. The position blends deep research in modern RL methods with hands-on engineering to ensure models are scalable, stable, and safe in production. You will contribute to shaping reward systems, training infrastructure, and evaluation frameworks that directly influence model behavior. The environment is highly technical and research-driven, requiring close collaboration with applied scientists and product teams. This is a high-impact role where your work will transition cutting-edge RL techniques into production-ready systems. You will help define how intelligent agents are trained, evaluated, and continuously improved at scale.
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Reinforcement Learning Engineer in the United States.
This role focuses on designing, training, and deploying advanced reinforcement learning systems that solve complex sequential decision-making problems where traditional supervised learning approaches are insufficient. You will work on building intelligent agents that learn through interaction, with applications spanning simulation environments and real-world production systems. The position blends deep research in modern RL methods with hands-on engineering to ensure models are scalable, stable, and safe in production. You will contribute to shaping reward systems, training infrastructure, and evaluation frameworks that directly influence model behavior. The environment is highly technical and research-driven, requiring close collaboration with applied scientists and product teams. This is a high-impact role where your work will transition cutting-edge RL techniques into production-ready systems. You will help define how intelligent agents are trained, evaluated, and continuously improved at scale.
