<div> <br><span>We’re looking for an MLE to build and scale distributed reinforcement learning systems for model training. You’ll deploy elastic environment microservices, design reward systems and optimize multi-node and multi-datacenter training pipelines.</span><br><h2><b>Responsibilities:</b></h2> <ul> <li aria-level="1"><span>Designing and implementing RL pipelines from reward modeling to policy optimization</span></li> <li aria-level="1"><span>Optimizing RL training stability and sample efficiency for large models</span></li> <li aria-level="1"><span>Verifying numerical correctness across inference and training</span></li> <li aria-level="1"><span>Performance engineering on trainer-inference communication</span></li> <li aria-level="1"><span>Validating methods from recent publications</span></li> </ul> <h2><b>Qualifications:</b></h2> <ul> <li aria-level="1"><span>Hands-on experience with reinforcement learning in production systems</span></li> <li aria-level="1"><span>Deep understanding of policy-space methods (GRPO, PPO, etc.)</span></li> <li aria-level="1"><span>Experience profiling distributed systems</span></li> </ul> <h2><b>Preferred:</b></h2> <ul> <li aria-level="1">History of OSS contributions</li> <li aria-level="1"><span>Knowledge of TorchTitan and SGLang or vLLM</span></li> </ul> </div>

(Other)

Software Engineering

Data Science

Nous Research is an applied research group focused on LLM architecture, data synthesis, and local inference. We are launching a composer for AI orchestration called Nous-Forge soon.

Saratoga, CA, USA

New York, NY, USA

Nous Research

Full TimeMachine Learning Engineer (Reinforcement Learning)

Collab+Currency Management, LLC

Collab+Currency

Tell us about your professional DNA to get discovered by any company in our network with opportunities relevant to your career goals.

Leverage our network to build your career.

#### Welcome!

Thank you for joining Collabcurrency Network!

To help us best support you in your search, please take a few minutes to tell us about what you are looking for in your next role. We’ll use this information to connect you to relevant opportunities in the Collabcurrency network as they come up.

You can always update this information later.

Welcome to the Collabcurrency talent network

As our companies grow, they look to us to help them find the best talent.

Signal that you'd be interested in working with a Collabcurrency company to help us put the right opportunities at great companies on your radar. The choice to pursue a new career move is then up to you.