Reinforcement Learning Environment Engineer

Open Data Science · san francisco, san francisco county, ca, United-States

Location
san francisco
Job Type
Full-time
Posted
June 07, 2026

Job Description

Reinforcement Learning Environment Engineer

RL Environments; MLE; LLM Tasks; Difficulty Distribution; Remote Contractor; PST Overlap (≥4h); Advanced English (C1/C2);

We’re hiring RL Environments Engineers to design and build MLE/SWE environments that deliver high-quality, diverse tasks with minimal supervision. You will target a specific language model, meet a defined difficulty distribution, and deliver about one task every 10 hours. This is a remote contractor role with ≥4 hours overlap to PST and advanced English (C1/C2) required.

About the company

Preference Model is building the next generation of training data to power the future of AI. Today's models are powerful but fail to reach their potential across diverse use cases because so many of the tasks that we want to use these models for are outside of their training data distribution. Preference Model creates reinforcement learning environments that encapsulate real-world use cases...

Ready to Apply?

Submit your application for Reinforcement Learning Environment Engineer at Open Data Science

Apply Now