Remote Otter LogoRemoteOtter

P.M

Preference Model

Preference Model is at the forefront of developing the next generation of training data to enhance the capabilities of AI. The company focuses on creating reinforcement learning (RL) environments that allow models to tackle research and engineering challenges, learn from realistic feedback loops, and improve their performance across diverse applications. With a founding team experienced in building data infrastructure and datasets for leading AI models, Preference Model collaborates with top AI labs to advance the field and is supported by prominent Silicon Valley venture capital. The company is dedicated to pushing AI closer to its transformative potential.

1 Remote Jobs at Preference Model

P.M

RL Environments Engineer (Contractor, Remote)

Preference Model

2 days ago

Join Preference Model as a remote RL Environments Engineer to design and build environments for teaching LLMs advanced reasoning and machine learning concepts.

USA
Contract
Software Development

No More Jobs Found