P.M

Preference Model

Preference Model is at the forefront of developing the next generation of training data to enhance the capabilities of AI. The company focuses on creating reinforcement learning (RL) environments that allow models to tackle research and engineering challenges, learn from realistic feedback loops, and improve their performance across diverse applications. With a founding team experienced in building data infrastructure and datasets for leading AI models, Preference Model collaborates with top AI labs to advance the field and is supported by prominent Silicon Valley venture capital. The company is dedicated to pushing AI closer to its transformative potential.

1 Remote Jobs at Preference Model

P.M

RL Environments Engineer (Contractor, Remote)

Preference Model

26 weeks ago

Preference Model

Join Preference Model as a remote RL Environments Engineer to design and build environments for teaching LLMs advanced reasoning and machine learning concepts.

USA

Contract

Software Development

26 weeks ago

No More Jobs Found