Remote Otter LogoRemoteOtter

RL Environments Engineer (Contractor, Remote)

Posted 2 days ago
Software Development
Contract
USA

Overview

The RL Environments Engineer will design and build MLE environments to teach LLMs better reasoning and advanced concepts from modern machine learning.

In Short

  • Remote contractor role with ≥4 hours overlap to PST.
  • Strong Python programming skills required.
  • Experience with Docker and production mindset.
  • Clear understanding of LLMs and their limitations.
  • Strong expertise in CUDA or Pallas kernel development preferred.
  • Knowledge in active DL/ML research areas is a plus.
  • Ability to meet throughput expectations and respond quickly to feedback.
  • Experience in building complex interactive RL environments.
  • Strong fundamentals and broad research interests in ML.
  • Creative problem-solving skills in RL-based learning systems.

Requirements

  • Advanced English proficiency (C1/C2).
  • Strong Python (engineering-quality, not notebook-only).
  • Docker experience with a focus on debugging and reliability.
  • Clear understanding of LLMs and their limitations.
  • Expertise in CUDA or Pallas kernel development.
  • Knowledge in generative modeling, geometry, topology, and reasoning.
  • Experience in building complex interactive RL environments.
  • Strong fundamentals in machine learning.
  • Ability to translate research into RLVR problems.
  • Creativity in solving open-ended RL-based learning challenges.

Benefits

  • Work remotely from anywhere.
  • Opportunity to work with leading AI labs.
  • Engage in cutting-edge research in AI.
  • Collaborate with experienced professionals from the AI field.
  • Flexible working hours with a focus on productivity.

P.M

Preference Model

Preference Model is at the forefront of developing the next generation of training data to enhance the capabilities of AI. The company focuses on creating reinforcement learning (RL) environments that allow models to tackle research and engineering challenges, learn from realistic feedback loops, and improve their performance across diverse applications. With a founding team experienced in building data infrastructure and datasets for leading AI models, Preference Model collaborates with top AI labs to advance the field and is supported by prominent Silicon Valley venture capital. The company is dedicated to pushing AI closer to its transformative potential.

Share This Job!

Save This Job!

Similar Jobs:

Dept logo

Senior iOS Engineer - US Remote Contractor

Dept

42 weeks ago

Join DEPT® as a Senior iOS Engineer for a remote contract role focused on innovative mobile development.

USA
Contract
Software Development
$75 - $135/hour
Nearform logo

Senior Data Engineer (Contract, Remote)

Nearform

32 weeks ago

Join Nearform as a Senior Data Engineer on a contract basis, working remotely to design and maintain data platforms.

Worldwide
Contract
Software Development

J.G

Platform Engineer (Remote / Contract)

Jun Group

40 weeks ago

Jun Group is seeking a remote Platform Engineer for a contract role to manage and optimize their infrastructure and CI/CD processes.

USA
Contract
DevOps / Sysadmin
INFUSE logo

Middle AI Engineer (Remote, Contract)

INFUSE

52 weeks ago

Join our team as an AI Engineer to develop and implement AI/ML solutions.

Worldwide
Full-time
Software Development
INFUSE logo

Middle AI Engineer (Remote, Contract)

INFUSE

52 weeks ago

Join our team as an AI Engineer to develop and implement AI/ML solutions using Python and C#.

Worldwide
Full-time
Software Development