Remote Otter LogoRemoteOtter

Machine Learning Engineer, Ads Training Platform - Remote

Posted 2 days ago
Software Development
Full Time
Worldwide
$185,800 - $260,100 USD

Overview

Reddit is looking for a Machine Learning Engineer to design and maintain large-scale distributed training infrastructure for its Ads ML models, enabling fast and reliable model training across large datasets.

In Short

  • Design and maintain distributed training infrastructure.
  • Develop tools on the Ray platform.
  • Debug and profile distributed training jobs.
  • Improve data access patterns with object storage integration.
  • Collaborate with ML engineers to optimize training efficiency.
  • Enhance scheduling and fault tolerance in the training platform.

Requirements

  • 3+ years in infrastructure/platform engineering.
  • 2+ years experience with the Ray platform.
  • Strong understanding of distributed computing principles.
  • Experience with distributed storage systems.
  • Proven debugging skills for distributed jobs.
  • Experience with deep learning frameworks like PyTorch or TensorFlow is a plus.
  • Bonus: experience in model optimization for distributed training.

Benefits

  • Comprehensive Healthcare Benefits.
  • 401k Match.
  • Family Planning Support.
  • Gender-Affirming Care.
  • Mental Health & Coaching Benefits.
  • Flexible Vacation & Global Days off.
  • Generous paid Parental Leave.
  • Paid Volunteer time off.
Reddit logo

Reddit

Reddit is a dynamic online platform that fosters community engagement and discussion across a wide range of topics. As a leading social news aggregation and discussion website, Reddit connects millions of users who share content and participate in conversations. The company is committed to understanding user needs and enhancing the user experience through innovative research and collaboration among cross-functional teams. Reddit values growth and is focused on driving product strategy through actionable insights, making it an exciting place for professionals passionate about user experience and research.

Share This Job!

Save This Job!

Similar Jobs:

Coinbase logo

Machine Learning Engineer - Platform - Remote

Coinbase

18 weeks ago

Join Coinbase as a Machine Learning Engineer to develop innovative solutions utilizing ML and GenAI in the blockchain space.

USA
Full-time
Software Development
$152,405 - $179,300 USD/year
Artera logo

Machine Learning Engineer (Platform) - Remote

Artera

19 weeks ago

Join Artera as a Machine Learning Engineer to develop scalable AI solutions for cancer therapy.

USA
Full-time
Software Development
Khealth logo

Machine Learning Platform Engineer - Remote

Khealth

24 weeks ago

Join K Health as a Machine Learning Platform Engineer to develop cutting-edge AI solutions for healthcare.

Worldwide
Full-time
Software Development
ABBYY logo

Machine Learning Platform Engineer - Remote

ABBYY

26 weeks ago

ABBYY is seeking a Machine Learning Platform Engineer to implement and maintain platform components for ML systems.

India
Full-time
Software Development
Coinbase logo

Machine Learning Engineer - AI Platform - Remote

Coinbase

4 weeks ago

Join Coinbase as a Machine Learning Engineer to build AI infrastructure that enhances customer experiences and drives productivity.

USA
Full-time
Software Development
$152,405 - $179,300 USD/year