Remote Otter LogoRemoteOtter

Machine Learning Infrastructure Engineer - Remote

Posted 5 weeks ago

Overview

Waymo is an autonomous driving technology company with the mission to be the most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo One, a fully autonomous ride-hailing service, and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over one million rider-only trips, enabled by its experience autonomously driving tens of millions of miles on public roads and tens of billions in simulation across 13+ U.S. states.

In Short

  • Develop the infrastructure components necessary for distributed training
  • Implement automation solutions for provisioning, deployment, monitoring, and scaling of distributed training infrastructure
  • Monitor system health, diagnose and perform routine maintenance tasks to ensure the reliability of the distributed training infrastructure.
  • Identify performance bottlenecks and optimization opportunities
  • Improve the developer experience and performance of our scalable ML framework

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field, or 4+ years equivalent experience
  • Experience building distributed systems for production environments.
  • Solid Python or C++ skills
  • Prior experience with Machine Learning frameworks (e.g., TensorFlow, PyTorch) and distributed training algorithms

Benefits

  • Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Similar Jobs:

Waymo logo

Machine Learning Infrastructure Engineer - Remote

Waymo

3 weeks ago

Waymo is seeking a Machine Learning Infrastructure Engineer to develop large-scale inference solutions for autonomous driving technology.

Machine Learning
ML Frameworks
JAX
PyTorch
CA, USA
Full-time
Software Development
$158,000 - $200,000 USD/year
Waymo logo

Machine Learning Infrastructure Engineer - Remote

Waymo

4 weeks ago

Waymo is seeking a Machine Learning Infrastructure Engineer to develop large-scale inference solutions for autonomous driving technology.

Machine Learning
JAX
PyTorch
TensorFlow
CA, USA
Full-time
Software Development
$192,000 - $243,000 USD/year

arcee.ai

Machine Learning Infrastructure Engineer - Remote

arcee.ai

20 weeks ago

Join Arcee.ai as a Machine Learning Infrastructure Engineer to design and maintain cutting-edge AI solutions.

Machine Learning
Infrastructure AS Code
Cloud Computing
AWS
Worldwide
Full-time
Software Development
Black Forest Labs logo

Machine Learning Infrastructure Engineer - Remote

Black Forest Labs

23 weeks ago

Join Black Forest Labs as a Machine Learning Infrastructure Engineer to develop and maintain cutting-edge ML infrastructure.

ML Infrastructure
Cloud Platforms
AWS
Azure
Worldwide
Full-time
DevOps / Sysadmin
Cantina logo

Lead Machine Learning Infrastructure Engineer - Remote

Cantina

20 weeks ago

Cantina is seeking a Tech Lead to guide the development of its machine learning infrastructure for AI-driven applications.

Machine Learning
AI
Cloud Platforms
AWS
United States
Full-time
Software Development
$200,000 - $250,000/year