Remote Otter LogoRemoteOtter

ML Engineer - Pre Training - Remote

Posted Yesterday
Software Development
Full Time
USA

Overview

The ML Engineer - Pre Training will design and optimize large-scale pre-training systems that power Mindbeam’s generative AI models.

In Short

  • Build scalable pre-training pipelines for foundation models.
  • Implement distributed training strategies across GPUs/TPUs.
  • Collaborate with researchers for production-ready workflows.
  • Develop monitoring and fault-tolerance systems.
  • Benchmark and tune performance across hardware and software stacks.
  • Requires a Bachelor’s, Master’s, or PhD in Computer Science or related field.
  • 2+ years of experience with large-scale model training.
  • Strong coding skills in Python and familiarity with ML frameworks.
  • Experience with GPU scheduling and memory optimization.
  • Comfort with containerized environments like Docker/Kubernetes.

Requirements

  • Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or related field.
  • 2+ years of experience with large-scale model training and distributed systems.
  • Strong coding skills in Python and familiarity with ML frameworks (PyTorch, TensorFlow, JAX).
  • Experience with GPU scheduling, memory optimization, and parallelism strategies.
  • Comfort with containerized and orchestrated environments (Docker/Kubernetes).
  • Understanding of high-performance computing and networking bottlenecks.

Benefits

  • Opportunity to work on cutting-edge AI technologies.
  • Collaborative and innovative work environment.
  • Flexible working hours.
  • Access to state-of-the-art resources and tools.
  • Professional growth and development opportunities.
Mindbeam logo

Mindbeam

Mindbeam is at the forefront of developing next-generation AI infrastructure tailored for both open source and enterprise applications. With a strong emphasis on research and innovation, the company is dedicated to advancing state-of-the-art AI technologies. Mindbeam fosters a collaborative environment where researchers, engineers, and visionaries work together, driven by a shared passion for curiosity and openness. The mission focuses on designing and optimizing large-scale pre-training systems that empower generative AI models, ensuring that the tools created can be built upon by others in the community.

Share This Job!

Save This Job!

Similar Jobs:

Cohere logo

Pre-Training Data Engineer - Remote

Cohere

13 weeks ago

Join Cohere as a Pre-Training Data Engineer to develop data infrastructure for advanced language models.

Worldwide
Full-time
Software Development
Anthropic logo

Research Engineer, Pre-training - Remote

Anthropic

10 weeks ago

Join Anthropic as a Research Engineer to develop the next generation of large language models, focusing on safe and steerable AI systems.

USA
Full-time
Software Development
$340,000 - $425,000 USD/year

Enbridge

Engineer in Training I - Remote

Enbridge

9 weeks ago

Join Enbridge as an Engineer in Training I to work on hydraulic system modeling and distribution optimization engineering.

Canada
Full-time
All others
Eventual logo

Software Engineer, Pre-Training/AI - Remote

Eventual

33 weeks ago

Join Eventual as a Software Engineer focused on AI Pretraining, working on cutting-edge AI research and scalable data systems.

CA, USA
Full-time
Software Development
WithersRavenel logo

Engineer in Training (EIT) - Remote

WithersRavenel

52 weeks ago

Join WithersRavenel as an Engineer in Training, a hybrid role focused on civil engineering and project management.

NC, USA
Full-time
All others