Remote Otter LogoRemoteOtter

Machine Learning Operations Manager - Remote

Posted 2 days ago
Software Development
Full Time
India

Overview

As the Machine Learning Operations Manager, you will oversee the end-to-end ML lifecycle — from model training and deployment to monitoring and optimization. You will lead a small, high-performing team of engineers while remaining hands-on in building scalable, reliable, and efficient ML infrastructure.

In Short

  • Manage training infrastructure, experiment tracking, deployment, and continuous optimization.
  • Partner with research teams to streamline training, evaluation, and fine-tuning workflows.
  • Mentor and guide a small team of ML engineers (3–4) while contributing as an individual contributor.
  • Improve latency, throughput, and cost efficiency; ensure robust packaging and runtime reliability.
  • Develop systems for CI/CD, versioning, rollback, A/B testing, monitoring, and alerting.
  • Maintain scalable, secure, and compliant AI environments across training and inference stages.
  • Collaborate with cloud providers (AWS, GCP, Azure) and AI platforms to enhance tooling and optimize costs.
  • Support GenAI and AI-driven projects across teams beyond core MLOps responsibilities.
  • Contribute to architectural planning, documentation, and the continuous evolution of the ML stack.
  • Promote automation, MLOps standards, and operational excellence throughout the ML lifecycle.

Requirements

  • 5+ years of hands-on experience in MLOps or ML/AI Engineering.
  • Strong understanding of ML/DL concepts and applied experience in model training and deployment infrastructure.
  • Proficiency with cloud-native ML tools (e.g., GCP Vertex AI, AWS SageMaker, Kubernetes).
  • Experience working across both model training and inference systems.
  • Familiarity with model optimization methods such as quantization, distillation, TensorRT, or FasterTransformer.
  • Demonstrated ability to lead complex technical projects independently.
  • Excellent communication and collaboration skills with a cross-functional mindset.
  • Ownership-oriented approach with comfort in driving clarity in ambiguous situations.

Benefits

  • Opportunity to lead and shape a high-performing team.
  • Work on cutting-edge ML infrastructure and technologies.
  • Collaborative and innovative work environment.
  • Flexible working hours and remote work.
  • Professional development and growth opportunities.
Weekday AI logo

Weekday AI

Weekday AI, a Y-Combinator-backed company, operates as a sourcing engine on auto-pilot, facilitating the hiring of engineers through referrals from other tech professionals. Ranked #1 on Product Hunt, Weekday AI offers startups a streamlined hiring process that includes automated outreach and instant reference checks, enabling companies to receive qualified candidates within just four days of signing up.

Share This Job!

Save This Job!

Similar Jobs:

Weekday AI logo

Machine Learning Operations Manager - Remote

Weekday AI

2 weeks ago

The Machine Learning Operations Manager will lead ML lifecycle management while overseeing a high-performing team in a remote setting.

India
Full-time
Software Development
Monzo logo

Machine Learning Manager, Operations - Remote

Monzo

37 weeks ago

Lead the Machine Learning team at Monzo to enhance customer service through innovative solutions.

UK
Full-time
Software Development
115000 - 130000/year
Construct Education logo

Learning Operations Manager - Remote

Construct Education

23 weeks ago

The Learning Operations Manager oversees the operational management and support for Learning Design and Content Marketing teams in Cape Town.

South Africa
Full-time
Project Management
Blue Coding logo

Machine Learning Operations Engineer - Remote

Blue Coding

31 weeks ago

Join our team as a Machine Learning Operations Engineer and work remotely to build and maintain machine-learning pipelines for US clients.

USA
Full-time
Software Development
Vana logo

Machine Learning Operations Engineer - Remote

Vana

32 weeks ago

The Machine Learning Operations Engineer focuses on deploying and automating machine learning models while collaborating with data teams.

GT
Full-time
DevOps / Sysadmin