Remote Otter LogoRemoteOtter

Infrastructure Engineer for AI Systems - Remote

Posted 6 weeks ago

Overview

Moonvalley is building the next generation creative studio, powered by the most capable video and image foundational models in the world. We are creating the platforms where the first generative Super Bowl ads and Oscar winning movies will be created.

In Short

  • Manage and scale GPU infrastructure (Kubernetes, Terraform / Pulumi).
  • Maintain ETL pipelines (Spark / Ray / Airflow).
  • Oversee the telemetry platform to monitor system health (Datadog, Grafana, W&B).
  • Manage the code platform (GitHub, CI/CD, PyTorch, Python).
  • Track and optimize assets like datasets, checkpoints, and compute resources.
  • Develop tools, documentation, and guidance for the team.
  • Windows client and server administration.

Requirements

  • Passion for building petabyte-scale systems that enhance efficiency and productivity.
  • Ability to balance quick fixes for urgent needs with long-term, scalable solutions.
  • Strong prioritization skills in a fast-moving, high-impact environment.
  • Comfortable using open-source tools or developing custom solutions when needed.
  • A versatile generalist, eager to learn and adapt to new tools and systems.

Benefits

  • Opportunity to work on cutting-edge AI technology.
  • Shape the future of media and entertainment.
  • Work with top AI talent.
  • Be part of a highly innovative and fast-paced environment.
  • Flexible remote work arrangements.

Similar Jobs:

V.P

Infrastructure Systems Engineer - Remote

Valore Partners

2 weeks ago

The Infrastructure Systems Engineer will specialize in Identity and Access Management (IAM) to provide consulting, design, and deployment services for critical IAM solutions.

Identity AND Access Management (IAM)
Automation
User Provisioning
Role-Based Access Control (RBAC)
AZ, USA
Full-time
DevOps / Sysadmin
iT1 logo

Systems Engineer II - Infrastructure - Remote

iT1

3 weeks ago

Join iT1 as a Systems Engineer II focusing on Infrastructure within a supportive NOC team.

Systems Engineering
Infrastructure
Network Troubleshooting
System Administration
USA
Full-time
DevOps / Sysadmin

I.T

Senior IT Infrastructure and Systems Engineer - Remote

iRhythm Technologies

2 weeks ago

Join iRhythm as a Senior IT Infrastructure and Systems Engineer to lead the design and support of endpoint technologies in a remote role based in the UK.

IT Infrastructure
Systems Engineering
Cloud Computing
AWS
UK
Full-time
DevOps / Sysadmin
Autohive logo

AI Infrastructure Engineer - Remote

Autohive

2 weeks ago

Join Autohive as an AI Infrastructure Engineer to build and optimize scalable AI infrastructure.

AWS
AI Service Operations
Cloud Infrastructure
Automation
New Zealand
Full-time
DevOps / Sysadmin
Ritual logo

AI Infrastructure Engineer - Remote

Ritual

24 weeks ago

Join Ritual as an AI Infrastructure Engineer to design and implement innovative solutions at the intersection of AI and blockchain.

AI
ML
Blockchain
Rust
Worldwide
Full-time
Software Development