Remote Otter LogoRemoteOtter

Manager of Infrastructure Operations - Remote

Posted 3 weeks ago
DevOps / Sysadmin
Full Time
USA

Overview

Voltage Park is seeking a highly skilled and proactive Manager of Infrastructure Operations to lead our 24/7 Infrastructure Operations team responsible for the stability, scalability, and performance of compute, storage, and platform infrastructure. This role plays a key part in delivering always-on, high-performance environments that support AI/ML training, inference, and HPC workloads at scale. The ideal candidate combines technical depth with strong leadership skills and a passion for operational excellence.

In Short

  • Lead a 24/7 infrastructure Operations team.
  • Develop and maintain operational runbooks and documentation.
  • Collaborate with various teams for infrastructure rollouts and upgrades.
  • Oversee observability systems and drive continuous improvements.
  • Implement best practices for system availability and performance.
  • Be available for on-call support during urgent incidents.
  • Ensure compliance with security and regulatory standards.
  • Inspire and lead a team towards common goals.
  • Promote diversity, equity, and inclusion within the team.
  • Value clear communication and documentation.

Requirements

  • Proficiency in Puppet, Terraform, and Ansible.
  • Strong scripting skills in Bash, Python, or Go.
  • Experience in managing Kubernetes clusters.
  • Track record of architecting and delivering complex systems.
  • Deep understanding of network protocols and security systems.
  • Excellent communication skills.
  • Ability to balance development and ideal architectures.
  • Strong decision-making skills.
  • Experience in conflict resolution.
  • Ability to communicate a clear vision.

Benefits

  • Full remote flexibility.
  • Opportunity to work with a motivated team.
  • Autonomy in task prioritization.
  • Support for professional development.
  • Focus on execution and collaboration.
Voltage Park logo

Voltage Park

Voltage Park is a pioneering company dedicated to democratizing access to machine learning infrastructure for a diverse range of clients, including large enterprises, research universities, seed-stage startups, and nonprofits. The company stands out as the only cloud provider that offers a platform showcasing all available GPUs for rent, complete with transparent, market-based pricing and long-term reserve contracts. As a rapidly growing startup in the AI infrastructure sector, Voltage Park is committed to providing seamless compute access and fostering innovation in the field of artificial intelligence.

Share This Job!

Save This Job!

Similar Jobs:

Vultr

Infrastructure Operations Manager - Remote

Vultr

9 weeks ago

Vultr is seeking an Infrastructure Operations Manager to lead and manage daily operations in a fully remote environment.

Worldwide
Full-time
DevOps / Sysadmin
$105,000 - $120,000/year
Hayden AI logo

IT Infrastructure and Operations Manager - Remote

Hayden AI

10 weeks ago

The IT Infrastructure and Operations Manager will oversee IT systems management, ensuring security and operational efficiency.

CA, USA
Full-time
DevOps / Sysadmin
ARC'TERYX logo

Manager, Technology Operations (Infrastructure) - Remote

ARC'TERYX

15 weeks ago

The Manager, Technology Operations (Infrastructure) leads the management of physical and cloud networks while fostering team development in a hybrid work environment.

Canada
Full-time
DevOps / Sysadmin
CAD104000 - CAD130000/year
Fervo Energy logo

Manager of Infrastructure - Remote

Fervo Energy

18 weeks ago

The Manager of Infrastructure will lead the planning, implementation, and maintenance of the organization's IT infrastructure.

United States
Full-time
DevOps / Sysadmin
CarGurus logo

Senior Manager of Data Infrastructure and Operations - Remote

CarGurus

7 weeks ago

CarGurus is seeking a Senior Manager of Data Infrastructure and Operations to lead their data engineering team and drive data-driven decisions across the company.

USA
Full-time
Data Analysis