Remote Otter LogoRemoteOtter

Infrastructure Operations Engineer (GPU Computing) - Enterprise AI - Remote

Posted 46 weeks ago
DevOps / Sysadmin
Full Time
United States

Overview

Aethir is a pioneering technology company at the forefront of GPU-based compute infrastructure, specializing in cutting-edge solutions for diverse industries ranging from AI and machine learning to high-performance computing (HPC).

In Short

  • Manage and optimize GPU-based compute infrastructure.
  • Deploy, configure, and maintain servers, storage, and networking.
  • Implement monitoring and alerting systems.
  • Develop automation scripts and tools.
  • Ensure security and compliance with regulations.
  • Provide tier-3 support for infrastructure issues.
  • Collaborate on capacity planning and scaling.
  • Maintain documentation of infrastructure configurations.
  • Participate in on-call rotation for critical incidents.
  • Foster knowledge sharing within the team.

Requirements

  • Experience with GPU-based infrastructure management.
  • Strong knowledge of monitoring and optimization techniques.
  • Proficiency in automation and orchestration tools.
  • Understanding of security best practices.
  • Experience in incident response and troubleshooting.
  • Ability to plan for capacity and scaling needs.
  • Excellent documentation skills.
  • Strong communication and collaboration abilities.

Benefits

  • Work with cutting-edge technology.
  • Dynamic and innovative work environment.
  • Opportunities for professional growth.
  • Collaborative team culture.
  • Flexible work arrangements.
Aethir logo

Aethir

Aethir is a pioneering provider of Enterprise-grade AI-focused GPU-as-a-service, leveraging a decentralized cloud computing infrastructure to connect GPU providers with enterprise clients in need of powerful GPU chips for AI and machine learning tasks. With a robust network of over 40,000 high-performance GPUs, including 3,000 NVIDIA H100s, Aethir delivers scalable and reliable GPU computing solutions. Backed by prominent Web3 investors and having raised over $130 million, Aethir is at the forefront of decentralized computing innovation, fostering a collaborative and dynamic work environment for its team.

Share This Job!

Save This Job!

Similar Jobs:

Aethir logo

Infrastructure Partner Manager - Enterprise AI GPU Compute Solutions - Remote

Aethir

46 weeks ago

Aethir is looking for an Infrastructure Partner Manager to establish and manage strategic partnerships with GPU hardware providers.

United States
Full-time
Sales / Business
Autohive logo

AI Infrastructure Engineer - Remote

Autohive

8 weeks ago

Join Autohive as an AI Infrastructure Engineer to build and optimize scalable AI infrastructure.

New Zealand
Full-time
DevOps / Sysadmin
Ritual logo

AI Infrastructure Engineer - Remote

Ritual

30 weeks ago

Join Ritual as an AI Infrastructure Engineer to design and implement innovative solutions at the intersection of AI and blockchain.

Worldwide
Full-time
Software Development
Ritual logo

AI Infrastructure Engineer - Remote

Ritual

39 weeks ago

Join Ritual to build the next generation of AI infrastructure in a fully remote environment.

Worldwide
Full-time
Software Development
Ritual logo

AI Infrastructure Engineer - Remote

Ritual

39 weeks ago

Join Ritual as an AI Infrastructure Engineer to work on pioneering blockchain solutions for AI.

Worldwide
Full-time
Software Development