Remote Otter LogoRemoteOtter

Infrastructure Engineer (Compute) - Remote

Posted 10 weeks ago
DevOps / Sysadmin
Full Time
USA

Overview

Fluidstack is the AI Cloud Platform. We build GPU supercomputers for top AI labs, governments, and enterprises. Our customers include Mistral, Poolside, Black Forest Labs, Meta, and more.

In Short

  • Design and implement GPU/ASIC infrastructure at the server, rack, and system level.
  • Troubleshoot complex GPU and compute system related failures.
  • Develop and maintain hardware/firmware management services.
  • Automate all aspects of the server lifecycle.
  • Own end-to-end compute lifecycle, including partnering with vendors on RMAs.
  • Serve as the main point of contact for hardware escalation and troubleshooting.
  • Monitor system performance, identifying and resolving bottlenecks.
  • Automate deployment and management tasks to improve efficiency.
  • Collaborate with storage and network teams to ensure cohesive infrastructure operations.

Requirements

  • 5+ years of experience in compute infrastructure engineering.
  • Strong knowledge of Linux systems administration and performance tuning.
  • Experience with bare metal provisioning tools (MaaS, Metal3, Tinkerbell, or other).
  • Familiarity with GPU hardware and workload optimization, especially kernel and driver level requirements.
  • Proficiency in automation tools (e.g., Ansible, Terraform).
  • Experience operating Kubernetes and SLURM clusters.

Benefits

  • Competitive total compensation package (salary + equity).
  • Retirement or pension plan, in line with local norms.
  • Health, dental, and vision insurance.
  • Generous PTO policy, in line with local norms.
  • Fluidstack is remote first, but has offices in key hubs. For all other locations, we provide access to WeWork.

FluidStack

FluidStack

FluidStack is an innovative AI cloud company that collaborates with leading AI firms globally, including notable names like Poolside, Meta, Modal, and Reka. The company specializes in providing high-performance computing (HPC) as a service, ensuring that its GPU infrastructure operates at peak performance while offering exceptional support to its customers. FluidStack is committed to scaling its operations through automation and efficient deployment of new clusters, making it a key player in the AI cloud industry.

Share This Job!

Save This Job!

Similar Jobs:

Hexa People logo

Infrastructure Engineer - Remote

Hexa People

6 days ago

Join our team as an Infrastructure Engineer responsible for managing and optimizing cloud infrastructure.

Worldwide
Full-time
DevOps / Sysadmin
G2i logo

Infrastructure Engineer - Remote

G2i

1 week ago

Join a remote team as an Infrastructure Engineer to design and maintain Kubernetes clusters and ensure system reliability.

Worldwide
Full-time
DevOps / Sysadmin
75000 USD/year

N.M.S.B

Infrastructure Engineer - Remote

NTT MSC Sdn. Bhd

1 week ago

Join NTT DATA as an Infrastructure Engineer, where you will support the installation and maintenance of network cabling systems in a remote work environment.

USA
Full-time
DevOps / Sysadmin
$84,900 - $106,100/year
River logo

Infrastructure Engineer - Remote

River

2 weeks ago

River is looking for a security-minded Infrastructure Engineer to enhance and secure their systems, primarily using Google Cloud.

USA
Full-time
DevOps / Sysadmin
$150,000 - $220,000/year
Masabi logo

Infrastructure Engineer - Remote

Masabi

3 weeks ago

Join Masabi as an Infrastructure Engineer to ensure platform reliability and scalability in a fully remote role based in Colombia.

Colombia
Full-time
DevOps / Sysadmin