Remote Otter LogoRemoteOtter

Principal Network Engineer - Remote

Posted 6 days ago
DevOps / Sysadmin
Full Time
CA, USA

Overview

The Principal Network Engineer will be responsible for the design, provisioning, and management of networks that underpin and interconnect large-scale GPU clusters on a global scale.

In Short

  • Design and implement a 400GbE spine-leaf network from scratch.
  • Automate network configuration deployment across a 20k node cluster.
  • Manage and optimize performance of high-performance distributed storage systems.
  • Work on globally distributed cross-datacenter interconnects.
  • Experience with HPC and GPU networks technologies.
  • Architect resilient high-performance networks.
  • Network automation using tools like Ansible, Bash, and Python.
  • Familiarity with network hardware from Arista, Cisco, Dell, Juniper, etc.
  • Document network configurations and processes effectively.
  • Preferred in-person work at the San Francisco office, but remote applicants considered.

Requirements

  • Prior experience with HPC and GPU networks including RoCEv2, InfiniBand, eBGP, EVPN/VXLAN.
  • Experience in architecting high-performance networks.
  • Familiarity with network automation tools.
  • Comfortable with configuring next-gen firewalls.
  • Strong documentation skills.

Benefits

  • Generous equity grant.
  • Visa sponsorships available.
  • Retirement matching up to 4%.
  • Comprehensive medical, dental, and vision insurance.
  • Unlimited paid time off and 10+ observed holidays.
  • Parental leave for biological, adoptive, and foster parents.
  • Daily lunch covered for employees.
  • Unlimited office book budget.

T.S.F.C.C

The San Francisco Compute Company

The San Francisco Compute Company is revolutionizing the way compute resources are bought and sold, treating compute as a commodity that can be traded in real-time. By creating a marketplace for compute contracts, the company aims to provide startups and compute providers with flexible pricing and booking options, ensuring that every FLOP is utilized efficiently. With a focus on large-scale GPU clusters and high-performance networks, SF Compute is dedicated to optimizing the supply chain for compute resources, making it accessible and affordable for all users.

Share This Job!

Save This Job!

Similar Jobs:

FluidStack

Principal Networking Engineer - Remote

FluidStack

12 weeks ago

Join Fluidstack as a Principal Networking Engineer to design and implement high-performance networks for AI deployments.

USA
Full-time
Software Development

Symbotic

Principal Network Engineer – Wireless - Remote

Symbotic

23 weeks ago

Symbotic is seeking a Principal Network Engineer – Wireless to provide technical support and leadership for wireless data networks.

USA
Full-time
All others

Cyncly

Principal Engineer - Remote

Cyncly

5 weeks ago

The Principal Engineer will contribute to building innovative solutions and writing clean code in a remote-first environment.

USA
Full-time
Software Development
EnableComp logo

Principal Engineer - Remote

EnableComp

12 weeks ago

The Principal Engineer will spearhead the development of an innovative AI platform to enhance revenue cycle management in healthcare.

USA
Full-time
Software Development
Voodoo logo

Principal Engineer - Remote

Voodoo

13 weeks ago

Voodoo is seeking a Principal Engineer to lead the SDK team in creating innovative solutions for mobile game development.

Worldwide
Full-time
Software Development