Remote Otter LogoRemoteOtter

Lead Platform Engineer (HPC & Stateless Linux) - Remote

Posted 1 week ago
DevOps / Sysadmin
Contract
Worldwide

Overview

The Project involves implementing a new on-premises Linux cluster to support rendering and production workloads across European branches, transitioning to a modern stateless architecture.

In Short

  • Implement on-premise cluster using Warewulf.
  • Deploy and configure SLURM as the primary scheduler.
  • Manage environment through Proxmox and design container images via Singularity / Apptainer.
  • Implement Icinga for monitoring and build a custom Conda repository.
  • Collaborate on network architecture and support CI workflows via GitLab CI.
  • Expert-level Linux system administration required.
  • Proven experience with large-scale compute environments.
  • Hands-on experience with stateless deployments and container technologies.
  • Proficiency in Python or Bash for automation.
  • Ability to make high-level architectural decisions.

Requirements

  • Must have expert-level Linux mastery.
  • Experience building or operating large-scale compute environments.
  • Hands-on experience with stateless deployments and virtualization.
  • Proficiency in scripting with Python or Bash.
  • Ability to own the foundational layer of a project.
  • Nice to have experience with SLURM, Warewulf, or similar tools.
  • Experience with Infrastructure-as-Code tools like Ansible or Terraform.
  • Experience in research computing or advanced university labs.

Benefits

  • Flexible working hours.
  • Opportunity to work on cutting-edge technology.
  • Collaborative environment with R&D and IT teams.
  • Initial project phase estimated at 4–8 weeks.
  • Consistent availability during the project window.
PFX logo

PFX

PFX is a dynamic and innovative company specializing in visual effects and compositing for various media projects. With a focus on delivering high-quality results in a fast-paced environment, PFX is dedicated to pushing the boundaries of creativity and technology. The company values skilled professionals who are passionate about their craft and are ready to contribute to exciting short-term projects.

Share This Job!

Save This Job!

Similar Jobs:

Grwm logo

Lead Platform Engineer - Remote

Grwm

30 weeks ago

Join Grwm. as a Lead Platform Engineer to develop user-friendly mobile applications and contribute to the future of social shopping.

Germany
Full-time
Software Development
Solace logo

Lead Platform Engineer - Remote

Solace

33 weeks ago

Join Solace as a Lead Platform Engineer to drive AWS migration and optimize developer workflows in a remote, mission-driven team.

USA
Full-time
DevOps / Sysadmin
Xero logo

Lead Engineer - Platform - Remote

Xero

33 weeks ago

Join Xero as a Lead Engineer to provide technical leadership and drive innovation in platform development.

NZ
Full-time
Software Development

Benepass

Lead Platform Engineer - Remote

Benepass

35 weeks ago

Lead the Site Reliability Engineering and DevOps functions at Benepass, ensuring reliability and scalability of core systems.

USA
Full-time
DevOps / Sysadmin
$170,000 - $195,000/year
TetraScience logo

Lead Platform Engineer - Remote

TetraScience

36 weeks ago

Join TetraScience as a Lead Platform Engineer and help scale their cloud-native data platform to support significant growth.

MA, USA
Full-time
Software Development