Remote Otter LogoRemoteOtter

Platform Engineer – AI/ML Infrastructure - Remote

Posted Yesterday
DevOps / Sysadmin
Full Time
CA, USA

Overview

Deepgram is seeking an expert Platform Engineer to build and operate the hybrid infrastructure for AI/ML research and product development, focusing on creating a robust self-service environment using Kubernetes, AWS, and Infrastructure-as-Code.

In Short

  • Architect and maintain core computing platform using Kubernetes on AWS and on-premise.
  • Develop and manage infrastructure using Infrastructure-as-Code principles with Terraform.
  • Design and optimize AI/ML job scheduling and orchestration systems.
  • Provision and maintain on-premise bare metal server infrastructure for GPU computing.
  • Implement networking and storage solutions for hybrid environments.
  • Develop observability stack for platform health.
  • Collaborate with AI researchers to build tools and workflows.
  • Automate life cycle of single-tenant managed deployments.

Requirements

  • 5+ years of experience in Platform Engineering, DevOps, or Site Reliability Engineering.
  • Hands-on experience building and managing production infrastructure with Terraform.
  • Expert-level knowledge of Kubernetes in a large-scale environment.
  • Experience with HPC job schedulers like Slurm for GPU workloads.
  • Experience managing bare metal infrastructure.
  • Strong scripting and automation skills (Python, Go, Bash).

Benefits

  • Opportunity to work on cutting-edge technology in the AI industry.
  • Collaborative and inclusive work environment.
  • Focus on continuous improvement and developer experience.
Deepgram logo

Deepgram

Deepgram is a pioneering AI company dedicated to revolutionizing human-machine interaction through natural language processing. They provide developers with access to a powerful voice AI platform that includes advanced models for speech-to-text, text-to-speech, and spoken language understanding via a simple API call. With a focus on applications ranging from transcription to sentiment analysis and voice synthesis, Deepgram is the go-to partner for those building innovative voice AI solutions. Backed by notable investors and having raised over $85 million in funding, Deepgram is committed to fostering a diverse and inclusive workplace while driving significant advancements in the AI industry.

Share This Job!

Save This Job!

Similar Jobs:

OP Labs logo

Infrastructure Engineer, Platform - Remote

OP Labs

7 weeks ago

Join as an Infrastructure Engineer to build and operate foundational systems for the OP Stack, focusing on high-performance infrastructure and operational excellence.

Worldwide
Full-time
DevOps / Sysadmin
Astra logo

Senior Infrastructure & Platform Engineer - Remote

Astra

7 days ago

Astra is seeking a Senior Infrastructure & Platform Engineer to design and maintain scalable cloud infrastructure for their financial platform.

USA
Full-time
DevOps / Sysadmin
Fresha logo

Senior Platform Engineer (Infrastructure) - Remote

Fresha

23 weeks ago

Fresha is looking for a Senior Platform Engineer to enhance their platform and improve the software development lifecycle.

GB
Full-time
Software Development
Alma logo

Senior Platform Engineer - Platform Infrastructure - Remote

Alma

31 weeks ago

Alma is seeking a Senior Platform Engineer to design and maintain cloud infrastructure solutions.

Worldwide
Full-time
DevOps / Sysadmin
$200,000 - $220,000/year
Arista Networks logo

Platform Software Infrastructure Engineer - Remote

Arista Networks

3 weeks ago

Join Arista Networks as a Platform Software Infrastructure Engineer to enhance automation and develop tools for manufacturing diagnostics.

India
Full-time
Software Development