Remote Otter LogoRemoteOtter

Senior DevOps Engineer - Remote

Posted 6 weeks ago
DevOps / Sysadmin
Full Time
FL, USA

Overview

We're building AI infrastructure that scales. With 5 distinct environments managing complex multi-cluster Kubernetes deployments, we need infrastructure experts who can architect systems for production readiness while maintaining security and operational excellence. This isn't just maintaining servers—you'll be designing the backbone that powers our AI platform across development, staging, and production environments.

In Short

  • Manage 5 distinct environments (NMS, Sandbox, Development, Staging, Production) with different security and access requirements
  • Design redundancy and failover mechanisms for our centralized NMS hub that manages all environments
  • Develop and maintain Pulumi-based infrastructure using Python
  • Automate resource provisioning and configuration management
  • Implement and maintain certificate-based VPN access with internal DNS resolution
  • Deploy and configure Prometheus, Grafana, Loki, Jaeger, and CloudWatch
  • Deploy and maintain the centralized API that manages all environments from the NMS hub
  • 5+ years in DevOps, SRE, or infrastructure engineering
  • Expert-level Kubernetes experience with EKS and multi-cluster management
  • Understanding of zero-trust architecture principles

Requirements

  • 5+ years in DevOps, SRE, or infrastructure engineering
  • Expert-level Kubernetes experience with EKS and multi-cluster management
  • Strong Python programming skills for infrastructure automation and API development
  • Infrastructure as Code expertise with Pulumi, Terraform, or similar tools
  • Deep AWS knowledge: VPC, EKS, ECR, S3, CloudWatch, IAM, and networking
  • Hands-on experience with Prometheus, Grafana, and centralized logging systems
  • Network security experience including VPN, firewalls, and certificate management

Benefits

  • Zero unplanned downtime across production environments
  • Successfully implement disaster recovery procedures with tested failover mechanisms
  • Achieve 99.9% uptime SLA across all critical services
  • Complete VPN-only access implementation with certificate-based authentication
  • Successfully integrate HashiCorp Vault across all environments
  • Pass security audit with comprehensive logging and monitoring in place
  • Reduce infrastructure provisioning time by 50% through automation
  • Implement comprehensive monitoring with <5 minute mean time to detection
  • Optimize GPU utilization rates above 80% across training workloads
  • Balance VPN-only security requirements with operational efficiency for remote team access
Aldea logo

Aldea

Aldea is an innovative company focused on creating a decision support platform that integrates artificial intelligence with human-centered design. Their mission is to empower individuals to navigate complex choices confidently through well-designed tools. Aldea is dedicated to developing intuitive and impactful user experiences, particularly through their consumer-facing features, and values collaboration among cross-functional teams to ensure that their products meet the real needs of users.

Share This Job!

Save This Job!

Similar Jobs:

Clear Capital logo

Senior DevOps Engineer - Remote

Clear Capital

6 weeks ago

Join our team as a Senior DevOps Engineer to build and maintain AWS cloud infrastructure for financial institutions.

USA
Full-time
DevOps / Sysadmin
$115,000 - $157,500/year
PRAGMATIKE logo

Senior DevOps Engineer - Remote

PRAGMATIKE

6 weeks ago

Join our fully remote team as a Senior DevOps Engineer, focusing on cutting-edge cloud computing and AI technologies.

Worldwide
Full-time
DevOps / Sysadmin
DevLynx logo

Senior DevOps Engineer - Remote

DevLynx

7 weeks ago

Join DevLynx as a Senior DevOps Engineer to optimize and maintain the infrastructure supporting software development projects.

USA
Full-time
DevOps / Sysadmin
Agero logo

Senior DevOps Engineer - Remote

Agero

7 weeks ago

Join Agero as a Senior DevOps Engineer to lead the development of cloud infrastructure and optimize application operations.

Worldwide
Full-time
DevOps / Sysadmin
$100,000 - $140,000 USD/year

Jobgether

Senior DevOps Engineer - Remote

Jobgether

7 weeks ago

Join a mission-driven team as a Senior DevOps Engineer, focused on delivering secure software systems in a remote-first environment.

USA
Full-time
DevOps / Sysadmin