Remote Otter LogoRemoteOtter

Infrastructure Engineer (Observability) - Remote

Posted 3 weeks ago
Software Development
Full Time
USA

Overview

Voltage Park is seeking an Infrastructure Engineer with a focus on Observability to join our Infrastructure Engineering team. Our engineers design and operate the systems that manage thousands of bare-metal servers, GPUs, and high-performance networks across multiple data centers.

This role combines the breadth of a core infrastructure engineer with a specialty in observability and telemetry. You’ll design and operate metrics, logs, traces, and alerting pipelines that provide actionable insights for both internal teams and external customers — helping to ensure reliability and transparency at scale.

This is a fully remote position, although candidates must be based in the continental United States. Unfortunately, we are unable to provide sponsorship for this role.

In Short

  • Design, build, and maintain observability platforms spanning metrics, logs, traces, and events.
  • Create dashboards and alerting for internal stakeholders and scoped visibility for external customers.
  • Ingest and correlate telemetry from GPUs, CPUs, networking, containers, APIs, and BMC/Redfish.
  • Implement noise-resistant alerting pipelines that improve detection and reduce operational load.
  • Collaborate with infrastructure, platform, and customer-facing teams to embed observability into workflows.
  • Contribute to broader infrastructure engineering projects beyond observability.

Requirements

  • 8+ years in infrastructure engineering, SRE, or observability roles. Strong experience with monitoring systems (Prometheus, Grafana, ELK, VictoriaMetrics, or similar).
  • Proficiency in Python, Go, or bash for automation and data integration.
  • Familiarity with container/Kubernetes observability.
  • Understanding of streaming telemetry pipelines (Kafka, OTEL, Promtail, or equivalent).
  • Strong written and verbal communication skills.

Benefits

  • You enjoy working with a small, highly motivated team.
  • You’re comfortable balancing autonomy with company-wide priorities.
  • You value clarity, documentation, and actionable insights in observability systems.
Voltage Park logo

Voltage Park

Voltage Park is a pioneering company dedicated to democratizing access to machine learning infrastructure for a diverse range of clients, including large enterprises, research universities, seed-stage startups, and nonprofits. The company stands out as the only cloud provider that offers a platform showcasing all available GPUs for rent, complete with transparent, market-based pricing and long-term reserve contracts. As a rapidly growing startup in the AI infrastructure sector, Voltage Park is committed to providing seamless compute access and fostering innovation in the field of artificial intelligence.

Share This Job!

Save This Job!

Similar Jobs:

Join Descript as an Infrastructure Engineer to enhance the reliability and performance of core production infrastructure.

CA, USA
Full-time
DevOps / Sysadmin
$191K - $232K/year
Hatch IT logo

Infrastructure Engineer - Remote

Hatch IT

4 weeks ago

Join Lastwall as an Infrastructure Engineer to enhance and maintain secure, scalable infrastructure in a cloud-native environment.

Worldwide
Full-time
DevOps / Sysadmin
Libertex Group logo

Infrastructure Engineer - Remote

Libertex Group

5 weeks ago

Join Libertex Group as an Infrastructure Engineer to design and maintain secure AWS infrastructure with a focus on automation.

Serbia
Full-time
DevOps / Sysadmin

Roboflow

Infrastructure Engineer - Remote

Roboflow

6 weeks ago

Join Roboflow as an Infrastructure Engineer to design and maintain robust cloud infrastructure for AI-driven applications.

USA
Full-time
DevOps / Sysadmin
$180,000 - $200,000/year
Hexa People logo

Infrastructure Engineer - Remote

Hexa People

7 weeks ago

Join our team as an Infrastructure Engineer responsible for managing and optimizing cloud infrastructure.

Worldwide
Full-time
DevOps / Sysadmin