Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer - Remote

Posted 6 weeks ago

Overview

As a Senior Site Reliability Engineer at Heidi, you'll be instrumental in establishing and scaling our reliability practices while ensuring robust, secure, and observable systems.

In Short

  • Design and implement comprehensive observability strategies using Datadog.
  • Implement OpenTelemetry instrumentation across our backend and frontend services.
  • Set up real user monitoring (RUM) and application performance monitoring (APM).
  • Create and maintain dashboards for different stakeholders.
  • Establish and implement incident management processes.
  • Define and implement SLOs that align with business requirements.
  • Optimise observability costs through efficient logging and metrics collection.
  • Experience with cloud infrastructure (AWS required).
  • Proven track record in implementing SRE practices.
  • Flexible work with a 50% hybrid environment.

Requirements

  • Extensive experience with observability platforms (Datadog preferred).
  • Strong knowledge of OpenTelemetry and modern instrumentation practices.
  • Experience implementing APM and RUM in Python and React/React Native.
  • Track record of establishing incident management processes.
  • Experience defining and implementing SLAs/SLOs for enterprise customers.
  • Strong background in monitoring distributed systems.
  • Experience with cloud infrastructure (AWS required).
  • Proven track record in implementing SRE practices.

Benefits

  • Additional paid day off for your birthday and wellness days.
  • Special corporate rates at Anytime Fitness.
  • A generous personal development budget of $500 per annum.
  • Learn from some of the best engineers and creatives.
  • Become an owner, with shares (equity) in the company.
  • The rare chance to create a global impact.
  • Opportunity to fast track your startup career.

Similar Jobs:

Joinpaxos logo

Senior Site Reliability Engineer - Remote

Joinpaxos

4 days ago

Join Paxos as a Senior Site Reliability Engineer to enhance cloud infrastructure reliability and performance.

AWS
RDS
PostgreSQL
Aurora
USA
Full-time
DevOps / Sysadmin
$157,254 - $185,005 USD/year

P.W

Senior Site Reliability Engineer - Remote

Point Wild

6 days ago

Join Point Wild as a Senior Site Reliability Engineer to maintain and enhance the reliability and performance of our systems.

Site Reliability Engineering
DevOps
AWS
Azure
Worldwide
Full-time
DevOps / Sysadmin

M.M

Senior Site Reliability Engineer - Remote

Modernizing Medicine

7 days ago

Join ModMed as a Senior Site Reliability Engineer to enhance cloud infrastructure and empower developers.

AWS
Cloud Infrastructure
Site Reliability Engineering
DataDog
USA
Full-time
DevOps / Sysadmin

M.M

Senior Site Reliability Engineer - Remote

Modernizing Medicine

7 days ago

Join Modernizing Medicine as a Senior Site Reliability Engineer to enhance cloud infrastructure and mentor junior engineers.

AWS
DataDog
Kubernetes
Jenkins
India
Full-time
DevOps / Sysadmin
Datacom logo

Senior Site Reliability Engineer - Remote

Datacom

1 week ago

Join Datacom as a Senior Site Reliability Engineer to enhance platform performance and resilience in a collaborative environment.

Site Reliability Engineering
DevOps
Cloud Platforms
Azure
New Zealand
Full-time
DevOps / Sysadmin