Remote Otter LogoRemoteOtter

Site Reliability Engineer (SRE) - Remote

Posted Yesterday
DevOps / Sysadmin
Contract
India

Overview

We are looking for a Site Reliability Engineer (SRE) with a strong background in observability, automation, and platform resilience to drive the operability and reliability of our Disaster Recovery as a Service (DRaaS) solution.

In Short

  • Build and maintain observability dashboards and proactive alerting systems.
  • Define and track Service Level Indicators (SLIs) and Error Budgets.
  • Collaborate on runbook automation and validation pipelines.
  • Lead chaos engineering initiatives and game-day exercises.
  • Conduct post-incident reviews and implement feedback loops.
  • Work with DR architecture and engineering teams on infrastructure improvements.
  • Participate in quarterly failover/failback simulations.
  • Help define SLOs for protected application groups.
  • Advocate for best practices around toil reduction and incident response.

Requirements

  • 5+ years of experience in SRE, DevOps, or Platform Engineering roles.
  • Strong hands-on experience with observability tools.
  • Experience designing and maintaining SLIs/SLOs and availability dashboards.
  • Proficiency in at least one scripting or programming language.
  • Knowledge of disaster recovery principles and infrastructure failover practices.
  • Experience with incident response and tracking improvement actions.
  • Familiarity with IaC tools.
  • Experience with CI/CD and cloud-native deployments.
  • Strong problem-solving and collaboration skills.
  • Fluent in English (written and spoken).

Benefits

  • Work in a dynamic and innovative environment.
  • Collaborate with cross-functional teams.
  • Contribute to high-impact projects.
  • Opportunity for professional growth and development.
  • Flexible working arrangements.
Monks logo

Monks

Monks is a global, purely digital operating brand under S4Capital plc, known for its innovative approach and specialized expertise in marketing and technology services. The company focuses on accelerating business possibilities and redefining brand interactions through a unique integration of systems and workflows. Monks delivers high-quality content production, scalable experiences, and enterprise-grade technology powered by AI, all managed by a diverse team of digital talent. Recognized for its rapid growth and creative excellence, Monks has received numerous accolades, including being named Adweek’s first AI Agency of the Year in 2023 and earning a spot on Newsweek’s Top 100 Global Most Loved Workplaces in 2023. Committed to diversity and inclusion, Monks fosters an empowering work environment that values unique perspectives and encourages collaboration.

Share This Job!

Save This Job!

Similar Jobs:

OpenFX logo

Site Reliability Engineer (SRE) - Remote

OpenFX

6 days ago

Join OpenFX as a Site Reliability Engineer to ensure the reliability and performance of our cross-border payment systems.

Worldwide
Full-time
DevOps / Sysadmin
Meijer Great Lakes LP logo

Site Reliability Engineer (SRE) - Remote

Meijer Great Lakes LP

3 weeks ago

Meijer is looking for a Site Reliability Engineer to enhance reliability and scalability within their Supply Chain Support team.

USA
Full-time
DevOps / Sysadmin
Group 1001 Resources logo

Site Reliability Engineer (SRE) - Remote

Group 1001 Resources

4 weeks ago

Join GROUP1001 as a Site Reliability Engineer (SRE) to ensure the reliability and performance of systems and applications.

USA
Full-time
DevOps / Sysadmin
$180,000 - $200,000/year
Cognigy logo

Site Reliability Engineer (SRE) - Remote

Cognigy

4 weeks ago

Join Cognigy as a Site Reliability Engineer, focusing on automation, system stability, and mentoring within a dynamic engineering team.

Germany
Full-time
DevOps / Sysadmin
Motive logo

Site Reliability Engineer (SRE) - Remote

Motive

4 weeks ago

Join Motive as a Site Reliability Engineer to design and manage AWS-backed infrastructure.

India
Full-time
DevOps / Sysadmin