Remote Otter LogoRemoteOtter

Senior IT Monitoring Engineer / Site Reliability Engineer - Remote

Posted 1 week ago
DevOps / Sysadmin
Full Time
India

Overview

The CrowdStrike Information Technology team is looking for a skilled Sr. IT Monitoring Engineer/Site Reliability Engineer (SRE) to join our IT Operations team. In this role, you will be responsible for designing, implementing, and maintaining monitoring solutions that ensure the reliability, availability, and performance of our critical IT infrastructure and applications.

In Short

  • Design and maintain comprehensive monitoring solutions across infrastructure and applications.
  • Configure appropriate alerting thresholds to ensure timely response to potential issues.
  • Define and track SLOs and error budgets for critical services.
  • Create and maintain dashboards providing real-time visibility into system health.
  • Participate in on-call rotation to respond to alerts and incidents.
  • Lead incident response efforts and conduct thorough post-incident reviews.
  • Develop scripts and automation to streamline monitoring tasks.
  • Integrate monitoring tools with other operational systems.
  • Stay current with industry trends in monitoring and site reliability engineering.
  • Contribute to the evolution of the organization's monitoring strategy.

Requirements

  • 5+ years of experience with enterprise monitoring tools.
  • Strong proficiency in scripting languages for automation.
  • Experience with log management platforms.
  • Working knowledge of cloud services monitoring.
  • Experience with application performance monitoring.
  • Knowledge of SRE principles, SLOs, and incident management.
  • Strong incident triage and root cause analysis skills.
  • Familiarity with Infrastructure as Code and containerization.
  • Experience participating in on-call rotations.
  • Bonus Points: SRE certifications and ITIL Foundation certification.

Benefits

  • Remote-friendly and flexible work culture.
  • Market leader in compensation and equity awards.
  • Comprehensive physical and mental wellness programs.
  • Competitive vacation and holidays for recharge.
  • Paid parental and adoption leaves.
  • Professional development opportunities for all employees.
  • Employee Networks and volunteer opportunities.
  • Vibrant office culture with world-class amenities.
  • Great Place to Work Certified™ across the globe.
Humio ApS logo

Humio ApS

CrowdStrike, Inc. is a global leader in cybersecurity, dedicated to stopping breaches through its innovative, cloud-native platform that provides unparalleled protection against sophisticated cyberattacks. Founded in 2011, the company has transformed the cybersecurity landscape by combining advanced endpoint protection with expert intelligence. CrowdStrike is recognized for its commitment to an inclusive, remote-first culture that empowers employees with flexibility and autonomy. The company values innovation, customer commitment, and collaboration, making it a top workplace for those passionate about shaping the future of cybersecurity. With a focus on diversity and equity, CrowdStrike fosters a culture where every individual is valued and encouraged to succeed.

Share This Job!

Save This Job!

Similar Jobs:

Flywire logo

Senior Site Reliability Engineer I - Remote

Flywire

18 weeks ago

Join Flywire as a Senior Site Reliability Engineer I to enhance our development ecosystem and ensure compliance with fintech regulations.

Israel
Full-time
DevOps / Sysadmin
Careem logo

Senior Site Reliability Engineer I - Remote

Careem

27 weeks ago

Join Careem's infra monitoring team to develop and enhance their distributed monitoring system.

Jordan
Full-time
DevOps / Sysadmin
GitLab logo

Senior Site Reliability Engineer - Remote

GitLab

4 days ago

Join GitLab as a Senior Site Reliability Engineer to help build and optimize their next-generation platform.

Worldwide
Full-time
DevOps / Sysadmin
Clay Labs logo

Senior Site Reliability Engineer - Remote

Clay Labs

6 days ago

Join Clay as a Senior Site Reliability Engineer to enhance infrastructure and ensure service reliability.

USA
Full-time
DevOps / Sysadmin

Jobgether

Senior Site Reliability Engineer - Remote

Jobgether

1 week ago

Join Dremio as a Senior Site Reliability Engineer to enhance mission-critical systems in a cloud-native environment.

India
Full-time
DevOps / Sysadmin