Remote Otter LogoRemoteOtter

Site Reliability Engineer - Remote

Posted 11 weeks ago
DevOps / Sysadmin
Full Time
Worldwide

Overview

As a Site Reliability Engineer, you’ll lead the design, implementation, and management of highly available and scalable systems, applying industry best practices and reliability engineering principles.

In Short

  • Collaborate with cross-functional teams to identify performance bottlenecks and troubleshoot complex issues.
  • Design and implement monitoring, alerting, and incident response strategies.
  • Drive automation initiatives to streamline deployment and configuration management.
  • Develop and maintain comprehensive documentation for system configurations.
  • Participate in on-call rotations and respond to incidents.

Requirements

  • Active Secret U.S. Government security clearance or higher.
  • Bachelor’s degree in Computer Science, Information Technology, or related field.
  • Minimum of 3 years of professional experience in a Site Reliability Engineering role.
  • Strong experience with cloud technologies (AWS, Azure, GCP) and infrastructure as code (Terraform, Ansible).
  • Proficiency in managing incident and outage response.
  • Strong engineering experience in network protocols (TCP/IP, DNS, HTTP/HTTPS).
  • Proficiency in programming and scripting languages (Python, Go, Bash).
  • Deep understanding of containerization and orchestration technologies (Kubernetes, Docker).
  • Expertise in monitoring and logging solutions (Splunk, Prometheus, Grafana).
  • Familiarity with CI/CD pipeline development (GitLab CI, Azure DevOps).

Benefits

  • Generous benefits package.
  • Professional growth opportunities.
  • Valuable time to recharge.

MetroStar

MetroStar

MetroStar is a technology services company dedicated to building exceptional teams and delivering innovative solutions. With a two-decade legacy, the company emphasizes a strong commitment to its people, fostering a culture that values collaboration and professional growth. MetroStar's mission centers around a passion for its employees and a commitment to providing value for its customers, particularly in the fields of data engineering and AI/ML capabilities. The company is also dedicated to creating a diverse and inclusive environment, ensuring equal opportunity for all applicants.

Share This Job!

Save This Job!

Similar Jobs:

Software Mind logo

Site Reliability Engineer - Remote

Software Mind

6 weeks ago

Software Mind is looking for a Site Reliability Engineer to enhance the reliability of their software systems in a flexible and supportive work environment.

LATAM
Full-time
DevOps / Sysadmin
Jackbox Games logo

Site Reliability Engineer - Remote

Jackbox Games

7 weeks ago

Join Jackbox Games as a Site Reliability Engineer to maintain AWS infrastructure and develop applications in Go.

USA
Full-time
DevOps / Sysadmin
$103,326 - $190,465/year
Pinterest logo

Site Reliability Engineer - Remote

Pinterest

7 weeks ago

Pinterest is seeking a Site Reliability Engineer to ensure the reliability of its large-scale distributed systems.

USA
Full-time
Software Development
Printify logo

Site Reliability Engineer - Remote

Printify

7 weeks ago

Join our team as a Site Reliability Engineer, responsible for ensuring the reliability of our distributed systems and platforms in a dynamic international environment.

Worldwide
Full-time
DevOps / Sysadmin
Zepz logo

Site Reliability Engineer - Remote

Zepz

7 weeks ago

Join Zepz as a Site Reliability Engineer to enhance service stability and resilience through innovative automation and observability practices.

South Africa
Full-time
DevOps / Sysadmin