Remote Otter LogoRemoteOtter

Principal Site Reliability Engineer, Data Protection Products - Remote

Posted 17 hours ago
Software Development
Full Time
Worldwide

Overview

As a Site Reliability Engineer, you will work as an integral member of product teams, helping to build, deploy, and monitor cloud services reliably. You will contribute to complex software development projects to maintain essential, revenue-critical services.

In Short

  • Build systems and infrastructure to monitor complex, large-scale distributed systems.
  • Identify stability/performance issues and collaborate with developers.
  • Represent the SRE organization in design reviews.
  • Devise ways to actively monitor system throughput and reliability.
  • Debug complex systems without causing downtime.
  • Engage in service capacity planning and demand forecasting.
  • Drive standardization efforts across multiple disciplines.
  • Monitor and troubleshoot Elasticsearch performance issues.
  • Work independently and collaboratively in a fast-paced environment.
  • Strong communication and interpersonal skills.

Requirements

  • Bachelor’s degree in Computer Science or equivalent experience.
  • Fundamental knowledge of virtualization, storage, networking, and security.
  • Experience with monitoring and logging solutions like Prometheus and Grafana.
  • Proficiency in scripting languages such as Python.
  • Experience with infrastructure-as-code tools like Terraform.
  • Strong understanding of Linux system administration.
  • Excellent troubleshooting and problem-solving skills.
  • Demonstrable knowledge of Unix, TCP/IP, and web application security.
  • Experience analyzing logs and troubleshooting distributed systems.
  • Excellent organizational and time management skills.

Benefits

  • Medical Insurance
  • Flexible PTO
  • Flex Friday
  • Hybrid Work Option Available
  • Tuition Reimbursement
  • And more!

‎ConnectWise

‎ConnectWise

ConnectWise is a leading provider of cybersecurity solutions, dedicated to empowering partners with the tools and support they need to effectively manage and secure their technology environments. The company focuses on delivering exceptional customer service through a collaborative approach, working closely with cross-functional teams to troubleshoot and resolve product issues. With a commitment to innovation and continuous improvement, ConnectWise aims to enhance partner experiences by providing comprehensive support and resources, including a robust knowledge base and clear communication of new features and improvements.

Share This Job!

Save This Job!

Similar Jobs:

Jobgether

Principal Site Reliability Engineer - Remote

Jobgether

2 weeks ago

We are looking for a Principal Site Reliability Engineer to enhance the reliability and efficiency of large-scale distributed systems in a hybrid remote setup.

USA
Full-time
DevOps / Sysadmin
Upwork logo

Principal Site Reliability Engineer - Remote

Upwork

7 weeks ago

Join Upwork as a Principal Site Reliability Engineer to lead and innovate in SRE practices for a global team.

Worldwide
Full-time
DevOps / Sysadmin
Cribl logo

Principal Site Reliability Engineer - Remote

Cribl

12 weeks ago

Join Cribl as a Principal Site Reliability Engineer to enhance observability and reliability in software systems.

USA
Full-time
DevOps / Sysadmin
$240,000 - $400,000/year

Groupon

Principal Site Reliability Engineer - Remote

Groupon

25 weeks ago

Join Groupon as a Principal Site Reliability Engineer to enhance the reliability and scalability of mission-critical systems.

Colombia
Full-time
DevOps / Sysadmin

Groupon

Principal Site Reliability Engineer - Remote

Groupon

25 weeks ago

Join Groupon as a Principal Site Reliability Engineer to enhance the reliability and scalability of mission-critical systems.

Worldwide
Full-time
DevOps / Sysadmin