Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer - Remote

Posted Yesterday
DevOps / Sysadmin
Full Time
USA

Overview

Stack is developing revolutionary AI and advanced autonomous systems designed to enhance safety, reliability, and efficiency of modern operations. Stack's autonomous technology incorporates cutting-edge advancements in artificial intelligence, robotics, machine learning, and cloud technologies, empowering us to create innovative solutions that address the needs and challenges of the dynamic trucking transportation industry. With decades of experience creating and deploying real world systems for demanding environments, the Stack team is dedicated to developing an autonomous solution ecosystem tailored to the trucking industry's unique demands.

In Short

  • Monitor and maintain mission-critical production services to ensure maximum uptime.
  • Design and implement scalable distributed systems to facilitate the development of self-driving vehicles.
  • Design and implement an incident management framework and build a culture of blameless postmortems and continuous learning.
  • Scale the reliability and velocity of our systems and processes through increased automation.
  • Document actions to build a comprehensive library of runbooks, which will act as a knowledge base and foundation for automation.
  • Participate in an on-call rotation to uphold the SLOs and SLAs of production services.

Requirements

  • Expertise in at least one scripting language (e.g. Bash, Python).
  • Fundamental understanding of Linux operating system internals, TCP/IP networking, and storage subsystems.
  • Experience scaling and securing services in the cloud (AWS, GCP) or cloud native environments.
  • Experience using infrastructure-as-code principles to automate the creation of infrastructure resources (e.g. Terraform, CloudFormation).
  • Understanding of engineering design limitations and ability to provide guidance to teams to scale their services to achieve desired performance within budget.
  • Strong experience implementing and debugging cloud native and open source tools such as Kubernetes, etcd, Prometheus, OpenTelemetry, and Istio.
  • Strong communication skills and the ability to work effectively in a diverse and distributed team.

Benefits

  • Equal opportunity workplace committed to diversity and inclusion.
  • Culture of entrepreneurship and innovation.
Stack AV logo

Stack AV

Stack AV is at the forefront of developing revolutionary AI and advanced autonomous systems aimed at enhancing the safety, reliability, and efficiency of modern operations, particularly within the trucking transportation industry. Leveraging decades of experience, Stack AV integrates cutting-edge advancements in artificial intelligence, robotics, machine learning, and cloud technologies to create innovative solutions tailored to the unique demands of the industry. The company is committed to building an autonomous solution ecosystem that addresses the challenges faced by the dynamic trucking sector.

Share This Job!

Save This Job!

Similar Jobs:

Motive logo

Senior Site Reliability Engineer - Remote

Motive

Yesterday

Motive is seeking a Senior Site Reliability Engineer to enhance their infrastructure and services for cloud-native solutions.

USA
Full-time
DevOps / Sysadmin
$126,000 - $193,000 USD/year

Jobgether

Senior Site Reliability Engineer - Remote

Jobgether

Yesterday

Join Tempo Software as a Senior Site Reliability Engineer to build and maintain secure and scalable infrastructure in a remote-first environment.

United Kingdom
Full-time
DevOps / Sysadmin
Ditto Job Board logo

Senior Site Reliability Engineer - Remote

Ditto Job Board

4 days ago

Join Ditto as a Senior Site Reliability Engineer to ensure the reliability and performance of our cloud infrastructure.

APAC
Full-time
DevOps / Sysadmin
GitLab logo

Senior Site Reliability Engineer - Remote

GitLab

1 week ago

Join GitLab as a Senior Site Reliability Engineer to help build and optimize their next-generation platform.

Worldwide
Full-time
DevOps / Sysadmin
Clay Labs logo

Senior Site Reliability Engineer - Remote

Clay Labs

1 week ago

Join Clay as a Senior Site Reliability Engineer to enhance infrastructure and ensure service reliability.

USA
Full-time
DevOps / Sysadmin