Remote Otter LogoRemoteOtter

Founding Site Reliability Engineer - Remote

Posted 21 hours ago
DevOps / Sysadmin
Full Time
USA

Overview

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Founding Site Reliability Engineer in the United States.

This is a unique opportunity to join a rapidly growing AI company as the first SRE hire in the San Francisco office. In this role, you will define and scale the Site Reliability Engineering discipline, ensuring the platform is reliable, secure, and performant at enterprise scale. You will work closely with engineering leads, product teams, and company founders to build infrastructure, establish best practices, and drive the organization’s reliability culture. The role involves hands-on system design, automation, and observability work, while providing leadership and strategic input to shape long-term operational excellence. Ideal candidates are technically strong, highly collaborative, and motivated by building world-class systems from the ground up.

In Short

  • Establish and scale the SRE discipline, including best practices, tooling, and culture.
  • Ensure >99.9% uptime of production systems and maintain global platform reliability.
  • Architect, automate, and manage AWS infrastructure using Terraform, CI/CD pipelines, and Infrastructure as Code.
  • Design and implement observability systems across microservices, APIs, and vector workloads, including metrics, tracing, and logging.
  • Lead incident management, reducing MTTR through runbooks, alerts, and postmortems.
  • Collaborate with engineering teams to embed reliability principles into the software development lifecycle.
  • Influence organizational strategy and culture as a founding voice in the engineering team.

Requirements

  • Strong technical skills in Site Reliability Engineering.
  • Experience with AWS and Infrastructure as Code tools like Terraform.
  • Familiarity with CI/CD processes and observability techniques.
  • Ability to lead incident management and improve system reliability.
  • Strong collaboration and communication skills.
  • Motivated to build and scale systems from the ground up.

Benefits

  • Opportunity to shape the SRE culture from the beginning.
  • Work with a rapidly growing AI company.
  • Collaborate with experienced engineering leads and product teams.
  • Competitive compensation package.
  • Flexible working environment.

Jobgether

Jobgether

Jobgether is a global platform dedicated to connecting job seekers with fully remote job opportunities. The company focuses on matching candidates to roles where they are most likely to succeed, providing valuable feedback on applications to enhance the job search experience. Jobgether aims to eliminate common frustrations in the job market, such as application black holes and recruiter ghosting, by offering a supportive and transparent approach to remote employment.

Share This Job!

Save This Job!

Similar Jobs:

Zapier logo

Site Reliability Engineer - Remote

Zapier

2 weeks ago

Zapier is seeking a Site Reliability Engineer to enhance its reliability systems and improve observability and incident response.

USA
Full-time
DevOps / Sysadmin

P.R

Site Reliability Engineer - Remote

Pay Retailers

2 weeks ago

Join PayRetailers as a Site Reliability Engineer to enhance platform reliability and performance in a fully remote role.

Worldwide
Full-time
Software Development
Deel logo

Site Reliability Engineer - Remote

Deel

2 weeks ago

Join Deel as a Site Reliability Engineer to ensure system reliability and scalability in a remote environment.

Worldwide
Full-time
DevOps / Sysadmin

Jobgether

Site Reliability Engineer - Remote

Jobgether

2 weeks ago

We are seeking a Site Reliability Engineer to ensure the reliability and performance of streaming and broadcast systems in a remote role.

USA
Full-time
DevOps / Sysadmin
Libertex Group logo

Site Reliability Engineer - Remote

Libertex Group

2 weeks ago

Join Libertex Group as a Site Reliability Engineer to ensure the stability and performance of our infrastructure.

Serbia
Full-time
DevOps / Sysadmin