Remote Otter LogoRemoteOtter

Site Reliability Engineer - Remote

Posted 17 weeks ago

Overview

Site Reliability Engineers (SREs) are responsible for keeping all production services running smoothly. SREs are a blend of pragmatic operators and software craftspeople that apply sound engineering principles, operational discipline, and mature automation to our operating environments.

In Short

  • Run our infrastructure with Terraform, CI/CD (Github and ArgoCD), and Kubernetes together with the devops team
  • Having a proactive approach to monitoring rather than a reactive approach
  • Participate in on-call rotations, along with every member of the engineering team
  • Improve and automate operational processes
  • Constantly improve the security of the product and security operation
  • Debug production issues across services and levels of the stack
  • Partner with engineering teams to ensure their products meet production standards
  • Be willing to go out of your comfort zone to solve unique issues
  • Help shape our company's engineering culture and keep high engineering standards

Requirements

  • 7+ years experience designing, building, and operating large-scale production systems
  • Experience with Google Cloud Platform
  • Experience with monitoring tools like datadog and preferably open source toolings like prometheus/grafana/jaeger(tracing)
  • Good to have elastic search experience
  • Experience with container orchestration tools like Kubernetes and tools that support Kubernetes deployment, like ArgoCD and helm
  • Strong programming skills in primarily GoLang and/or any other languages
  • Strong knowledge about database optimization
  • Good knowledge of ensuring good security practices within cloud infrastructure

Benefits

  • Generous compensation in cash and equity
  • Early exercise for all options, including pre-vested
  • Work from anywhere: Remote-first Culture
  • Flexible paid time off, Year-end break, Self care days off
  • Health insurance, dental, and vision coverage for employees and dependents - US and Canada specific
  • 4% matching in 401k / RRSP - US and Canada specific
  • MacBook Pro delivered to your door
  • One-time stipend to set up a home office — desk, chair, screen, etc.
  • Monthly meal stipend
  • Monthly social meet-up stipend
  • Annual health and wellness stipend
  • Annual Learning stipend
  • Unlimited access to an expert financial advisory

Similar Jobs:

Software Mind logo

Site Reliability Engineer - Remote

Software Mind

3 days ago

Software Mind is looking for a Site Reliability Engineer to enhance the reliability of their software systems in a flexible and supportive work environment.

Site Reliability Engineering
Cloud Native Applications
Azure
AWS
LATAM
Full-time
DevOps / Sysadmin
Jackbox Games logo

Site Reliability Engineer - Remote

Jackbox Games

1 week ago

Join Jackbox Games as a Site Reliability Engineer to maintain AWS infrastructure and develop applications in Go.

Site Reliability Engineering
AWS
GO
ECS
USA
Full-time
DevOps / Sysadmin
$103,326 - $190,465/year
Pinterest logo

Site Reliability Engineer - Remote

Pinterest

1 week ago

Pinterest is seeking a Site Reliability Engineer to ensure the reliability of its large-scale distributed systems.

Site Reliability Engineering
Python
GO
Linux
USA
Full-time
Software Development
Printify logo

Site Reliability Engineer - Remote

Printify

1 week ago

Join our team as a Site Reliability Engineer, responsible for ensuring the reliability of our distributed systems and platforms in a dynamic international environment.

Site Reliability Engineering
System Design
Development
Configuration
Worldwide
Full-time
DevOps / Sysadmin
Zepz logo

Site Reliability Engineer - Remote

Zepz

1 week ago

Join Zepz as a Site Reliability Engineer to enhance service stability and resilience through innovative automation and observability practices.

SRE
DevOps
Automation
Monitoring
South Africa
Full-time
DevOps / Sysadmin