Remote Otter LogoRemoteOtter

Staff Engineer - Site Reliability Engineering (SRE) - Remote

Posted 5 weeks ago

Overview

The Staff Engineer - Site Reliability Engineering (SRE) will be responsible for ensuring the reliability, scalability, stability, and performance of systems and services at Forbes Advisor. This role involves working closely with cross-functional teams to design, build, and maintain systems while troubleshooting issues as they arise.

In Short

  • Responsible for the reliability and performance of systems and services.
  • Work with cross-functional teams to design and maintain systems.
  • Define Service Level Objectives (SLO) and agreements (SLA).
  • Deploy and manage monitoring tools for system health.
  • Analyze performance and implement solutions for scalability.
  • Develop scripts and automation frameworks.
  • Implement observability practices with development teams.
  • Conduct disaster recovery drills and maintain documentation.
  • Ensure security best practices are followed.
  • Publish KPI reports and system health updates.

Requirements

  • Bachelor's degree in CS or related field, or equivalent experience.
  • 12+ years of overall IT experience.
  • 7+ years as a Senior Site Reliability Engineer or similar role.
  • 5+ years of AWS Cloud experience with relevant certifications.
  • Experience with a broad range of AWS technologies.
  • 2+ years in CDN and/or Cache systems.
  • Strong experience with Cloud deployments (AWS/Docker/Kubernetes).
  • Knowledge of IAC Tools like Terraform, Chef, Ansible.
  • Experience with monitoring systems like CloudWatch, NewRelic.
  • Strong scripting skills in Bash and Python.

Benefits

  • Day off on the 3rd Friday of every month.
  • Monthly Wellness Reimbursement Program.
  • Paid paternity and maternity leaves.

Similar Jobs:

G2i logo

Staff Site Reliability Engineer (SRE) - Remote

G2i

2 weeks ago

Join LaunchDarkly as a Staff Site Reliability Engineer to enhance system reliability and operational efficiency.

AWS
Golang
CockroachDB
ElasticSearch
USA
Full-time
DevOps / Sysadmin
$170,000 - $260,000/year
Gigster logo

Staff SRE (Site Reliability Engineer) - Remote

Gigster

12 weeks ago

Join the Gigster Talent Network as a Staff Site Reliability Engineer to work on innovative cloud and software development projects.

Site Reliability Engineering
Cloud Services
Infrastructure AS Code
Python
LatAm, Europe, Africa, Asia
Contract
DevOps / Sysadmin
Wellhub logo

Staff Site Reliability Engineer - Remote

Wellhub

3 days ago

Join Wellhub as a Staff Site Reliability Engineer to build a secure and scalable cloud infrastructure.

AWS
Kubernetes
DevSecOps
Golang
Brazil
Full-time
DevOps / Sysadmin
Gemini logo

Staff Site Reliability Engineer - Remote

Gemini

2 weeks ago

Join Gemini as a Staff Site Reliability Engineer to lead engineering teams in adopting modern DevOps practices and enhancing system reliability.

Site Reliability Engineering
DevOps
Automation
Cloud Technologies
USA
Full-time
DevOps / Sysadmin
$172,000 - $241,000/year
Varo Bank logo

Staff Site Reliability Engineer - Remote

Varo Bank

3 weeks ago

Join Varo's SRE team as a Staff Site Reliability Engineer, focusing on cloud infrastructure reliability and performance.

AWS
Kubernetes
Terraform
CI/CD
USA
Full-time
DevOps / Sysadmin