Remote Otter LogoRemoteOtter

Staff Site Reliability Engineer - Remote

Posted 16 weeks ago
DevOps / Sysadmin
Full Time
USA

Overview

As a Staff Site Reliability Engineer (SRE), you will play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based services, driving best practices and contributing to the design and implementation of robust cloud infrastructures.

In Short

  • Lead and mentor a team of SREs, driving best practices and fostering a culture of reliability and performance
  • Provide strategic direction in the design, implementation, and management of scalable and resilient cloud-based infrastructure on AWS
  • Oversee the implementation and optimization of observability solutions using OpenTelemetry
  • Supervise the utilization of Prometheus and Grafana for effective monitoring
  • Manage the design and implementation of service meshes with Istio
  • Develop and enforce SRE best practices, including incident response and capacity planning
  • Collaborate with development teams to ensure alignment with reliability and performance goals
  • Write and maintain infrastructure as code for core systems
  • Automate operational tasks to save time and improve accuracy
  • Write clean and scalable scripts, software, and systems to manage platform infrastructure

Requirements

  • Minimum 12 years experience as a Site Reliability, DevOps, or Software Engineer
  • Proven leadership in SRE team settings
  • Expert Linux and troubleshooting skills
  • Experience in building high-availability cloud environments in AWS
  • Expertise in Infrastructure as code and deployment automation
  • Experience running Kubernetes and Istio in production
  • Advanced Observability skills with monitoring tools
  • Experience instrumenting code and creating instrumentation frameworks
  • Participate in an on-call rotation for after-hours incidents
  • Experience with SDLC, CI/CD, and related tooling

Benefits

  • Competitive salary and benefits package
  • Opportunity to work with cutting-edge technologies
  • Collaborative and innovative work environment
  • Professional development opportunities
  • Flexible work arrangements
Varo Bank logo

Varo Bank

Varo Bank is a pioneering all-digital bank that launched in 2017 with a mission to integrate the best of fintech into the regulated banking system. As the first consumer fintech to receive a national bank charter in 2020, Varo is dedicated to promoting financial inclusion and providing opportunities for all Americans. The bank offers a suite of customer-first features designed to meet diverse consumer needs, particularly for underserved communities historically excluded from traditional financial systems. With a focus on innovation and technology, Varo aims to empower its customers through accessible financial products and insights, all while maintaining a commitment to its core values of putting customers first, taking ownership, and fostering a culture of respect and curiosity.

Share This Job!

Save This Job!

Similar Jobs:

Wellhub logo

Staff Site Reliability Engineer - Remote

Wellhub

13 weeks ago

Join Wellhub as a Staff Site Reliability Engineer to build a secure and scalable cloud infrastructure.

Brazil
Full-time
DevOps / Sysadmin
Gemini logo

Staff Site Reliability Engineer - Remote

Gemini

15 weeks ago

Join Gemini as a Staff Site Reliability Engineer to lead engineering teams in adopting modern DevOps practices and enhancing system reliability.

USA
Full-time
DevOps / Sysadmin
$172,000 - $241,000/year
Syngenta Group logo

Staff Site Reliability Engineer - Remote

Syngenta Group

16 weeks ago

Join our team as a Staff Site Reliability Engineer to design and optimize large-scale distributed systems.

Brazil
Full-time
DevOps / Sysadmin
Earnest logo

Staff Site Reliability Engineer - Remote

Earnest

17 weeks ago

Join Earnest as a Staff Site Reliability Engineer to ensure the reliability and performance of systems while optimizing infrastructure.

USA
Full-time
DevOps / Sysadmin
$194,000 - $220,000 USD/year

Agiloft

Staff Site Reliability Engineer - Remote

Agiloft

20 weeks ago

Join Agiloft as a Staff Site Reliability Engineer to develop and implement reliable and scalable systems while collaborating with various teams.

USA
Full-time
DevOps / Sysadmin