Remote Otter LogoRemoteOtter

Site Reliability Engineer II - Tooling - Remote

Posted Yesterday
DevOps / Sysadmin
Full Time
USA
$111,500 - $184,000 USD

Overview

Toast is driven by building the platform that helps restaurants adapt, take control, and focus on what they do best: creating experiences their guests love. Tremendous business growth has spurred a need for significant investment in Toast's platform teams. The Site Reliability Engineering team at Toast is tasked with overseeing Toast production services, with a commitment to quality, reliability, and low latency — without needing heroics. The team accomplishes this goal by:

In Short

  • Building tooling to automate, monitor, and manage deployed services using reliability best practices
  • Developing and evangelizing patterns and best practices to improve the scalability, observability, and reliability of all Toast systems
  • Consulting with teams to improve product scalability, observability, security, and reliability
  • Participating in outage response and root cause analysis for critical systems and infrastructure incidents
  • Contribute to strategic organization-wide scalability, observability, and reliability initiatives
  • Guide teams to build and maintain systems that are reliable and available for Toast customers
  • Optimize existing processes, identify areas for improvement, and implement automated solutions to enhance efficiency and reliability of Toast systems
  • Enable low-risk, compliant releases with rapid rollback capability to maintain platform reliability

Requirements

  • Industry experience with at least 2 years engineering experience with a focus on SRE/building internal development tools
  • Bachelor’s Degree in Computer Science, engineering, or related field
  • Working knowledge of complex cloud environments (AWS, GCP, Azure, etc.)
  • Hands-on coding experience with multiple coding languages - Java/JVM required + one or more of Kotlin, Go, Python, etc.
  • Background completing complex engineering projects in a Scrum environment
  • Experience in building and running distributed systems
  • Experience participating in Incident Response
  • Well-developed written and verbal communication skills
  • Deep problem-solving skills and the ability to think strategically and analytically
  • Experience working with a diverse global team across multiple regions and time zones

Benefits

  • Competitive salary and total rewards components including cash compensation, benefits, and equity
  • Diversity, Equity, and Inclusion initiatives embedded in the company culture
  • Hybrid work model that fosters collaboration while valuing individual needs
Toast logo

Toast

Toast is a rapidly growing company that is transforming the restaurant industry by integrating technology with a strong commitment to customer success. Their platform combines restaurant point of sale systems, guest-facing technology, and award-winning customer support to help restaurants streamline operations, boost revenue, and enhance guest experiences. The company is dedicated to empowering the restaurant community, enabling them to delight guests and thrive in a competitive market. With a focus on diversity, equity, and inclusion, Toast values its employees as the key ingredient to its success and strives to create an inclusive environment that fosters authenticity and respect.

Share This Job!

Save This Job!

Similar Jobs:

Join Atlan as a Site Reliability Engineer II to enhance system reliability and incident management in a fully remote environment.

India
Full-time
DevOps / Sysadmin

Jobgether

Site Reliability Engineer II - Remote

Jobgether

5 weeks ago

Join as a Site Reliability Engineer II to enhance the reliability and performance of large-scale distributed systems.

USA
Full-time
Software Development
Humio ApS logo

Engineer II - Site Reliability - Remote

Humio ApS

11 weeks ago

Join CrowdStrike as an Engineer II in Site Reliability, focusing on automation and tooling for a leading cybersecurity platform.

India
Full-time
DevOps / Sysadmin
Fivetran logo

Site Reliability Engineer II - Remote

Fivetran

24 weeks ago

Fivetran is seeking a Site Reliability Engineer II to ensure the reliability and performance of its data platform while collaborating with various teams.

India
Full-time
DevOps / Sysadmin

W.D.M.S.D.R.D.C

Site Reliability Engineer II - Remote

WM de Mexico, S. de R.L. de C.V

25 weeks ago

Join Wood Mackenzie as a Site Reliability Engineer II to enhance our DevOps practices and support cloud-based service transitions.

Mexico
Full-time
DevOps / Sysadmin