Remote Otter LogoRemoteOtter

Staff Site Reliability Engineer - Remote

Posted Yesterday
DevOps / Sysadmin
Full Time
USA
$201,000 - $287,100 USD/year

Overview

Veeam is launching a global Site Reliability Engineering (SRE) function to support the rollout and operation of our new SaaS offering: the Veeam Data Cloud. As a Staff Site Reliability Engineer, you will serve as a hands-on technical leader within the SRE team, guiding senior engineers, influencing product development teams, and ensuring the systems we operate are built to be reliable, scalable, and observable from the ground up.

In Short

  • Act as a technical authority in your area, mentoring senior engineers and guiding design choices that improve service reliability and resilience.
  • Lead the definition and enforcement of SLIs, SLOs, and error budgets; drive adherence across engineering teams.
  • Collaborate with Staff peers across teams to align strategy and champion shared reliability standards and goals.
  • Partner with development and product teams to proactively design for failure, build resilient architecture, and operationalize reliability from the start.
  • Drive company-wide adoption of observability best practices and tooling.
  • Ensure metrics, logs, and traces provide deep, actionable insights across systems.
  • Lead complex incident responses, postmortems, and systemic reliability improvements.
  • Promote and enforce a blameless culture of learning and continuous improvement.
  • Lead initiatives in infrastructure as code, deployment automation, and resilience testing.
  • Work closely with your peer Staff Engineers to plan, align, and deliver against reliability goals.

Requirements

  • 8+ years of experience in a Software Engineering or SRE role, including technical leadership.
  • Demonstrated experience mentoring and guiding senior engineers.
  • Deep expertise in building distributed systems on public cloud (Azure preferred).
  • Strong skills in programming (e.g., JS, Go, Typescript, Java, or C#).
  • Hands-on experience with observability tooling (e.g., Prometheus, Grafana, OpenTelemetry).
  • Mastery of infrastructure automation tools (Terraform, Pulumi) and container orchestration (Kubernetes).
  • Ability to communicate clearly across geographies and disciplines.

Benefits

  • Be a core architect in the rollout of Veeam’s first global SaaS offering—the Veeam Data Cloud.
  • Help shape a modern, engineering-driven SRE practice from the ground up.
  • Influence long-term reliability and architecture across a global product portfolio.
  • Work in a collaborative environment with engineering leaders who value strategic thinking, hands-on problem solving, and customer empathy.
  • Enjoy competitive pay and benefits, flexible work arrangements, and a team culture built on learning, ownership, and impact.
  • Unlimited PTO and paid holidays.
  • Medical, dental, and vision coverage starting on day one.
  • 401(k) plan and professional training opportunities.
Veeam Software logo

Veeam Software

Veeam Software is the leading global provider of data protection and ransomware recovery solutions, dedicated to empowering organizations to not only recover from data outages but to thrive in the face of challenges. With a focus on radical resilience, Veeam offers a comprehensive Data Platform that secures and ensures the availability of applications and data across hybrid cloud environments. Headquartered in Seattle and operating in over 30 countries, Veeam serves more than 450,000 customers worldwide, including a significant portion of the Global 2000, by delivering exceptional products and support experiences. The company is committed to simplicity and customer success, fostering a collaborative environment that encourages innovation and excellence.

Share This Job!

Save This Job!

Similar Jobs:

Tilt Finance logo

Staff Site Reliability Engineer - Remote

Tilt Finance

7 days ago

Join Tilt as a Staff Site Reliability Engineer to design scalable infrastructure and automate deployments in a remote-first environment.

USA
Full-time
DevOps / Sysadmin
CookUnity logo

Staff Site Reliability Engineer - Remote

CookUnity

2 weeks ago

CookUnity is seeking a Staff Site Reliability Engineer to manage and enhance their cloud-native infrastructure.

LATAM
Full-time
DevOps / Sysadmin
CookUnity logo

Staff Site Reliability Engineer - Remote

CookUnity

2 weeks ago

Join CookUnity as a Staff Site Reliability Engineer to lead technical projects and manage cloud infrastructure.

Argentina
Full-time
DevOps / Sysadmin
Zapier logo

Staff Engineer, Site Reliability - Remote

Zapier

3 weeks ago

Zapier is seeking a Staff Site Reliability Engineer to lead observability and incident response initiatives while mentoring engineers and enhancing system reliability.

USA
Full-time
Software Development
Axon logo

Staff Site Reliability Engineer - Remote

Axon

4 weeks ago

Join Axon as a Staff Site Reliability Engineer to drive technical direction and ensure high-availability cloud applications.

USA
Full-time
DevOps / Sysadmin
USD 161,250 - USD 258,000/year