Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer - Remote

Posted 11 weeks ago
DevOps / Sysadmin
Full Time
USA

Overview

The Core Platform team maintains and optimizes the data, infrastructure, messaging, and services platform that powers Sift’s online systems. We ensure these systems are always available, reliable, and performing at their best to meet customer needs. In the event of an outage or failure, we follow well-practiced recovery plans to restore services swiftly. Managing such complex, large-scale systems requires continuous monitoring and proactive maintenance to uphold these standards.

In Short

  • Own the availability, performance, and scalability of Sift’s primary online storage systems and infrastructure.
  • Design and build immutable infrastructure and fault-tolerant, multi-AZ/multi-region systems that are resilient and self-healing.
  • Design and Implement multi-region deployments, such as BigTable clusters spanning multiple regions.
  • Solve complex problems that arise from our unique data volume and request rate.
  • Optimize local development and testing workflows to be fast, efficient, and seamless.
  • Design and implement services and libraries for components to interact with data stores.
  • Develop tools for monitoring, detecting faults, and automatically repairing distributed systems.
  • Provide design support to internal engineering teams for optimal usage of data stores.
  • Participate in on-call support and incident response activities.

Requirements

  • 8+ years of experience as a Software Engineer focused on infrastructure/platform services or in a Site Reliability Engineering (SRE) role.
  • Strong programming skills in languages such as Java, Scala, or Python.
  • Experience designing and implementing distributed systems.
  • Experience building and managing cloud infrastructure on AWS or GCP.
  • Expertise in building infrastructure as code and automating provisioning processes.
  • Proficiency in setting up and managing monitoring and alerting systems.
  • Familiarity with Docker and container orchestration technologies.
  • Strong experience troubleshooting and resolving production system issues.
  • Proven expertise in automation and a solid understanding of configuration management tools.

Benefits

  • Competitive total compensation package.
  • 401k plan.
  • Medical, dental, and vision coverage.
  • Wellness reimbursement.
  • Education reimbursement.
  • Flexible time off.
Sift logo

Sift

Sift is an AI-powered fraud prevention platform dedicated to securing digital trust for leading global businesses. With a robust focus on machine learning and user identity, Sift processes a data network scoring 1 trillion events annually, empowering over 700 customers, including well-known brands like DoorDash, Yelp, and Poshmark, to grow confidently and deliver seamless consumer experiences. The company is committed to long-term customer success and fostering a diverse, equitable, and inclusive workplace, believing that diversity drives innovation and inclusion is essential for building trust and creating a safer Internet.

Share This Job!

Save This Job!

Similar Jobs:

Dremio logo

Senior Site Reliability Engineer - Remote

Dremio

6 weeks ago

Join Dremio as a Senior Site Reliability Engineer to enhance the reliability and performance of cloud services.

India
Full-time
DevOps / Sysadmin
Airalo logo

Senior Site Reliability Engineer - Remote

Airalo

7 weeks ago

Join Airalo as a Senior Site Reliability Engineer to develop and maintain reliable systems in a remote-first environment.

Worldwide
Full-time
DevOps / Sysadmin
Joinpaxos logo

Senior Site Reliability Engineer - Remote

Joinpaxos

7 weeks ago

Join Paxos as a Senior Site Reliability Engineer to enhance cloud infrastructure reliability and performance.

USA
Full-time
DevOps / Sysadmin
$157,254 - $185,005 USD/year

P.W

Senior Site Reliability Engineer - Remote

Point Wild

7 weeks ago

Join Point Wild as a Senior Site Reliability Engineer to maintain and enhance the reliability and performance of our systems.

Worldwide
Full-time
DevOps / Sysadmin

M.M

Senior Site Reliability Engineer - Remote

Modernizing Medicine

7 weeks ago

Join Modernizing Medicine as a Senior Site Reliability Engineer to enhance cloud infrastructure and mentor junior engineers.

India
Full-time
DevOps / Sysadmin