Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer - Data - Remote

Posted Yesterday
DevOps / Sysadmin
Full Time
USA
$130,000 - $140,000/year

Overview

The Discogs Platform team is focused on several objectives: building and supporting performant, cost-effective, reliable infrastructure; developer experience tooling and mentorship; and creating "golden paths" for organization-wide standards and velocity. As a key member of the Platform team, the Senior Site Reliability Engineer - Data will be working closely with other Discogs engineering squads to develop and optimize scalable, well-planned relational database architectures, drive best practices and stability for our use of Kafka and change data capture, and contribute to the Platform team’s operations.

In Short

  • Stewarding Discogs’ data stores as a key subject matter expert
  • Leading efforts on the reliability and design patterns of our Kafka and Kafka Connect implementations
  • Establishing data contracts and clear communication standards between CDC producers and consumers
  • Working closely with engineering squads to refactor and re-architect MySQL database schema and indexing for long-term scalability, performance, and cost effectiveness
  • Mentoring engineering squads on Platform best practices for MySQL, Kafka, and other software development lifecycle areas
  • Writing documentation and runbooks that contribute to the engineering organization’s knowledge base
  • Working in a containerized, orchestrated environment
  • Contributing to the Platform team’s disciplines of site reliability and operations, supporting both our squads and Platform’s central infrastructure
  • Participating in on-call rotation, responding to incidents, and troubleshooting data and other operations issues

Requirements

  • Experience with Site Reliability Engineering principles
  • Strong knowledge of Kafka and its ecosystem
  • Proficiency in MySQL and database design
  • Experience in containerization and orchestration tools
  • Strong communication skills for collaboration with engineering teams
  • Ability to write clear documentation and runbooks
  • Experience in incident response and troubleshooting

Benefits

  • Competitive salary and benefits package
  • Remote work flexibility
  • Opportunities for professional development and mentorship
  • Supportive team culture focused on collaboration
  • Access to resources for personal growth and learning
Discogs logo

Discogs

Discogs is the largest crowd-sourced, community-driven database of recorded music information in the world, where millions of users connect to learn about music and buy or sell vinyl records, CDs, and cassettes. As a growing company, Discogs values individual contributions and emphasizes quality, critical thinking, and continuous improvement. The team operates collaboratively across geographical locations, utilizing open-source tools to enhance their work. Discogs is dedicated to serving the music community and is looking for motivated individuals to help realize its mission.

Share This Job!

Save This Job!

Similar Jobs:

Binance logo

Senior Site Reliability Engineer (Big Data) - Remote

Binance

155 weeks ago

Binance is looking for a seasoned SRE Engineer to enhance its Big Data infrastructure and services.

Singapore
Full-time
DevOps / Sysadmin
Flowhub logo

Senior Data & Reliability Engineer - Remote

Flowhub

7 weeks ago

Flowhub is seeking a Senior Data & Reliability Engineer to enhance data systems and ensure performance in the cannabis industry.

USA
Full-time
Software Development
135000 - 180000/year
P2P. org logo

Senior Site Reliability Engineer (Data Team) - Remote

P2P. org

5 days ago

Join P2P.org as a Senior Site Reliability Engineer to ensure the reliability of data platforms and improve service delivery pipelines.

Worldwide
Full-time
DevOps / Sysadmin
Visa logo

Senior Site Reliability Engineer - Remote

Visa

5 days ago

Join Visa as a Senior Site Reliability Engineer to support critical application pipelines and enhance data operations.

TX, USA
Full-time
DevOps / Sysadmin
$134,285 - $164,100/year
BenchSci logo

Senior Site Reliability Engineer - Remote

BenchSci

1 week ago

Join our team as a Senior Site Reliability Engineer, where you will enhance our platform's reliability and observability.

CA
Full-time
DevOps / Sysadmin