Remote Otter LogoRemoteOtter

Reliability Engineer - Remote

Posted 15 weeks ago
Software Development
Full Time
USA

Overview

The Reliability Engineering team helps realize our vision by supporting Coinbase engineering teams to build software that is world-class in terms of its reliability. As a core service team, Coinbase Reliability Engineers work closely with the rest of engineering. We proactively seek out and gather the state-of-the-art, best practices from the industry at large. Through education and advocacy, we seek to ensure that reliability is a core value of our engineering culture. We level up other engineers by sharing deep knowledge, performing proactive analysis and improving processes, tools, and automation. Ultimately, Reliability Engineering succeeds when all engineering teams are able to build reliable software on their own.

In Short

  • Build automation and improve systems to eliminate toil and operations work.
  • Improve observability, reliability and availability by defining and measuring key metrics.
  • Collaborate with our core infrastructure team to performance tune and optimize our cloud deployments.
  • Collaborate with Coinbase product teams to reduce service disruptions and automate incident response.
  • Proactively find and analyze reliability problems across our business units and stack, then design and implement software to create step-function improvements.
  • Facilitate incident response, conduct root cause analysis and blameless retrospectives.
  • Educate, mentor and hold accountable the engineering team to improve the reliability of our systems.

Requirements

  • You have at least 5+ years of software engineering experience.
  • You have a strong understanding of data structures & algorithms, especially as they pertain to performance and reliability.
  • You are fluent in at least one programming language such as Golang, Ruby, Python or JavaScript.
  • You possess strong skills around observability, debugging and performance tuning.
  • You have the ability to debug complex systems and the willingness to dive into understanding, debugging, and improving any layer of the stack.
  • You have experience working with containers / container orchestration systems and monitoring tools.
  • You have deep knowledge of UNIX/Linux system internals.
  • You have strong communication skills and the ability to explain technical concepts clearly and simply.
  • You have demonstrated critical thinking under pressure.

Benefits

  • Crypto-forward experience, including familiarity with onchain activity.
  • Experience with AWS, GCP, Azure, or other cloud environment.
  • Experience designing and building reliable systems capable of handling high throughput and low latency.
  • Experience with observability and monitoring systems.
  • Experience working in a highly regulated environment.
  • Exposure to both NoSQL and SQL database technologies.
  • Familiarity with working in rapid growth environments.

R.O.B

Referrals Only Board

Referrals Only Board is a forward-thinking organization that is dedicated to harnessing the transformative power of onchain technology. They believe that the shift to onchain represents a significant technological advancement, comparable to the transition to the online world. The company is committed to fostering an open, free, and globally accessible onchain ecosystem that promotes innovation, creativity, and economic freedom. With a focus on user-centric design, they are actively seeking talented individuals to contribute to the development of self-custodial wallets and other onchain applications, ensuring that their solutions are intuitive and effective for users.

Share This Job!

Save This Job!

Similar Jobs:

Coinbase logo

Reliability Engineer - Remote

Coinbase

12 weeks ago

Join Coinbase as a Reliability Engineer to enhance software reliability and support engineering teams.

USA
Full-time
Software Development
$180,625 - $212,000/year
Software Mind logo

Site Reliability Engineer - Remote

Software Mind

6 weeks ago

Software Mind is looking for a Site Reliability Engineer to enhance the reliability of their software systems in a flexible and supportive work environment.

LATAM
Full-time
DevOps / Sysadmin
Jackbox Games logo

Site Reliability Engineer - Remote

Jackbox Games

7 weeks ago

Join Jackbox Games as a Site Reliability Engineer to maintain AWS infrastructure and develop applications in Go.

USA
Full-time
DevOps / Sysadmin
$103,326 - $190,465/year
PayNearMe logo

Data Reliability Engineer - Remote

PayNearMe

7 weeks ago

Join PayNearMe as a Data Reliability Engineer to design and maintain a reliable data infrastructure.

USA
Full-time
Software Development
$150,000 - 170,000/year
Pinterest logo

Site Reliability Engineer - Remote

Pinterest

7 weeks ago

Pinterest is seeking a Site Reliability Engineer to ensure the reliability of its large-scale distributed systems.

USA
Full-time
Software Development