Remote Otter LogoRemoteOtter

Reliability Engineer - Remote

Posted 8 weeks ago

Overview

The Reliability Engineering team helps realize our vision by supporting Coinbase engineering teams to build software that is world-class in terms of its reliability. As a core service team, Coinbase Reliability Engineers work closely with the rest of engineering. We proactively seek out and gather the state-of-the-art, best practices from the industry at large. Through education and advocacy, we seek to ensure that reliability is a core value of our engineering culture. We level up other engineers by sharing deep knowledge, performing proactive analysis and improving processes, tools, and automation. Ultimately, Reliability Engineering succeeds when all engineering teams are able to build reliable software on their own.

In Short

  • Build automation and improve systems to eliminate toil and operations work.
  • Improve observability, reliability and availability by defining and measuring key metrics.
  • Collaborate with our core infrastructure team to performance tune and optimize our cloud deployments.
  • Collaborate with Coinbase product teams to reduce service disruptions and automate incident response.
  • Proactively find and analyze reliability problems across our business units and stack, then design and implement software to create step-function improvements.
  • Facilitate incident response, conduct root cause analysis and blameless retrospectives.
  • Educate, mentor and hold accountable the engineering team to improve the reliability of our systems.

Requirements

  • You have at least 5+ years of software engineering experience.
  • You have a strong understanding of data structures & algorithms, especially as they pertain to performance and reliability.
  • You are fluent in at least one programming language such as Golang, Ruby, Python or JavaScript.
  • You possess strong skills around observability, debugging and performance tuning.
  • You have the ability to debug complex systems and the willingness to dive into understanding, debugging, and improving any layer of the stack.
  • You have experience working with containers / container orchestration systems and monitoring tools.
  • You have deep knowledge of UNIX/Linux system internals.
  • You have strong communication skills and the ability to explain technical concepts clearly and simply.
  • You have demonstrated critical thinking under pressure.

Benefits

  • Crypto-forward experience, including familiarity with onchain activity.
  • Experience with AWS, GCP, Azure, or other cloud environment.
  • Experience designing and building reliable systems capable of handling high throughput and low latency.
  • Experience with observability and monitoring systems.
  • Experience working in a highly regulated environment.
  • Exposure to both NoSQL and SQL database technologies.
  • Familiarity with working in rapid growth environments.

Similar Jobs:

Coinbase logo

Reliability Engineer - Remote

Coinbase

6 weeks ago

Join Coinbase as a Reliability Engineer to enhance software reliability and support engineering teams.

Reliability Engineering
Software Engineering
Automation
Cloud Deployments
USA
Full-time
Software Development
$180,625 - $212,000/year
Software Mind logo

Site Reliability Engineer - Remote

Software Mind

2 days ago

Software Mind is looking for a Site Reliability Engineer to enhance the reliability of their software systems in a flexible and supportive work environment.

Site Reliability Engineering
Cloud Native Applications
Azure
AWS
LATAM
Full-time
DevOps / Sysadmin
Jackbox Games logo

Site Reliability Engineer - Remote

Jackbox Games

7 days ago

Join Jackbox Games as a Site Reliability Engineer to maintain AWS infrastructure and develop applications in Go.

Site Reliability Engineering
AWS
GO
ECS
USA
Full-time
DevOps / Sysadmin
$103,326 - $190,465/year
PayNearMe logo

Data Reliability Engineer - Remote

PayNearMe

1 week ago

Join PayNearMe as a Data Reliability Engineer to design and maintain a reliable data infrastructure.

Data Reliability Engineering
AWS
MySQL
PostgreSQL
USA
Full-time
Software Development
$150,000 - 170,000/year
Pinterest logo

Site Reliability Engineer - Remote

Pinterest

1 week ago

Pinterest is seeking a Site Reliability Engineer to ensure the reliability of its large-scale distributed systems.

Site Reliability Engineering
Python
GO
Linux
USA
Full-time
Software Development