Remote Otter LogoRemoteOtter

Site Reliability Engineer - Remote

Posted 16 weeks ago
DevOps / Sysadmin
Full Time
United Kingdom

Overview

The SRE Team is responsible for managing Neon’s multi-region, multi-cloud deployment in close collaboration with the broader engineering team, as well as improving the reliability of the overall platform. All the features we want to implement can only reach our customers if the changes are delivered in a reliable way, which means the SRE team plays a significant role in defining our pace of development.

Successful candidates will get the opportunity to contribute to the effort of evolving Neon to become multi-cloud so that we can be as close as possible to our customers while also making decisions about how to best utilize different cloud technologies. They will also take part in refining and improving our existing infrastructure so that stability and scalability complement the delivery of new features and services.

Neon's foundations is built on open source software, if you want to take a look into what makes Neon work, feel feel to browse https://github.com/neondatabase/neon (storage layer of databases) and https://github.com/neondatabase/autoscaling (autoscaling of databases), as well as our engineering blog. SREs frequently work with stakeholders in different teams, these repos provide a sneak peek of what the Neon engineering team is capable of producing.

In Short

  • Join an experienced team and contribute to the foundation all of Neon is built upon
  • Contribute to building a stable and cost-efficient infrastructure foundation
  • Play a key role in ensuring we are proactive instead of reactive on infrastructure and reliability
  • Coach your fellow engineers on cloud, infrastructure, and reliability topics
  • Be ready to join an on-call rotation

Requirements

  • 4+ years experience working in Site Reliability Engineering
  • Experience with cloud infrastructure components in Azure and/or AWS
  • Experience in a complex Linux infrastructure environment
  • Experience focusing on building repeatable and cost-efficient infrastructure
  • Experience building solutions for problems with no answers on Google
  • Experience working with monitoring solutions in the Prometheus ecosystem; Grafana, Loki, Tempo, VictoriaMetrics
  • Experience managing multi-cluster, multi-cloud Kubernetes deployments
  • Nice to have: Familiarity with Go, GitOps (e.g., Flux, ArgoCD), Postgres, Virtualization (QEMU/KVM)

Benefits

  • You have an opportunity to be an early employee in the fast-scaling ambitious team
  • You can work 100% remote: we'll handle all formalities to arrange work from your home
  • We grant equity (stock options) for all full-time hires
  • We offer a competitive benefits package in line with all tech companies (top-notch equipment, unlimited vacations, paid parental leaves, and much more)
  • We are distributed, yet make our bonds during regular offsites (the last one was in Lisbon, Portugal)
Neon logo

Neon

Neon Inc is an innovative open-source company dedicated to creating a cloud-native PostgreSQL database service tailored for developers. With a focus on separating storage from compute, Neon enables serverless PostgreSQL solutions. Founded by a team of PostgreSQL experts and led by CEO Nikita Shamgunov, the company emphasizes open-source principles and aims to contribute back to the PostgreSQL and developer communities. Operating as a distributed team of over 90 professionals across more than 25 countries, Neon fosters a culture of transparency, collaboration, and diversity. Backed by top-tier investors, Neon is positioned as a fast-scaling startup with a commitment to operational excellence and cutting-edge database technology.

Share This Job!

Save This Job!

Similar Jobs:

Software Mind logo

Site Reliability Engineer - Remote

Software Mind

6 weeks ago

Software Mind is looking for a Site Reliability Engineer to enhance the reliability of their software systems in a flexible and supportive work environment.

LATAM
Full-time
DevOps / Sysadmin
Jackbox Games logo

Site Reliability Engineer - Remote

Jackbox Games

7 weeks ago

Join Jackbox Games as a Site Reliability Engineer to maintain AWS infrastructure and develop applications in Go.

USA
Full-time
DevOps / Sysadmin
$103,326 - $190,465/year
Pinterest logo

Site Reliability Engineer - Remote

Pinterest

7 weeks ago

Pinterest is seeking a Site Reliability Engineer to ensure the reliability of its large-scale distributed systems.

USA
Full-time
Software Development
Printify logo

Site Reliability Engineer - Remote

Printify

7 weeks ago

Join our team as a Site Reliability Engineer, responsible for ensuring the reliability of our distributed systems and platforms in a dynamic international environment.

Worldwide
Full-time
DevOps / Sysadmin
Zepz logo

Site Reliability Engineer - Remote

Zepz

7 weeks ago

Join Zepz as a Site Reliability Engineer to enhance service stability and resilience through innovative automation and observability practices.

South Africa
Full-time
DevOps / Sysadmin