Remote Otter LogoRemoteOtter

Site Reliability Engineer (SRE) - Remote

Posted 20 hours ago
DevOps / Sysadmin
Full Time
Worldwide

Overview

OpenFX is on a mission to move money as freely as data, unrestricted by time zones, banking hours, or legacy systems. We are building the infrastructure that powers the next generation of cross-border payment systems for institutions. Our early team comes with experience from J.P. Morgan, Goldman Sachs, FalconX, PayPal, Affirm, Kraken, and Nium, and we’re backed by Accel, Lightspeed, NfX, and other top-tier investors.

As a Site Reliability Engineer, you will ensure the reliability, availability, and performance of OpenFX’s systems. This is a hands-on, high-impact role at the intersection of DevOps and incident response. You will participate in on-call rotations covering U.S. operating hours, triage production issues in real time, and work with engineering pods to quickly resolve or escalate incidents.

In Short

  • Serve as first responder for production incidents during U.S. operating hours (±2h EST).
  • Lead triage during outages, analyzing logs, metrics, and traces to identify root causes.
  • Drive incident postmortems and follow-ups to prevent recurrence.
  • Communicate clearly and quickly during incidents to internal stakeholders.
  • Own reliability outcomes across all OpenFX systems, focusing on uptime, latency, and error budgets.
  • Enhance observability through logging, metrics, alerting, and dashboards.
  • Optimize on-call processes and ensure smooth handoffs across IST, EST, and PST coverage.
  • Proactively identify systemic reliability risks and propose improvements.
  • Contribute automation and tooling to reduce manual incident handling.
  • Champion best practices in reliability engineering and operational excellence.

Requirements

  • 5+ years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering.
  • Proven experience leading incident response, running postmortems, and communicating during outages.
  • Strong background with cloud infrastructure (AWS preferred), container orchestration (Kubernetes, ECS), and Infrastructure-as-Code (Terraform, CloudFormation).
  • Familiarity with observability stacks (e.g., Prometheus, Grafana, Datadog, ELK, OpenTelemetry).
  • Ability to triage errors at both the infrastructure and application level, and escalate effectively when deeper intervention is required.
  • Ownership mindset with strong communication skills in high-pressure situations.

Benefits

  • Competitive salary and benefits package.
  • Equity in a rapidly growing company.
  • Opportunity to work on mission-critical infrastructure in fintech.
  • A collaborative team culture with a bias toward ownership and outcomes.
  • The chance to make a direct impact on the resilience of global financial infrastructure.
OpenFX logo

OpenFX

OpenFX is a forward-thinking company dedicated to revolutionizing cross-border payment systems by enabling the seamless movement of money akin to data, free from the constraints of time zones and traditional banking systems. With a team comprised of seasoned professionals from prestigious financial institutions such as J.P. Morgan, Goldman Sachs, and PayPal, OpenFX is rapidly scaling its operations, supported by top-tier investors like Accel and Lightspeed. The company fosters a collaborative and dynamic work environment, focusing on building high-performance applications that enhance its financial platform.

Share This Job!

Save This Job!

Similar Jobs:

Meijer Great Lakes LP logo

Site Reliability Engineer (SRE) - Remote

Meijer Great Lakes LP

2 weeks ago

Meijer is looking for a Site Reliability Engineer to enhance reliability and scalability within their Supply Chain Support team.

USA
Full-time
DevOps / Sysadmin
Group 1001 Resources logo

Site Reliability Engineer (SRE) - Remote

Group 1001 Resources

3 weeks ago

Join GROUP1001 as a Site Reliability Engineer (SRE) to ensure the reliability and performance of systems and applications.

USA
Full-time
DevOps / Sysadmin
$180,000 - $200,000/year
Cognigy logo

Site Reliability Engineer (SRE) - Remote

Cognigy

3 weeks ago

Join Cognigy as a Site Reliability Engineer, focusing on automation, system stability, and mentoring within a dynamic engineering team.

Germany
Full-time
DevOps / Sysadmin
Motive logo

Site Reliability Engineer (SRE) - Remote

Motive

4 weeks ago

Join Motive as a Site Reliability Engineer to design and manage AWS-backed infrastructure.

India
Full-time
DevOps / Sysadmin
Cognigy logo

Site Reliability Engineer (SRE) - Remote

Cognigy

4 weeks ago

Join Cognigy as a Site Reliability Engineer (SRE) to automate processes and ensure the reliability of cutting-edge SaaS products in a dynamic team environment.

Germany
Full-time
DevOps / Sysadmin