Remote Otter LogoRemoteOtter

Site Reliability Engineer - Remote

Posted 7 days ago
DevOps / Sysadmin
Full Time
CA, USA
$245,000 - $385,000/year

Overview

Lambda is on a mission to be the world's top AI computing platform, equipping engineers with tools to deploy AI that is fast, secure, affordable, and built to scale.

In Short

  • Design and implement full-stack monitoring solutions.
  • Create robust CI/CD pipelines for reliable deployments.
  • Develop and maintain API gateway configurations.
  • Implement SLIs, SLOs, and error budgets.
  • Architect resilient application patterns.
  • Build and optimize caching strategies.
  • Collaborate with development teams to establish benchmarks.
  • Create runbooks and automated remediation.
  • Lead incident response for application issues.
  • Optimize resource utilization across servers.

Requirements

  • 7+ years in Site Reliability Engineering or DevOps roles.
  • Strong understanding of modern web application architectures.
  • Experience with React and Python frameworks like Django.
  • Knowledge of database performance tuning.
  • Experience with RESTful APIs and microservices.
  • Proficiency with Infrastructure as Code (Terraform).
  • Proficiency with CI/CD pipelines (Argo).
  • Understanding of web application security.
  • Experience with cloud platforms (AWS, GCP, Azure).
  • Knowledge of containerization and orchestration (Docker, Kubernetes).

Benefits

  • Generous cash & equity compensation.
  • Health, dental, and vision coverage for you and your dependents.
  • 401k Plan with 2% company match (USA employees).
  • Flexible Paid Time Off Plan.
  • Commuter/Work from home stipends for select roles.
Lambda logo

Lambda

Founded in 2012, Lambda is a rapidly growing AI computing platform that originated from a team of AI engineers dedicated to advancing machine learning. The company focuses on providing engineers with robust tools for deploying AI solutions that are fast, secure, and scalable, whether through powerful on-site GPU hardware or flexible cloud-based options. Lambda's AI Cloud is trusted by leading companies and research institutions, aiming to make computation as accessible and essential as electricity. With a commitment to innovation and high demand for its systems, Lambda offers competitive compensation, comprehensive benefits, and a collaborative work environment.

Share This Job!

Save This Job!

Similar Jobs:

Referral Board logo

Site Reliability Engineer - Remote

Referral Board

7 days ago

Join Elastic as a Site Reliability Engineer to enhance the reliability of their global infrastructure while working in a collaborative environment.

USA
Full-time
DevOps / Sysadmin
$149,900 - $195,600 USD/year

ZENVIA

Site Reliability Engineer - Remote

ZENVIA

1 week ago

Worldwide
Full-time
DevOps / Sysadmin
Pinterest logo

Site Reliability Engineer - Remote

Pinterest

2 weeks ago

Pinterest is seeking a Site Reliability Engineer to ensure the reliability of its large-scale distributed systems.

USA
Full-time
Software Development
Printify logo

Site Reliability Engineer - Remote

Printify

2 weeks ago

Join our team as a Site Reliability Engineer, responsible for ensuring the reliability of our distributed systems and platforms in a dynamic international environment.

Worldwide
Full-time
DevOps / Sysadmin
Zepz logo

Site Reliability Engineer - Remote

Zepz

2 weeks ago

Join Zepz as a Site Reliability Engineer to enhance service stability and resilience through innovative automation and observability practices.

South Africa
Full-time
DevOps / Sysadmin