Remote Otter LogoRemoteOtter

Lead Site Reliability Engineer (SRE) - Remote

Posted 1 week ago

Overview

As a Lead Site Reliability Engineer (SRE) at Corelight, you will ensure the stability, performance, and security of our Federal region’s cloud platform, managing infrastructure and operations with a focus on availability, latency, performance optimization, monitoring, incident response, and capacity planning.

In Short

  • Collaborate with software engineering teams to ensure the reliability, performance, and security of the Federal region’s infrastructure.
  • Design, implement, and manage FedRAMP-compliant infrastructure and systems.
  • Establish continuous monitoring, logging, and auditing processes to ensure compliance with FedRAMP controls.
  • Partner with security teams to conduct security assessments and implement necessary controls.
  • Design and implement scalable infrastructure solutions that support multi-region growth.
  • Drive automation efforts, enabling infrastructure and platforms to scale efficiently with a focus on compliance.
  • Stay up-to-date on best practices, evolving security threats, and FedRAMP guidelines to maintain a strong security posture.
  • Deploy and maintain cloud-native services in AWS that are resilient and elastic.
  • Participate in 24x7 incident response and on-call rotations.
  • Plan for capacity and work with teams to prepare for platform growth.

Requirements

  • 8+ years of experience building and operating FedRAMP environments or similarly regulated systems.
  • Expertise in AWS services (e.g., EC2, S3, RDS, Lambda, ECS/EKS, Glue, EMR, Redshift, OpenSearch, VPC).
  • Deep understanding of the FedRAMP framework, controls, and compliance requirements.
  • Proficiency in programming languages such as Python, Go, or Java.
  • Experience with big data technologies (Hadoop, Spark, Kafka).
  • Strong skills in Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Ansible.
  • Knowledge of containerization and orchestration tools like Docker and Kubernetes.
  • Experience with CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
  • Proven track record in building and scaling platforms with high availability, resilience, and strict SLO objectives.
  • Strong experience with Unix/Linux systems and cloud providers, ideally AWS.

Benefits

  • Competitive salary and equity options.
  • Flexible work environment with options to work from home.
  • Comprehensive health benefits.
  • Opportunities for professional growth and development.
  • Collaborative and inclusive company culture.

Similar Jobs:

MongoDB

Site Reliability Engineer (SRE) Lead - Remote

MongoDB

4 weeks ago

Join MongoDB as a Site Reliability Engineer (SRE) Lead to build and maintain secure communication infrastructure in a multi-cloud environment.

Site Reliability Engineering
Networking
Distributed Systems
Automation
USA
Full-time
DevOps / Sysadmin
$147,000 - $289,000 USD/year

MongoDB

Site Reliability Engineer (SRE) Lead - Remote

MongoDB

4 weeks ago

Join MongoDB as a Site Reliability Engineer Lead to build and maintain secure communication infrastructure in a multi-cloud environment.

Site Reliability Engineering
Networking
Distributed Systems
Automation
USA
Full-time
DevOps / Sysadmin
$147,000 - $289,000 USD/year

MongoDB

Site Reliability Engineer (SRE) Lead - Remote

MongoDB

4 weeks ago

Join MongoDB as a Site Reliability Engineer (SRE) Lead to build and maintain secure communication infrastructure.

Site Reliability Engineering
Networking
Distributed Systems
Automation
Canada
Full-time
DevOps / Sysadmin
$159,000 - $221,000 CAD/year

MongoDB

Site Reliability Engineer (SRE) Lead - Remote

MongoDB

4 weeks ago

Join MongoDB as a Site Reliability Engineer (SRE) Lead to build and maintain secure communication infrastructure.

Site Reliability Engineering
Networking
Distributed Systems
Automation
Mexico
Full-time
DevOps / Sysadmin

MongoDB

Site Reliability Engineer (SRE) Lead - Remote

MongoDB

4 weeks ago

Join MongoDB as a Site Reliability Engineer (SRE) Lead to build and maintain secure communication infrastructure in a multi-cloud environment.

Site Reliability Engineering
Networking
Distributed Systems
Automation
USA
Full-time
DevOps / Sysadmin
$147,000 - $289,000 USD/year