Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer (SRE) - Remote

Posted 6 weeks ago
DevOps / Sysadmin
Full Time
Canada

Overview

We are seeking an experienced Senior SRE Engineer to join our dynamic team. As a Senior SRE Engineer, you will be responsible for designing, implementing, and maintaining our infrastructure and CI/CD pipelines, with a focus on automation, scalability, and performance.

In Short

  • Design, build, and maintain highly scalable infrastructure using Terraform and Terragrunt.
  • Manage cloud environments, particularly in AWS, ensuring cost optimization, security, and high availability.
  • Work with Confluent Cloud and Kafka to manage and scale data streaming platforms.
  • Deploy and manage REDIS instances for caching and real-time data processing.
  • Implement and maintain monitoring and alerting solutions using Prometheus, Grafana, Alert Manager, and OpsGenie.
  • Enable feature flag management and controlled rollouts using LaunchDarkly.
  • Manage Kubernetes clusters using Helm, ArgoCD, Istio, and Kustomize.
  • Troubleshoot and resolve complex system issues, ensuring high performance and uptime.
  • Continuously improve automation tools, processes, and methodologies.
  • Stay up-to-date with emerging SRE trends and technologies.

Requirements

  • 8+ years proven experience as a Senior SRE Engineer or in a similar role.
  • Expertise in Infrastructure as Code (IaC) using Terraform and Terragrunt.
  • Deep knowledge of AWS cloud services and best practices.
  • Hands-on experience with Confluent Cloud and Kafka.
  • Strong experience with REDIS for caching and RDS data storage.
  • Proficiency in monitoring and alerting using Prometheus and Grafana.
  • Experience with LaunchDarkly for feature flag management.
  • Extensive experience managing Kubernetes clusters.
  • Excellent problem-solving skills.
  • Strong communication and collaboration skills.

Benefits

  • Competitive Health, Vision, Dental, and Life Insurance plans.
  • Robust 401k plan.
  • Discretionary Time Off.
  • Other minor perks.
Blackpoint Cyber logo

Blackpoint Cyber

Blackpoint Cyber is a leading provider of advanced cybersecurity solutions, specializing in threat hunting, detection, and remediation technology. Founded by former National Security Agency (NSA) cyber operations experts, the company leverages national security-grade technology to serve commercial customers globally. Currently experiencing rapid growth, Blackpoint Cyber has recently secured a $190 million Series C funding round, positioning itself as a key player in the cybersecurity industry.

Share This Job!

Save This Job!

Similar Jobs:

CI&T logo

Senior Site Reliability Engineer (SRE) - Remote

CI&T

6 days ago

We are looking for a qualified Senior Site Reliability Engineer (SRE) to manage application reliability and collaborate with various teams.

BR
Full-time
DevOps / Sysadmin
Cribl logo

Senior Site Reliability Engineer (SRE) - Remote

Cribl

7 days ago

Join Cribl as a Senior Site Reliability Engineer to enhance observability and reliability in a remote-first environment.

USA
Full-time
DevOps / Sysadmin
$165,000 - $205,000/year

ZenGRC

Senior Site Reliability Engineer (SRE) - Remote

ZenGRC

1 week ago

Join ZenGRC as a Senior Site Reliability Engineer to define and implement cloud infrastructure and support Kubernetes clusters.

USA
Full-time
DevOps / Sysadmin

BLACKLANE

Senior Site Reliability Engineer (SRE) - Remote

BLACKLANE

2 weeks ago

Join our team as a Senior Site Reliability Engineer to enhance system reliability and mentor junior engineers.

Worldwide
Full-time
DevOps / Sysadmin

inbybob_

Senior Site Reliability Engineer (SRE) - Remote

inbybob_

4 weeks ago

Seeking a Senior SRE engineer to enhance DevOps practices and manage distributed systems in a financial strategy development team.

Argentina
Full-time
DevOps / Sysadmin