Remote Otter LogoRemoteOtter

Principal Site Reliability Engineer, ML Platform - Remote

Posted 9 hours ago
DevOps / Sysadmin
Full Time
NJ, USA
$164,500 - $235,000 USD/year

Overview

As a Principal Site Reliability Engineer - ML Platform at Zscaler, you will influence the AI strategy and drive innovation for the world's largest security cloud.

In Short

  • Architect and maintain large-scale distributed systems for AI pipelines.
  • Ensure performance and availability of AI-driven applications on AWS.
  • Collaborate on CI/CD pipeline design and implementation.
  • Integrate Kubernetes and ArgoCD into cloud environments.
  • Optimize hosting costs as the FinOps expert for ZAIRe.
  • 10+ years of experience in Site Reliability Engineering is required.
  • Strong programming skills in Python and SQL.
  • Hands-on experience with CI/CD and infrastructure-as-code tools.
  • Knowledge of cloud platforms, preferably AWS.
  • A Bachelor's degree in Computer Science or related field is needed.

Requirements

  • 10+ years in SRE, cloud infrastructure, or applications architecture.
  • Expertise in Kubernetes and Docker.
  • Proficiency in Python, SQL, and distributed processing technologies.
  • Experience with CI/CD and infrastructure-as-code.
  • Strong knowledge of AWS and cloud-native management.
  • Bachelor's degree in Computer Science or related field.

Benefits

  • Various health plans.
  • Time off for vacation and sick leave.
  • Parental leave options.
  • Retirement options.
  • Education reimbursement.
  • In-office perks and more!
Zscaler logo

Zscaler

Zscaler is a global leader in cloud security, dedicated to providing a secure, cloud-enabled digital future for its customers. The company prides itself on its Sales and Go-to-Market team, which consists of passionate professionals focused on nurturing trusted partnerships and delivering exceptional customer experiences. Zscaler's collaborative approach involves various teams, including Sales, Customer Success, and Technology Partnerships, working together to showcase the agility and power of cloud transformation. With a commitment to innovation and excellence, Zscaler aims to solidify its position as a frontrunner in the cloud security industry.

Share This Job!

Save This Job!

Similar Jobs:

Jobgether

Principal Site Reliability Engineer - Remote

Jobgether

8 weeks ago

We are looking for a Principal Site Reliability Engineer to enhance the reliability and efficiency of large-scale distributed systems in a hybrid remote setup.

USA
Full-time
DevOps / Sysadmin
Upwork logo

Principal Site Reliability Engineer - Remote

Upwork

14 weeks ago

Join Upwork as a Principal Site Reliability Engineer to lead and innovate in SRE practices for a global team.

Worldwide
Full-time
DevOps / Sysadmin
Cribl logo

Principal Site Reliability Engineer - Remote

Cribl

18 weeks ago

Join Cribl as a Principal Site Reliability Engineer to enhance observability and reliability in software systems.

USA
Full-time
DevOps / Sysadmin
$240,000 - $400,000/year

Groupon

Principal Site Reliability Engineer - Remote

Groupon

32 weeks ago

Join Groupon as a Principal Site Reliability Engineer to enhance the reliability and scalability of mission-critical systems.

Worldwide
Full-time
DevOps / Sysadmin

Groupon

Principal Site Reliability Engineer - Remote

Groupon

32 weeks ago

Join Groupon as a Principal Site Reliability Engineer to enhance the reliability and scalability of mission-critical systems.

Worldwide
Full-time
DevOps / Sysadmin