Remote Otter LogoRemoteOtter

Site Reliability Engineer - Cloud Operations - Remote

Posted 2 days ago
DevOps / Sysadmin
Full Time
Worldwide

Overview

Join Nexthink's vibrant Madrid team as a Site Reliability Engineer, where cutting-edge technology meets innovation. This role focuses on maintaining high availability and performance of cloud infrastructure and services.

In Short

  • Manage and maintain Kubernetes clusters.
  • Automate routine tasks and streamline operations.
  • Participate in on-call rotation for critical incidents.
  • Proactively identify and troubleshoot system anomalies.
  • Set up monitoring and alerting systems.
  • Continuously assess performance and implement optimizations.
  • Maintain documentation of processes and procedures.

Requirements

  • Experience managing Kubernetes clusters in production.
  • Knowledge in config automation (Ansible), CI/CD (Jenkins), IaC (Terraform).
  • Familiar with GitHub, Bitbucket, and Atlassian suite.
  • Experience in an on-call rotation environment.
  • Strong problem-solving skills.
  • Commitment to maintaining high system reliability.
  • Experience with AWS cloud services.
  • Basic knowledge of Kafka is a plus.
  • Excellent communication skills.

Benefits

  • Permanent contract with a competitive compensation package.
  • Hybrid work model with flexible hours.
  • Unlimited vacation plus 3 company-paid volunteer days.
  • Regular company and team events.
  • Bonuses for referring successful hires.
Nexthink logo

Nexthink

Nexthink is a leading provider of digital employee experience management software, empowering IT leaders with unparalleled insights to identify, diagnose, and resolve issues affecting employees across various applications and networks before they become noticeable. With a proactive approach to IT management, Nexthink serves over 1,200 customers and enhances the digital experiences of more than 15 million employees globally. The company is dual headquartered in Lausanne, Switzerland, and Boston, Massachusetts, and operates nine offices worldwide. As pioneers in the Digital Employee Experience (DEX) market, Nexthink integrates real-time analytics, automation, and employee feedback to create productive workplaces and satisfied employees. The company values diversity and inclusion, employing over 1,000 individuals from more than 75 nationalities, fostering a collaborative and innovative work environment.

Share This Job!

Save This Job!

Similar Jobs:

BDR Solutions logo

Cloud Site Reliability Engineer (SRE) - Remote

BDR Solutions

10 weeks ago

Join BDR Solutions as a Cloud Site Reliability Engineer, focusing on building and automating infrastructure services for SaaS solutions on Azure and AWS.

USA
Full-time
DevOps / Sysadmin
Ryzlabs logo

Cloud Site Reliability Engineer (SRE) - Remote

Ryzlabs

14 weeks ago

RYZ is looking for a Cloud SRE to enhance system resiliency and availability for self-driving robotic carriers.

Argentina, Uruguay
Full-time
DevOps / Sysadmin

MongoDB

Site Reliability Engineer - Cloud Team - Remote

MongoDB

13 weeks ago

Join MongoDB's Cloud Team as a Site Reliability Engineer to design and build global infrastructure for cloud services.

USA
Full-time
DevOps / Sysadmin
$127,000 - $249,000 USD/year

MongoDB

Site Reliability Engineer - Cloud Team - Remote

MongoDB

13 weeks ago

Join MongoDB's Cloud Team as a Site Reliability Engineer to design and build global infrastructure for cloud services.

USA
Full-time
DevOps / Sysadmin
$127,000 - $249,000 USD/year

MongoDB

Site Reliability Engineer - Cloud Team - Remote

MongoDB

13 weeks ago

Join MongoDB's Cloud Team as a Site Reliability Engineer to design and build global infrastructure for cloud services.

USA
Full-time
DevOps / Sysadmin
$127,000 - $249,000 USD/year