Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer/DevOps - Remote

Posted 21 weeks ago
DevOps / Sysadmin
Full Time
Worldwide

Overview

An overview paragraph about the job

In Short

  • Manage day-to-day alerts, system checks, and issue escalation as necessary.
  • Provide 24x7 on-call support for critical SaaS events.
  • Document issues and remediation steps.
  • Proactively create monitors within the EKS/K8s ecosystem.
  • Deploy to EKS/K8s cluster using Terraform and Helm/Flux.
  • Enhance infrastructure health by implementing checks and scripts to address known issues.
  • Maintain and develop deployment code.
  • Implement/integrate new technologies into our Cloud Infrastructure.
  • Collaborate with other teams to provide top-notch support and assistance.
  • Prioritize customer focus in planning deployments/updates, ensuring minimal impact.
  • Conduct RCA and take necessary corrective actions to prevent issue recurrence.
  • Assign alert-related actions to the appropriate team after investigation.
  • Handle support requests for environment-specific actions.

Requirements

  • Strong experience with issue processing (RCA, Postmortems).
  • Proficiency in Kubernetes (deployment, scaling, troubleshooting).
  • Familiarity with AWS, Terraform, Docker, CI/CD.
  • Experience with monitoring tools like DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch.
  • Strong understanding of networking concepts and protocols.
  • Proficiency in at least one scripting language (e.g., Python, NodeJS, Go).
  • Experience with configuration management tools like FluxCD/ArgoCD.
  • Proficiency in Git or other version control systems.
  • Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps.
  • Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform.

Benefits

  • Quarterly Bonuses based on transparent and systematic evaluation.
  • Flexible Work Schedule.
  • Remote Work Option for Enhanced Flexibility.
  • Comprehensive Medical Insurance for you and your significant other.
  • Financial Support for Life Events.
  • Unlimited Paid Vacation.
  • Unlimited Paid Sick Leave.
  • Reimbursement for professional development courses and training.
Playson logo

Playson

Founded in 2012, Playson is a globally recognized leader in the iGaming industry, providing a high-end micro-service-based platform designed to handle billions of financial transactions daily. The company is dedicated to achieving zero latency and ensuring the best gaming experience for users, regardless of their internet conditions. Playson emphasizes innovation and technical excellence, making significant investments in technology to enhance game performance and connectivity. With a focus on continuous improvement and professional development, Playson offers a supportive and flexible work environment for its employees.

Share This Job!

Save This Job!

Similar Jobs:

Kiln logo

Senior Site Reliability Engineer / DevOps - Remote

Kiln

10 weeks ago

Join Kiln as a Senior Site Reliability Engineer / DevOps to enhance our blockchain infrastructure and services.

France, UK, Italy, Spain, Portugal, Netherlands
Full-time
DevOps / Sysadmin
90000 - 100000€/year

Exadel

Middle/Senior DevOps/Site Reliability Engineer - Remote

Exadel

7 weeks ago

Join Exadel as a Middle/Senior DevOps/Site Reliability Engineer to enhance system reliability and performance.

Hungary, Poland
Full-time
DevOps / Sysadmin

FeverUp

Site Reliability Engineer / DevOps Engineer - Remote

FeverUp

16 weeks ago

Join Fever as a Site Reliability Engineer / DevOps Engineer to design and maintain scalable infrastructures with a focus on automation.

Spain
Full-time
DevOps / Sysadmin
35,000 - 70,000EUR/year
Playson logo

Site Reliability Engineer/DevOps - Remote

Playson

45 weeks ago

Join Playson as a Site Reliability Engineer/DevOps to manage and enhance our cloud infrastructure.

Worldwide
Full-time
DevOps / Sysadmin
Cherre logo

Senior DevOps and Site Reliability Engineer, remote

Cherre

328 weeks ago

Cherre is seeking experienced DevOps and Site Reliability Engineers to build and support infrastructure for real estate data services.

US
Full-time
DevOps / Sysadmin