Remote Otter LogoRemoteOtter

Site Reliability Engineer - SRE - Remote

Posted Yesterday
DevOps / Sysadmin
Full Time
Worldwide

Overview

The team is responsible for ensuring the reliability, availability, and scalability of solutions supporting one of Zenvia's most strategic products. We work in an on-premise environment, applying best practices in Infrastructure as Code (IaC) and automation to maintain robust and efficient operations.

In Short

  • Support the development of projects in collaboration with all technology teams.
  • Build relationships with peers.
  • Participate in designing and implementing improvements to increase availability, scalability, reliability, and security.
  • Support and/or create automated routines for infrastructure provisioning using automation tools.
  • Assist in incident resolution and postmortem documentation.
  • Work on reliability and performance aspects of databases within SRE squads.
  • Analyze solutions and implement best practices for backing services clusters (Kubernetes, Kafka, MongoDB, Redis).
  • Work on observability metrics and ensure objectives are met.
  • Collaborate with SRE peers on production environment implementations or changes to mitigate incidents.
  • Support infrastructure automation to facilitate and speed up the availability of new infrastructure components.

Requirements

  • Experience with on-premise infrastructure.
  • Experience with Linux operating systems (Redhat, Debian & Ubuntu).
  • Knowledge of private (VMWare) and public (AWS) cloud.
  • Knowledge in IaC (Infrastructure as Code using Ansible and Gitlab).
  • Knowledge in Kubernetes and container orchestration.
  • Knowledge of observability tools (Prometheus, Grafana, Elastic & Zabbix).
  • Knowledge in shell scripting and programming languages (Bash and Shell).
  • Creativity, willingness to learn new technologies, and good communication.

Benefits

  • 100% remote work with flexible hours.
  • Zenvia Care program for you and your family.
  • Healthcare plan without co-participation for employees and the option to include dependents.
  • Wellness care programs including sports coaching and telepsychology.
  • Extended parental leave and support for new parents.
  • Remote Care policy including financial assistance for home office setup.
  • Career development programs and internal mobility opportunities.
  • Participation in results based on contributions.
  • Informal work environment promoting knowledge sharing.
  • Transparency and open dialogue are essential values.

ZENVIA

ZENVIA

Zenvia is a forward-thinking technology company that specializes in providing a unified, multichannel customer communication platform known as Zenvia Customer Cloud. This innovative solution enables businesses to create personalized and seamless customer experiences through various channels, including WhatsApp, SMS, and chatbots. Zenvia is recognized for fostering a collaborative environment that values employee autonomy and encourages continuous development. The company is committed to inclusivity and diversity, offering a fully remote and flexible work culture, comprehensive benefits, and opportunities for professional growth. With a focus on innovation and strategic transformation, Zenvia is dedicated to impacting millions of people globally through cutting-edge technologies.

Share This Job!

Save This Job!

Similar Jobs:

Top Hat logo

Site Reliability Engineer (SRE) - Remote

Top Hat

7 weeks ago

Join our Core Platform team as a Site Reliability Engineer to enhance software delivery performance and mentor teams in DevOps practices.

Canada
Full-time
DevOps / Sysadmin
Gorilla Logic logo

Site Reliability Engineer (SRE) - Remote

Gorilla Logic

7 weeks ago

Gorilla Logic is looking for a Site Reliability Engineer (SRE) to lead observability and monitoring initiatives using Dynatrace.

Colombia
Full-time
DevOps / Sysadmin
PRAGMATIKE logo

Site Reliability Engineer (SRE) - Remote

PRAGMATIKE

7 weeks ago

Join us as a Site Reliability Engineer (SRE) to ensure the reliability and scalability of our AWS infrastructure in a fully remote role.

Worldwide
Full-time
DevOps / Sysadmin

Blackfluo.ai

Site Reliability Engineer (SRE) - Remote

Blackfluo.ai

8 weeks ago

Join our team as a Site Reliability Engineer (SRE) to enhance the reliability and scalability of our AWS infrastructure.

Worldwide
Full-time
DevOps / Sysadmin
Arista Networks logo

Site Reliability Engineer (SRE) - Remote

Arista Networks

8 weeks ago

Join Arista Networks as a Site Reliability Engineer to manage and enhance the global CloudVision service fleet.

Ireland
Full-time
DevOps / Sysadmin