Remote Otter LogoRemoteOtter

Senior Site Reliability Engineer (SRE) - Remote

Posted 5 days ago
DevOps / Sysadmin
Full Time
Brazil

Overview

We are seeking an experienced Site Reliability Engineer (SRE) to join our team and help ensure the reliability, performance, and scalability of our GenAI SaaS platform. As an SRE, you will bridge the gap between development and operations, implementing automation and best practices to maintain our service reliability objectives while supporting rapid innovation.

In Short

  • Architect and maintain scalable, highly available infrastructure for our GenAI platform.
  • Design and implement robust monitoring, alerting, and observability solutions.
  • Automate deployment, scaling, and management of cloud-native infrastructure.
  • Define, measure, and improve Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
  • Participate in on-call rotations and respond to production incidents.
  • Collaborate with development teams for reliable and efficient systems.
  • Lead incident response efforts and champion continuous improvement.
  • Optimize infrastructure for performance and cost-effectiveness.
  • Implement security best practices across all systems.
  • Create and maintain comprehensive documentation.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field.
  • 5+ years of experience in DevOps, SRE, or similar roles.
  • Strong experience with cloud platforms (AWS, GCP, or Azure).
  • Proficiency in at least one programming/scripting language.
  • Hands-on experience with infrastructure as code tools.
  • Solid background in containerization technologies.
  • Proven experience with monitoring and observability tools.
  • Strong understanding of CI/CD pipelines and automation.
  • Exceptional troubleshooting and problem-solving skills.

Benefits

  • Opportunity to work with cutting-edge technology in AI.
  • Collaborative and innovative team environment.
  • Flexible working hours and remote work options.
  • Professional development opportunities.
  • Competitive salary and benefits package.
Articul8 logo

Articul8

Articul8 AI is a forward-thinking company dedicated to creating exceptional AI products that surpass customer expectations. With a strong focus on excellence, the team at Articul8 AI is committed to making a positive impact on the world through innovative solutions. They emphasize collaboration and creativity, fostering an environment that encourages personal and professional growth. By leveraging their expertise in AI and financial services, Articul8 AI aims to transform customer experiences and drive enterprise-level outcomes in the financial industry.

Share This Job!

Save This Job!

Similar Jobs:

Plasma logo

Senior Site Reliability Engineer (SRE) - Remote

Plasma

1 week ago

Seeking a Senior Site Reliability Engineer (SRE) to enhance the reliability and performance of our blockchain protocol in the APAC region.

APAC
Full-time
DevOps / Sysadmin
Plum Fintech logo

Senior Site Reliability Engineer (SRE) - Remote

Plum Fintech

2 weeks ago

Join Plum as a Senior Site Reliability Engineer to enhance system resilience and support growth.

Greece
Full-time
DevOps / Sysadmin
Veeva Systems logo

Senior Site Reliability Engineer - SRE - Remote

Veeva Systems

6 weeks ago

Veeva Systems is seeking a Senior Site Reliability Engineer to enhance the scalability and reliability of their cloud applications.

USA
Full-time
Software Development
$110,000 - $270,000/year
Sailpoint Technologies logo

Senior Site Reliability Engineer (SRE) - Remote

Sailpoint Technologies

7 weeks ago

Join an Identity Security Cloud software development team as a Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of services.

Mexico
Full-time
DevOps / Sysadmin
CookUnity logo

Senior Site Reliability Engineer (SRE) - Remote

CookUnity

10 weeks ago

CookUnity is looking for a Senior Site Reliability Engineer to manage and enhance their cloud-native infrastructure.

Latam
Full-time
DevOps / Sysadmin