Remote Otter LogoRemoteOtter

Java Site Reliability Engineer (SRE) - Remote

Posted 179 weeks ago
Software Development
Full Time
Worldwide

Overview

This role is focused on ensuring the reliability and scalability of a large-scale distributed streaming platform using Java, Apache Kafka, and Apache Flink.

In Short

  • Work on a distributed streaming platform handling over 4M messages per second.
  • Architect and implement software solutions to improve system stability and scalability.
  • Participate in critical technical decision-making within the team.
  • Monitor system health and handle outages effectively.
  • Collaborate with multi-functional teams.
  • Own multiple services and ensure reliability across data centers.
  • Conduct capacity tests to manage system growth.
  • Share on-call rotation responsibilities for incident escalation.
  • Utilize hands-on experience in Linux administration.
  • Bring creative problem-solving skills to the team.

Requirements

  • Experience in building and maintaining complex, scalable systems.
  • Strong understanding of Software Engineering and Computer Science principles.
  • Ability to create software solutions from scratch.
  • Fluency in English, both spoken and written.
  • Experience with networking, security, and storage is a plus.
  • Hands-on experience with Linux administration.
  • Familiarity with defining SLIs and SLOs.
  • Experience with Apache Kafka and/or Apache Flink is an advantage.

Benefits

  • Work in a fully remote environment.
  • Opportunity to work with cutting-edge technologies.
  • Collaborative and supportive team culture.
  • Flexible working hours.
  • Professional development opportunities.

I.R

InterContinental Recruiting

InterContinental Recruiting's client is a dynamic digital products and solutions company that delivers impactful results across various industries, including Consumer Lending, Manufacturing, Media & Entertainment, and Retail. With a global presence in North America, Europe, and Asia-Pacific, the company focuses on building and maintaining complex, scalable, and distributed systems, emphasizing software engineering principles and innovative problem-solving. They are dedicated to enhancing the reliability and scalability of large-scale distributed streaming platforms, utilizing technologies like Apache Kafka and Apache Flink, while fostering collaboration across multi-functional teams.

Share This Job!

Save This Job!

Similar Jobs:

Top Hat logo

Site Reliability Engineer (SRE) - Remote

Top Hat

10 weeks ago

Join our Core Platform team as a Site Reliability Engineer to enhance software delivery performance and mentor teams in DevOps practices.

Canada
Full-time
DevOps / Sysadmin
Gorilla Logic logo

Site Reliability Engineer (SRE) - Remote

Gorilla Logic

11 weeks ago

Gorilla Logic is looking for a Site Reliability Engineer (SRE) to lead observability and monitoring initiatives using Dynatrace.

Colombia
Full-time
DevOps / Sysadmin
PRAGMATIKE logo

Site Reliability Engineer (SRE) - Remote

PRAGMATIKE

11 weeks ago

Join us as a Site Reliability Engineer (SRE) to ensure the reliability and scalability of our AWS infrastructure in a fully remote role.

Worldwide
Full-time
DevOps / Sysadmin

Blackfluo.ai

Site Reliability Engineer (SRE) - Remote

Blackfluo.ai

11 weeks ago

Join our team as a Site Reliability Engineer (SRE) to enhance the reliability and scalability of our AWS infrastructure.

Worldwide
Full-time
DevOps / Sysadmin
Arista Networks logo

Site Reliability Engineer (SRE) - Remote

Arista Networks

12 weeks ago

Join Arista Networks as a Site Reliability Engineer to manage and enhance the global CloudVision service fleet.

Ireland
Full-time
DevOps / Sysadmin