Remote Otter LogoRemoteOtter

Site Reliability Engineer (SRE) - Remote

Posted 3 days ago
DevOps / Sysadmin
Full Time
Canada

Overview

Be part of a global team that ensures the performance, scalability, and reliability of critical cloud-based applications. As part of the Global Investor and Distribution Solutions (GIDS) Platform Services team, you’ll play a key role in keeping our systems running smoothly and efficiently—while helping shape the future of our platform.

In Short

  • Collaborate with global teams as part of a follow-the-sun support model.
  • Respond to, troubleshoot, and resolve Level 2 application incidents.
  • Ensure critical applications are effectively monitored using tools like Prometheus and Grafana.
  • Create and maintain dashboards and alerts to enhance visibility into application health.
  • Define, implement, and track key SRE metrics (SLOs, SLIs, error budgets).
  • Partner with development teams to improve application reliability and resilience.
  • Analyze incident trends and recommend improvements to reduce recurrence.
  • Automate repetitive support tasks to improve efficiency.
  • Participate in post-incident reviews and drive reliability initiatives.
  • Perform infrastructure and application patching as part of regular maintenance cycles.
  • Support security vulnerability remediation efforts across both infrastructure and application layers.

Requirements

  • Bachelor’s degree in Computer Science, Computer Engineering, IT, or related field.
  • 5+ years of experience for senior roles; fresh graduates welcome for junior roles.
  • Proficiency in one or more programming languages, preferably Java, JavaScript or Python.
  • Proven ability to troubleshoot complex systems.
  • Skilled in debugging, code optimization, and automation.
  • Experience with relational databases and data analysis.
  • Experience working in Site Reliable Engineer (SRE) roles or incident response environments.
  • Hands-on experience with cloud infrastructure, preferably AWS.
  • Familiarity with observability tools such as Grafana, ELK Stack, or similar.
  • Experience deploying and managing applications on Kubernetes platforms.
  • Strong skills in analyzing and troubleshooting issues in large-scale, distributed systems.
  • Familiarity with PostgreSQL and its performance tuning, monitoring, and troubleshooting.

Benefits

  • Flexibility: Hybrid Work Model & a Business Casual Dress Code, including jeans.
  • Your Future: RRSP Matching Program, Professional Development Reimbursement.
  • Work/Life Balance: Flexible Personal/Vacation Time Off, Sick Leave, Paid Holidays.
  • Your Wellbeing: Medical, Dental, Vision, Employee Assistance Program, Parental Leave.
  • Diversity & Inclusion: Committed to Welcoming, Celebrating and Thriving on Diversity.
  • Training: Hands-On, Team-Customized, including SS&C Learning Institute.
  • Extra Perks: Discounts on fitness clubs, travel and more!
  • Wide-Ranging Perspectives: Committed to Celebrating the Variety of Backgrounds, Talents and Experiences of Our Employees.
SS&C Technologies Canada Corp logo

SS&C Technologies Canada Corp

SS&C Technologies Canada Corp is a global leader in providing software and services for the financial services industry. The company specializes in delivering innovative cloud-based solutions that enhance the performance, scalability, and reliability of critical applications. With a commitment to diversity and inclusion, SS&C fosters a collaborative work environment where employees can thrive. The organization emphasizes professional development and offers a range of benefits to support work-life balance and employee wellbeing.

Share This Job!

Save This Job!

Similar Jobs:

Mattermost logo

Site Reliability Engineer (SRE) - Remote

Mattermost

1 week ago

Join Mattermost as a Site Reliability Engineer (SRE) to enhance the reliability and performance of our secure collaboration platform.

USA
Full-time
DevOps / Sysadmin
$150,000 - $190,000 USD
Valtech logo

Site Reliability Engineer (SRE) - Remote

Valtech

1 week ago

Valtech is seeking a Site Reliability Engineer to enhance system reliability and collaborate with global teams.

Mexico
Full-time
DevOps / Sysadmin
Articul8 logo

Site Reliability Engineer (SRE) - Remote

Articul8

3 weeks ago

Seeking an experienced Site Reliability Engineer to ensure the reliability and scalability of our GenAI SaaS platform.

Brazil
Full-time
DevOps / Sysadmin
OneImaging logo

Site Reliability Engineer (SRE) - Remote

OneImaging

3 weeks ago

Join our infrastructure team as a Site Reliability Engineer (SRE) responsible for the scalability, reliability, and performance of our cloud-based services.

USA
Full-time
DevOps / Sysadmin
Tempo logo

Site Reliability Engineer (SRE) - Remote

Tempo

5 weeks ago

Join Tempo as a Site Reliability Engineer to build and maintain infrastructure for innovative time management solutions.

Worldwide
Full-time
DevOps / Sysadmin