Remote Otter LogoRemoteOtter

Assistant Vice President, Site Reliability Engineering - Remote

Posted 8 weeks ago
DevOps / Sysadmin
Full Time
Worldwide

Overview

The Site Reliability Engineering team at iCapital is fundamental to ensuring our platform delivers consistent, reliable service to our client base. As an Assistant Vice President, you'll work at the intersection of software engineering and operations, applying engineering principles to infrastructure challenges.

In Short

  • Design, implement, and maintain service level objectives (SLOs) that align with business goals and customer expectations
  • Develop observability strategies, focusing on meaningful metrics that drive actionable insights
  • Architect and implement scalable infrastructure solutions using cloud-native technologies and infrastructure as code
  • Drive automation initiatives to eliminate toil and improve system reliability
  • Champion reliability best practices across development teams through consultation and tooling
  • Design and operation of a Kubernetes environment for container management and orchestration
  • Lead incident response, conduct thorough postmortems, and drive systematic improvements
  • Participate in on-call rotations with a focus on continuous service improvement

Requirements

  • 5+ years of SRE experience or related experience with 3+ years in AWS
  • Strong experience with container orchestration platforms like Kubernetes and related ecosystem tools
  • Working knowledge of databases such as MongoDB, Postgres, DynamoDB
  • Strong foundation in reliability engineering principles and distributed systems behavior
  • Experience defining and implementing SLOs/SLIs and using them to drive system improvements
  • Demonstrated ability to design and implement observability solutions that provide actionable insights while minimizing alert fatigue
  • Coding abilities in at least one IaC language (Terraform strongly preferred) and one programming language such as Python, Ruby or Java with a focus on maintainable, tested code
  • Understanding of modern observability practices and experience implementing and maintaining monitoring solutions (Prometheus/Grafana, Splunk, NewRelic, CloudWatch, and ELK in the cloud)
  • Strong incident response skills with experience leading incident retrospectives and driving improvements
  • Excellent problem-solving abilities and experience debugging distributed systems
  • Track record of successfully automating operations and reducing toil
  • Strong communication skills with ability to explain complex technical concepts to diverse audiences
  • A desire to share, teach, and learn as part of a team

Benefits

  • Comprehensive benefits package including competitive salary, annual performance bonus, and equity for all full-time employees
  • Healthcare with 100% employer-paid health and dental insurance
  • Generous paid time off (PTO)
iCapital logo

iCapital

iCapital is a leading financial technology platform that revolutionizes the alternative investment marketplace, enabling advisors, wealth management firms, asset managers, and banks to effectively evaluate and recommend tailored public and private market strategies for high-net-worth clients. With approximately $209 billion in global client assets invested across 1,690 funds as of November 2024, iCapital has earned recognition as a top fintech company, being named to the Forbes Fintech 50 for seven consecutive years and receiving multiple awards for its innovative solutions. The company is committed to providing exceptional client service and fostering inclusive workplace practices.

Share This Job!

Save This Job!

Similar Jobs:

Graylog logo

Vice President, Engineering - Remote

Graylog

15 weeks ago

Graylog is looking for a Vice President of Engineering to lead global engineering teams and deliver innovative security solutions.

US, Germany, UK
Full-time
Software Development
Citibank, N.A logo

Senior Data Engineer - Assistant Vice President - Remote

Citibank, N.A

7 weeks ago

The Senior Data Engineer will develop high-quality data products to support regulatory requirements and data-driven decision-making.

India
Full-time
Data Analysis
Forbes Advisor logo

Staff Engineer - Site Reliability Engineering (SRE) - Remote

Forbes Advisor

11 weeks ago

Join Forbes Advisor as a Staff Engineer in Site Reliability Engineering, focusing on system reliability and performance.

India
Full-time
DevOps / Sysadmin
Komodo Health logo

Vice President, Sales Engineering - Remote

Komodo Health

14 weeks ago

The Vice President of Sales Engineering will lead a team to deliver innovative healthcare data solutions and drive customer success.

USA
Full-time
Sales / Business
$230,000 - $311,000 USD/year

Jerry

Vice President of Engineering - Remote

Jerry

7 weeks ago

Join a pre-IPO startup as the Vice President of Engineering, leading a talented team and driving product innovation.

CA, USA
Full-time
Software Development