Site Reliability Engineer - Remote

Posted 72 weeks ago

Software Development

Full Time

USA

Site Reliability Engineering

Infrastructure AS Code

AWS

Distributed Systems

Overview

The Site Reliability Engineering organization at Pinterest is accountable for ensuring overall Pinterest availability as well as enhancing Engineering teams’ capability to design, build and operate robust systems at scale. Pinterest’s applications and infrastructure that handle billions of monthly page views and petabytes of data as Pinterest continues to grow and scale. As a Pinterest SRE, you will design and build systems, platforms, tools, frameworks and methodologies to assure the reliability of our large-scale distributed systems.

In Short

Develop software solutions to enable reliability and operability of large scale distributed systems handling petabytes of data and serving
Build a deep understanding of how Pinterest’s systems behave, scale, interact and fail, and use that insight to identify risks and opportunities for remediation
Build tools and automation to eliminate toil and reduce operational overhead. Create frameworks, processes and best practices to be used across Pinterest Engineering
Build meaningful, insightful and actionable SLIs
Automate critical portions of Pinterest’s engineering processes, to minimize risk and maximize the speed of innovation
Manage capacity and performance to help scale our infrastructure both on public and private clouds around the world

Requirements

5+ years of industry experience, building and operating large scale, high performance distributed systems
Experience programming with Python or Go
Strong knowledge of Linux/Unix/BSD internals and experience working with open source software (e.g. MySQL, Hadoop, Envoy, HAProxy, Nginx)
Experience with technologies such as ElasticSearch, ZooKeeper, HBase, Hadoop, Memcache and Kafka with a focus on reliability, automation, operability and performance
Infrastructure as code a plus (e.g. Terraform, Puppet, Chef, Ansible, Salt, Fabric, Docker, etc)
Bonus points if experienced with deploying web apps to cloud infrastructure (AWS, etc.) and working with distributed, service-oriented architecture
Bachelor’s degree in a relevant field such as Computer Science, or equivalent experience

Benefits

Flexible work model with in-office collaboration 1-2 times every 6 months
Opportunity to work on large-scale systems
Support for professional growth and development
Inclusive and diverse work environment

Pinterest is a global platform where millions of users come to discover new ideas, find inspiration, and plan for what matters most in their lives. The company's mission is to help individuals create a life they love by providing a positive and engaging online experience. Pinterest fosters a culture of growth and collaboration, encouraging employees to bring their unique perspectives to the table. With a progressive work model called PinFlex, Pinterest emphasizes flexibility in work arrangements while maintaining a strong focus on user engagement and product innovation.

Share This Job!

Save This Job!

Jobs from Pinterest:

Senior Paralegal

Litigation Support

Case Management

Discovery Coordination

Data Science Team Lead

Data Science

Team Management

Business Acumen

Programmatic Ads Sales Lead

Programmatic Advertising

Digital Media Sales

AD Tech

Associate Creative Director, Performance Marketing

Performance Marketing

Creative Direction

Advertising

Global Mobility & Immigration Lead

Global Mobility

Immigration

Project Management