Remote Otter LogoRemoteOtter

Sr Site Reliability Engineer, Platform Engineering - Remote

Posted 10 weeks ago
Kentik logo

Kentik

DevOps / Sysadmin
Full Time
USA
$186,000 - $251,000/year

Share This Job!

Overview

Kentik is the network observability company. Our platform is a must-have for the network front line, whether digital business, corporate IT, or service provider. Network professionals turn to the Kentik Network Observability Cloud to plan, run, and fix any network, relying on our infinite granularity, AI-driven insights, and insanely fast search.

In Short

  • Ensure our real-time, scalable, microservices-based infrastructure is set up for growth and working efficiently.
  • Work on tools and processes to better monitor our platform as well ensure its stability through our rapid growth.
  • Deep-diving into diverse topics, from NetFlow and IP routing, to database replication strategies or HTTP optimization.
  • Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective.
  • Contribute code, code reviews and tools or patches to all kinds of existing code.
  • Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure.
  • Provide valuable feedback on team goals, projects, and processes.

Requirements

  • 5+ years of experience in Systems Administration, Datacenter/IT and/or SRE related projects.
  • Experience working with *nix system command line (e.g. ssh, grep, awk).
  • Detailed understanding of major internet protocols works (tcp/ip, dns, http, TLS).
  • Experience with or desire to learn about microservices, containers and orchestration.
  • Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar.
  • A passion for documenting code, processes, and infrastructure in runbooks and wikis.
  • Strong collaboration and communication skills.
  • Worked with a configuration management (infrastructure as code) platform such as: Ansible, Puppet, Chef, SaltStack or CFEngine.
  • Worked with metrics monitoring solutions such as grafana, prometheus, and OpenTelemetry.
  • A strong preference towards automation - coding in Bash, Python, Ruby, or Go.
  • Experience with public cloud (AWS, GCP, Azure, etc.) architectures and technologies management using Terraform.

Benefits

  • 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents.
  • Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family.
  • Paid family & medical leave.
  • Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays.
  • 401(k) retirement account.
  • Home office reimbursement.
  • Stock options.

Similar Jobs:

Codurance logo

Platform Engineer / Site Reliability Engineer - Remote

Codurance

7 weeks ago

Join Codurance as a Platform Engineer to work on cloud migration and CI/CD projects in a collaborative environment.

DevOps
GitOps
CI/CD
Cloud Migration
Worldwide
Contract
DevOps / Sysadmin
PriceHubble logo

Platform & Site Reliability Engineer - Remote

PriceHubble

12 weeks ago

Join PriceHubble as a Senior Engineer to shape cloud architecture and promote DevOps practices in a dynamic team.

Site Reliability Engineering
Platform Engineering
DevOps
Terraform
Germany
Full-time
DevOps / Sysadmin

Algolia

Site Reliability Engineer - AI Platform - Remote

Algolia

3 weeks ago

Join Algolia as a Site Reliability Engineer to enhance the AI Platform's infrastructure and ensure reliable AI product delivery.

Kubernetes
Cloud Providers
GCP
AWS
Worldwide
Full-time
DevOps / Sysadmin

Algolia

Site Reliability Engineer - AI Platform - Remote

Algolia

3 weeks ago

Join Algolia as a Site Reliability Engineer to enhance the AI Platform and support cloud-based deployments.

Kubernetes
Container Orchestration
Infrastructure AS Code
Terraform
Worldwide
Full-time
DevOps / Sysadmin
Crunchyroll logo

Staff Site Reliability Engineer - Data Engineering, Platform - Remote

Crunchyroll

13 weeks ago

Join Crunchyroll as a Staff Site Reliability Engineer to enhance the reliability of our data infrastructure.

Site Reliability Engineering
Data Infrastructure
AWS
Monitoring Tools
United States
Full-time
DevOps / Sysadmin
$191,000 - $239,000/year