Remote Otter LogoRemoteOtter

Sr Site Reliability Engineer, Platform Engineering - Remote

Posted 16 weeks ago
DevOps / Sysadmin
Full Time
USA
$186,000 - $251,000/year

Overview

Kentik is the network observability company. Our platform is a must-have for the network front line, whether digital business, corporate IT, or service provider. Network professionals turn to the Kentik Network Observability Cloud to plan, run, and fix any network, relying on our infinite granularity, AI-driven insights, and insanely fast search.

In Short

  • Ensure our real-time, scalable, microservices-based infrastructure is set up for growth and working efficiently.
  • Work on tools and processes to better monitor our platform as well ensure its stability through our rapid growth.
  • Deep-diving into diverse topics, from NetFlow and IP routing, to database replication strategies or HTTP optimization.
  • Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective.
  • Contribute code, code reviews and tools or patches to all kinds of existing code.
  • Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure.
  • Provide valuable feedback on team goals, projects, and processes.

Requirements

  • 5+ years of experience in Systems Administration, Datacenter/IT and/or SRE related projects.
  • Experience working with *nix system command line (e.g. ssh, grep, awk).
  • Detailed understanding of major internet protocols works (tcp/ip, dns, http, TLS).
  • Experience with or desire to learn about microservices, containers and orchestration.
  • Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar.
  • A passion for documenting code, processes, and infrastructure in runbooks and wikis.
  • Strong collaboration and communication skills.
  • Worked with a configuration management (infrastructure as code) platform such as: Ansible, Puppet, Chef, SaltStack or CFEngine.
  • Worked with metrics monitoring solutions such as grafana, prometheus, and OpenTelemetry.
  • A strong preference towards automation - coding in Bash, Python, Ruby, or Go.
  • Experience with public cloud (AWS, GCP, Azure, etc.) architectures and technologies management using Terraform.

Benefits

  • 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents.
  • Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family.
  • Paid family & medical leave.
  • Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays.
  • 401(k) retirement account.
  • Home office reimbursement.
  • Stock options.
Kentik logo

Kentik

Kentik is a leading network observability company that provides a powerful platform for network professionals across digital businesses, corporate IT, and service providers. Their Network Observability Cloud offers AI-driven insights and rapid search capabilities, enabling users to effectively plan, run, and troubleshoot networks. Kentik specializes in analyzing network, cloud, host, and container flow, as well as Internet routing and performance metrics, ensuring that businesses can maintain optimal network performance, health, and security. Trusted by major market leaders like IBM, Box, and Zoom, Kentik is committed to delivering comprehensive network observability solutions. The company fosters a remote-friendly culture and values collaboration, independence, and continuous improvement among its world-class engineering and network expert teams.

Share This Job!

Save This Job!

Similar Jobs:

Codurance logo

Platform Engineer / Site Reliability Engineer - Remote

Codurance

13 weeks ago

Join Codurance as a Platform Engineer to work on cloud migration and CI/CD projects in a collaborative environment.

Worldwide
Contract
DevOps / Sysadmin
PriceHubble logo

Platform & Site Reliability Engineer - Remote

PriceHubble

18 weeks ago

Join PriceHubble as a Senior Engineer to shape cloud architecture and promote DevOps practices in a dynamic team.

Germany
Full-time
DevOps / Sysadmin

Algolia

Site Reliability Engineer - AI Platform - Remote

Algolia

9 weeks ago

Join Algolia as a Site Reliability Engineer to enhance the AI Platform's infrastructure and ensure reliable AI product delivery.

Worldwide
Full-time
DevOps / Sysadmin

Algolia

Site Reliability Engineer - AI Platform - Remote

Algolia

9 weeks ago

Join Algolia as a Site Reliability Engineer to enhance the AI Platform and support cloud-based deployments.

Worldwide
Full-time
DevOps / Sysadmin
Crunchyroll logo

Staff Site Reliability Engineer - Data Engineering, Platform - Remote

Crunchyroll

20 weeks ago

Join Crunchyroll as a Staff Site Reliability Engineer to enhance the reliability of our data infrastructure.

United States
Full-time
DevOps / Sysadmin
$191,000 - $239,000/year