Remote Otter LogoRemoteOtter

System Reliability Engineer - Contract - Remote

Posted 41 weeks ago
DevOps / Sysadmin
Contract
Mexico

Overview

As a System Reliability Engineer, you will be crucial in managing Linux and Windows environments, automating processes, and implementing robust monitoring and security practices. Your expertise will help us maintain high availability and performance across our client's systems. If you thrive on solving complex problems and optimizing systems, we want to hear from you!

In Short

  • Manage, configure, and maintain Linux and Windows Server environments.
  • Perform regular system updates, patches, and security configurations.
  • Implement and maintain monitoring tools to track system performance, availability, and reliability.
  • Analyze performance metrics and logs to identify and resolve issues proactively.
  • Collaborate with stakeholders to create dashboards and alerts for proactive performance monitoring.
  • Develop and maintain automation scripts for routine tasks, deployments, and incident responses.
  • Use configuration management tools to ensure consistent and repeatable system setups.
  • Implement and enforce security best practices for system configurations and network setups.
  • Conduct regular vulnerability assessments and apply necessary patches to mitigate risks.
  • Work closely with development, DevSecOps, and cloud engineering teams to support application deployments and infrastructure changes.
  • Provide technical guidance and support for resolving complex system issues.
  • Create and maintain detailed documentation for system configurations, procedures, and incident reports.
  • Identify opportunities for process improvements and implement changes to enhance system reliability and performance.

Requirements

  • Proficiency in managing and troubleshooting Linux (e.g., Amazon Linux, CentOS) and Windows Server systems.
  • Experience with system configuration, management, and maintenance.
  • Experience with automation tools such as Ansible, Puppet, or Chef.
  • Familiarity with monitoring solutions such as AWS CloudWatch, Dynatrace, Datadog or similar solutions.
  • Ability to analyze system performance metrics and implement optimizations.
  • Experience with patch management, vulnerability assessment, and remediation.
  • Proficiency in scripting languages such as Bash, Python and PowerShell for automating administrative tasks.
  • Experience with version control systems like Git.
  • Familiarity with AWS, specifically in managing EC2 instances, lambdas and containers.
  • Familiarity with AWS System Manager features, specifically Patch Manager and Run Command
  • Familiarity in incident response, troubleshooting, and performing root cause analysis.
  • Familiarity with infrastructure as code (IaC) tools like Terraform or AWS CloudFormation.

Benefits

  • Remote Work Opportunities
  • Flexible Work Hours
Techholding logo

Techholding

Techholding is a full-service consulting firm dedicated to delivering predictable outcomes and high-quality solutions to its clients. Founded by industry veterans with experience in both startups and Fortune 50 companies, Techholding emphasizes deep expertise, integrity, transparency, and dependability in its operations. The company fosters a culture of innovation and operational excellence, providing opportunities for employees to contribute to the enhancement of DevOps and security practices. Committed to diversity and inclusion, Techholding welcomes applicants from all backgrounds and experiences.

Share This Job!

Save This Job!

Similar Jobs:

CI&T logo

System Reliability Engineer - Remote

CI&T

6 weeks ago

We are seeking a System Reliability Engineer with expertise in cloud infrastructure and a focus on Azure.

CO, USA
Full-time
DevOps / Sysadmin

CivicPlus

Site Reliability Engineer (Contractor) - Remote

CivicPlus

13 weeks ago

The Site Reliability Engineer (Contractor) ensures the stability and reliability of production systems while managing infrastructure across multiple cloud providers.

Worldwide
Contract
DevOps / Sysadmin
Cloudflare logo

Security Systems Reliability Engineer - Remote

Cloudflare

10 weeks ago

Join Cloudflare as a Security Systems Reliability Engineer to design and manage secure infrastructure.

Worldwide
Full-time
DevOps / Sysadmin
P2P.org logo

Site Reliability Engineer - Cosmos Ecosystem - Remote

P2P.org

9 weeks ago

P2P.org is seeking a Site Reliability Engineer to manage and enhance multi-cloud infrastructure within the Cosmos ecosystem.

Spain
Full-time
DevOps / Sysadmin
Coinbase logo

Reliability Engineer - Remote

Coinbase

12 weeks ago

Join Coinbase as a Reliability Engineer to enhance software reliability and support engineering teams.

USA
Full-time
Software Development
$180,625 - $212,000/year