Remote Otter LogoRemoteOtter

Senior Python Systems Engineer (Agent & Infrastructure) - Remote

Posted 22 hours ago
DevOps / Sysadmin
Full Time
Italy

Overview

We are looking for a Senior Systems Engineer to own the execution layer of the ClearML platform, responsible for critical components that manage containers and GPU resources.

In Short

  • Design and optimize the clearml-agent, a Python service for executing ML pipelines.
  • Interact with Kubernetes APIs to manage Pod life-cycles and CRDs.
  • Implement dynamic resource allocation for GPU/CPU/Memory.
  • Build robust daemons and services for OS-level interactions.
  • Troubleshoot and optimize networking for seamless connectivity.
  • Work at the intersection of Software Engineering and DevOps.
  • Contribute to the infrastructure management for AI lifecycle.
  • Collaborate with a team focused on solving critical challenges.
  • Support various environments for ClearML operations.
  • Engage in a mission-driven company with a focus on AI.

Requirements

  • Strong experience in Python programming.
  • Proficiency with Docker and Kubernetes.
  • Experience in systems programming and networking.
  • Knowledge of resource management and orchestration.
  • Ability to troubleshoot and optimize system connectivity.
  • Self-driven and curious mindset.
  • Experience in AI or ML environments is a plus.
  • Strong problem-solving skills.
  • Ability to work in a fast-paced environment.
  • Effective communication skills.

Benefits

  • Opportunity to work on cutting-edge AI infrastructure.
  • Collaborative and innovative work environment.
  • Impactful work on global challenges.
  • Professional growth and development opportunities.
  • Flexible work arrangements.

ClearML

ClearML

ClearML is a rapidly growing company dedicated to simplifying infrastructure management throughout the AI lifecycle, from model development to large-scale deployment. With a mission to empower over 2,000 organizations, including AI builders and IT teams, ClearML's platform supports a wide range of applications, from early-stage research and development to critical public sector and enterprise AI pipelines. The company is committed to addressing significant global challenges, such as advancing healthcare, discovering new medicines, enhancing financial security, safeguarding national security, and protecting the environment. ClearML seeks innovative and self-motivated individuals to help shape the future of AI and its underlying infrastructure.

Share This Job!

Save This Job!

Similar Jobs:

iT1 logo

Senior Systems Engineer - Infrastructure - Remote

iT1

35 weeks ago

iT1 is seeking a motivated Senior Systems Engineer to support their Managed Services Division with a focus on Infrastructure.

USA
Full-time
DevOps / Sysadmin

I.T

Senior IT Infrastructure and Systems Engineer - Remote

iRhythm Technologies

53 weeks ago

Join iRhythm as a Senior IT Infrastructure and Systems Engineer to lead the design and support of endpoint technologies in a remote role based in the UK.

UK
Full-time
DevOps / Sysadmin
Aixial Group logo

Senior Infrastructure System Engineer - Remote

Aixial Group

54 weeks ago

Join Aixial Group as a Senior Infrastructure System Engineer, responsible for supporting and maintaining Windows servers and Azure cloud environments.

Romania
Full-time
DevOps / Sysadmin

V.L

Senior Systems Engineer – Infrastructure Deployment - Remote

Via Logic

21 weeks ago

Seeking a Senior Systems Engineer to design and implement scalable infrastructure solutions for enterprise environments.

VT, USA
Full-time
DevOps / Sysadmin
Mirantis logo

Senior Agentic Infrastructure Engineer - Remote

Mirantis

26 weeks ago

Join Mirantis as a Senior Agentic Infrastructure Engineer to lead development and mentor junior developers in a dynamic open-source environment.

Bulgaria
Full-time
Software Development