Remote Otter LogoRemoteOtter

AI Research Engineer (Kernel & Inference Optimization) - Remote

Posted Yesterday
Software Development
Full Time
Worldwide

Overview

As a member of our AI model team, you will drive innovation in model serving and inference architectures for advanced AI systems. Your work will focus on optimizing model deployment and inference strategies to deliver highly responsive, efficient, and scalable performance across real-world applications.

In Short

  • Design and deploy state-of-the-art model serving architectures.
  • Build, run, and monitor controlled inference tests.
  • Identify and prepare high-quality test datasets and simulation scenarios.
  • Analyze computational efficiency and diagnose bottlenecks.
  • Work closely with cross-functional teams to integrate optimized frameworks.
  • A degree in Computer Science or related field, ideally a PhD.
  • Knowledge of Metal Shading Language (MSL).
  • Experience in low-level kernel optimizations and inference optimization.
  • Strong expertise in writing GPU kernels for mobile devices.
  • Understanding of advanced model architectures and inference techniques.

Requirements

  • Proven experience in AI R&D with good publications.
  • Deep understanding of model serving architectures.
  • Experience in developing and deploying end-to-end inference pipelines.
  • Ability to apply empirical research to optimize model serving.
  • Knowledge of Distributed Inference Systems.
  • Understanding of advanced techniques like Pruning and Quantization.

Benefits

  • Work remotely from anywhere in the world.
  • Collaborate with a global team of talented professionals.
  • Opportunity to contribute to cutting-edge fintech solutions.
  • Be part of a pioneering company in the digital finance sector.
  • Engage in a culture of innovation and continuous improvement.
Tether Operations Limited logo

Tether Operations Limited

Tether Operations Limited is at the forefront of the digital finance revolution, providing innovative solutions that empower businesses to integrate reserve-backed tokens across various blockchains. With a commitment to transparency and security, Tether offers a suite of products including the widely trusted stablecoin USDT, energy optimization solutions for Bitcoin mining, and advanced data sharing applications. The company is dedicated to democratizing access to digital education and fostering sustainable growth through cutting-edge technology. Operating as a global team, Tether is focused on pushing the boundaries of fintech and creating a future where technology and human potential converge.

Share This Job!

Save This Job!

Similar Jobs:

BentoML logo

Inference Optimization Engineer - Remote

BentoML

43 weeks ago

Join BentoML as an Inference Optimization Engineer to enhance the efficiency of large language models and contribute to open-source projects.

CA, USA
Full-time
Software Development
SFR3 logo

Operations Research / Optimization Engineer - Remote

SFR3

60 weeks ago

Join a boutique real estate investment fund as an Operations Research / Optimization Engineer, focusing on resource management and operational efficiency.

BR
Full-time
Software Development
CloudZero logo

Cloud Optimization Research Engineer - Remote

CloudZero

63 weeks ago

Join CloudZero as a Cloud Optimization Research Engineer to drive cloud cost efficiency and innovation.

USA
Full-time
Software Development
CloudZero logo

Cloud Optimization Research Engineer - Remote

CloudZero

63 weeks ago

Join CloudZero as a Cloud Optimization Research Engineer to analyze cloud workloads and design algorithms for cost optimization.

USA
Full-time
Software Development
Axelera logo

AI Research Engineer – Data Generation & Optimization - Remote

Axelera

66 weeks ago

Join Axelera as an AI Research Engineer focusing on model compression and optimization for high-performance AI applications.

Italy
Full-time
Software Development