Remote Otter LogoRemoteOtter

Member of Technical Staff - LLM Inference - Remote

Posted 6 weeks ago
Software Development
Full Time
Worldwide

Overview

At .txt, our mission is to make AI reliable. We are the authors of outlines and outlines-core, both leading open source libraries (+10k ⭐️) for structured generation. We raised $11.9 million, which is fueling the efforts of our global, fully remote team to create software that goes beyond simple conversation.

In Short

  • Looking for strong engineers to drive breakthroughs in LLM inference optimization.
  • Focus on backend engineering to cut latency, boost throughput, and reduce resource costs.
  • Experience deploying inference engines like vLLM, SGLang, or TensorRT is required.
  • Experience with distributed inference and low-latency communication (NCCL).
  • Hands-on knowledge of NVIDIA GPU architecture (CUDA, CUDA cores, memory hierarchy).
  • Track record of improving inference performance.
  • Background in LLM MLOps is beneficial.
  • Proficiency in Python and willingness to learn Rust.
  • Understanding of containerization (Docker, Kubernetes) and Linux systems.
  • Remote first work culture.

Requirements

  • Proven experience deploying inference engines like vLLM, SGLang, or TensorRT.
  • Experience with distributed inference (multi-GPU, single node) and low-latency communication (NCCL).
  • Hands-on knowledge of NVIDIA GPU architecture (CUDA, CUDA cores, memory hierarchy).
  • Track record of improving inference performance (e.g., improving throughput by 20% through kernel optimizations).
  • Background in LLM MLOps (monitoring, scaling, fault tolerance for inference services).
  • Proficiency in Python and familiarity with (or willingness to learn) Rust.
  • Understanding of containerization (Docker, Kubernetes) and Linux systems.

Benefits

  • Cutting-edge technology in structured generation.
  • Remote first work culture.
  • Competitive compensation and benefits.
  • Health and dental insurance offered.
  • 401k available for US employees.
  • Provision of GPU if not already owned.
dottxt logo

dottxt

.txt is a pioneering company focused on making AI reliable through structured outputs. With a mission to redefine the capabilities of AI technology, .txt has developed leading open-source libraries, outlines and outlines-core, which have garnered significant recognition in the developer community. The company has successfully raised $11.9 million in funding to support its global, fully remote team in creating innovative software that transcends basic conversational AI. .txt offers proprietary products such as dotjson, dotgrammar, and dotlambda, which enhance the potential of structured generation technology. The company fosters a culture of written communication and emphasizes work-life balance, making it an attractive workplace for those passionate about advancing AI reliability.

Share This Job!

Save This Job!

Similar Jobs:

H Company logo

Member of technical staff (Inference) - Remote

H Company

39 weeks ago

Join H as a member of the technical staff to develop advanced AI inference pipelines.

France, UK, US
Full-time
Software Development
anchorage logo

Member of Technical Staff - Remote

anchorage

48 weeks ago

Join Anchorage Digital as a Member of Technical Staff to work on cloud infrastructure and build systems for a leading digital asset platform.

Worldwide
Full-time
Software Development
Moonvalley AI logo

Member of Technical Staff - Remote

Moonvalley AI

54 weeks ago

Join Moonvalley as a Member of Technical Staff to work on cutting-edge AI technology in a fully remote role.

UK
Full-time
Software Development
anchorage logo

Member of Technical Staff - Remote

anchorage

81 weeks ago

Join Anchorage Digital as a Member of Technical Staff to support and integrate new crypto assets into a leading digital asset platform.

USA
Full-time
Software Development
anchorage logo

Member of Technical Staff - Remote

anchorage

100 weeks ago

Join Anchorage Digital as a Member of Technical Staff to build tools for managing blockchain assets in a collaborative environment.

USA
Full-time
Software Development