Remote Otter LogoRemoteOtter

Research Scientist / Engineer – Performance Optimization - Remote

Posted 26 weeks ago
Software Development
Full Time
CA, USA
$180,000 - $250,000/year

Overview

The Performance Optimization team at Luma is dedicated to maximizing the efficiency and performance of our AI models. Working closely with both research and engineering teams, this group ensures that our cutting-edge multimodal models can be trained efficiently and deployed at scale while maintaining the highest quality standards.

In Short

  • Profile and optimize GPU/CPU/Accelerator code for maximum utilization and minimal latency
  • Write high-performance PyTorch, Triton, CUDA, deferring to custom PyTorch operations if necessary
  • Develop fused kernels and leverage tensor cores and modern hardware features for optimal hardware utilization on different hardware platforms
  • Optimize model architectures and implementations for distributed multi-node production deployment
  • Build performance monitoring and analysis tools and automation
  • Research and implement cutting-edge optimization techniques for transformer model

Requirements

  • Expert-level proficiency in Triton/CUDA programming and GPU optimization
  • Strong PyTorch skills
  • Experience with PyTorch kernel development and custom operations
  • Proficiency with profiling tools (NVIDIA Nsight, torch profiler, custom tooling)
  • Deep understanding of transformer architectures and attention mechanisms
  • (Preferred) Experience with compilers/exporters such as torch.compile, TensorRT, ONNX, XLA
  • (Preferred) Experience optimizing inference workloads for latency and throughput
  • (Preferred) Experience with Triton compiler and kernel fusion techniques
  • (Preferred) Knowledge of warp-level intrinsics and advanced CUDA optimization
  • (Preferred) Background in compiler optimization or hardware-software co-design

Benefits

  • Competitive equity packages in the form of stock options
  • Comprehensive benefits plan
Luma AI logo

Luma AI

Luma AI is dedicated to advancing multimodal artificial intelligence to enhance human creativity and capabilities. The company believes that integrating various modalities is essential for developing intelligent systems that surpass traditional language models. Luma AI focuses on training and scaling multimodal foundation models that can perceive, comprehend, and interact with the world, aiming to create systems that are not only aware but also capable of effecting meaningful change. The team is committed to optimizing performance across diverse hardware platforms, ensuring that their state-of-the-art models are accessible to a wide audience at the best performance-to-cost ratio.

Share This Job!

Save This Job!

Similar Jobs:

Blend360 logo

Data Scientist - AI Performance Optimization - Remote

Blend360

1 week ago

Join Blend as a Data Scientist to optimize AI performance and tackle computational challenges.

USA
Full-time
Data Analysis
Eclipse logo

Performance Research Engineer - Remote

Eclipse

34 weeks ago

Join Eclipse as a Research Engineer to enhance the performance of the fastest Ethereum Layer 2 execution environment.

Worldwide
Full-time
Software Development
$300,000 - $550,000/year
SFR3 logo

Operations Research / Optimization Engineer - Remote

SFR3

21 weeks ago

Join a boutique real estate investment fund as an Operations Research / Optimization Engineer, focusing on resource management and operational efficiency.

BR
Full-time
Software Development
Anthropic logo

Research Engineer/Scientist - Remote

Anthropic

25 weeks ago

Join Anthropic as a Research Engineer/Scientist to build large scale machine learning systems focused on safety and trustworthiness.

USA
Full-time
Software Development
$280,000 - $425,000 USD/year
Avra logo

Research Engineer / Scientist - Remote

Avra

63 weeks ago

Join Avra as a Research Engineer / Scientist to enhance AI models and drive technical excellence in a fully remote role.

Brazil
Full-time
Software Development