Remote Otter LogoRemoteOtter

Research Scientist / Engineer – Multimodal Capabilities - Remote

Posted 23 weeks ago
Software Development
Full Time
CA, USA
$200,000 - $300,000/year

Overview

The Multimodal Capabilities team at Luma focuses on unlocking advanced capabilities in our foundation models through strategic research into multimodal understanding and generation. This team tackles fundamental research questions around how different modalities can be combined to enable new behaviors and capabilities, working on the open-ended challenges of what makes multimodal AI systems truly powerful and versatile.

In Short

  • Collaborate with the Foundation Models team to identify capability gaps and research solutions
  • Design datasets, experiments, and methodologies to systematically improve model capabilities across vision, audio, and language
  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities
  • Create prototypes and demonstrations that showcase new multimodal capabilities

Requirements

  • Strong programming skills in Python and PyTorch
  • Experience with multimodal data processing pipelines and large-scale dataset curation
  • Understanding of computer vision, audio processing, and / or natural language processing techniques
  • (Preferred) Expertise working with interleaved multimodal data
  • (Preferred) Hands-on experience with Vision Language Models, Audio Language Models, or generative video models

Benefits

  • Competitive equity packages in the form of stock options
  • Comprehensive benefits plan
Luma AI logo

Luma AI

Luma Ai is dedicated to advancing the field of artificial intelligence through the development of multimodal systems that enhance human creativity and capabilities. The company believes that integrating various forms of data, particularly visual information, is essential for creating more intelligent and interactive AI systems. Luma Ai focuses on training and scaling multimodal foundation models that can perceive, understand, and engage with the world, aiming to deliver high-performance AI solutions across diverse hardware platforms.

Share This Job!

Save This Job!

Similar Jobs:

Anthropic logo

Research Engineer / Research Scientist, Multimodal - Remote

Anthropic

31 weeks ago

Join Anthropic to work on cutting-edge AI systems with a focus on safety and societal impact.

Switzerland
Full-time
Software Development
Anthropic logo

Research Engineer / Research Scientist, Multimodal - Remote

Anthropic

31 weeks ago

Join Anthropic as a Research Engineer to build safe and efficient large-scale machine learning systems.

United Kingdom
Full-time
Software Development
£250,000 - £270,000/year
Anthropic logo

Research Engineer / Research Scientist, Multimodal - Remote

Anthropic

31 weeks ago

Join Anthropic's Multimodal team to work on innovative AI systems and contribute to foundational research in machine learning.

United Kingdom
Full-time
Software Development
£250,000 - £270,000/year
Anthropic logo

Research Engineer/Scientist - Remote

Anthropic

25 weeks ago

Join Anthropic as a Research Engineer/Scientist to build large scale machine learning systems focused on safety and trustworthiness.

USA
Full-time
Software Development
$280,000 - $425,000 USD/year
Avra logo

Research Engineer / Scientist - Remote

Avra

62 weeks ago

Join Avra as a Research Engineer / Scientist to enhance AI models and drive technical excellence in a fully remote role.

Brazil
Full-time
Software Development