Remote Otter LogoRemoteOtter

Gen AI Audio Researcher - Remote

Posted 21 weeks ago
Software Development
Full Time
India

Overview

We are looking for a Gen AI Researcher for Audio to join our team and help develop next-generation voice synthesis models. You'll research and build deep learning systems that can generate expressive, natural-sounding speech from text or audio prompts, and collaborate with cross-functional teams to integrate your work into production-ready pipelines.

In Short

  • Research and develop state-of-the-art voice synthesis models (e.g., TTS, voice cloning, speech-to-speech).
  • Build and fine-tune models using frameworks like PyTorch and HuggingFace.
  • Design training pipelines and datasets for scalable voice model training.
  • Explore techniques for emotional expressiveness, multilingual synthesis, and speaker adaptation.
  • Work closely with product and creative teams to ensure models meet quality and production constraints.
  • Stay on top of academic and industrial trends in speech synthesis and related fields.

Requirements

  • Strong background in machine learning and deep learning, with focus on speech/audio.
  • Hands-on experience with TTS, voice cloning, or related voice synthesis tasks.
  • Proficiency with Python and PyTorch; experience with libraries like torchaudio, ESPnet, or similar.
  • Experience training models at scale and working with large audio datasets.
  • Familiarity with vocoders and transformer-based architectures.
  • Strong problem-solving skills, ability to work autonomously in a remote-first environment.

Benefits

  • PhD degree in Computer Science/ Machine Learning and publications in top venues.
  • Contributions to open-source speech research or participation in relevant benchmarks.
  • Familiarity with adjacent areas like lip-syncing, audio-driven animation, or expressive speech control.
  • Experience with voice datasets or proprietary pipelines.
BRAHMA logo

BRAHMA

BRAHMA is an innovative technology company focused on advancing the field of generative video models with a strong emphasis on human-centric applications. The company is dedicated to pushing the boundaries of neural rendering and avatar animation, creating lifelike talking-head videos that can be generated from text, audio, or motion signals. With a world-class team of researchers and engineers, BRAHMA is committed to developing cutting-edge solutions that enhance expression control, lip synchronization, and overall realism in video synthesis. The company operates in a remote-first environment, hiring talent across the EMEA region to collaborate on groundbreaking projects in machine learning and deep learning.

Share This Job!

Save This Job!

Similar Jobs:

Cartesia logo

Audio Researcher - Remote

Cartesia

28 weeks ago

Join Cartesia as an Audio Researcher to innovate in the field of voice AI through cutting-edge research and development.

UK
Full-time
Software Development
Wand Synthesis AI logo

AI Researcher - Remote

Wand Synthesis AI

22 weeks ago

Join Wand AI as an AI Researcher to design and build innovative AI prototypes and systems.

Worldwide
Full-time
Software Development
WRITER logo

AI Researcher - Remote

WRITER

28 weeks ago

Join WRITER as an AI Researcher to conduct cutting-edge research on large language models and contribute to innovative AI solutions.

CA, USA
Full-time
Software Development

FirstPrinciples

AI Researcher - Remote

FirstPrinciples

38 weeks ago

Join FirstPrinciples as an AI Researcher to develop cutting-edge AI methodologies that enhance scientific discovery.

Worldwide
Full-time
Software Development
Writer logo

AI Researcher - Remote

Writer

45 weeks ago

Join Writer as an AI Researcher to conduct groundbreaking research in AI and large language models.

CA, USA
Full-time
Software Development