Remote Otter LogoRemoteOtter

Voice AI Evaluation Lead - Remote

Posted 8 weeks ago
Software Development
Full Time
CA, USA

Overview

Deepgram is looking for a Voice AI Evaluation Lead to take ownership of how we benchmark and evaluate the performance of our voice AI models. This role is pivotal to the integrity and impact of our AI offerings.

In Short

  • Build and maintain scalable benchmarking pipelines for model evaluations.
  • Run regular evaluations of production and pre-release models.
  • Partner with Research, Data, and Engineering teams for new methodologies.
  • Design and refine evaluation metrics reflecting product goals.
  • Author comprehensive model cards and internal reports.
  • Work closely with Data Labeling Ops for evaluation datasets.
  • Collaborate with QA Engineers for model tests integration.
  • Support Marketing and Product with data-backed comparisons.
  • Track market developments and maintain competitive benchmarks.
  • Support GTM teams with benchmarking best practices.

Requirements

  • Experience designing evaluation pipelines for ML models.
  • Proficiency in Python and data analysis libraries.
  • Ability to develop automated evaluation systems.
  • Comfort with large-scale datasets and performance metrics.
  • Experience using LLMs for analysis or pipeline prototyping.
  • Strong communication skills for translating data insights.
  • Proven success working cross-functionally.

Benefits

  • Opportunity to work on cutting-edge technology.
  • Collaborative and inclusive work environment.
  • Equal opportunity employer.
  • Support for applicants needing accommodations.
Deepgram logo

Deepgram

Deepgram is a pioneering AI company dedicated to revolutionizing human-machine interaction through natural language processing. They provide developers with access to a powerful voice AI platform that includes advanced models for speech-to-text, text-to-speech, and spoken language understanding via a simple API call. With a focus on applications ranging from transcription to sentiment analysis and voice synthesis, Deepgram is the go-to partner for those building innovative voice AI solutions. Backed by notable investors and having raised over $85 million in funding, Deepgram is committed to fostering a diverse and inclusive workplace while driving significant advancements in the AI industry.

Share This Job!

Save This Job!

Similar Jobs:

Kiddom logo

Impact & Evaluation Leader - Remote

Kiddom

19 weeks ago

Kiddom is seeking an Impact & Evaluation Leader to validate educational products and enhance student learning outcomes.

USA
Full-time
All others
P 1ai logo

AI Evaluation Engineer - Remote

P 1ai

11 weeks ago

Join P-1 AI as an AI Evaluation Engineer to develop and implement evaluation benchmarks for our AI systems.

USA, Canada
Full-time
Software Development
Superside logo

AI Motion Lead - Remote

Superside

16 weeks ago

Join Superside as an AI Motion Lead to innovate motion design production through AI integration.

Worldwide
Full-time
Design
Rocket Lawyer logo

AI Legal Evaluator - Remote

Rocket Lawyer

15 weeks ago

Rocket Lawyer is looking for an AI Legal Evaluator to enhance and evaluate their AI-driven legal solutions.

Worldwide
Full-time
All others
£23,500 - £37,250 GBP/year

Join Altarum as an Evaluation Analyst to support projects improving population health and addressing health equity.

USA
Full-time
Data Analysis