Remote Otter LogoRemoteOtter

AI Language Model Task Evaluator - Remote

Posted Yesterday
Writing
Contract
USA, UK, Canada, Australia

Overview

This role involves enhancing large-language-model performance on realistic reasoning tasks by creating and evaluating tasks and solutions.

In Short

  • Write challenging, creative, and realistic tasks for AI models to solve
  • Write gold-standard solutions to those tasks
  • Evaluate AI-generated responses for clarity, accuracy, and comprehensiveness
  • Collaborate directly with Mercor coordinators and client’s research leads
  • Availability of up to 40 hours of work per week
  • Fully remote and asynchronous work, flexible to your schedule
  • Project length is a minimum of 6 weeks with potential extensions
  • Legally classified as an hourly contractor for Mercor
  • Payment at the end of each week via Stripe Connect
  • Complete a short interview and participate in a paid work trial

Requirements

  • Experience in technical writing, editing, research analysis, or related fields
  • Demonstrable experience creating Q&A, summaries, or problem-solving tasks
  • Ability to excel at deep reading and synthesize information
  • Strong fact-checking and editing skills
  • Analytical judgment to spot inconsistencies in draft outputs
  • Clear and concise written communication with high attention to detail

Benefits

  • Flexible work schedule
  • Remote work opportunity
  • Weekly payment
  • Opportunity for project extensions
  • Experience with top AI labs
Mercor logo

Mercor

HelixRecruit is a forward-thinking recruitment firm specializing in connecting talent with innovative companies. They focus on providing opportunities for individuals to engage in data annotation projects that enhance artificial intelligence systems. With a commitment to flexibility, HelixRecruit offers remote and asynchronous work arrangements, allowing contractors to set their own schedules while contributing to meaningful projects. The company values detail-oriented generalists and encourages applicants from diverse educational backgrounds, including students and early career professionals.

Share This Job!

Save This Job!

Similar Jobs:

I.A

AI Language Model Evaluator and Translator - Remote

Invisible Agency

31 weeks ago

Join a remote team to evaluate and translate mathematical texts for AI language models.

Worldwide
Contract
All others
$30/hour

I.A

AI Language Model Trainer - Remote

Invisible Agency

31 weeks ago

Join a short-term project to translate mathematical texts to Indonesian and assess AI-generated translations.

Worldwide
Contract
All others
$30/hour
Welocalize logo

AI Language Data Evaluator - German Talent Hub - Remote

Welocalize

18 weeks ago

Join Welo Data as an AI Language Data Evaluator to enhance AI models through evaluating German language data in a flexible, remote role.

Germany, Remote, Worldwide, Spain
Freelance
All others
Utilidata logo

Language Model Engineer - Remote

Utilidata

41 weeks ago

Utilidata is seeking a highly skilled Language Model Engineer to design and implement advanced AI applications for edge devices in a remote setting.

USA
Full-time
Software Development
$140,000 - $170,000/year
TSMG logo

AI Agent Evaluator - Remote

TSMG

71 weeks ago

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Worldwide
Full-time
All others