AI Language Model Task Evaluator - Remote

Posted 39 weeks ago

Writing

Contract

USA, UK, Canada, Australia

Technical Writing

Content Reviewing

Research Analysis

Communication Skills

Overview

This role involves enhancing large-language-model performance on realistic reasoning tasks by creating and evaluating tasks and solutions.

In Short

Write challenging, creative, and realistic tasks for AI models to solve
Write gold-standard solutions to those tasks
Evaluate AI-generated responses for clarity, accuracy, and comprehensiveness
Collaborate directly with Mercor coordinators and client’s research leads
Availability of up to 40 hours of work per week
Fully remote and asynchronous work, flexible to your schedule
Project length is a minimum of 6 weeks with potential extensions
Legally classified as an hourly contractor for Mercor
Payment at the end of each week via Stripe Connect
Complete a short interview and participate in a paid work trial

Requirements

Experience in technical writing, editing, research analysis, or related fields
Demonstrable experience creating Q&A, summaries, or problem-solving tasks
Ability to excel at deep reading and synthesize information
Strong fact-checking and editing skills
Analytical judgment to spot inconsistencies in draft outputs
Clear and concise written communication with high attention to detail

Benefits

Flexible work schedule
Remote work opportunity
Weekly payment
Opportunity for project extensions
Experience with top AI labs

Mercor

HelixRecruit is a forward-thinking recruitment firm specializing in connecting talent with innovative companies. They focus on providing opportunities for individuals to engage in data annotation projects that enhance artificial intelligence systems. With a commitment to flexibility, HelixRecruit offers remote and asynchronous work arrangements, allowing contractors to set their own schedules while contributing to meaningful projects. The company values detail-oriented generalists and encourages applicants from diverse educational backgrounds, including students and early career professionals.

Share This Job!

Save This Job!

Jobs from Mercor:

Medical Secretary and Administrative Assistant (Contractor)

Medical Secretary

Administrative Assistant

Asynchronous Work

Administrative Services Manager (Contractor)

Administrative Services

Project Management

Financial Manager (Independent Contractor)

Financial Management

Asynchronous Work

Computer and Information Systems Manager (Independent Contractor)

Computer AND Information Systems Management

Asynchronous Work

Securities, Commodities, and Financial Services Sales Agent

Financial Services

Mercor

HelixRecruit is a forward-thinking recruitment firm specializing in connecting talent with innovative companies. They focus on providing opportunities for individuals to engage in data annotation projects that enhance artificial intelligence systems. With a commitment to flexibility, HelixRecruit offers remote and asynchronous work arrangements, allowing contractors to set their own schedules while contributing to meaningful projects. The company values detail-oriented generalists and encourages applicants from diverse educational backgrounds, including students and early career professionals.

Share This Job!

Save This Job!

Jobs from Mercor:

Medical Secretary and Administrative Assistant (Contractor)

Medical Secretary

Administrative Assistant

Asynchronous Work

Administrative Services Manager (Contractor)

Administrative Services

Project Management

Financial Manager (Independent Contractor)

Financial Management

Asynchronous Work

Computer and Information Systems Manager (Independent Contractor)

Computer AND Information Systems Management

Asynchronous Work

Securities, Commodities, and Financial Services Sales Agent

Financial Services

Similar Jobs:

I.A

AI Language Model Evaluator and Translator - Remote

Invisible Agency

71 weeks ago

Invisible Agency

Join a remote team to evaluate and translate mathematical texts for AI language models.

Worldwide

Contract

All others

$30/hour

71 weeks ago

I.A

AI Language Model Trainer - Remote

Invisible Agency

71 weeks ago

Invisible Agency

Join a short-term project to translate mathematical texts to Indonesian and assess AI-generated translations.

Worldwide

Contract

All others

$30/hour

71 weeks ago

AI Language Data Evaluator - German Talent Hub - Remote

Welocalize

58 weeks ago

Welocalize

German Language Proficiency

Machine Translation Evaluation

Data Evaluation

Join Welo Data as an AI Language Data Evaluator to enhance AI models through evaluating German language data in a flexible, remote role.

German Language Proficiency

Machine Translation Evaluation

Data Evaluation

Germany, Remote, Worldwide, Spain

Freelance

All others

58 weeks ago

Language Model Engineer - Remote

Utilidata

80 weeks ago

Utilidata

Machine Learning

Utilidata is seeking a highly skilled Language Model Engineer to design and implement advanced AI applications for edge devices in a remote setting.

Machine Learning

USA

Full-time

Software Development

$140,000 - $170,000/year

80 weeks ago

AI Agent Evaluator - Remote

TSMG

111 weeks ago

TSMG

Flexible Schedule

Detail-oriented

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Flexible Schedule

Detail-oriented

Worldwide

Full-time

All others

111 weeks ago