English LLM Evaluator - AI Output Review - Remote

Posted 8 weeks ago

All others

Contract

USA

Overview

Join Project Hermes 2.0, a global initiative focused on enhancing LLM output across a range of domains!

In Short

Review input-output pairs, assessing AI-generated responses for quality and accuracy
Evaluate content based on criteria such as clarity, tone, relevance, and alignment with intent
Provide structured responses to evaluation questions to support model improvement
Maintain high attention to detail and quality across all assigned tasks
Native or near-native fluency in English required
Strong reading comprehension and writing skills needed
Minimum commitment of 20 hours per week
Bachelor’s degree or relevant work experience in content evaluation, writing, linguistics, or a related field
Comfortable reviewing content that may include sensitive or potentially harmful material
Project-based opportunity with CrowdGen

Requirements

Native or near-native fluency in English
Strong reading comprehension and writing skills
Minimum commitment of 20 hours per week
Bachelor’s degree or relevant work experience in content evaluation, writing, linguistics, or a related field
High attention to detail and ability to follow detailed guidelines
Comfortable reviewing content that may include sensitive or potentially harmful material

Benefits

Make an impact on the future of AI
Start contributing from the comfort of your home
Flexible working hours
Opportunity to join the CrowdGen Community
Chance to secure a spot quickly as slots are filling up fast

Appen

Appen is a global leader in AI enablement, specializing in critical tasks such as model improvement, supervision, and evaluation. With over 25 years of experience, Appen leverages a diverse crowd of more than one million skilled contractors from 130 countries, speaking over 180 languages and dialects. The company utilizes an advanced AI-assisted data annotation platform to collect and label various data types, including images, text, speech, audio, and video. Trusted by the world's largest technology companies, Appen plays a crucial role in building and enhancing innovative AI systems across multiple sectors, including automotive, finance, retail, healthcare, and government. Committed to creating an inclusive and diverse workplace, Appen fosters a learn-it-all culture that values growth, innovation, and customer obsession.

English LLM Evaluator - AI Output Review - Remote

Overview

In Short

Requirements

Benefits

Appen

Appen

Similar Jobs:

Music Evaluator – English - Remote

Danish LLM Evaluator - Remote

AI Response Evaluator - Remote

AI Response Evaluator - Remote

AI Response Evaluator - Remote