Remote Otter LogoRemoteOtter

AI Preference Ranking Evaluator - Remote

Posted 2 days ago
All others
Freelance
USA
$25 - $35/hour

Overview

Mercor is collaborating with a leading AI lab on a short-term project focused on improving preference ranking models for conversational AI systems. We’re seeking detail-oriented generalists—ideally with prior experience in data labeling or content evaluation—to assess and rank model outputs across a variety of domains.

In Short

  • Evaluate and compare AI-generated responses based on quality, coherence, and helpfulness
  • Assign preference rankings to pairs or sets of model outputs
  • Follow detailed labeling guidelines and adjust based on evolving criteria
  • Provide brief written explanations for ranking decisions when required
  • Flag edge cases or inconsistencies in task design or model output
  • Prior experience in data labeling, content moderation, or preference ranking tasks
  • Excellent critical thinking and reading comprehension skills
  • Comfort working with evolving guidelines and ambiguity
  • Strong attention to detail and consistency across repetitive tasks
  • Availability for regular part-time work on a weekly basis

Requirements

  • Remote and asynchronous — set your own hours
  • Expected commitment: 10–20 hours/week
  • Flexible workload depending on your availability and performance

Benefits

  • $25–35/hour depending on experience and location
  • Payments issued weekly via Stripe Connect
  • This is a freelance engagement; you’ll be classified as an independent contractor
Moonlight logo

Moonlight

Moonlight is a San Francisco-based startup dedicated to creating an engaging and empowering online tarot experience for hobbyists and professionals around the world. The company aims to make tarot accessible and enjoyable, offering innovative features such as real-time multiplayer tarot rooms, a professional reader marketplace, and a digital deck marketplace. With a focus on collaboration and technical excellence, Moonlight is building a platform that fosters meaningful connections through tarot, while providing a flexible and remote-friendly work environment for its team.

Share This Job!

Save This Job!

Similar Jobs:

TSMG logo

AI Agent Evaluator - Remote

TSMG

76 weeks ago

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Worldwide
Full-time
All others
TSMG logo

AI Agent Evaluator - Remote

TSMG

76 weeks ago

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Worldwide
Full-time
All others
TSMG logo

AI Agent Evaluator - Remote

TSMG

76 weeks ago

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Worldwide
Full-time
All others
TSMG logo

AI Agent Evaluator - Remote

TSMG

76 weeks ago

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Worldwide
Full-time
All others
TSMG logo

AI Agent Evaluator - Remote

TSMG

76 weeks ago

Join our team as an AI Agent Evaluator, reviewing and evaluating AI responses to ensure safety and quality.

Worldwide
Full-time
All others