Remote Otter LogoRemoteOtter

Evaluation Scenario Writer - QA - Remote

Posted Yesterday
QA
Part Time
USA

Overview

At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

In Short

  • Flexible, project-based opportunity for QA Evaluation Scenario Writer.
  • Focus on ensuring the quality and correctness of evaluation scenarios for LLM agents.
  • Blend of manual scenario validation and automated test thinking.
  • Collaborate with writers and engineers.
  • Review and validate test scenarios from Evaluation Writers.
  • Spot logical inconsistencies and suggest improvements.
  • Opportunity for ongoing collaboration on future projects.
  • Open to analysts, researchers, consultants, and students.
  • Work on your own schedule.
  • Contribute to a project aligned with your skills.

Requirements

  • Strong critical thinking skills.
  • Experience in QA or related fields.
  • Ability to work with ambiguity and complexity.
  • Interest in AI systems testing and evaluation.
  • Proactive and detail-oriented mindset.
  • Ability to collaborate effectively.

Benefits

  • Flexible working hours.
  • Opportunity to shape the future of AI.
  • Potential for ongoing projects based on performance.
  • Work from anywhere.
Mindrift logo

Mindrift

Mindrift is an innovative platform at the forefront of artificial intelligence development, dedicated to advancing the field through collaborative online projects. The company provides a unique opportunity for freelancers to contribute to Generative AI by creating data and refining AI responses, all from the comfort of their own locations. Mindrift emphasizes the importance of collective intelligence in ethically shaping the future of AI, allowing users to engage in diverse tasks that enhance AI capabilities. With a focus on making AI models more adept at complex reasoning and specialized inquiries, Mindrift fosters an inclusive environment where individuals can participate in meaningful projects that align with their professional commitments.

Share This Job!

Save This Job!

Similar Jobs:

Mindrift logo

Evaluation Scenario Writer - AI Agent Testing Specialist - Remote

Mindrift

6 weeks ago

Join Mindrift as an Evaluation Scenario Writer to design and test evaluation scenarios for AI agents.

Mexico
Contract
All others
Flat Branch Home Loans logo

Scenario Desk Underwriter - Remote

Flat Branch Home Loans

19 weeks ago

Join our team as a Scenario Desk Underwriter to manage complex mortgage inquiries and ensure accurate underwriting decisions.

USA
Full-time
Finance / Legal
P 1ai logo

AI Evaluation Engineer - Remote

P 1ai

20 weeks ago

Join P-1 AI as an AI Evaluation Engineer to develop and implement evaluation benchmarks for our AI systems.

USA, Canada
Full-time
Software Development