All roles

Professional Evaluator - Fully Remote | Upto $35/hr Hourly

Remote · USA Full-time New today

About The Job Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey. Position: AI Model Evaluation Contractor Type: Contract Compensation: $25–$35/hour Commitment: 20 hours/week Role Responsibilities

  • Write realistic prompts reflecting professional and consumer domain-specific guidance.
  • Evaluate AI-generated responses for factual accuracy, regulatory correctness, and practical usefulness.
  • Identify fabricated claims, incorrect references, or misleading reasoning in model outputs.
  • Score and rank multiple model responses using structured rubrics across dimensions.
  • Provide written justifications with specific evidence for each evaluation.

Qualifications

Must-Have

  • Professional experience applying domain expertise in a practitioner or advisory capacity.
  • Familiarity with industry-specific standards, regulations, or clinical guidelines.
  • Strong written communication and critical reasoning skills.

Application Process (Takes 20–30 mins to complete)

  • Submit your resume to begin.
  • Complete the Model Response Evaluation assessment.

Resources & Support

  • For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
  • For any help or support, reach out to: [email protected]

PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity. Apply tot his job Apply To this Job

Related roles

Audio Evaluator - Fully Remote | Upto $50/hr Hourly

Remote · USA Full-time

Special Investigations Unit, Investigator- Remote

Remote · USA Full-time

Healthcare Fraud Investigator - Case Development- Remote

Remote · USA Full-time

Client Service Advisor

Remote · USA Full-time

(US) Customer Success Manager, Senior Living – Remote, USA

Remote · USA Full-time

Logistics Coordinator (Entry Level)

Remote · USA Full-time

Coordinator, Talent

Remote · USA Full-time

Remote | travel logistics coordinator

Remote · USA Full-time

Remote Backend Data Entry Jobs for College Students

Remote · USA Full-time

Remote Customer Onboarding Specialist – Tech Services

Remote · USA Full-time

Bilingual NLP Engineer (Japanese)- Remote

Remote · USA Full-time

Experienced Full Stack Business Analyst – Forecasting and Operations Optimization

Remote · USA Full-time

Experienced Full Stack Customer Engineer – Dynamics CRM & Power Platform

Remote · USA Full-time

Epic Analyst, Security

Remote · USA Full-time

Experienced Virtual Data Entry Clerk – Flexible Work-from-Home Opportunity with arenaflex

Remote · USA Full-time

Book reviewer

Remote · USA Full-time

Remote Grading Assistant - Engineering Math - College of Engineering and Technology

Remote · USA Full-time

Experienced Remote Live Chat Assistant – Unlock Your Potential in a Dynamic and Flexible Role

Remote · USA Full-time

Experienced Luxury Brand Customer Service Representative + Shipping And Receiving Specialist – Join arenaflex Today

Remote · USA Full-time

Coach – Applied AI Engineering (Level 6)

Remote · USA Full-time