Job Description
About the job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco our investors include Benchmark General Catalyst Peter Thiel Adam DAngelo Larry Summers and Jack Dorsey .
Position
Position: Language Model Evaluator
Type: Full-time or Part-time Contract Work
Compensation: $23/hour
Location: Geography restricted to Egypt Saudi Arabia UAE USA
Role Responsibilities
- Evaluate LLM-generated responses on their ability to effectively answer user queries.
- Conduct fact-checking using trusted public sources and external tools .
- Generate high-quality human evaluation data by annotating response strengths areas for improvement and factual inaccuracies.
- Assess reasoning quality clarity tone and completeness of responses.