Data Annotator / AI Trainer — Outlier AI (Remote, 2024 – Present)
Performed AI response evaluation on Outlier AI, rating model outputs for accuracy, helpfulness, coherence, and instruction-following to support RLHF pipelines. Wrote and evaluated prompts for technical STEM topics by verifying factual correctness, reasoning quality, and hallucination presence using engineering domain knowledge. Created preference labels by comparing paired AI responses and providing written rationales selecting the better response against task criteria. • Rated responses with rubric-based consistency across safety, accuracy, and adherence dimensions • Provided justification text to explain why one response better met the stated criteria • Adapted to new project types and updated guidelines while maintaining reproducible decisions • Maintained high precision with a zero-tolerance approach to factual errors