AI Trainer & LLM Evaluator
Evaluated large language model outputs for accuracy and logical consistency. Conducted structured analysis to identify edge cases, annotation errors, and areas for improvement in dataset quality. Processed and cleaned CSV and JSON datasets to ensure data integrity for AI workflows. • Performed detailed reviews using specified evaluation guidelines • Conducted prompt analysis and response ranking tasks • Ensured high-quality training data for language models • Collaborated remotely with project teams