LLM Evaluation & Human Feedback for AI Training
Contributed to multiple AI training projects focused on improving large language model performance through high-quality human feedback. Responsibilities included evaluating AI-generated responses for accuracy, reasoning, relevance, tone, and instruction adherence. Provided structured ratings and written feedback aligned with detailed project guidelines to support RLHF and supervised fine-tuning workflows. Worked on prompt–response analysis, identifying logical errors, hallucinations, bias, and safety issues, and suggesting improved responses where required. Also participated in red-teaming style tasks to test model robustness and safe behavior. Demonstrated strong attention to detail, consistency across annotations, and the ability to adapt quickly to evolving task requirements and annotation standards.