LLM Evaluation & Text Annotation
This project focused on large-scale text annotation and evaluation to support the training and fine-tuning of multilingual large language models. My responsibilities included rating AI-generated responses for relevance, factual accuracy, safety, and quality. I also completed prompt–response writing and supervised fine-tuning tasks to improve model instruction.