AI Data Labeling & LLM Response Evaluation
Annotated and evaluated large-scale text datasets to support training of natural language processing and large language models. Tasks included response ranking, intent classification, prompt-response evaluation, and quality review of AI-generated outputs. Followed strict annotation guidelines to ensure high data accuracy and consistency while contributing to improved model performance.