Advanced Data Annotation and RLHF for LLMs
In the Pegasus HLE project, I created advanced PhD/Master’s-level prompts to evaluate and improve LLM reasoning in high-complexity tasks. Responsibilities included writing challenging questions, crafting ideal ground-truth answers, and providing strategic hints to guide model thinking. This work supported alignment and fine-tuning for models in academic and professional domains.