Aether Generalist
Performed high-fidelity data labeling and Reinforcement Learning from Human Feedback (RLHF) to optimize multi-modal AI models. Evaluated and annotated video and image datasets, focusing on identifying subtle visual discrepancies, temporal consistency, and entity tagging. Authored detailed "Gold Standard" justifications for model rankings, ensuring alignment with strict truthfulness, safety, and helpfulness guidelines. Consistently maintained a high quality-score by auditing model hallucinations and providing granular feedback on complex reasoning tasks