Graduate Research Assistant (AI & Clinical NLP) - Georgia Institute of Technology
Worked as a graduate research assistant developing LLM evaluation pipelines for clinical question-answering and medical AI assistant use cases. Created multi-stage prompt evaluation frameworks grounded in established clinical guidelines and evidence-based practices. Measured hallucination rates, calibration reliability, reasoning consistency, and fairness characteristics across healthcare datasets while collaborating on AI safety and compliance efforts. • Designed and benchmarked LLM evaluation pipelines for healthcare NLP. • Developed prompt evaluation frameworks using CDC, AHA, and clinical guidance. • Performed bias detection and fairness analysis in AI-generated clinical summaries and triage outputs. • Supported research on RAG, human-in-the-loop evaluation, and AI safety/compliance initiatives.