AI Evaluator & Prompt Engineering Specialist (Independent / Freelance)
Evaluated AI-generated responses for logical consistency, factual accuracy, reasoning quality, and instruction adherence across technical and analytical domains. Designed and iterated structured prompts for engineering analysis and multi-step workflow testing based on output quality. Detected and documented hallucinations, weak reasoning patterns, and inconsistencies, providing structured feedback for model refinement. • Logical consistency and factual accuracy checks • Instruction adherence and reasoning quality evaluation • Comparative analysis versus expected STEM and knowledge outcomes • Prompt iteration based on model output performance