AI Prompt Engineering & Red Teaming (Self-directed)
From 2022 to the present, the self-directed role focuses on testing frontier LLMs through adversarial prompt crafting and jailbreak experimentation. The work evaluates model boundary behaviors, safety mechanisms, and response patterns while documenting failure modes such as hallucinations and logical inconsistencies. The goal is to generate human feedback that can support AI safety improvements and alignment training through rigorous red-teaming. • Tested multiple models (ChatGPT, Grok, Claude, Gemini) and AI agents to compare safety behavior • Designed and iterated jailbreak prompts to probe constraints and robustness • Assessed outputs for hallucination likelihood and logical or factual inconsistency • Performed prompt/response evaluation to support adversarial testing and feedback quality