Generalist
Evaluate AI systems by stress-testing them with complex prompts and reviewing outputs for accuracy and quality. Review audio and visual data by confirming captions, identifying languages, labeling entities, sourcing visual matches, and adjusting images to required formats. Ensure high standards through detailed quality checks, especially on complex content with many elements.