MediConfusion: Can you trust your AI radiologist? Probing the reliability of multimodal medical foundation models
2024-09-01
Full analysis loading… Code implementations, benchmark data, and reproduction guides are being assembled. Please check back shortly.
Need human evaluators for your AI research? Scale annotation with expert AI Trainers.