Everyone (Public)

🤖 AI Frontiers: Hosted by NLP/LLM Interest Group

Name: 🤖 AI Frontiers: Hosted by NLP/LLM Interest Group
Start: 2026-06-08T20:00:00.0000000Z
End: 2026-06-08T21:00:00.0000000Z
Location: Yale University

Are Multimodal LLMs Ready for Clinical Dermatology? A Real-World Evaluation in Dermatology

101 College Street

Join our mailing list to receive Zoom Link & Passcode: https://mailman.yale.edu/mailman/listinfo/nlp-llm-ig

Add event to Calendar

Abstract: Multimodal large language models (MLLMs) have shown promising performance on dermatology benchmark datasets, but their ability to support real-world clinical decision-making remains unclear. In this seminar, I will present findings from a large-scale evaluation of MLLMs across both public dermatology benchmarks and a multi-site hospital dermatology consultation cohort comprising. I will discuss model performance for differential diagnosis generation and urgent dermatology triage, the impact of clinical context on diagnostic accuracy, and key limitations related to visual grounding and context integration. These results highlight the importance of realistic evaluation frameworks for assessing the clinical readiness of AI systems in dermatology.