Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology
Giuffrè M, You K, Pang Z, Kresevic S, Chung S, Chen R, Ko Y, Chan C, Saarinen T, Ajcevic M, Crocè L, Garcia-Tsao G, Gralnek I, Sung J, Barkun A, Laine L, Sekhon J, Stadie B, Shung D. Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology. Npj Digital Medicine 2025, 8: 242. PMID: 40319106, PMCID: PMC12049514, DOI: 10.1038/s41746-025-01589-z.Peer-Reviewed Original ResearchReward modelSimilarity-based rankingZero-shot baselineSupervised fine-tuningRejection samplingLanguage modelSimilarity metricModel safetyHuman performanceFine-tuningHuman gradingExpert verificationTime-consumingDecision-makingMedical decision-makingMedical questionsEVALAccuracyLanguageDatasetMetricsAssess accuracyRewardVerificationSets
This site is protected by hCaptcha and its Privacy Policy and Terms of Service apply