Vision-language foundation model for generalizable nasal disease diagnosis using unlabeled endoscopic records
Liu X, Gong W, Chen X, Li Z, Liu Y, Wang L, Liu Q, Sun X, Liu X, Chen X, Shi Y, Yu H. Vision-language foundation model for generalizable nasal disease diagnosis using unlabeled endoscopic records. Pattern Recognition 2025, 165: 111646. DOI: 10.1016/j.patcog.2025.111646.Books
Labeled dataGeneralization performanceExpert annotationsArtificial intelligencePre-training datasetSuperior generalization performanceState-of-the-artMedical artificial intelligencePerformance of AI modelsNasal endoscopic imagesLearning frameworkAI modelsMultiple imagesSemantic representationDiagnostic tasksFine-tuningTask-specificUniversal representationDatasetExperimental resultsDisease classificationEndoscopic imagesDiagnosis of diseasesAnnotationFoundation model