Calibrating Multi-modal Representations: A Pursuit of Group Robustness without Annotations
You C, Min Y, Dai W, Sekhon J, Staib L, Duncan J. Calibrating Multi-modal Representations: A Pursuit of Group Robustness without Annotations. 2015 IEEE Conference On Computer Vision And Pattern Recognition (CVPR) 2024, 00: 26140-26150. PMID: 39640960, PMCID: PMC11620289, DOI: 10.1109/cvpr52733.2024.02470.Peer-Reviewed Original ResearchDiverse downstream tasksVision-language modelsPre-trained modelsRepresentation of samplesContrastive learningDownstream tasksFeature reweightingTraining dataFeature patternsModel generalizationGroup annotationsPain pointsGroup labelsAnnotationRobustnessClassifierClipsFeaturesDeepDeploymentBenchmarksTime-intensiveCodeTaskLearning