A multimodal vision transformer for interpretable fusion of functional and structural neuroimaging data
Bi Y, Abrol A, Fu Z, Calhoun V. A multimodal vision transformer for interpretable fusion of functional and structural neuroimaging data. Human Brain Mapping 2024, 45: e26783. PMID: 39600159, PMCID: PMC11599617, DOI: 10.1002/hbm.26783.Peer-Reviewed Original ResearchConceptsCross-attention mechanismVision transformerDeep learning modelsBrain disordersCharacteristics of schizophreniaDiagnosis of schizophreniaStructural neuroimaging dataNetwork connectivity matrixData fusion approachAttention mapsMultimodal baselinesFunctional network connectivityFuse informationDeep learningICA algorithmFusion approachGrey matter mapsAI algorithmsFunctional network connectivity matricesLeverage multiple sources of informationGray matter imagesLearning modelsMultiple sources of informationBrain imaging modalitiesNetwork connectivity