Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator
Liu X, Xing F, Prince J, Zhuo J, Stone M, El Fakhri G, Woo J. Tagged-MRI Sequence to Audio Synthesis via Self Residual Attention Guided Heterogeneous Translator. Lecture Notes In Computer Science 2022, 13436: 376-386. PMID: 36820764, PMCID: PMC9942274, DOI: 10.1007/978-3-031-16446-0_36.Peer-Reviewed Original ResearchAudio waveformEnd-to-end deep learning frameworkAdversarial training approachDeep learning frameworkEnd-to-endTwo-dimensional spectrogramAdversarial networkIntermediate representationLearning frameworkResidual attentionDisentanglement strategyAudio synthesisDataset sizeImprove realismHeterogeneous representationsHeterogeneous translationAttentional strategiesTraining approachExperimental resultsMuscle deformationIntelligible speechMotor control theoriesTagged-MRIRelated-disordersSpeech acousticsTagged-MRI to audio synthesis with a pairwise heterogeneous deep translator
Liu X, Xing F, Stone M, Prince J, Kim J, Fakhri G, Woo J. Tagged-MRI to audio synthesis with a pairwise heterogeneous deep translator. The Journal Of The Acoustical Society Of America 2022, 151: a133-a133. DOI: 10.1121/10.0010891.Peer-Reviewed Original ResearchLatent space featuresEncoder-decoder structureCNN-based encoderSpace featuresDeep learning frameworkTagged MRI sequencesKullback-Leibler divergenceMel-spectrogramSpeech-related disordersLearning frameworkAudio synthesisAudio waveformSpeech productionKullback-LeiblerHeterogeneous representationsEvaluation strategiesIntelligible speechFrameworkTagged-MRISpeechDecodingAudioVisual movementEncodingUtterances