Synthesizing audio from tongue motion during speech using tagged MRI via transformer
Liu X, Xing F, Prince J, Stone M, Fakhri G, Woo J. Synthesizing audio from tongue motion during speech using tagged MRI via transformer. Proceedings Of SPIE--the International Society For Optical Engineering 2023, 12464: 1246410-1246410-5. PMID: 38009135, PMCID: PMC10669779, DOI: 10.1117/12.2653345.Peer-Reviewed Original ResearchMotion fieldAudio waveformAdversarial training approachImprove synthesis qualityConvolutional decoderAudio dataSynthesis qualityTranslation networkData structureSpeech waveformTemporal modelTagged MRITongue motionTraining approachSpectrogramMuscle deformationSource of informationSpeechIntelligible speechFrameworkDecodingInformationPredictive informationEncodingNetwork