2024
Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI
Liu X, Xing F, Zhuo J, Stone M, Prince J, El Fakhri G, Woo J. Speech motion anomaly detection via cross-modal translation of 4D motion fields from tagged MRI. Proceedings Of SPIE--the International Society For Optical Engineering 2024, 12926: 129262w-129262w-5. PMID: 39238547, PMCID: PMC11377028, DOI: 10.1117/12.3006874.Peer-Reviewed Original ResearchCross-modal translationHealthy individualsTongue cancer patientsMotion fieldOut-of-distributionOne-class SVMPatient dataAnomaly detectionAnomaly detectorCancer patientsTagged MRISpeech-related disordersGeneralization capabilityReconstruction qualitySpeech qualityArticulatory-acoustic relationsPatientsSpeech waveformTraining setInnovative treatmentsMRITest setMotion patternsArticulatory featuresTraining translators
2023
Speech Audio Synthesis from Tagged MRI and Non-negative Matrix Factorization via Plastic Transformer
Liu X, Xing F, Stone M, Zhuo J, Fels S, Prince J, El Fakhri G, Woo J. Speech Audio Synthesis from Tagged MRI and Non-negative Matrix Factorization via Plastic Transformer. Lecture Notes In Computer Science 2023, 14226: 435-445. PMID: 38651032, PMCID: PMC11034915, DOI: 10.1007/978-3-031-43990-2_41.Peer-Reviewed Original ResearchWeight mapAudio waveformEnd-to-end deep learning frameworkMatrix factorization-based approachesFactorization-based approachDeep learning frameworkNon-negative matrix factorizationEnd-to-endAdversarial trainingProcess of speech productionTwo-dimensional spectrogramConventional convolutionLearning frameworkMotion featuresTraining samplesAudio synthesisDimension expansionMatrix inputMatrix factorizationTagged MRISpeech productionTransformation modelExperimental resultsSpectrogramPlastic transformationSynthesizing audio from tongue motion during speech using tagged MRI via transformer
Liu X, Xing F, Prince J, Stone M, Fakhri G, Woo J. Synthesizing audio from tongue motion during speech using tagged MRI via transformer. Proceedings Of SPIE--the International Society For Optical Engineering 2023, 12464: 1246410-1246410-5. PMID: 38009135, PMCID: PMC10669779, DOI: 10.1117/12.2653345.Peer-Reviewed Original ResearchMotion fieldAudio waveformAdversarial training approachImprove synthesis qualityConvolutional decoderAudio dataSynthesis qualityTranslation networkData structureSpeech waveformTemporal modelTagged MRITongue motionTraining approachSpectrogramMuscle deformationSource of informationSpeechIntelligible speechFrameworkDecodingInformationPredictive informationEncodingNetwork