2024
Introduction to Natural Language Processing of Clinical Text
Demner Fushman D, Xu H. Introduction to Natural Language Processing of Clinical Text. Cognitive Informatics In Biomedicine And Healthcare 2024, 3-11. DOI: 10.1007/978-3-031-55865-8_1.Peer-Reviewed Original ResearchNatural language processingLanguage processingComplex language processingBiomedical natural language processingClinical natural language processingLanguage generation tasksClinical language processingBiomedical language processingLanguage modelClinical textGeneration taskMachine learningDelivery of informationClinical languageLanguageDevelopment of Clinical NLP Systems
Xu H, Demner Fushman D. Development of Clinical NLP Systems. Cognitive Informatics In Biomedicine And Healthcare 2024, 301-324. DOI: 10.1007/978-3-031-55865-8_11.Peer-Reviewed Original ResearchLarge language models for biomedicine: foundations, opportunities, challenges, and best practices
Sahoo S, Plasek J, Xu H, Uzuner Ö, Cohen T, Yetisgen M, Liu H, Meystre S, Wang Y. Large language models for biomedicine: foundations, opportunities, challenges, and best practices. Journal Of The American Medical Informatics Association 2024, ocae074. PMID: 38657567, DOI: 10.1093/jamia/ocae074.Peer-Reviewed Original ResearchNatural language processingPrompt tuningNLP applicationsLanguage modelState-of-the-art performanceNLP practitionersNatural language processing applicationsBiomedical NLP applicationsPre-training datasetNatural language understandingNeural network architecture modelNatural language generationBiomedical informatics communityNetwork architecture modelAmerican Medical Informatics Association (AMIAPrompt-tuningFew-shotZero-ShotNLP challengeNLP tasksReinforcement learningHuman feedbackLanguage generationLanguage understandingEvaluation metricsEnsemble pretrained language models to extract biomedical knowledge from literature
Li Z, Wei Q, Huang L, Li J, Hu Y, Chuang Y, He J, Das A, Keloth V, Yang Y, Diala C, Roberts K, Tao C, Jiang X, Zheng W, Xu H. Ensemble pretrained language models to extract biomedical knowledge from literature. Journal Of The American Medical Informatics Association 2024, ocae061. PMID: 38520725, DOI: 10.1093/jamia/ocae061.Peer-Reviewed Original ResearchNatural language processingNatural language processing systemsLanguage modelExpansion of biomedical literatureZero-shot settingManually annotated corpusKnowledge graph developmentTask-specific modelsDomain-specific modelsZero-ShotEntity recognitionBillion parametersEnsemble learningLocation informationKnowledge basesBiomedical entitiesLanguage processingFree textGraph developmentBiomedical conceptsAutomated techniqueBiomedical literatureDetection methodPredictive performanceBiomedical knowledgeAdvancing entity recognition in biomedicine via instruction tuning of large language models
Keloth V, Hu Y, Xie Q, Peng X, Wang Y, Zheng A, Selek M, Raja K, Wei C, Jin Q, Lu Z, Chen Q, Xu H. Advancing entity recognition in biomedicine via instruction tuning of large language models. Bioinformatics 2024, 40: btae163. PMID: 38514400, PMCID: PMC11001490, DOI: 10.1093/bioinformatics/btae163.Peer-Reviewed Original ResearchNamed Entity RecognitionSequence labeling taskNatural language processingBiomedical NER datasetsLanguage modelNER datasetsEntity recognitionLabeling taskText generationField of natural language processingBiomedical NERFew-shot learning capabilityReasoning tasksMulti-domain scenariosDomain-specific modelsEnd-to-endMinimal fine-tuningSOTA performanceF1 scoreHealthcare applicationsBiomedical entitiesBiomedical domainLanguage processingMulti-taskingPubMedBERT modelPrompt Tuning in Biomedical Relation Extraction
He J, Li F, Li J, Hu X, Nian Y, Xiang Y, Wang J, Wei Q, Li Y, Xu H, Tao C. Prompt Tuning in Biomedical Relation Extraction. Journal Of Healthcare Informatics Research 2024, 8: 206-224. PMID: 38681754, PMCID: PMC11052745, DOI: 10.1007/s41666-024-00162-9.Peer-Reviewed Original ResearchFew-shot scenariosBiomedical relation extractionNatural language processingBiomedical RERelation extractionPrompt tuningState-of-the-art performanceText mining applicationsTuning modelBioCreative VISemEval-2013Knowledge graphLanguage modelMining applicationsBiomedical textOriginal inputComputational resourcesLanguage processingExternal knowledgeSpecific textsSuperior performanceDatasetEfficient approachTaskModel performanceImproving large language models for clinical named entity recognition via prompt engineering
Hu Y, Chen Q, Du J, Peng X, Keloth V, Zuo X, Zhou Y, Li Z, Jiang X, Lu Z, Roberts K, Xu H. Improving large language models for clinical named entity recognition via prompt engineering. Journal Of The American Medical Informatics Association 2024, ocad259. PMID: 38281112, DOI: 10.1093/jamia/ocad259.Peer-Reviewed Original ResearchClinical NER tasksNER taskTask-specific promptsEntity recognitionLanguage modelTraining samplesState-of-the-art modelsFew-shot learningState-of-the-artMinimal training dataTask-specific knowledgeF1-socreAnnotated samplesConcept extractionModel performanceAnnotated datasetsTraining dataF1 scoreTask descriptionFormat specificationsComplex clinical dataOptimal performanceTaskEvaluation schemaGPT modelStandardizing Multi-site Clinical Note Titles to LOINC Document Ontology: A Transformer-based Approach.
Zuo X, Zhou Y, Duke J, Hripcsak G, Shah N, Banda J, Reeves R, Miller T, Waitman L, Natarajan K, Xu H. Standardizing Multi-site Clinical Note Titles to LOINC Document Ontology: A Transformer-based Approach. AMIA Annual Symposium Proceedings 2024, 2023: 834-843. PMID: 38222429, PMCID: PMC10785935.Peer-Reviewed Original Research
2023
Leveraging Generative AI and Large Language Models: A Comprehensive Roadmap for Healthcare Integration
Yu P, Xu H, Hu X, Deng C. Leveraging Generative AI and Large Language Models: A Comprehensive Roadmap for Healthcare Integration. Healthcare 2023, 11: 2776. PMID: 37893850, PMCID: PMC10606429, DOI: 10.3390/healthcare11202776.Peer-Reviewed Original ResearchLarge language modelsGenerative artificial intelligenceArtificial intelligenceLanguage modelInformation retrievalAI systemsShot learningData managementHuman feedbackReinforcement learningInformation managementSystem implementationCo-design processData acquisitionComprehensive roadmapDecision-making processLearningTechnologyFull potentialHealthcareIntelligenceHealthcare qualityRetrievalIntegrationPromising advancement
2020
Relation Extraction from Clinical Narratives Using Pre-trained Language Models.
Wei Q, Ji Z, Si Y, Du J, Wang J, Tiryaki F, Wu S, Tao C, Roberts K, Xu H. Relation Extraction from Clinical Narratives Using Pre-trained Language Models. AMIA Annual Symposium Proceedings 2020, 2019: 1236-1245. PMID: 32308921, PMCID: PMC7153059.Peer-Reviewed Original ResearchConceptsPre-trained language modelsNatural language processingLanguage modelRE tasksNLP tasksClinical narrativesRecent deep learning methodsDeep learning methodsClinical NLP tasksRelation extraction taskTraditional word embeddingsTraditional machineExtraction taskArt performanceRelation extractionBERT modelLanguage processingLearning methodsWord embeddingsShared TaskPrevious stateBiomedical literatureDifferent implementationsTaskOpen domain