2024
Ascle—A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study
Yang R, Zeng Q, You K, Qiao Y, Huang L, Hsieh C, Rosand B, Goldwasser J, Dave A, Keenan T, Ke Y, Hong C, Liu N, Chew E, Radev D, Lu Z, Xu H, Chen Q, Li I. Ascle—A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study. Journal Of Medical Internet Research 2024, 26: e60601. PMID: 39361955, PMCID: PMC11487205, DOI: 10.2196/60601.Peer-Reviewed Original ResearchConceptsNatural language processingNatural language processing toolkitQuestion-answering taskLanguage modelText generationText processingDomain-specific language modelsNatural language processing functionsMinimal programming expertiseText generation tasksMedical knowledge graphMachine translation tasksROUGE-L scoreDomain-specific challengesAll-in-one solutionROUGE-LText summarizationBLEU scoreKnowledge graphMachine translationUnstructured textQuestion-answeringHugging FaceProcessing toolkitLanguage processing
2023
MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval
Jin Q, Kim W, Chen Q, Comeau D, Yeganova L, Wilbur W, Lu Z. MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval. Bioinformatics 2023, 39: btad651. PMID: 37930897, PMCID: PMC10627406, DOI: 10.1093/bioinformatics/btad651.Peer-Reviewed Original ResearchMeSH KeywordsInformation Storage and RetrievalLanguageNatural Language ProcessingPubMedReview Literature as TopicSemanticsConceptsInformation retrievalIR tasksUser click logsSemantic information retrievalBiomedical information retrievalBiomedical knowledge acquisitionPre-trained TransformerClinical decision supportClick logsSearch logsContrastive learningLexical matchingArt performanceIR systemsSemantic retrievalBiomedical articlesDecision supportSentence representationModel encoderKnowledge acquisitionLarge modelsSemantic evaluationRetrievalTransformer modelUnprecedented scale
2021
Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing
Chen Q, Leaman R, Allot A, Luo L, Wei C, Yan S, Lu Z. Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing. Annual Review Of Biomedical Data Science 2021, 4: 1-27. PMID: 34465169, DOI: 10.1146/annurev-biodatasci-021821-061045.Peer-Reviewed Original ResearchMeSH KeywordsCommunicationCOVID-19Data MiningDatasets as TopicEmotionsHumansInformation Storage and RetrievalKnowledge DiscoveryNatural Language ProcessingPandemicsPeriodicals as TopicSoftwareConceptsNatural language processingArtificial intelligenceLanguage processingInformation needsLiterature-based discoveryInformation retrievalEntity recognitionMisinformation detectionInformation overloadNLP studiesNLP tasksEmotion analysisTopic modelingCOVID-19 pandemicIntelligenceAdditional tasksHuman languagePublic health measuresTaskHealth measuresProcessingSerious health effectsHealth effectsRetrievalDataset
2019
ML-Net: multi-label classification of biomedical texts with deep neural networks
Du J, Chen Q, Peng Y, Xiang Y, Tao C, Lu Z. ML-Net: multi-label classification of biomedical texts with deep neural networks. Journal Of The American Medical Informatics Association 2019, 26: 1279-1285. PMID: 31233120, PMCID: PMC7647240, DOI: 10.1093/jamia/ocz085.Peer-Reviewed Original ResearchMeSH KeywordsBenchmarkingClassificationComputational BiologyData MiningDeep LearningMachine LearningNatural Language ProcessingNeural Networks, ComputerConceptsMulti-label classificationML-NetBiomedical textEnd deep learning frameworkMulti-label text classificationDeep learning frameworkDeep neural networksTraditional machineDocument contextFeature engineeringText classificationTextual documentsMachine learningNovel endLearning frameworkPrediction networkIndividual classifiersNeural networkHuman effortTarget documentsF-measureArt methodsPrediction mechanismContextual informationLabel countsBioWordVec, improving biomedical word embeddings with subword information and MeSH
Zhang Y, Chen Q, Yang Z, Lin H, Lu Z. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific Data 2019, 6: 52. PMID: 31076572, PMCID: PMC6510737, DOI: 10.1038/s41597-019-0055-0.Peer-Reviewed Original ResearchConceptsWord embeddingsSubword informationWord representationsBiomedical natural language processingNatural language processingMultiple NLP tasksBiomedical word embeddingsInformation retrievalUnlabeled textBiomedical textText miningBiomedical domainLanguage processingNLP tasksStructured resourcesChallenging taskPrevious stateBenchmarking resultsLarge corpusEmbeddingWord levelBioWordVecSuch informationTaskInformation