2023
Opportunities and challenges for ChatGPT and large language models in biomedicine and health
Tian S, Jin Q, Yeganova L, Lai P, Zhu Q, Chen X, Yang Y, Chen Q, Kim W, Comeau D, Islamaj R, Kapoor A, Gao X, Lu Z. Opportunities and challenges for ChatGPT and large language models in biomedicine and health. Briefings In Bioinformatics 2023, 25: bbad493. PMID: 38168838, PMCID: PMC10762511, DOI: 10.1093/bib/bbad493.Peer-Reviewed Original ResearchConceptsLarge language modelsLanguage modelSensitive patient dataBiomedical information retrievalText generation tasksInformation retrievalPrivacy concernsDomain expertsInformation extractionText summarizationBiomedical domainArt methodsDiverse applicationsPrevious stateBiomedical researchersGeneration taskPatient dataSuch methodsTaskDistinct complexityGeneration capabilityExtensive literature surveySummarizationRecent rapid progressChallengesMedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval
Jin Q, Kim W, Chen Q, Comeau D, Yeganova L, Wilbur W, Lu Z. MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval. Bioinformatics 2023, 39: btad651. PMID: 37930897, PMCID: PMC10627406, DOI: 10.1093/bioinformatics/btad651.Peer-Reviewed Original ResearchMeSH KeywordsInformation Storage and RetrievalLanguageNatural Language ProcessingPubMedReview Literature as TopicSemanticsConceptsInformation retrievalIR tasksUser click logsSemantic information retrievalBiomedical information retrievalBiomedical knowledge acquisitionPre-trained TransformerClinical decision supportClick logsSearch logsContrastive learningLexical matchingArt performanceIR systemsSemantic retrievalBiomedical articlesDecision supportSentence representationModel encoderKnowledge acquisitionLarge modelsSemantic evaluationRetrievalTransformer modelUnprecedented scale
2021
Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing
Chen Q, Leaman R, Allot A, Luo L, Wei C, Yan S, Lu Z. Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing. Annual Review Of Biomedical Data Science 2021, 4: 1-27. PMID: 34465169, DOI: 10.1146/annurev-biodatasci-021821-061045.Peer-Reviewed Original ResearchMeSH KeywordsCommunicationCOVID-19Data MiningDatasets as TopicEmotionsHumansInformation Storage and RetrievalKnowledge DiscoveryNatural Language ProcessingPandemicsPeriodicals as TopicSoftwareConceptsNatural language processingArtificial intelligenceLanguage processingInformation needsLiterature-based discoveryInformation retrievalEntity recognitionMisinformation detectionInformation overloadNLP studiesNLP tasksEmotion analysisTopic modelingCOVID-19 pandemicIntelligenceAdditional tasksHuman languagePublic health measuresTaskHealth measuresProcessingSerious health effectsHealth effectsRetrievalDataset
2020
Better synonyms for enriching biomedical search
Yeganova L, Kim S, Chen Q, Balasanov G, Wilbur W, Lu Z. Better synonyms for enriching biomedical search. Journal Of The American Medical Informatics Association 2020, 27: 1894-1902. PMID: 33083825, PMCID: PMC7727334, DOI: 10.1093/jamia/ocaa151.Peer-Reviewed Original ResearchMeSH KeywordsAlgorithmsBiomedical ResearchInformation Storage and RetrievalLinguisticsProbabilityPubMedTerminology as TopicDeep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records
Chen Q, Du J, Kim S, Wilbur W, Lu Z. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records. BMC Medical Informatics And Decision Making 2020, 20: 73. PMID: 32349758, PMCID: PMC7191680, DOI: 10.1186/s12911-020-1044-0.Peer-Reviewed Original ResearchMeSH KeywordsData MiningDeep LearningElectronic Health RecordsHumansInformation Storage and RetrievalLanguageMachine LearningPubMedConceptsEnd deep learning modelEncoder networkDeep learning modelsSentence embeddingsBiomedical corporaLearning modelRandom forestTraditional machineText mining applicationsDeep learning approachSimilar sentencesMachine learning modelsHigh performanceMining applicationsRelated datasetsClinical notesLearning approachSentence semanticsPubMed abstractsChallenge taskEnsembled modelBest submissionSentence pairsNetworkTest set