2024
Ascle—A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study
Yang R, Zeng Q, You K, Qiao Y, Huang L, Hsieh C, Rosand B, Goldwasser J, Dave A, Keenan T, Ke Y, Hong C, Liu N, Chew E, Radev D, Lu Z, Xu H, Chen Q, Li I. Ascle—A Python Natural Language Processing Toolkit for Medical Text Generation: Development and Evaluation Study. Journal Of Medical Internet Research 2024, 26: e60601. PMID: 39361955, PMCID: PMC11487205, DOI: 10.2196/60601.Peer-Reviewed Original ResearchConceptsNatural language processingNatural language processing toolkitQuestion-answering taskLanguage modelText generationText processingDomain-specific language modelsNatural language processing functionsMinimal programming expertiseText generation tasksMedical knowledge graphMachine translation tasksROUGE-L scoreDomain-specific challengesAll-in-one solutionROUGE-LText summarizationBLEU scoreKnowledge graphMachine translationUnstructured textQuestion-answeringHugging FaceProcessing toolkitLanguage processing
2023
Opportunities and challenges for ChatGPT and large language models in biomedicine and health
Tian S, Jin Q, Yeganova L, Lai P, Zhu Q, Chen X, Yang Y, Chen Q, Kim W, Comeau D, Islamaj R, Kapoor A, Gao X, Lu Z. Opportunities and challenges for ChatGPT and large language models in biomedicine and health. Briefings In Bioinformatics 2023, 25: bbad493. PMID: 38168838, PMCID: PMC10762511, DOI: 10.1093/bib/bbad493.Peer-Reviewed Original ResearchConceptsLarge language modelsLanguage modelSensitive patient dataBiomedical information retrievalText generation tasksInformation retrievalPrivacy concernsDomain expertsInformation extractionText summarizationBiomedical domainArt methodsDiverse applicationsPrevious stateBiomedical researchersGeneration taskPatient dataSuch methodsTaskDistinct complexityGeneration capabilityExtensive literature surveySummarizationRecent rapid progressChallenges
2019
Evaluation of Five Sentence Similarity Models on Electronic Medical Records
Chen Q, Du J, Kim S, Wilbur W, Lu Z. Evaluation of Five Sentence Similarity Models on Electronic Medical Records. 2019, 533-533. DOI: 10.1145/3307339.3343239.Peer-Reviewed Original ResearchSentence similarity modelSimilarity modelLarge biomedical corporaLarge public datasetsTraditional machineClinical domainsBiomedical corporaText summarizationBidirectional transformersPublic datasetsSemantic similaritySmall datasetsSentence similarityDataset consistingSentence pairsDatasetElectronic medical recordsPrimary applicationCNNSummarizationBERTVital roleMachineDomainEmbedding
2018
Sentence Similarity Measures Revisited
Chen Q, Kim S, Wilbur W, Lu Z. Sentence Similarity Measures Revisited. 2018, 531-532. DOI: 10.1145/3233547.3233640.Peer-Reviewed Original ResearchSentence similaritySimilarity measureNatural language processingMultiple similarity measuresSentence similarity measureNDCG scoresText summarizationBiomedical domainLanguage processingLarge-scale benchmark setPubMed abstractsComputational biologySemantic measuresBenchmark setExperimental resultsSummarizationSentencesDatasetCrucial componentDocumentsProcessingSimilaritySet