2024
Advancing entity recognition in biomedicine via instruction tuning of large language models
Keloth V, Hu Y, Xie Q, Peng X, Wang Y, Zheng A, Selek M, Raja K, Wei C, Jin Q, Lu Z, Chen Q, Xu H. Advancing entity recognition in biomedicine via instruction tuning of large language models. Bioinformatics 2024, 40: btae163. PMID: 38514400, PMCID: PMC11001490, DOI: 10.1093/bioinformatics/btae163.Peer-Reviewed Original ResearchNamed Entity RecognitionSequence labeling taskNatural language processingBiomedical NER datasetsLanguage modelNER datasetsEntity recognitionLabeling taskText generationField of natural language processingBiomedical NERFew-shot learning capabilityReasoning tasksMulti-domain scenariosDomain-specific modelsEnd-to-endMinimal fine-tuningSOTA performanceF1 scoreHealthcare applicationsBiomedical entitiesBiomedical domainLanguage processingMulti-taskingPubMedBERT model
2023
Opportunities and challenges for ChatGPT and large language models in biomedicine and health
Tian S, Jin Q, Yeganova L, Lai P, Zhu Q, Chen X, Yang Y, Chen Q, Kim W, Comeau D, Islamaj R, Kapoor A, Gao X, Lu Z. Opportunities and challenges for ChatGPT and large language models in biomedicine and health. Briefings In Bioinformatics 2023, 25: bbad493. PMID: 38168838, PMCID: PMC10762511, DOI: 10.1093/bib/bbad493.Peer-Reviewed Original ResearchConceptsLarge language modelsLanguage modelSensitive patient dataBiomedical information retrievalText generation tasksInformation retrievalPrivacy concernsDomain expertsInformation extractionText summarizationBiomedical domainArt methodsDiverse applicationsPrevious stateBiomedical researchersGeneration taskPatient dataSuch methodsTaskDistinct complexityGeneration capabilityExtensive literature surveySummarizationRecent rapid progressChallenges
2019
BioWordVec, improving biomedical word embeddings with subword information and MeSH
Zhang Y, Chen Q, Yang Z, Lin H, Lu Z. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific Data 2019, 6: 52. PMID: 31076572, PMCID: PMC6510737, DOI: 10.1038/s41597-019-0055-0.Peer-Reviewed Original ResearchConceptsWord embeddingsSubword informationWord representationsBiomedical natural language processingNatural language processingMultiple NLP tasksBiomedical word embeddingsInformation retrievalUnlabeled textBiomedical textText miningBiomedical domainLanguage processingNLP tasksStructured resourcesChallenging taskPrevious stateBenchmarking resultsLarge corpusEmbeddingWord levelBioWordVecSuch informationTaskInformation
2018
Sentence Similarity Measures Revisited
Chen Q, Kim S, Wilbur W, Lu Z. Sentence Similarity Measures Revisited. 2018, 531-532. DOI: 10.1145/3233547.3233640.Peer-Reviewed Original ResearchSentence similaritySimilarity measureNatural language processingMultiple similarity measuresSentence similarity measureNDCG scoresText summarizationBiomedical domainLanguage processingLarge-scale benchmark setPubMed abstractsComputational biologySemantic measuresBenchmark setExperimental resultsSummarizationSentencesDatasetCrucial componentDocumentsProcessingSimilaritySet