2024
PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge
Wei C, Allot A, Lai P, Leaman R, Tian S, Luo L, Jin Q, Wang Z, Chen Q, Lu Z. PubTator 3.0: an AI-powered literature resource for unlocking biomedical knowledge. Nucleic Acids Research 2024, 52: w540-w546. PMID: 38572754, PMCID: PMC11223843, DOI: 10.1093/nar/gkae235.Peer-Reviewed Original ResearchState-of-the-art AI techniquesState-of-the-artComplex information needsAdvanced search capabilitiesPairs queriesEntity relationsRetrieval qualitySearch capabilityAI techniquesLiterature resourcesPubTatorInformation needsPubMed abstractsBiomedical literatureOnline interfaceLarge-scale analysisGenetic variantsBiomedical knowledgeAPIScientific discoveryComprehensive setChatGPTQueryVerifiabilityRetrieval
2020
Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records
Chen Q, Du J, Kim S, Wilbur W, Lu Z. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records. BMC Medical Informatics And Decision Making 2020, 20: 73. PMID: 32349758, PMCID: PMC7191680, DOI: 10.1186/s12911-020-1044-0.Peer-Reviewed Original ResearchConceptsEnd deep learning modelEncoder networkDeep learning modelsSentence embeddingsBiomedical corporaLearning modelRandom forestTraditional machineText mining applicationsDeep learning approachSimilar sentencesMachine learning modelsHigh performanceMining applicationsRelated datasetsClinical notesLearning approachSentence semanticsPubMed abstractsChallenge taskEnsembled modelBest submissionSentence pairsNetworkTest setBioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale
Chen Q, Lee K, Yan S, Kim S, Wei C, Lu Z. BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale. PLOS Computational Biology 2020, 16: e1007617. PMID: 32324731, PMCID: PMC7237030, DOI: 10.1371/journal.pcbi.1007617.Peer-Reviewed Original ResearchConceptsConcept embeddingsNER toolsLearning modelBiomedical text mining applicationsAdvanced deep learning modelsDifferent machine learning modelsEvaluation resultsText mining applicationsDeep learning modelsSemantics of conceptsMachine learning modelsLiterature-based discoveryConcept recognitionDifferent machineProtein-protein interaction predictionPubMed abstractsRecognition toolsMassive numberVector representationBiomedical conceptsLarge marginExtrinsic evaluationBiomedical literatureIntrinsic evaluationSemantic relatedness
2018
Sentence Similarity Measures Revisited
Chen Q, Kim S, Wilbur W, Lu Z. Sentence Similarity Measures Revisited. 2018, 531-532. DOI: 10.1145/3233547.3233640.Peer-Reviewed Original ResearchSentence similaritySimilarity measureNatural language processingMultiple similarity measuresSentence similarity measureNDCG scoresText summarizationBiomedical domainLanguage processingLarge-scale benchmark setPubMed abstractsComputational biologySemantic measuresBenchmark setExperimental resultsSummarizationSentencesDatasetCrucial componentDocumentsProcessingSimilaritySet