2023
MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval
Jin Q, Kim W, Chen Q, Comeau D, Yeganova L, Wilbur W, Lu Z. MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval. Bioinformatics 2023, 39: btad651. PMID: 37930897, PMCID: PMC10627406, DOI: 10.1093/bioinformatics/btad651.Peer-Reviewed Original ResearchConceptsInformation retrievalIR tasksUser click logsSemantic information retrievalBiomedical information retrievalBiomedical knowledge acquisitionPre-trained TransformerClinical decision supportClick logsSearch logsContrastive learningLexical matchingArt performanceIR systemsSemantic retrievalBiomedical articlesDecision supportSentence representationModel encoderKnowledge acquisitionLarge modelsSemantic evaluationRetrievalTransformer modelUnprecedented scale
2020
BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale
Chen Q, Lee K, Yan S, Kim S, Wei C, Lu Z. BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale. PLOS Computational Biology 2020, 16: e1007617. PMID: 32324731, PMCID: PMC7237030, DOI: 10.1371/journal.pcbi.1007617.Peer-Reviewed Original ResearchConceptsConcept embeddingsNER toolsLearning modelBiomedical text mining applicationsAdvanced deep learning modelsDifferent machine learning modelsEvaluation resultsText mining applicationsDeep learning modelsSemantics of conceptsMachine learning modelsLiterature-based discoveryConcept recognitionDifferent machineProtein-protein interaction predictionPubMed abstractsRecognition toolsMassive numberVector representationBiomedical conceptsLarge marginExtrinsic evaluationBiomedical literatureIntrinsic evaluationSemantic relatedness