Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records
Chen Q, Du J, Kim S, Wilbur W, Lu Z. Deep learning with sentence embeddings pre-trained on biomedical corpora improves the performance of finding similar sentences in electronic medical records. BMC Medical Informatics And Decision Making 2020, 20: 73. PMID: 32349758, PMCID: PMC7191680, DOI: 10.1186/s12911-020-1044-0.Peer-Reviewed Original ResearchMeSH KeywordsData MiningDeep LearningElectronic Health RecordsHumansInformation Storage and RetrievalLanguageMachine LearningPubMedConceptsEnd deep learning modelEncoder networkDeep learning modelsSentence embeddingsBiomedical corporaLearning modelRandom forestTraditional machineText mining applicationsDeep learning approachSimilar sentencesMachine learning modelsHigh performanceMining applicationsRelated datasetsClinical notesLearning approachSentence semanticsPubMed abstractsChallenge taskEnsembled modelBest submissionSentence pairsNetworkTest setBioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale
Chen Q, Lee K, Yan S, Kim S, Wei C, Lu Z. BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale. PLOS Computational Biology 2020, 16: e1007617. PMID: 32324731, PMCID: PMC7237030, DOI: 10.1371/journal.pcbi.1007617.Peer-Reviewed Original ResearchMeSH KeywordsAlgorithmsComputational BiologyData MiningDatabases, ProteinDeep LearningDrug InteractionsElectronic Health RecordsHumansProtein Interaction MappingPublicationsPubMedSemanticsConceptsConcept embeddingsNER toolsLearning modelBiomedical text mining applicationsAdvanced deep learning modelsDifferent machine learning modelsEvaluation resultsText mining applicationsDeep learning modelsSemantics of conceptsMachine learning modelsLiterature-based discoveryConcept recognitionDifferent machineProtein-protein interaction predictionPubMed abstractsRecognition toolsMassive numberVector representationBiomedical conceptsLarge marginExtrinsic evaluationBiomedical literatureIntrinsic evaluationSemantic relatedness