2022
Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data
Rasmy L, Nigo M, Kannadath B, Xie Z, Mao B, Patel K, Zhou Y, Zhang W, Ross A, Xu H, Zhi D. Recurrent neural network models (CovRNN) for predicting outcomes of patients with COVID-19 on admission to hospital: model development and validation using electronic health record data. The Lancet Digital Health 2022, 4: e415-e425. PMID: 35466079, PMCID: PMC9023005, DOI: 10.1016/s2589-7500(22)00049-8.Peer-Reviewed Original ResearchConceptsLight Gradient Boost MachineFeature engineeringGradient-boosting machineMultiple machine learning modelsElectronic health record dataNeural network-based modelReal-world datasetsRecurrent neural network modelComplex feature engineeringMachine learning modelsBinary classification taskSpecific feature selectionLogistic regression algorithmNeural network modelHealth record dataRecurrent neural network-based modelBinary classification modelNetwork-based modelTraditional machineExtensive data preprocessingHigh prediction accuracyMultiple external datasetsClassification taskData preprocessingFeature selection
2019
Recognizing software names in biomedical literature using machine learning
Wei Q, Zhang Y, Amith M, Lin R, Lapeyrolerie J, Tao C, Xu H. Recognizing software names in biomedical literature using machine learning. Health Informatics Journal 2019, 26: 21-33. PMID: 31566474, PMCID: PMC7334865, DOI: 10.1177/1460458219869490.Peer-Reviewed Original ResearchConceptsSoftware namesF-measureNatural language processing methodsBiomedical literatureWord representation featuresLanguage processing methodsEntity recognition systemSoftware catalogSoftware repositoriesFeature engineeringBiomedical softwareRecognition systemSoftware toolsBiomedical domainRepresentation featuresMEDLINE abstractsWord embeddingsKnowledge featuresManual curationSoftwareMachineProcessing methodsBest systemRepositorySystem
2017
Entity recognition from clinical texts via recurrent neural network
Liu Z, Yang M, Wang X, Chen Q, Tang B, Wang Z, Xu H. Entity recognition from clinical texts via recurrent neural network. BMC Medical Informatics And Decision Making 2017, 17: 67. PMID: 28699566, PMCID: PMC5506598, DOI: 10.1186/s12911-017-0468-7.Peer-Reviewed Original ResearchConceptsRecurrent neural networkNatural language processingEntity recognitionClinical textTraditional machineNeural networkClinical natural language processingMedical concept extractionHand-crafted featuresClinical entity recognitionDeep learning methodsClinical event detectionConditional Random FieldsSupport vector machineI2b2 NLP challengePerformance of LSTMTypes of entitiesClinical domainsContext informationFeature engineeringConcept extractionDe-identificationEvent detectionKnowledge basesLSTM layers
2016
Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning
Zhang Y, Xu J, Chen H, Wang J, Wu Y, Prakasam M, Xu H. Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning. Database 2016, 2016: baw049. PMID: 27087307, PMCID: PMC4834204, DOI: 10.1093/database/baw049.Peer-Reviewed Original ResearchConceptsMachine learning-based systemsLearning-based systemConditional Random FieldsDomain knowledgeEntity recognitionMatthews correlation coefficientDrug Named Entity RecognitionBioCreative V challengeInformation extraction systemWord representation featuresUnsupervised feature learningUnsupervised learning algorithmNamed Entity RecognitionSemantic type informationSupport vector machinePrecision-recall curveBrown clusteringFeature learningFeature engineeringUnsupervised featureIndividual subtasksMining systemNER taskLearning algorithmCPD task
2013
Applying active learning to high-throughput phenotyping algorithms for electronic health records data
Chen Y, Carroll R, Hinz E, Shah A, Eyler A, Denny J, Xu H. Applying active learning to high-throughput phenotyping algorithms for electronic health records data. Journal Of The American Medical Informatics Association 2013, 20: e253-e259. PMID: 23851443, PMCID: PMC3861916, DOI: 10.1136/amiajnl-2013-001945.Peer-Reviewed Original ResearchConceptsActive learningUnrefined featuresSupervised Machine Learning AlgorithmsRefined featuresPhenotyping algorithmElectronic health record dataMachine Learning AlgorithmsHealth record dataVenous thromboembolismRheumatoid arthritisFeature engineeringDomain expertsDomain knowledgePhenotyping tasksLearning algorithmFeature setsLearning approachColorectal cancerAL approachCurve scorePassive learning approachHigh-throughput phenotyping methodsAlgorithmSmall setRecord data