2014
PhenDisco: phenotype discovery system for the database of genotypes and phenotypes
Doan S, Lin K, Conway M, Ohno-Machado L, Hsieh A, Feupe S, Garland A, Ross M, Jiang X, Farzaneh S, Walker R, Alipanah N, Zhang J, Xu H, Kim H. PhenDisco: phenotype discovery system for the database of genotypes and phenotypes. Journal Of The American Medical Informatics Association 2014, 21: 31-36. PMID: 23989082, PMCID: PMC3912702, DOI: 10.1136/amiajnl-2013-001882.Peer-Reviewed Original ResearchConceptsNew information retrieval systemInformation retrieval systemsInformation retrieval toolsDatabase of GenotypesText processing toolsRetrieval systemSearch scenariosDiscovery systemRetrieval toolsAuthorized usersNon-standardized wayCross-study validationSearch comparisonProcessing toolsPromising performanceUsersPhenotype informationDatabaseInformationBiotechnology InformationQueriesMetadataEntrezResourcesSystem
2011
Detecting abbreviations in discharge summaries using machine learning methods.
Wu Y, Rosenbloom S, Denny J, Miller R, Mani S, Giuse D, Xu H. Detecting abbreviations in discharge summaries using machine learning methods. AMIA Annual Symposium Proceedings 2011, 2011: 1541-9. PMID: 22195219, PMCID: PMC3243185.Peer-Reviewed Original ResearchConceptsNatural language processingMachine learning methodsHighest F-measureF-measureClinical natural language processingLexical resourcesClinical abbreviationsTraining setPre-defined featuresRandom forest classifierDomain expertsML algorithmsML classifiersLanguage processingVoting schemeLearning methodsDischarge summariesForest classifierTest setClassifierCorpus-based methodSetResourcesAlgorithmAbbreviations