2024
Artificial intelligence-powered pharmacovigilance: A review of machine and deep learning in clinical text-based adverse drug event detection for benchmark datasets
Li Y, Tao W, Li Z, Sun Z, Li F, Fenton S, Xu H, Tao C. Artificial intelligence-powered pharmacovigilance: A review of machine and deep learning in clinical text-based adverse drug event detection for benchmark datasets. Journal Of Biomedical Informatics 2024, 152: 104621. PMID: 38447600, DOI: 10.1016/j.jbi.2024.104621.Peer-Reviewed Original ResearchNamed-entity recognitionEnd-to-end tasksEnd-to-endMachine learningBenchmark datasetsAdverse drug event extractionNamed-entity recognition taskLearning modelsAdverse drug event detectionBidirectional Encoder RepresentationsDeep learning techniquesDeep learning methodsDeep learning modelsEffectiveness of machine learningDeep learning methodologyMachine learning modelsSocial media dataEncoder RepresentationsEvent detectionDeep learningLearning techniquesMultilayer perceptronLearning methodsMedia dataRC task
2020
Relation Extraction from Clinical Narratives Using Pre-trained Language Models.
Wei Q, Ji Z, Si Y, Du J, Wang J, Tiryaki F, Wu S, Tao C, Roberts K, Xu H. Relation Extraction from Clinical Narratives Using Pre-trained Language Models. AMIA Annual Symposium Proceedings 2020, 2019: 1236-1245. PMID: 32308921, PMCID: PMC7153059.Peer-Reviewed Original ResearchConceptsPre-trained language modelsNatural language processingLanguage modelRE tasksNLP tasksClinical narrativesRecent deep learning methodsDeep learning methodsClinical NLP tasksRelation extraction taskTraditional word embeddingsTraditional machineExtraction taskArt performanceRelation extractionBERT modelLanguage processingLearning methodsWord embeddingsShared TaskPrevious stateBiomedical literatureDifferent implementationsTaskOpen domain
2019
Time-sensitive clinical concept embeddings learned from large electronic health records
Xiang Y, Xu J, Si Y, Li Z, Rasmy L, Zhou Y, Tiryaki F, Li F, Zhang Y, Wu Y, Jiang X, Zheng W, Zhi D, Tao C, Xu H. Time-sensitive clinical concept embeddings learned from large electronic health records. BMC Medical Informatics And Decision Making 2019, 19: 58. PMID: 30961579, PMCID: PMC6454598, DOI: 10.1186/s12911-019-0766-3.Peer-Reviewed Original ResearchConceptsConcept similarity measurePositive pointwise mutual informationConcept embeddingsSimilarity measurePredictive modeling tasksLarge electronic health recordTime-sensitive informationPointwise mutual informationImportant research areaDeep learningElectronic health recordsMedical domainLarge electronic health record databaseWord2vec embeddingsTemporal dependenciesLearning methodsFastText algorithmModeling tasksResultsOur experimentsExtrinsic evaluationIntrinsic evaluationMutual informationHealth recordsDistributional representationsEmbedding
2017
Clinical Word Sense Disambiguation with Interactive Search and Classification.
Wang Y, Zheng K, Xu H, Mei Q. Clinical Word Sense Disambiguation with Interactive Search and Classification. AMIA Annual Symposium Proceedings 2017, 2016: 2062-2071. PMID: 28269966, PMCID: PMC5333264.Peer-Reviewed Original ResearchConceptsDomain knowledgeHuman expertsWSD modelClinical textCurrent active learning methodsWord sense disambiguation systemNatural language processing applicationsMachine learning processLanguage processing applicationsWord sense disambiguationActive learning methodsContextual wordsInteractive searchWord ambiguityLearning methodsSense disambiguationProcessing applicationsAmbiguous instancesSearch processDisambiguation systemEvaluation corpusLearning processExpertsQueriesClassifier
2015
A study of active learning methods for named entity recognition in clinical text
Chen Y, Lasko T, Mei Q, Denny J, Xu H. A study of active learning methods for named entity recognition in clinical text. Journal Of Biomedical Informatics 2015, 58: 11-18. PMID: 26385377, PMCID: PMC4934373, DOI: 10.1016/j.jbi.2015.09.010.Peer-Reviewed Original ResearchConceptsClinical NER tasksMachine learningAnnotation costF-measureEntity recognitionNER taskActive learningLearning methodsI2b2/VA NLP challengeNatural language processing systemsPerformance of MLClinical natural language processing (NLP) systemsSequential labeling tasksSupervised machine learningAL methodsLanguage processing systemDiversity-based methodReal-time settingActive learning methodsNew AL methodsNER corpusDomain expertsUncertainty samplingAnnotation effortClinical textNamed Entity Recognition in Chinese Clinical Text Using Deep Neural Network.
Wu Y, Jiang M, Lei J, Xu H. Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network. 2015, 216: 624-8. PMID: 26262126, PMCID: PMC4624324.Peer-Reviewed Original ResearchConceptsDeep neural networksLarge unlabeled corpusNamed Entity RecognitionWord embeddingsUnlabeled corpusUnsupervised learningEntity recognitionNeural networkNatural language processing technologyNovel deep learning methodLanguage processing technologyDeep learning methodsUnsupervised feature learningFeature engineering approachImportant healthcare informationChinese clinical textTypes of entitiesFeature learningNER taskClinical textLearning methodsClinical documentsCRF modelHealthcare informationFree text
2012
Genetic studies of complex human diseases: Characterizing SNP-disease associations using Bayesian networks
Han B, Chen X, Talebizadeh Z, Xu H. Genetic studies of complex human diseases: Characterizing SNP-disease associations using Bayesian networks. BMC Systems Biology 2012, 6: s14. PMID: 23281790, PMCID: PMC3524021, DOI: 10.1186/1752-0509-6-s3-s14.Peer-Reviewed Original ResearchMeSH KeywordsAlgorithmsAlzheimer DiseaseArtificial IntelligenceAutistic DisorderBayes TheoremComputational BiologyComputer SimulationDatabases, GeneticEpistasis, GeneticGenome-Wide Association StudyHumansMacular DegenerationMarkov ChainsModels, GeneticMonte Carlo MethodPolymorphism, Single NucleotideConceptsEpistatic interaction detectionBayesian network structure learning methodTwo-layer Bayesian networkBayesian network-based methodBayesian networkInteraction detectionMarkov chain Monte Carlo methodsStructure learning methodReal disease dataNetwork-based methodReal GWAS datasetMonte Carlo methodHigh-order epistatic interactionsMachine learningSearch spaceLearning methodsDisease datasetCarlo methodTarget nodeModel complexityStatistical methodsReal dataNew scoring functionComplex human diseasesDataset
2011
Detecting abbreviations in discharge summaries using machine learning methods.
Wu Y, Rosenbloom S, Denny J, Miller R, Mani S, Giuse D, Xu H. Detecting abbreviations in discharge summaries using machine learning methods. AMIA Annual Symposium Proceedings 2011, 2011: 1541-9. PMID: 22195219, PMCID: PMC3243185.Peer-Reviewed Original ResearchConceptsNatural language processingMachine learning methodsHighest F-measureF-measureClinical natural language processingLexical resourcesClinical abbreviationsTraining setPre-defined featuresRandom forest classifierDomain expertsML algorithmsML classifiersLanguage processingVoting schemeLearning methodsDischarge summariesForest classifierTest setClassifierCorpus-based methodSetResourcesAlgorithmAbbreviations
2006
Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues
Xu H, Markatou M, Dimova R, Liu H, Friedman C. Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues. BMC Bioinformatics 2006, 7: 334. PMID: 16822321, PMCID: PMC1550263, DOI: 10.1186/1471-2105-7-334.Peer-Reviewed Original ResearchConceptsNatural language processingBiomedical domainInformation retrieval systemsML methodsWSD classifierSense disambiguationMachine learning methodsVector machine classifierError rateWord sense disambiguationRetrieval systemMachine learningML techniquesText miningBiomedical abbreviationsLanguage processingLearning methodsCross-validation methodWSD problemMachine classifierAccurate accessSense distributionClassifierBiomolecular entitiesWSD task