2024
Mapping Clinical Documents to the Logical Observation Identifiers, Names and Codes (LOINC) Document Ontology using Electronic Health Record Systems Structured Metadata.
Khan H, Mosa A, Paka V, Rana M, Mandhadi V, Islam S, Xu H, McClay J, Sarker S, Rao P, Waitman L. Mapping Clinical Documents to the Logical Observation Identifiers, Names and Codes (LOINC) Document Ontology using Electronic Health Record Systems Structured Metadata. AMIA Annual Symposium Proceedings 2024, 2023: 1017-1026. PMID: 38222329, PMCID: PMC10785913.Peer-Reviewed Original ResearchConceptsDocument ontologyElectronic health recordsBag-of-words approachNatural language processing techniquesFree-text documentsLanguage processing techniquesClinical documentationLogical Observation IdentifiersText documentsStructured metadataWords approachComputational scalabilityMetadataHealth recordsEHR documentationElectronic health record fieldsProcessing techniquesOntologyDocumentsAutomated pipelineNLPScalabilityClinical careFrameworkLOINC
2022
A comparative study of pre-trained language models for named entity recognition in clinical trial eligibility criteria from multiple corpora
Li J, Wei Q, Ghiasvand O, Chen M, Lobanov V, Weng C, Xu H. A comparative study of pre-trained language models for named entity recognition in clinical trial eligibility criteria from multiple corpora. BMC Medical Informatics And Decision Making 2022, 22: 235. PMID: 36068551, PMCID: PMC9450226, DOI: 10.1186/s12911-022-01967-7.Peer-Reviewed Original ResearchConceptsPre-trained language modelsNER taskUnstructured textEntity recognitionLanguage modelNatural language processing techniquesClinical trial eligibility criteriaLanguage processing techniquesData augmentation resultsData augmentation approachDomain-specific corpusBetter performanceTransformer modelCross-validation showMultiple data sourcesEligibility criteria textBiomedical domainEmbedding modelsNER performanceAugmentation approachContextual embeddingsMeaningful informationEvaluation resultsSuch documentsProcessing techniques
2021
From Tokenization to Self-Supervision: Building a High-Performance Information Extraction System for Chemical Reactions in Patents
Wang J, Ren Y, Zhang Z, Xu H, Zhang Y. From Tokenization to Self-Supervision: Building a High-Performance Information Extraction System for Chemical Reactions in Patents. Frontiers In Research Metrics And Analytics 2021, 6: 691105. PMID: 35005421, PMCID: PMC8727901, DOI: 10.3389/frma.2021.691105.Peer-Reviewed Original ResearchEvent extractionEntity recognitionNatural language processing techniquesAccurate information extractionInformation extraction systemLanguage processing techniquesKnowledge-based rulesInformation extractionAutomatic toolEnd systemArt resultsSemantic rolesLanguage modelSelf-SupervisionFree textChemical patentsSubtask 1Reaction extractionDifferent semantic rolesHybrid approachEvent triggersProcessing techniquesSubtasksTokenizationHigh performance
2020
Opioid2FHIR: A system for extracting FHIR-compatible opioid prescriptions from clinical text
Wang J, Mathews W, Pham H, Xu H, Zhang Y. Opioid2FHIR: A system for extracting FHIR-compatible opioid prescriptions from clinical text. 2020, 00: 1748-1751. DOI: 10.1109/bibm49941.2020.9313258.Peer-Reviewed Original ResearchFast Healthcare Interoperability ResourcesInformation extractionNatural language processing techniquesLanguage processing techniquesMedical concept normalizationOpioid informationPost-processing rulesClinical decision supportManual effortConcept normalizationClinical textF-measureNLP applicationsPrescription recordsClinical data standardsData standardsDecision supportFree textProcessing toolsPrescription drug monitoring programsNational public health emergencyProcessing techniquesPrescription opioid overdoseDrug monitoring programsDrug overdose deaths
2018
Adapting Word Embeddings from Multiple Domains to Symptom Recognition from Psychiatric Notes.
Zhang Y, Li H, Wang J, Cohen T, Roberts K, Xu H. Adapting Word Embeddings from Multiple Domains to Symptom Recognition from Psychiatric Notes. AMIA Joint Summits On Translational Science Proceedings 2018, 2017: 281-289. PMID: 29888086, PMCID: PMC5961810.Peer-Reviewed Original ResearchWord embeddingsClinical textTarget domainSource domainNatural language processing techniquesLanguage processing techniquesMultiple word embeddingsBaseline methodsBiomedical literatureFirst workProcessing techniquesEmbeddingPsychiatric notesMultiple domainsExperimental resultsDifferent weightsSuch informationImportant topicRecognitionDifferent approachesWikipediaInformationPersonalizationDomainText