2024
Development of Clinical NLP Systems
Xu H, Demner Fushman D. Development of Clinical NLP Systems. Cognitive Informatics In Biomedicine And Healthcare 2024, 301-324. DOI: 10.1007/978-3-031-55865-8_11.Peer-Reviewed Original Research
2023
Towards precise PICO extraction from abstracts of randomized controlled trials using a section-specific learning approach
Hu Y, Keloth V, Raja K, Chen Y, Xu H. Towards precise PICO extraction from abstracts of randomized controlled trials using a section-specific learning approach. Bioinformatics 2023, 39: btad542. PMID: 37669123, PMCID: PMC10500081, DOI: 10.1093/bioinformatics/btad542.Peer-Reviewed Original ResearchNatural language processingMicro-F1 scoreCOVID-19 datasetNLP pipelineF1 scoreEntity recognition modelAD datasetPICO elementsSentence classificationNER modelRecognition modelLanguage processingLearning approachLearning modelEnd evaluationSupplementary dataDatasetPipelineExtractionInformationRCT abstractsAnnotationSentencesBioinformaticsComplexity
2020
Efficient and Accurate Extracting of Unstructured EHRs on Cancer Therapy Responses for the Development of RECIST Natural Language Processing Tools: Part I, the Corpus
Li Y, Luo Y, Wampfler J, Rubinstein S, Tiryaki F, Ashok K, Warner J, Xu H, Yang P. Efficient and Accurate Extracting of Unstructured EHRs on Cancer Therapy Responses for the Development of RECIST Natural Language Processing Tools: Part I, the Corpus. JCO Clinical Cancer Informatics 2020, 4: cci.19.00147. PMID: 32364754, PMCID: PMC7265793, DOI: 10.1200/cci.19.00147.Peer-Reviewed Original ResearchConceptsNatural language processing toolsElectronic health recordsLanguage processing toolsGold standard dataUnstructured electronic health recordsProcessing toolsAmount of dataClinical notesStandard dataMayo Clinic electronic health recordsClinic's electronic health recordEnvironment toolsAccurate annotationHealth recordsInformatics toolsEffective analysisData setsTextual sourcesCorpusToolInformationData extractionSetExtractingAnnotation
2019
Cost-aware active learning for named entity recognition in clinical text
Wei Q, Chen Y, Salimi M, Denny J, Mei Q, Lasko T, Chen Q, Wu S, Franklin A, Cohen T, Xu H. Cost-aware active learning for named entity recognition in clinical text. Journal Of The American Medical Informatics Association 2019, 26: 1314-1322. PMID: 31294792, PMCID: PMC6798575, DOI: 10.1093/jamia/ocz102.Peer-Reviewed Original ResearchConceptsAnnotation costUser studyActive learningAL methodsAL algorithmCost-CAUSEReal-world environmentsAnnotation taskAnnotation timeAnnotation accuracyEntity recognitionClinical textAnnotation dataPassive learningInformative examplesCurve scoreMost approachesSimulation areaUsersSyntactic featuresLearningCost measuresAlgorithmCostAnnotation
2018
Clinical text annotation - what factors are associated with the cost of time?
Wei Q, Franklin A, Cohen T, Xu H. Clinical text annotation - what factors are associated with the cost of time? AMIA Annual Symposium Proceedings 2018, 2018: 1552-1560. PMID: 30815201, PMCID: PMC6371268.Peer-Reviewed Original ResearchConceptsAnnotation timeClinical textNatural language processing modelsClinical corpusIndividual user behaviorEntity recognition taskLanguage processing modelsPractice of annotationCharacteristics of sentencesClinical Text AnnotationText annotationsUser behaviorIndividual usersCost of timeActive learning researchRecognition taskLearning researchProcessing modelCost modelAnnotationUsersLimited workCorpusTextTask
2007
A study of abbreviations in clinical notes.
Xu H, Stetson P, Friedman C. A study of abbreviations in clinical notes. AMIA Annual Symposium Proceedings 2007, 2007: 821-5. PMID: 18693951, PMCID: PMC2655910.Peer-Reviewed Original ResearchConceptsUnified Medical Language SystemNatural language processing systemsLanguage processing systemNarrative clinical notesDetection methodClinical notesDifferent knowledge sourcesSense inventoryDomain expertsNLP systemsCorrect sensesDecision supportText corporaKnowledge sourcesError detectionProcessing systemBiomedical literatureStudy of abbreviationsLanguage systemPatient informationAmbiguity rateBetter detection methodsDatabaseAnnotationAbbreviations