From Tokenization to Self-Supervision: Building a High-Performance Information Extraction System for Chemical Reactions in Patents
Wang J, Ren Y, Zhang Z, Xu H, Zhang Y. From Tokenization to Self-Supervision: Building a High-Performance Information Extraction System for Chemical Reactions in Patents. Frontiers In Research Metrics And Analytics 2021, 6: 691105. PMID: 35005421, PMCID: PMC8727901, DOI: 10.3389/frma.2021.691105.Peer-Reviewed Original ResearchEvent extractionEntity recognitionNatural language processing techniquesAccurate information extractionInformation extraction systemLanguage processing techniquesKnowledge-based rulesInformation extractionAutomatic toolEnd systemArt resultsSemantic rolesLanguage modelSelf-SupervisionFree textChemical patentsSubtask 1Reaction extractionDifferent semantic rolesHybrid approachEvent triggersProcessing techniquesSubtasksTokenizationHigh performanceA Discrete Joint Model for Entity and Relation Extraction from Clinical Notes.
Ji Z, Ghiasvand O, Wu S, Xu H. A Discrete Joint Model for Entity and Relation Extraction from Clinical Notes. AMIA Joint Summits On Translational Science Proceedings 2021, 2021: 315-324. PMID: 34457146, PMCID: PMC8378610.Peer-Reviewed Original ResearchConceptsRelation classificationPipeline architectureClinical natural language processingNatural language processingEntity recognitionBeam searchRelation extractionClinical notesLanguage processingClassification stepEntity pairsStructured perceptronFundamental taskClinical narrativesTraditional solutionsRecognition stepError propagationArchitectureJoint modelTaskSubtasksPerceptronClinical conceptsEntitiesClassification