Arman Cohan
Assistant ProfessorCards
About
Research
Publications
2025
RouterRetriever: Routing over a Mixture of Expert Embedding Models
Lee H, Soldaini L, Cohan A, Seo M, Lo K. RouterRetriever: Routing over a Mixture of Expert Embedding Models. Proceedings Of The AAAI Conference On Artificial Intelligence 2025, 39: 11995-12003. DOI: 10.1609/aaai.v39i11.33306.Peer-Reviewed Original ResearchEmbedding modelRouting mechanismGeneral domain datasetsMulti-task trainingDomain-specific dataInformation retrieval methodsMulti-task modelDomain-specific expertsExpert retrievalInformation retrievalLanguage modelRouting techniquesRetrieval modelUnderperforming modelsRetrieval methodRetrievalSpecialized domainsDatasetGeneration researchExpertsQueryInformationLanguageTrainingEmbeddingmFollowIR: A Multilingual Benchmark for Instruction Following in Retrieval
Weller O, Chang B, Yang E, Yarmohammadi M, Barham S, MacAvaney S, Cohan A, Soldaini L, Van Durme B, Lawrie D. mFollowIR: A Multilingual Benchmark for Instruction Following in Retrieval. Lecture Notes In Computer Science 2025, 15573: 295-310. DOI: 10.1007/978-3-031-88711-6_19.Peer-Reviewed Original ResearchFollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
Weller O, Chang B, MacAvaney S, Lo K, Cohan A, Van Durme B, Lawrie D, Soldaini L. FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions. 2025, 11926-11942. DOI: 10.18653/v1/2025.naacl-long.597.Peer-Reviewed Original ResearchReIFE: Re-evaluating Instruction-Following Evaluation
Liu Y, Shi K, Fabbri A, Zhao Y, Wang P, Wu C, Joty S, Cohan A. ReIFE: Re-evaluating Instruction-Following Evaluation. 2025, 12247-12287. DOI: 10.18653/v1/2025.naacl-long.610.Peer-Reviewed Original ResearchSciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
Wadden D, Shi K, Morrison J, Li A, Naik A, Singh S, Barzilay N, Lo K, Hope T, Soldaini L, Shen S, Downey D, Hajishirzi H, Cohan A. SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature. 2025, 6083-6120. DOI: 10.18653/v1/2025.emnlp-main.310.Peer-Reviewed Original Research
2024
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Ni A, Yin P, Zhao Y, Riddell M, Feng T, Shen R, Yin S, Liu Y, Yavuz S, Xiong C, Joty S, Zhou Y, Radev D, Cohan A, Cohan A. L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models. Transactions Of The Association For Computational Linguistics 2024, 12: 1311-1329. DOI: 10.1162/tacl_a_00705.Peer-Reviewed Original ResearchLanguage modelNatural language inputSemantic parsingHuman evaluationPretraining dataModel architectureModel sizeGeneration capabilityConfidence calibrationLearning paradigmPython programProject websiteTaskCapabilityLanguage inputParsingComprehensive evaluationLanguagePythonArchitectureCodeEvaluationLLMFramework1LearningBenchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization
Liu Y, Fabbri A, Chen J, Zhao Y, Han S, Joty S, Liu P, Radev D, Wu C, Cohan A. Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization. 2024, 4481-4501. DOI: 10.18653/v1/2024.findings-naacl.280.Peer-Reviewed Original ResearchP-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains
Han S, Yu A, Shen R, Qi Z, Riddell M, Zhou W, Qiao Y, Zhao Y, Yavuz S, Liu Y, Joty S, Zhou Y, Xiong C, Radev D, Ying R, Cohan A. P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains. 2024, 16553-16565. DOI: 10.18653/v1/2024.findings-emnlp.966.Peer-Reviewed Original ResearchTESS: Text-to-Text Self-Conditioned Simplex Diffusion
Karimi Mahabadi R, Ivison H, Tae J, Henderson J, Beltagy I, Peters M, Cohan A. TESS: Text-to-Text Self-Conditioned Simplex Diffusion. 2024, 2347-2361. DOI: 10.18653/v1/2024.eacl-long.144.Peer-Reviewed Original ResearchOn Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering
Nan L, Zhang E, Zou W, Zhao Y, Zhou W, Cohan A. On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering. 2024, 4556-4579. DOI: 10.18653/v1/2024.findings-naacl.284.Peer-Reviewed Original Research
News
News
Get In Touch
Contacts
Email