Kalpana Raja, PhD, MRSB, CSci
Instructor of Biomedical Informatics and Data ScienceCards
Contact Info
Biomedical Informatics & Data Science
100 College St
New Haven, CT 06510
United States
About
Titles
Instructor of Biomedical Informatics and Data Science
Biography
Kalpana Raja, PhD joined the Section of Biomedical Informatics & Data Science (BIDS) at Yale School of Medicine in February of 2023. Before moving to New Haven, CT, Kalpana worked as an assistant professor at the School of Biomedical Informatics, University of Texas Health Science Center (UTHealth) at Houston, TX. She also worked as a scientist at Sema4, a patient centered healthcare company located in Stamford, CT. Kalpana completed her bachelor’s degree in pharmacy from Tamil Nadu Dr. M.G.R. Medical University at Chennai, India. She is a registered pharmacist with the Indian Pharmacy Council. With a vision to develop software for biological applications, she completed her master’s degree in computing with a focus in software technology from The Robert Gordon University, Aberdeen, UK. She developed ProfileSKiM, an intelligent document retrieval tool, and submitted the findings in her MSc thesis. ProfileSKiM received a reward from the Robert Gordon University in 2005 and the Technology Award from the British Computer Society, London, UK in 2006. Kalpana completed her second master’s degree in bioinformatics and her PhD in computing: software technology – bioinformatics (inter-disciplinary) from Bharathiar University, Coimbatore, India. She presented her findings from the PhD research work at the 2012 Asia Pacific Bioinformatics Conference (APBC) held in Melbourne, Australia, and BioCreative Conference V held at Washington DC.
Kalpana’s research interests include natural language processing (NLP) and machine learning. She developed methodologies and software for information retrieval, information extraction, knowledge summarization, literature-based discovery, and automated hypothesis generation. She applied her approaches on various biological domains such as protein-protein interaction, protein phosphorylation, drug-drug interactions, adverse drug events, drug repurposing, and disease comorbidity. She also provided the NLP support for various genomics and transcriptomics projects. Kalpana has published more than 70 articles in peer reviewed journals, books, and conference proceedings. She has reviewed several research articles submitted to prestigious journals such as Briefings in Bioinformatics and serves as an associate editor in the Journal of Embryology & Stem Cell Research.
Kalpana was elected as a “Member of Royal Society of Biology” (MRSB) in 2019 by the Royal Society of Biology, London, UK. Recently, she was honored as the “Chartered Scientist” (CSci) by the Royal Society of Biology, London, UK. Kalpana also received the “2019 Women Scientist Award” from the Society for Bioinformatics and Biological Sciences, a non-profit professional society based in India.
Areas of Expertise
- Natural Language Processing
- Artificial Intelligence (AI)
- Large Language Models (LLMs)
- Deep Learning
- Machine Learning
- Biomedical informatics
Google scholar
Appointments
Biomedical Informatics & Data Science
InstructorPrimary
Other Departments & Organizations
- Biomedical Informatics & Data Science
- Clinical NLP Lab
Education & Training
- PhD
- Bharathiar University
- MSc
- Bharathiar University
- MSc
- The Robert Gordon University
- BPharm
- Tamilnadu Dr M G R Medical University
Research
Overview
Medical Research Interests
ORCID
0000-0002-3156-4197
Research at a Glance
Yale Co-Authors
Publications Timeline
Research Interests
Hua Xu, PhD
Vipina K. Keloth, PhD
Qingyu Chen, PhD
William K. Oh, MD
Jeffrey Zhang
Ron Adelman, MD, MPH, MBA, FACS
Natural Language Processing
Phosphorylation
Comorbidity
Machine Learning
Publications
2025
Gut‑lung axis microbiome: Towards precision medicine in respiratory disorders (Review)
Manoharan S, Iyappan O, Prabahar A, Bhasuran B, Raja K. Gut‑lung axis microbiome: Towards precision medicine in respiratory disorders (Review). World Academy Of Sciences Journal 2025, 7: 1-15. DOI: 10.3892/wasj.2025.376.Peer-Reviewed Original ResearchBenchmarking large language models for biomedical natural language processing applications and recommendations
Chen Q, Hu Y, Peng X, Xie Q, Jin Q, Gilson A, Singer M, Ai X, Lai P, Wang Z, Keloth V, Raja K, Huang J, He H, Lin F, Du J, Zhang R, Zheng W, Adelman R, Lu Z, Xu H. Benchmarking large language models for biomedical natural language processing applications and recommendations. Nature Communications 2025, 16: 3280. PMID: 40188094, PMCID: PMC11972378, DOI: 10.1038/s41467-025-56989-2.Peer-Reviewed Original ResearchCitationsAltmetricMeSH Keywords and ConceptsConceptsLanguage modelNatural language processing applicationsBiomedical natural language processingMedical question answeringLanguage processing applicationsNatural language processingGrowth of biomedical literatureMissing informationFew-shotQuestion AnsweringZero-ShotKnowledge curationLanguage processingProcessing applicationsBioNLPBART modelPerformance gapBiomedical literatureGeneral domainTaskBenchmarksBERTInformationPerformanceLLM
2024
Tomato Disease Classification Using CNN
Archanaa N, Daniel V, Divya S, Raja K, Oviya I. Tomato Disease Classification Using CNN. Smart Innovation, Systems And Technologies 2024, 392: 259-272. DOI: 10.1007/978-981-97-3690-4_20.Peer-Reviewed Original ResearchCitationsConceptsCapabilities of convolutional neural networksCNN modelRelevant hyper-parametersReLU activation functionConvolutional neural networkMachine learning techniquesPlant disease detectionRaw image dataTomato diseasesTomato yellow leaf curl virusYellow leaf curl virusImage prepossessingClass imbalanceFeature extractionData augmentationImage quality variabilityHyper-parametersNeural networkActivation functionLearning techniquesLeaf curl virusPlant imagesSeptoria leaf spotTwo-spotted spider miteEnhancement techniquesLarge Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression
Bhasuran B, Manoharan S, Iyyappan O, Murugesan G, Prabahar A, Raja K. Large Language Models and Genomics for Summarizing the Role of microRNA in Regulating mRNA Expression. Biomedicines 2024, 12: 1535. PMID: 39062108, PMCID: PMC11274411, DOI: 10.3390/biomedicines12071535.Peer-Reviewed Original ResearchCitationsConceptsMiRNA-mRNA interactionsRegulation of gene expressionMaintenance of cellular homeostasisMicroRNA (miRNA)-messenger RNAGenomic approachesRegulate mRNA expressionCellular homeostasisCellular differentiationGene expressionBiological processesPathogenesis of numerous diseasesMiRNA-mRNAPotential therapeutic targetGenomeDisease mechanismsNumerous diseasesLLM modelTherapeutic targetMetabolic conditionsMRNA expressionExpressionMicroRNAsRNAApoptosisLlamasPredicting Protein-Protein Interactions Using Self-Attention-Based Deep Neural Networks and FastText Embeddings
Oviya I, Sravya N, Raja K. Predicting Protein-Protein Interactions Using Self-Attention-Based Deep Neural Networks and FastText Embeddings. 2024, 00: 1-6. DOI: 10.1109/icccnt61001.2024.10725821.Peer-Reviewed Original ResearchCitationsConceptsPredicting Protein-Protein InteractionsProtein-protein interactionsProtein sequencesRepresentation of protein sequencesEncoded protein sequencesK-mer sequencesLearning modelsAmino acid segmentSelf-attention-basedNetwork feature extractionDeep neural networksDeep learning modelsPredicting PPIsK-mersMachine learning modelsProtein interactionsCellular functionsFastText embeddingsSelf-attentionTransfer learningAcid segmentFeature extractionHuman bacillusEncoding techniqueNeural networkRelation Extraction
Devarakonda M, Raja K, Xu H. Relation Extraction. Cognitive Informatics In Biomedicine And Healthcare 2024, 101-135. DOI: 10.1007/978-3-031-55865-8_5.Peer-Reviewed Original ResearchNamed Entity Recognition
Devarakonda M, Raja K, Xu H. Named Entity Recognition. Cognitive Informatics In Biomedicine And Healthcare 2024, 79-99. DOI: 10.1007/978-3-031-55865-8_4.Peer-Reviewed Original ResearchCitationsA Study of Biomedical Relation Extraction Using GPT Models.
Zhang J, Wibert M, Zhou H, Peng X, Chen Q, Keloth V, Hu Y, Zhang R, Xu H, Raja K. A Study of Biomedical Relation Extraction Using GPT Models. AMIA Joint Summits On Translational Science Proceedings 2024, 2024: 391-400. PMID: 38827097, PMCID: PMC11141827.Peer-Reviewed Original ResearchCitationsComorbidity-Guided Text Mining and Omics Pipeline to Identify Candidate Genes and Drugs for Alzheimer’s Disease
Oviya I, Sankar D, Manoharan S, Prabahar A, Raja K. Comorbidity-Guided Text Mining and Omics Pipeline to Identify Candidate Genes and Drugs for Alzheimer’s Disease. Genes 2024, 15: 614. PMID: 38790243, PMCID: PMC11121575, DOI: 10.3390/genes15050614.Peer-Reviewed Original ResearchCitationsAltmetricMeSH Keywords and ConceptsConceptsAlzheimer's diseaseMultifactorial neurodegenerative disorderComplex traitsCandidate genesMultiple genesOmics approachesGenesMG-132AD treatmentOmics pipelineNeurodegenerative disordersOmicsHybrid pipelineAlzheimerMutationsType 2 diabetesFood and Drug AdministrationComorbid diseasesTraitsLINCPathwayElevating Ocular Diagnosis: Harnessing the Power of EfficientNet for Eye Disease Classification
Mohith V, Raja K, Oviya I. Elevating Ocular Diagnosis: Harnessing the Power of EfficientNet for Eye Disease Classification. 2024, 00: 1-6. DOI: 10.1109/aiiot58432.2024.10574627.Peer-Reviewed Original ResearchConceptsNeural network designDeep learning methodsMedical image processingAutomated diagnostic toolLearning methodsImage processingAccuracy rateField of ophthalmologyNetwork designHealthcare technologiesOverall accuracy rateSpectrum of medical conditionsDiabetic retinopathyOcular illnessEye conditionsTreatment planningDisease classificationMedical conditionsDiagnostic toolImprove patient careEfficientNetDiverse spectrumEyesPreprocessingTechnology integration
Academic Achievements & Community Involvement
Honors
honor Full Member
12/04/2023National AwardSigma Xi, NC, USAhonor Outstanding Reviewer
11/30/2023International AwardMultidisciplinary Digital Publishing Institute (MDPI), Basel, SwitzerlandDetailsSwitzerlandhonor Chartered Scientist (CSci)
04/01/2022International AwardThe Royal Society, London, UKDetailsUnited Kingdomhonor 2019 Women Scientist Award
12/18/2020International AwardThe Society for Bioinformatics and Biological Sciences, Indiahonor Chair Person
12/14/2020International AwardInternational Conference on Agriculture and Biological Sciences at Kathmandu (Lalitpur), Nepal
News
News
- September 16, 2025Source: NIH
Yale Team Recognized in NIH $1M Data Sharing Challenge
- September 27, 2024
Biomedical Informatics and Data Science (BIDS) Secures a $7.88 Million NIH Grant to Advance Mental Health Research Using AI Technology
- June 17, 2024
Hot off the Press: Natural Language Processing in Biomedicine
Get In Touch
Contacts
Biomedical Informatics & Data Science
100 College St
New Haven, CT 06510
United States