Apply
Request for Project Proposals
Copy Link
Fellows accepted into the program will gain insight into research in both academia and industry. Our prestigious fellowships are fully funded for a three-year period and provide the fellows’ salaries, fringe benefits, and a generous travel allowance. Our fellowships are open to outstanding candidates seeking career advancement in biomedical data sciences.
The Yale-BI Joint Steering Committee (JSC) has established the following data-driven research theme for the 2026 program. The steering committee will consider how well proposed research projects align with the prioritized theme when judging submissions and postdoctoral applications. Selected applications will be further reviewed for approval.
Advancing AI-Driven Biomedical Research for Drug Discovery
Proposal of Research Themes for the Yale-Boehringer Biomedical Data Science Fellowship 2026
The Yale-BI Biomedical Data Science Fellowship Program for 2026 invites innovative research proposals that harness cutting-edge AI to tackle critical challenges in drug development. This initiative combines Yale’s leadership in AI and Foundation Models (FM) with Boehringer Ingelheim’s expertise in translational science to accelerate discovery and patient impact.
The 2026 program focuses on three transformative areas designed to drive innovation at the intersection of computational biology, AI, and drug discovery:
- Foundation Models for Patient Stratification: Enabling precision medicine by integrating multi-omics, Real-World Data (RWD), and clinical data.
- Foundation Models for Safety Prediction: Addressing drug development bottlenecks by improving early toxicity assessment.
- Agentic AI: Leveraging autonomous agents to generate scientific hypotheses and recommend subsequent experiments.
These themes aim to transform treatment strategies for chronic liver and kidney diseases, inflammatory diseases, and cancer.
Project 1: Foundation Models for Patient Stratification, Selection, and Predictive Biomarker Discovery
The Challenge: A major hurdle in drug development is the lack of comprehensive, data-driven insights to guide patient selection across the translational pipeline. Traditional approaches often rely on operational criteria which, while necessary for logistics, fail to fully capture the complexity of diverse patient populations. While current methods have seen success in cancer and genetic mutation-based rare diseases, they have had limited success in other therapeutic areas.
The Goal: This project seeks to develop foundation model-based methodologies that synthesize multi-omics, real-world evidence, and clinical data to generate actionable insights for intelligent patient segmentation and risk stratification.
Key Focus: Proposals should focus on developing novel methods that move beyond diseases driven by single-gene mutations. The objective is to enable broadly applicable strategies that segment patient populations and disease progression to inform precision therapeutics.
Project 2: Foundation Models for Target and Drug Safety Prediction
The Challenge: Drug development faces high attrition rates due to safety failures in early clinical trials, largely driven by the poor translatability of preclinical toxicity findings to humans.
The Goal: This project aims to develop foundation-model-based approaches to quantify the likelihood of unmanageable human toxicity early in the preclinical pipeline—potentially before the synthesis of a molecule—to reduce the risk of costly failures later in the clinic.
Key Focus: Research should leverage recent technological advances in AI foundation models and utilize a wide range of data sources and modalities to learn complex biological relationships for toxicity assessment.
Project 3: Agentic AI for Hypothesis Generation and Experimental Design
The Challenge: Standard research methodologies often face prolonged timelines in hypothesis testing and experimental planning.
The Goal: Agentic AI systems, powered by Large Language Models (LLMs) and multi-agent architectures, have the potential to redefine scientific research by automating hypothesis generation, experimental design, and execution. These systems can close the loop of the scientific method, significantly reducing preclinical timelines and enabling novel approaches to trial optimization.
Key Focus: This project will develop multi-agent AI frameworks to autonomously generate scientific hypotheses that enhance disease understanding and identify novel therapeutic targets by demonstrating a strong Target to Disease Link (T2DL). It will leverage advanced agentic architectures, such as "AI co-scientist" prototypes, to integrate multi-omics, in vitro, in vivo, and clinical data for experimental automation.
Potential Applications Include:
- Autonomous Genetic Perturbation Design: Accelerating disease mechanism studies and therapeutic target identification.
- Novel Target Identification and Prioritization: Synthesizing heterogeneous data to speed up upstream drug discovery.
Frequently Asked Questions
Copy Link
- 1. What is the application timeline?
- Submission portal opens: February 2, 2026
- Submission deadline: March 30, 2026, at 11:59 PM ET
- Notification to applicants: July 1, 2026
- Program start date: September 1, 2026
- 2. How long is the fellowship program?
- Three years.
- 3. What are the qualifications and responsibilities of the fellows in this program?
- Qualifications
- PhD degree in the areas of computational biology, bioinformatics, data science, or relevant scientific disciplines, as well as excellent computing skills
- Good analytical and written communication skills, including the ability to effectively describe scientific material to both specialized and lay audiences
- A strong CV, preferably having published in high-impact journals and presented at international meetings
- Fellow Responsibilities
- Conduct high quality research through developing novel computational methods/tools to address significant biomedical problems
- Publish research results in high-impact journals
- Communicate research progress at regular intervals with Yale and Boehringer mentors
- Participate in required program activities
- Qualifications
- 4. How many fellows will you accept this year?
The program will allow up to three new competitive fellows this year, including but not limited to Yale Postdocs. We encourage eligible candidates globally to apply.
- 5. What is the contact information for questions about the program?
- 6. Who will evaluate my application?
A Yale-BI Joint Steering Committee (JSC) comprising representatives of Yale and Boehringer Ingelheim will evaluate applications and make decisions.
- 7. How many mentors will I have?
Each fellow will have two mentors, one from Yale and the other from Boehringer Ingelheim.
- 8. Is this a full-time position?
Yes, this is a full-time position once accepted into the fellowship program.
- 9. Will there be any networking events to learn more about other fellows’ work?
- Yes, there will be an annual symposium for fellows to report on and share with each other their research progress. Mentors and scientists from both Yale and BI will be invited to attend.
- 10. Will there be opportunities to learn more about Boehringer Ingelheim?
- Yes, the program will sponsor corporate campus visits for fellows to meet with BI employees and learn about their projects.
- 11. What benefits do fellows receive?
- Fellows will receive salaries and fringe benefits (such as health insurance).
- 12. Is travel to relevant conferences covered?
- Travel expenses related to attending conferences or workshops that are approved by mentors and joint committees will be covered.