Qingyu Chen, PhD

Assistant Professor of Biomedical Informatics and Data Science

DownloadHi-Res Photo

Appointments

Biomedical Informatics & Data Science

Primary

Ophthalmology

Secondary

Contact Info

qingyu.chen@yale.edu

About

Titles

Assistant Professor of Biomedical Informatics and Data Science

Biography

We always have open positions and look for talents. Please read this before emailing me. Without reading this, your email will be redirected to my best friend, Reviewer #2, without a reply :)

Dr. Qingyu Chen is a tenure-track Assistant Professor at the Department of Biomedical Informatics & Data Science, School of Medicine, starting in 2024. Prior to this, he completed the postdoctoral training at the National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health. Dr. Chen holds a PhD in Computer Science (Biomedical Informatics) from the University of Melbourne.

His research focuses on data science and artificial intelligence (AI) in biomedicine and healthcare. Dr. Chen has led significant milestones from data generation to method development and practical applications. His research interests can be broadly categorized into three areas:

Biomedical Natural Language Processing and Large Language Models
Medical Image and Multimodal Analysis in Healthcare
Downstream Accountability and Trustworthy AI for Medical Applications

Dr. Chen is the Principal Investigator of the R01 grant on improving factuality of LLMs in medicine (see news) and K99/R00 grant on multimodal AI-assisted disease diagnosis (see news). I have published over 40 first/last-author papers out of a total of 80+ publications within these research areas. My work has been in venues such as Nature, Nature Medicine, Nature Machine Intelligence, Nature Aging, NPJ Digital Medicine, Nucleic Acids Research, among others.

Dr. Chen has also taught over 20 courses and mentored more than 10 trainees. His teaching and mentoring have been recognized with the NIH Summer Research Mentor Award (four times) and Excellence in Teaching Awards (twice).

For more information, please visit https://sites.google.com/view/qingyuchen/home.

Appointments

Biomedical Informatics & Data Science
Assistant Professor
Primary
Biomedical Informatics & Data Science
Ophthalmology
Assistant Professor
Secondary
Ophthalmology

Education & Training

PhD: University of Melbourne, Computer Science and Biomedical Informatics (Microsoft Innovation Award; Excellence in Teaching Award; Top-ranked performance in AI challenge tasks

BS (Hon): The Royal Melbourne Institute of Technology, Computer Science (GPA ranked 1st; First-class Honor; Academic Excellence Award) (2013)

Research

Overview

Dr. Chen's research focuses on data science and artificial intelligence (AI) in biomedicine and healthcare. He has led significant milestones from data generation to method development and practical applications. His research interests can be broadly categorized into three areas:

Biomedical Natural Language Processing and Large Language Models
Medical Image and Multimodal Analysis in Healthcare
Downstream Accountability and Trustworthy AI for Medical Applications

Dr. Chen is the Principal Investigator of the K99/R00 grant on multimodal AI-assisted disease diagnosis. He has authored over 40 first/last-author papers out of a total of 80+ publications within these research areas. My work has been featured in venues such as Nature, Nature Medicine, Nature Machine Intelligence, Nature Aging, NPJ Digital Medicine, Nucleic Acids Research, among others.

His research has been recognized with several awards, including the NIH Fellows Award for Research Excellence (twice), AI Talent Scholar (Top 50 in AI across disciplines, selected by Baidu Scholar), NLM Honor Award (twice), and top-ranked performances in biomedical and clinical NLP challenges (four times, three as first author).

He has also taught over 20 courses and mentored more than 10 trainees. My teaching and mentoring have been recognized with the NIH Summer Research Mentor Award (four times) and Excellence in Teaching Awards (twice).

Selected recent work

Outpatient Reception via Collaboration Between Nurses and a Large Language Model: A Randomized Controlled Trial. Nature Medicine, 2024.
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information. Bioinformatics, 2024.
Advancing Entity Recognition in Biomedicine via Instruction Tuning of Large Language Models. Bioinformatics, 2024.
Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering. Journal of the American Medical Informatics Association, 2024.
Large Language Models in Biomedical Natural Language Processing: Benchmarks, Baselines, and Recommendations. 2023.
LitCovid in 2022: An Information Resource for the COVID-19 Literature. Nucleic Acids Research, 2023. (LitCovid has been accessed millions of times per month.)
DeepLensNet: Deep Learning Automated Diagnosis and Quantitative Classification of Cataract Type and Severity. Ophthalmology, 2022.
Detecting Visually Significant Cataract Using Retinal Photograph-Based Deep Learning. Nature Aging, 2022.
Predicting Myocardial Infarction Through Retinal Scans and Minimal Personal Information. Nature Machine Intelligence, 2022.
Multimodal, Multitask, Multiattention (M3) Deep Learning Detection of Reticular Pseudodrusen: Toward Automated and Accessible Classification of Age-Related Macular Degeneration. Journal of the American Medical Informatics Association, 2021. (NLM Honor Award)
LitCovid: An Open Database of COVID-19 Literature. Nucleic Acids Research, 2021.
Keep Up with the Latest Coronavirus Research. Nature, 2021.

Large Language Models in Biomedicine: Selected Paper Series of Our Research

Outpatient Reception via Collaboration Between Nurses and a Large Language Model: A Randomized Controlled Trial. Nature Medicine, 2024.
Opportunities and Challenges for ChatGPT and Large Language Models in Biomedicine and Health. Briefings in Bioinformatics, 2024.
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information. Bioinformatics, 2024.
Advancing Entity Recognition in Biomedicine via Instruction Tuning of Large Language Models. Bioinformatics, 2024.
Improving Large Language Models for Clinical Named Entity Recognition via Prompt Engineering. Journal of the American Medical Informatics Association, 2024.
Large Language Models in Biomedical Natural Language Processing: Benchmarks, Baselines, and Recommendations. 2023.
Large Language Models and the Retina: A Review of Current Applications and Future Directions. 2023.

Biomedical Foundation Models: Selected Paper Series of Our Research

Me llama: Foundation large language models for medical applications. 2024.
Bioformer: an efficient transformer language model for biomedical text mining. 2023.
MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval. Bioinformatics. 2023.
BioConceptVec: Creating and evaluating literature-based biomedical concept embeddings on a large scale. PLoS computational biology. 2020
BioSentVec: creating sentence embeddings for biomedical texts. IEEE International Conference on Healthcare Informatics. 2019
BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific data. 2019

AI in Healthcare: Selected Paper Series of Our Research

A deep network DeepOpacityNet for detection of cataracts from color fundus photographs. Nature Communications Medicine. 2023.
DeepLensNet: Deep Learning Automated Diagnosis and Quantitative Classification of Cataract Type and Severity. Ophthalmology, 2022.
Detecting Visually Significant Cataract Using Retinal Photograph-Based Deep Learning. Nature Aging, 2022.
Predicting Myocardial Infarction Through Retinal Scans and Minimal Personal Information. Nature Machine Intelligence, 2022.
Learning Structure from Visual Semantic Features and Radiology Ontology for Lymph Node Classification on MRI, MLMI, 2021.
Predicting risk of late age-related macular degeneration using deep learning. NPJ digital medicine, 2020.
DeepSeeNet: a deep learning model for automated classification of patient-based age-related macular degeneration severity from color fundus photographs, Ophthalmology, 2019.

For more information, please visit https://sites.google.com/view/qingyuchen/home.

Qingyu Chen Lab
View Lab Website

Research at a Glance

Yale Co-Authors

Frequent collaborators of Qingyu Chen's published research.

Hua Xu, PhD
View Full Profile
View 4 Common Publications
Ron Adelman, MD, MPH, MBA, FACS
View Full Profile
View 2 Common Publications
Vipina K. Keloth, PhD
View Full Profile
View 2 Common Publications

Publications

2024

See All Publications

Academic Achievements & Community Involvement

honor
National Library of Medicine Honor Award
National AwardNational Institutes of Health
Details
honor
National Library of Medicine Data Science and Informatics Mentor Awards
National AwardNational Institutes of Health
Details
honor
Summer Research Mentor Award
National AwardNational Institutes of Health
Details
honor
Fellows Award for Research Excellence
National AwardNational Institutes of Health
Details
honor
Summer Research Mentor Award
National AwardNational Institutes of Health
Details

See All Achievements

News

Get In Touch

Contacts

qingyu.chen@yale.edu

Administrative Support

Catherine Barabas
Research Assistant 1 MS

Appointments

Contact Info

Titles

Biography

Appointments

Biomedical Informatics & Data Science

Ophthalmology

Other Departments & Organizations

Education & Training

Overview

Qingyu Chen Lab

Research at a Glance

Yale Co-Authors

Hua Xu, PhD

Ron Adelman, MD, MPH, MBA, FACS

Vipina K. Keloth, PhD

Publications

2024

National Library of Medicine Honor Award

National Library of Medicine Data Science and Informatics Mentor Awards

Summer Research Mentor Award

Fellows Award for Research Excellence

Summer Research Mentor Award

News

Yale BIDS Presenting at the AMIA 2024 Annual Symposium

Advancing AI-Assisted Diagnosis of Ophthalmic Diseases

NIH Awards $1.5 Million Grant to Improve Factual Correctness in Large Language Models in Health Care

What Does Natural Language Processing Mean for Biomedicine?

Contacts

Administrative Support