Skip to Main Content

Xiting Yan, PhD

Associate Professor of Medicine (Pulmonary, Critical Care and Sleep Medicine)
DownloadHi-Res Photo

Additional Titles

Director of Data Analysis and Bioinformatics Hub, The Center for Precision Pulmonary Medicine (P2MED)

Assistant Professor, Biostatistics

About

Titles

Associate Professor of Medicine (Pulmonary, Critical Care and Sleep Medicine)

Director of Data Analysis and Bioinformatics Hub, The Center for Precision Pulmonary Medicine (P2MED); Assistant Professor, Biostatistics

Biography

Dr. Yan received doctoral degrees in both applied statistics and computational biology and bioinformatics. She is interested in genetics, genomics, computational biology, biostatistics, system biology and bioinformatics. Her current research topics include (1) understanding disease heterogeneity and pathogenesis using large-scale omics data at both bulk and single cell resolution and (2) developing novel statistical and computational methods for analyses of different types of omics data and the integration of them with drug perturbation data for potential personalized treatment design.

Last Updated on April 12, 2023.

Appointments

Other Departments & Organizations

Education & Training

Postdoctoral Associate
Yale School of Medicine (2010)
PhD
University of Southern California, Biological Science Department/Computational Biology and Bioinformatics (2009)
PhD
Peking University, Department of Probability and Statistics, School of Mathematical Sciences/Applied Statistics (2006)
BS
Peking University, Department of Probability and Statistics, School of Mathematical Sciences/Probability and Statistics (2001)

Research

Overview

Understanding the pathogenesis and progression of chronic lung diseases is critical for therapeutic development. Different types of OMICs data, including genetic, genomic, transcriptomic, epigenetic data and so on, provide rich, reproducible and mechanism indicating information for understanding disease pathogenesis and progression. However, OMICs data usually have high dimension, complicated data structure, high noise level, and complex interactions between features (genes, proteins, metabolites, etc.). The corresponding data analysis is challenging but critical to obtain biologically meaningful and reproducible discoveries.

My current research interest focus on two parts: (1) developing novel statistical and computational models to analyze large scale omics and drug perturbation data to better understand disease pathogenesis and precision medicine, and (2) understanding the heterogeneity, pathogenesis and progression of pulmonary diseases, such as asthma, idiopathic pulmonary fibrosis (IPF), sarcoidosis, pediatric cystic fibrosis and so on, by tailoring statistical and computational methods based on existing biological knowledge of the diseases.

My team has been involved in multiple transcriptomic studies of asthma, IPF, sarcoidosis, cystic fibrosis and lung injuries in pediatric patients undertaking cardio bypass procedure. These studies generated various types of large-scale transcriptomic data including microarray gene expression data, bulk RNA sequencing data, single cell RNA sequencing data, T cell receptor repertoire data, 16s rRNA sequencing data, spatial transcriptomic data and single-cell chromotin structural data. For each study, we tailed our computational and statistical analysis of the data based on existing biological knowledge of the corresponding disease or condition. These analyses have made various discoveries in asthma pathogenesis heterogeneity, cell type specific changes in asthma patients, heterogeneity and molecular biomarker of sarcoidosis, cell populations specific to IPF and COPD, potential antigen specific T cell clones for SARS-CoV-2 infection (COVID19) in adults and so on. My team is currently closely working with physicians and basic scientists to make further and more translational discoveries for the aforementioned and other pulmonary diseases.

Through the extensive analyses of various types of omics data generated by our collaborators, my team also identifies computational and statistical needs and develops novel methods to address these needs. Topics of computational tools we have developed include imputation of single-cell RNA sequencing data (G2S3), identifying differentially expressed genes from scRNA-seq data with mutliple subjects (iDESC), cell type deconvolution of spatial transcriptomic data (SDePER), identifying spatial domains from spatial transcriptomic data using large language model (LLMiniST) and so on. The development of these computational tools further boosted our capacity and ability to analyze different types of OMICS data to better understand disease heterogeneity, pathogenesis and progression.

Medical Research Interests

Biostatistics; Computational Biology; Genetics; Genomics; Lung Diseases; Molecular Biology; Molecular Medicine; Respiratory Hypersensitivity

Public Health Interests

Bioinformatics; Biomarkers; Genetics, Genomics, Epigenetics; Microbial Ecology; Modeling

Research at a Glance

Yale Co-Authors

Frequent collaborators of Xiting Yan's published research.

Publications

Featured Publications

Academic Achievements & Community Involvement

Activities

  • activity

    The Journal of Allergy and Clinical Immunology

  • activity

    BMC Bioinformatics

  • activity

    Bioinformatics

  • activity

    Computational Methods for Single-cell RNA Sequencing and Spatial Transcriptomic Data Analysis for Precision Medicine

  • activity

    Understanding disease heterogeneity of asthma using a pathway-based distance score for gene expression data

Honors

  • honor

    The YCCI Scholar Award, Yale University School of Medicine

Teaching & Mentoring

Mentoring

  • Siming Zheng

    Postdoc
    2024 - Present
  • Huanhuan Wei

    Postdoc
    2023 - 2026
  • Yuening Zhang

    Postdoc
    2023 - Present

Get In Touch

Contacts

Academic Office Number