Skip to Main Content

Introduction to Yale Center for Genomic Health

April 30, 2021
  • 00:00And I'm just really happy to
  • 00:02be here today to talk to you.
  • 00:04You know, for the first time since
  • 00:07I joined the community at Yale.
  • 00:09And so today's you know Workshop
  • 00:11is focused on human genomics,
  • 00:13data science and precision medicine,
  • 00:15and it's sort of designed to highlight
  • 00:17some of the plans and activities
  • 00:19for Center for genomic health.
  • 00:23And you know, I'm going to start things
  • 00:25off here by giving you an introduction
  • 00:26to sort of what we're trying to do,
  • 00:28and I'm going to keep it
  • 00:30at a really high level.
  • 00:31I have no data and very few slides.
  • 00:33I really want to sort of outline,
  • 00:34you know where the field is at and
  • 00:36where we think that we can make
  • 00:38a difference in the long term.
  • 00:40But before I could start like to reiterate,
  • 00:43you know what Antonio said,
  • 00:44which is, you know,
  • 00:45this center is really the product
  • 00:47of a joint effort from the school
  • 00:49and the hospital and really was born
  • 00:51out of a shared vision and support
  • 00:53of the prior Dean bulb output in
  • 00:54the prior President Rick Tequila.
  • 00:56And you know,
  • 00:57really is the product of a lot
  • 00:59of people's work over the course
  • 01:00of the last couple years.
  • 01:02You know,
  • 01:02long before I got here and I really
  • 01:04appreciate that and I appreciate
  • 01:06the ongoing support of the current
  • 01:07of the current leadership.
  • 01:09You know in this.
  • 01:10This joint leadership is really emblematic
  • 01:13of the two prong mission of our center,
  • 01:15which on one hand is to leave
  • 01:17cutting edge genomic research.
  • 01:19And on the other hand,
  • 01:21is to do our very best to implement
  • 01:23these technologies into the clinic
  • 01:25to make meaningful improvements
  • 01:27in health care and and both of
  • 01:29these arms need to work together,
  • 01:31and they both need to be strong if
  • 01:33we're going to be successful in our vision,
  • 01:36and so you know, I'm a basic scientist,
  • 01:39genome biologist,
  • 01:40Human Genetics.
  • 01:40Bring that sort of expertise to the table,
  • 01:43but there's really lots of different
  • 01:44perspectives that are important
  • 01:46here and I look forward to working
  • 01:48with everybody for a long time
  • 01:49on these important issues.
  • 01:50And so, without further ado, you know.
  • 01:53So what is what is genomic health, right?
  • 01:55So it's an emerging medical discipline
  • 01:58that involves anomic information about an
  • 02:00individual as part of their clinical care.
  • 02:03So that substantial fraction of the
  • 02:05human disease burden has a genetic component,
  • 02:07right?
  • 02:08So 5% of the world's population
  • 02:10suffers from a rare disease.
  • 02:12Many of these are caused by
  • 02:14rare pathogenic mutations.
  • 02:15Most people at some point in your life
  • 02:17will suffer from a common disease,
  • 02:19and we know that these show
  • 02:23substantial heritability.
  • 02:24Now we can collect genomic data.
  • 02:26You know affordably in that scale, right?
  • 02:28So the obvious thing that we want to do is,
  • 02:31you know sequence everybody's
  • 02:32genome and collect lots of other
  • 02:34types of OMICS data as well,
  • 02:36and use these data to inform
  • 02:37health care in the process.
  • 02:39We want to improve care and
  • 02:41obviously reduce costs,
  • 02:42and so this vision, you know,
  • 02:43is not controversial.
  • 02:44This has been the vision for the
  • 02:47past 30 years, and you know,
  • 02:49there's lots of really exciting applications
  • 02:51we won't have time to cover them all.
  • 02:54And there's been some really nice
  • 02:56great success stories along the way,
  • 02:58right?
  • 02:58But I think it's fair to say that you
  • 03:01know, for the vast majority of
  • 03:04heritable conditions, you know we're
  • 03:05barely scratching the surface of
  • 03:07what we could be doing. OK and.
  • 03:12You know we need to do our best
  • 03:14to push the envelope here because
  • 03:16it's an important problem, right?
  • 03:18And so so why is that so?
  • 03:19It's worth sort of taking a step back
  • 03:22and thinking about the big picture about
  • 03:24where we are as a field right now and
  • 03:26where we need to go because really motivates.
  • 03:29Sort of how we're thinking about the center.
  • 03:32And so you know,
  • 03:34we've come a long way right?
  • 03:35So you're 20 years ago we didn't even
  • 03:38notice single human genome look like OK,
  • 03:40and the Human Genome Project, you know,
  • 03:42sort of gave us this solid foundation
  • 03:44of human genome structure and function,
  • 03:46and it allowed us to sort of start to do
  • 03:49Human Genetics in a in a systematic way.
  • 03:51OK, and the next big landmark was the
  • 03:53development of these high throughput
  • 03:55DNA sequencing methods that allowed
  • 03:57us to go beyond one genome start.
  • 03:59Look at many genomes,
  • 04:00start to implement these technologies
  • 04:01into the clinic.
  • 04:02I mean,
  • 04:03you know a couple years ago we finally
  • 04:05reached that that long awaited landmark
  • 04:06of being able to sequence a genome for $1000.
  • 04:09And this was sort of what
  • 04:10everybody was waiting for,
  • 04:11right?
  • 04:13So these first two steps notice there's many
  • 04:15more important things that we need to do,
  • 04:17but you know,
  • 04:18we're on pretty solid ground right now.
  • 04:20And right now we're kind of in the
  • 04:22middle of these second, third,
  • 04:24and fourth steps where we're
  • 04:26trying to take these technologies,
  • 04:28apply them at scale across the
  • 04:30human population,
  • 04:31learn about how genetic variation looks
  • 04:33across all different ancestry groups,
  • 04:34learn about how genetic variants
  • 04:36operate in cells,
  • 04:37and obviously to take.
  • 04:40You know to look at genetic variation
  • 04:42in the context of lock the whole
  • 04:44companion of human diseases and start to
  • 04:46catalog all the different variants that
  • 04:48actually have an effect on disease risk,
  • 04:51right?
  • 04:51And we're kind of in the very beginning
  • 04:53of this last stage or sort of taking
  • 04:56this information and trying to
  • 04:58implement it in the health care system,
  • 05:00right?
  • 05:00Some of the really exciting technologies
  • 05:02are using polygenic risk scores to
  • 05:04partition people by common disease risk.
  • 05:06Using this technology is to
  • 05:07increase the diagnostics diagnostic
  • 05:09yield for rare disease.
  • 05:10And using these knowledge to
  • 05:12drive drug discovery or CRISPR
  • 05:15based therapies right and so the.
  • 05:17There's a lot to do here.
  • 05:19The possibilities are really immense,
  • 05:21right?
  • 05:21And I think you know we've been
  • 05:24trying to do this for 30 years and I
  • 05:27think 30 years from now we'll look at
  • 05:29the moment that we're in right now.
  • 05:32As you know, maybe the Golden age, right?
  • 05:34Maybe the inflection point between
  • 05:36what came before and what came after.
  • 05:38But like right now,
  • 05:39it's it's kind of moving slow,
  • 05:41actually,
  • 05:42and we're sort of in this hard slog
  • 05:44of trying to lay the foundation
  • 05:46of knowledge and technologies
  • 05:48that allow us to do this.
  • 05:50Not in an anecdotal way,
  • 05:51but in a systematic way in a real way.
  • 05:54OK so. We're going to slide for awhile,
  • 05:57so get comfortable.
  • 05:57I mean, I want to discuss or some of
  • 05:59the some of the challenges here because
  • 06:01these challenges really would motivate
  • 06:02like what we're trying to do, OK?
  • 06:04So the first first challenge
  • 06:06here is genome analysis. OK,
  • 06:08so you know we can produce genomic data now.
  • 06:11Incredible scale,
  • 06:11but we're still not there.
  • 06:13Still a lot of challenges in how
  • 06:15we analyze and interpret it,
  • 06:17so there's types of genetic variants
  • 06:19that are very difficult to detect.
  • 06:21There's parts of the genome that
  • 06:23are really hard for us to look at.
  • 06:26It's very difficult for us to
  • 06:29predict the function or the impact
  • 06:32of genetic variants computationally.
  • 06:34You know this is the famous variants
  • 06:36of unknown significance problem,
  • 06:38and it's a huge problem not playing
  • 06:40field and there's no easy solution,
  • 06:42and it's something that that
  • 06:44we need to solve.
  • 06:45You know, one approaches, you know better,
  • 06:47fancier machine learning algorithms,
  • 06:49and this is important.
  • 06:51It helps to just have a
  • 06:53lot more genomes around,
  • 06:54so that's important too.
  • 06:56But we also need a couple this
  • 06:59effort with efforts to produce.
  • 07:01Catalogs of what variants do
  • 07:02in cells using high throughput
  • 07:04functional genomics methods so that
  • 07:05we have good data to train the next
  • 07:07generation of AI based methods for
  • 07:09for interpreting genetic variation.
  • 07:10And this is what we need to do if
  • 07:13we're going to have these technologies
  • 07:15being the clinic in a robust way.
  • 07:17I'm at least questions.
  • 07:18These are questions that our
  • 07:20center is very interested in.
  • 07:21Is something in my own lab,
  • 07:22has worked on for a long time and we think
  • 07:26it's going to really push the needle.
  • 07:28So the second big challenge here
  • 07:30is that this effort to catalog
  • 07:32variants that cause disease is.
  • 07:33It's just really hard.
  • 07:35OK,
  • 07:35that's fair to say that it's a lot
  • 07:37harder than people appreciated
  • 07:3910 or 20 years ago,
  • 07:40and there's lots of reasons for that,
  • 07:42but I'll but I'll go into a few of them,
  • 07:46right? So on one hand.
  • 07:48We now know that common diseases,
  • 07:50and in fact most human traits
  • 07:51are highly polygenic, right?
  • 07:53Which means we have we need to
  • 07:54study very large sample sizes in
  • 07:56the range of 10s to hundreds of
  • 07:58thousands of people if not millions
  • 07:59of people you know to really get
  • 08:02a handle on the genetics.
  • 08:04And even for rare Mendelian
  • 08:05diseases where we sort of think
  • 08:07about them as being sort of simple,
  • 08:09they can also be quite complicated due
  • 08:11to the effects of incomplete penetrance.
  • 08:13I'm very well expressivity.
  • 08:16And this can also require
  • 08:17larger sample sizes,
  • 08:18and we specially need that if
  • 08:20we want to map the modifyers.
  • 08:22The protective alleles that
  • 08:23suggest drug targets, right?
  • 08:24So for both of these reasons,
  • 08:26no one institution,
  • 08:27no one lab,
  • 08:28maybe not even any one nation
  • 08:30can really do this on their own.
  • 08:32We need to be participating in large
  • 08:34scale consortia and team science that
  • 08:36really that really get it that we
  • 08:38also need to be more clever about how
  • 08:41we assemble human cohorts and how we
  • 08:43incorporate deep phenotype information.
  • 08:44So every health system needs to be a biobank.
  • 08:47And and every bio bank you know needs
  • 08:49to be connected to every other bio
  • 08:50bank in a network that allows us to
  • 08:53communicate and identify patients
  • 08:54that have similar genomic profiles
  • 08:55and similar phenotypic profiles.
  • 08:56Just something that we need to do.
  • 09:00And the third thing we need to do
  • 09:02is make every effort to make sure
  • 09:03that we do a better job at including
  • 09:06diverse ancestry groups.
  • 09:07In the studies that we do,
  • 09:09for historical reasons,
  • 09:09you know most of our knowledge
  • 09:11is built upon large studies of
  • 09:13European descent individuals.
  • 09:14This is a real problem because
  • 09:15it can actually as this trickles
  • 09:17down into the health care arena,
  • 09:19the algorithms that we use for risk
  • 09:21prediction and clinical decision
  • 09:22making are going to be biased, right?
  • 09:24So we all need to do our part to
  • 09:26alleviate this potential serious issue.
  • 09:28And so this general question of how do we do?
  • 09:30Gene discovery,
  • 09:31the next generation of gene
  • 09:33discovery projects that are bigger,
  • 09:34use better technologies,
  • 09:35and there are more diverse is
  • 09:37a real key goal of our center,
  • 09:39and in fact you know many of our
  • 09:42members are participating in if not
  • 09:44leading some of the most high profile
  • 09:47high impact studies in the world right now.
  • 09:50And the last thing I'll mention here,
  • 09:52I'll do this a little bit faster.
  • 09:54Maybe is that you know the
  • 09:56last challenge here is,
  • 09:58is disease mechanism?
  • 09:58OK,
  • 09:59so all of the things I've talked
  • 10:01about this far oftentimes at the end
  • 10:03of that you still have a correlation.
  • 10:06You still just have an Association
  • 10:07you don't necessarily know how that
  • 10:09impacts the biology of the disease,
  • 10:11and so it's really important that
  • 10:13we take the results of these
  • 10:15large scale studies and these
  • 10:17clinical sequencing efforts,
  • 10:18and we try to translate them into concrete
  • 10:20knowledge about disease mechanism.
  • 10:22And this is really hard because the
  • 10:24approach will vary a lot depending on
  • 10:26which disease you're talking about,
  • 10:28and so we need to engage with
  • 10:30disease experts.
  • 10:31People who know exactly how
  • 10:32how that disease works.
  • 10:34We need to, you know,
  • 10:35engage with animal models.
  • 10:37We need to use stem cell models,
  • 10:39organoid models.
  • 10:41And we need high throughput
  • 10:42functional methods that allow us to
  • 10:44interrogate what these genes do in
  • 10:45cells in a high throughput ways.
  • 10:47A lot of results to parse through.
  • 10:50And then another solution that
  • 10:52we're really interested in from the
  • 10:54standpoint of getting up disease
  • 10:56biology is using health systems as a
  • 10:58platform for learning about this right?
  • 11:00And so if you have,
  • 11:02if you have a lot of people where you have,
  • 11:05you know genomic information
  • 11:06and you also have well organized
  • 11:08electronic health records,
  • 11:09you can start to design studies
  • 11:11where you select groups of people
  • 11:13based on their genotype and do a
  • 11:15better job at looking for phenotype
  • 11:18and doing focused investigations.
  • 11:19And this is really crucial.
  • 11:21I think for taking this types
  • 11:23of studies to the next level,
  • 11:24and of course this is something that
  • 11:26we're trying to build here at yo
  • 11:28with the generations project in the
  • 11:30computational health platform and you'll
  • 11:31hear more about that later today.
  • 11:33So I think look, we covered it.
  • 11:36There's a lot of ground covered here.
  • 11:39I think you know there's a few
  • 11:41take home messages you know one.
  • 11:43This is a really hard problem.
  • 11:46It requires collaboration.
  • 11:48It requires input from lots of
  • 11:51different types of expertise.
  • 11:54Right, and actually,
  • 11:55you know the most exciting projects.
  • 11:57The most impactful projects and
  • 11:59initiatives are going to come at the
  • 12:01intersection of areas that I just mentioned.
  • 12:04OK, and that's really the motivation
  • 12:06for forming the center is to have a
  • 12:09team to have a venue for combining
  • 12:11people with lots of different expertise
  • 12:14that can tackle these questions
  • 12:16in a in a really impactful way.
  • 12:21And so we sort of formed this center.
  • 12:23Now you know there's been going
  • 12:25on for a couple years on the
  • 12:28clinical side's been very active.
  • 12:30We've now sort of assembled the
  • 12:32first initial group of members who
  • 12:34sort of span the whole range of
  • 12:37expertise that I just talked about.
  • 12:39And this is just an initial group.
  • 12:41You know.
  • 12:42I'm new here so I don't know everybody yet.
  • 12:44And so if you know if you're doing
  • 12:46relevant work and you want to get involved,
  • 12:49you know, please contact me.
  • 12:50So what are we going to wear?
  • 12:52Center of Excellence in genomics?
  • 12:54Data science in precision medicine?
  • 12:55You know we're trying to harness
  • 12:57these technologies to improve
  • 12:58healthcare and our principles are to
  • 13:00collaborate on high impact projects,
  • 13:01to share data and tools and to
  • 13:03really be a global partner.
  • 13:05To participate in these very large
  • 13:07population scale efforts that
  • 13:08we really need to do.
  • 13:09To push the envelope here,
  • 13:11but to bring all that technology,
  • 13:13all that knowledge, all that data,
  • 13:14all those tools to bear in
  • 13:16our local population.
  • 13:17OK, that's the mission.
  • 13:18And we've got a lot of
  • 13:20important partners in this.
  • 13:22You know,
  • 13:22we're not doing this alone,
  • 13:24most notably the Yale
  • 13:25Center for Genome Analysis,
  • 13:26which is super important partner.
  • 13:27And what we're trying to do Center for
  • 13:29outcomes evaluation of her research and
  • 13:31evaluation core is also another one,
  • 13:32and there's probably others I left off here.
  • 13:34They don't know about more
  • 13:36when I've been here longer.
  • 13:38So very,
  • 13:38very excited to do all this just to
  • 13:41sort of be a little bit more explicit
  • 13:43about what we're trying to do.
  • 13:45You know the the the one of the main
  • 13:48goals is to sort of build shares,
  • 13:50core technology platforms for the
  • 13:51integrative analysis of genomic data,
  • 13:53and the HR data.
  • 13:54This is super important,
  • 13:55and it's something that's going
  • 13:57to benefit everybody.
  • 13:58This is supposed to be a a shared
  • 14:00resource that anybody at the school
  • 14:02medicine can access for research projects.
  • 14:04This includes generations,
  • 14:05project, highly cloud project,
  • 14:06led by Mike Murray that we hear about.
  • 14:09Computational health platform led
  • 14:10by Wade Schultz as part of core,
  • 14:12and we're also my group in collaboration
  • 14:14with Jim Knight and Y CG or building
  • 14:17in Genomic Data Science platform,
  • 14:18which is essentially a set of pipeline
  • 14:21of really cutting edge genome analysis
  • 14:23tools that are designed to really
  • 14:25get the most out of the genomic data
  • 14:27that we're producing here at Yale.
  • 14:29And to make sure that all of these
  • 14:31three things get integrated really well
  • 14:34together to really push the science.
  • 14:36I think that you know big goal here
  • 14:38is to catalyze collaborative projects,
  • 14:41focus on all the areas that I just talked
  • 14:44about to work on with the hospital to
  • 14:47implement these technologies into the clinic,
  • 14:49and then just more generally
  • 14:51to build a bigger,
  • 14:52stronger genomics community here at Yale.
  • 14:54From the standpoint of recruitment,
  • 14:56training, workshops,
  • 14:57seminars and so really excited to do this,
  • 15:00really excited to be here,
  • 15:02I can't wait to get started.
  • 15:04Work with all of you here an.
  • 15:07With that, I'll say thank you.