Featured Publications
A draft human pangenome reference
Liao W, Asri M, Ebler J, Doerr D, Haukness M, Hickey G, Lu S, Lucas J, Monlong J, Abel H, Buonaiuto S, Chang X, Cheng H, Chu J, Colonna V, Eizenga J, Feng X, Fischer C, Fulton R, Garg S, Groza C, Guarracino A, Harvey W, Heumos S, Howe K, Jain M, Lu T, Markello C, Martin F, Mitchell M, Munson K, Mwaniki M, Novak A, Olsen H, Pesout T, Porubsky D, Prins P, Sibbesen J, Sirén J, Tomlinson C, Villani F, Vollger M, Antonacci-Fulton L, Baid G, Baker C, Belyaeva A, Billis K, Carroll A, Chang P, Cody S, Cook D, Cook-Deegan R, Cornejo O, Diekhans M, Ebert P, Fairley S, Fedrigo O, Felsenfeld A, Formenti G, Frankish A, Gao Y, Garrison N, Giron C, Green R, Haggerty L, Hoekzema K, Hourlier T, Ji H, Kenny E, Koenig B, Kolesnikov A, Korbel J, Kordosky J, Koren S, Lee H, Lewis A, Magalhães H, Marco-Sola S, Marijon P, McCartney A, McDaniel J, Mountcastle J, Nattestad M, Nurk S, Olson N, Popejoy A, Puiu D, Rautiainen M, Regier A, Rhie A, Sacco S, Sanders A, Schneider V, Schultz B, Shafin K, Smith M, Sofia H, Abou Tayoun A, Thibaud-Nissen F, Tricomi F, Wagner J, Walenz B, Wood J, Zimin A, Bourque G, Chaisson M, Flicek P, Phillippy A, Zook J, Eichler E, Haussler D, Wang T, Jarvis E, Miga K, Garrison E, Marschall T, Hall I, Li H, Paten B. A draft human pangenome reference. Nature 2023, 617: 312-324. PMID: 37165242, PMCID: PMC10172123, DOI: 10.1038/s41586-023-05896-x.Peer-Reviewed Original ResearchThe Human Pangenome Project: a global resource to map genomic diversity
Wang T, Antonacci-Fulton L, Howe K, Lawson HA, Lucas JK, Phillippy AM, Popejoy AB, Asri M, Carson C, Chaisson MJP, Chang X, Cook-Deegan R, Felsenfeld AL, Fulton RS, Garrison EP, Garrison N, Graves-Lindsay TA, Ji H, Kenny EE, Koenig BA, Li D, Marschall T, McMichael JF, Novak AM, Purushotham D, Schneider VA, Schultz BI, Smith MW, Sofia HJ, Weissman T, Flicek P, Li H, Miga KH, Paten B, Jarvis ED, Hall IM, Eichler EE, Haussler D. The Human Pangenome Project: a global resource to map genomic diversity. Nature 2022, 604: 437-446. PMID: 35444317, PMCID: PMC9402379, DOI: 10.1038/s41586-022-04601-8.Peer-Reviewed Original ResearchConceptsHuman reference genomeReference genomeGenomic diversityGenomic variationHuman genomic variationGlobal genomic diversitySingle nucleotide variantsGene-disease associationsDiploid genomeGenetic resourcesGenomeGenomic researchFuture biomedical researchHigh-quality referenceStructural variantsHuman geneticsRoutine assemblyCommon variantsFunctional elementsPolymorphic regionDiversityBiomedical researchVariantsMajor updateGeneticsGenomic Analysis in the Age of Human Genome Sequencing
Lappalainen T, Scott AJ, Brandt M, Hall IM. Genomic Analysis in the Age of Human Genome Sequencing. Cell 2019, 177: 70-84. PMID: 30901550, PMCID: PMC6532068, DOI: 10.1016/j.cell.2019.02.032.Peer-Reviewed Original ResearchMeSH KeywordsBiological Specimen BanksChromosome MappingGenetic Predisposition to DiseaseGenetic TestingGenetic VariationGenome, HumanGenome-Wide Association StudyGenomicsHigh-Throughput Nucleotide SequencingHuman Genome ProjectHumansPolymorphism, Single NucleotideSequence Analysis, DNAWhole Genome SequencingConceptsFunctional genomics approachAllele frequency spectrumHuman genome sequencingGene mapping studiesGenome sequencing technologiesRare human diseasesWhole-genome sequencingGenomic approachesGenetic variant discoveryGenome variationHuman genomeGenome analysisGenomic analysisSequencing technologiesGenome sequencingVariant discoveryHuman diseasesHuman geneticsGenomeFunctional interpretationMapping studiesFunctional effectsSequencingGermline variantsGeneticsSVScore: an impact prediction tool for structural variation
Ganel L, Abel HJ, , Hall IM. SVScore: an impact prediction tool for structural variation. Bioinformatics 2017, 33: 1083-1085. PMID: 28031184, PMCID: PMC5408916, DOI: 10.1093/bioinformatics/btw789.Peer-Reviewed Original ResearchCharacterizing complex structural variation in germline and somatic genomes
Quinlan AR, Hall IM. Characterizing complex structural variation in germline and somatic genomes. Trends In Genetics 2011, 28: 43-53. PMID: 22094265, PMCID: PMC3249479, DOI: 10.1016/j.tig.2011.10.002.Peer-Reviewed Original ResearchConceptsComplex structural variationsStructural variationsNext-generation DNA sequencingHallmarks of cancerSomatic genomeGenetic diversityMultiple chromosomesSingle locusDistinct lociRecombination eventsComplex variantsSingle mutationMapping experimentsDNA sequencingComplicated rearrangementsMammalsCurrent knowledgeMapping studiesLociSubtle alterationsVariantsGenomeSurprising numberChromosomesGermlineBEDTools: a flexible suite of utilities for comparing genomic features
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010, 26: 841-842. PMID: 20110278, PMCID: PMC2832824, DOI: 10.1093/bioinformatics/btq033.Peer-Reviewed Original ResearchConceptsBrowser Extensible DataComprehensive user manualNew software suiteUNIX commandsMassive datasetsLarge genomic datasetsSource codeExtensible dataGenomics tasksSoftware suiteFundamental taskLarge datasetsBAM formatUser manualAnnotation tracksWeb-based methodsSupplementary dataEfficient mannerFlexible suiteDatasetFlexible toolBEDToolsCurrent sequencing technologiesGenomic datasetsTask
2022
Semi-automated assembly of high-quality diploid human reference genomes
Jarvis E, Formenti G, Rhie A, Guarracino A, Yang C, Wood J, Tracey A, Thibaud-Nissen F, Vollger M, Porubsky D, Cheng H, Asri M, Logsdon G, Carnevali P, Chaisson M, Chin C, Cody S, Collins J, Ebert P, Escalona M, Fedrigo O, Fulton R, Fulton L, Garg S, Gerton J, Ghurye J, Granat A, Green R, Harvey W, Hasenfeld P, Hastie A, Haukness M, Jaeger E, Jain M, Kirsche M, Kolmogorov M, Korbel J, Koren S, Korlach J, Lee J, Li D, Lindsay T, Lucas J, Luo F, Marschall T, Mitchell M, McDaniel J, Nie F, Olsen H, Olson N, Pesout T, Potapova T, Puiu D, Regier A, Ruan J, Salzberg S, Sanders A, Schatz M, Schmitt A, Schneider V, Selvaraj S, Shafin K, Shumate A, Stitziel N, Stober C, Torrance J, Wagner J, Wang J, Wenger A, Xiao C, Zimin A, Zhang G, Wang T, Li H, Garrison E, Haussler D, Hall I, Zook J, Eichler E, Phillippy A, Paten B, Howe K, Miga K. Semi-automated assembly of high-quality diploid human reference genomes. Nature 2022, 611: 519-531. PMID: 36261518, PMCID: PMC9668749, DOI: 10.1038/s41586-022-05325-5.Peer-Reviewed Original ResearchConceptsDiploid genome assemblyGenome assemblyProtein-coding genesGlobal genetic variationCurrent human reference genomeDiploid human genomeHigh-quality assemblyAccurate long readsNon-synonymous amino acid changesHuman reference genomeAmino acid changesMost chromosomesReference assemblyReference genomeHuman genomeCentromeric regionsGenetic variationHigh diversityGenome sequencingLong readsSingle nucleotideGenomeAcid changesManual curationBiological genomes
2017
The impact of rare variation on gene expression across tissues
Aguet F, Ardlie K, Cummings B, Gelfand E, Getz G, Hadley K, Handsaker R, Huang K, Kashin S, Karczewski K, Lek M, Li X, MacArthur D, Nedzel J, Nguyen D, Noble M, Segrè A, Trowbridge C, Tukiainen T, Abell N, Balliu B, Barshir R, Basha O, Battle A, Bogu G, Brown A, Brown C, Castel S, Chen L, Chiang C, Conrad D, Cox N, Damani F, Davis J, Delaneau O, Dermitzakis E, Engelhardt B, Eskin E, Ferreira P, Frésard L, Gamazon E, Garrido-Martín D, Gewirtz A, Gliner G, Gloudemans M, Guigo R, Hall I, Han B, He Y, Hormozdiari F, Howald C, Kyung Im H, Jo B, Yong Kang E, Kim Y, Kim-Hellmuth S, Lappalainen T, Li G, Li X, Liu B, Mangul S, McCarthy M, McDowell I, Mohammadi P, Monlong J, Montgomery S, Muñoz-Aguirre M, Ndungu A, Nicolae D, Nobel A, Oliva M, Ongen H, Palowitch J, Panousis N, Papasaikas P, Park Y, Parsana P, Payne A, Peterson C, Quan J, Reverter F, Sabatti C, Saha A, Sammeth M, Scott A, Shabalin A, Sodaei R, Stephens M, Stranger B, Strober B, Sul J, Tsang E, Urbut S, van de Bunt M, Wang G, Wen X, Wright F, Xi H, Yeger-Lotem E, Zappala Z, Zaugg J, Zhou Y, Akey J, Bates D, Chan J, Chen L, Claussnitzer M, Demanelis K, Diegel M, Doherty J, Feinberg A, Fernando M, Halow J, Hansen K, Haugen E, Hickey P, Hou L, Jasmine F, Jian R, Jiang L, Johnson A, Kaul R, Kellis M, Kibriya M, Lee K, Billy Li J, Li Q, Li X, Lin J, Lin S, Linder S, Linke C, Liu Y, Maurano M, Molinie B, Montgomery S, Nelson J, Neri F, Oliva M, Park Y, Pierce B, Rinaldi N, Rizzardi L, Sandstrom R, Skol A, Smith K, Snyder M, Stamatoyannopoulos J, Stranger B, Tang H, Tsang E, Wang L, Wang M, Van Wittenberghe N, Wu F, Zhang R, Nierras C, Branton P, Carithers L, Guan P, Moore H, Rao A, Vaught J, Gould S, Lockart N, Martin C, Struewing J, Volpi S, Addington A, Koester S, Little A, Brigham L, Hasz R, Hunter M, Johns C, Johnson M, Kopen G, Leinweber W, Lonsdale J, McDonald A, Mestichelli B, Myer K, Roe B, Salvatore M, Shad S, Thomas J, Walters G, Washington M, Wheeler J, Bridge J, Foster B, Gillard B, Karasik E, Kumar R, Miklos M, Moser M, Jewell S, Montroy R, Rohrer D, Valley D, Davis D, Mash D, Undale A, Smith A, Tabor D, Roche N, McLean J, Vatanian N, Robinson K, Sobin L, Barcus M, Valentino K, Qi L, Hunter S, Hariharan P, Singh S, Um K, Matose T, Tomaszewski M, Barker L, Mosavel M, Siminoff L, Traino H, Flicek P, Juettemann T, Ruffier M, Sheppard D, Taylor K, Trevanion S, Zerbino D, Craft B, Goldman M, Haeussler M, Kent W, Lee C, Paten B, Rosenbloom K, Vivian J, Zhu J. The impact of rare variation on gene expression across tissues. Nature 2017, 550: 239-243. PMID: 29022581, PMCID: PMC5877409, DOI: 10.1038/nature24267.Peer-Reviewed Original ResearchLandscape of X chromosome inactivation across human tissues
Aguet F, Ardlie K, Cummings B, Gelfand E, Getz G, Hadley K, Handsaker R, Huang K, Kashin S, Karczewski K, Lek M, Li X, MacArthur D, Nedzel J, Nguyen D, Noble M, Segrè A, Trowbridge C, Tukiainen T, Abell N, Balliu B, Barshir R, Basha O, Battle A, Bogu G, Brown A, Brown C, Castel S, Chen L, Chiang C, Conrad D, Cox N, Damani F, Davis J, Delaneau O, Dermitzakis E, Engelhardt B, Eskin E, Ferreira P, Frésard L, Gamazon E, Garrido-Martín D, Gewirtz A, Gliner G, Gloudemans M, Guigo R, Hall I, Han B, He Y, Hormozdiari F, Howald C, Kyung Im H, Jo B, Yong Kang E, Kim Y, Kim-Hellmuth S, Lappalainen T, Li G, Li X, Liu B, Mangul S, McCarthy M, McDowell I, Mohammadi P, Monlong J, Montgomery S, Muñoz-Aguirre M, Ndungu A, Nicolae D, Nobel A, Oliva M, Ongen H, Palowitch J, Panousis N, Papasaikas P, Park Y, Parsana P, Payne A, Peterson C, Quan J, Reverter F, Sabatti C, Saha A, Sammeth M, Scott A, Shabalin A, Sodaei R, Stephens M, Stranger B, Strober B, Sul J, Tsang E, Urbut S, van de Bunt M, Wang G, Wen X, Wright F, Xi H, Yeger-Lotem E, Zappala Z, Zaugg J, Zhou Y, Akey J, Bates D, Chan J, Chen L, Claussnitzer M, Demanelis K, Diegel M, Doherty J, Feinberg A, Fernando M, Halow J, Hansen K, Haugen E, Hickey P, Hou L, Jasmine F, Jian R, Jiang L, Johnson A, Kaul R, Kellis M, Kibriya M, Lee K, Li J, Li Q, Li X, Lin J, Lin S, Linder S, Linke C, Liu Y, Maurano M, Molinie B, Montgomery S, Nelson J, Neri F, Oliva M, Park Y, Pierce B, Rinaldi N, Rizzardi L, Sandstrom R, Skol A, Smith K, Snyder M, Stamatoyannopoulos J, Stranger B, Tang H, Tsang E, Wang L, Wang M, Van Wittenberghe N, Wu F, Zhang R, Nierras C, Branton P, Carithers L, Guan P, Moore H, Rao A, Vaught J, Gould S, Lockart N, Martin C, Struewing J, Volpi S, Addington A, Koester S, Little A, Brigham L, Hasz R, Hunter M, Johns C, Johnson M, Kopen G, Leinweber W, Lonsdale J, McDonald A, Mestichelli B, Myer K, Roe B, Salvatore M, Shad S, Thomas J, Walters G, Washington M, Wheeler J, Bridge J, Foster B, Gillard B, Karasik E, Kumar R, Miklos M, Moser M, Jewell S, Montroy R, Rohrer D, Valley D, Davis D, Mash D, Undale A, Smith A, Tabor D, Roche N, McLean J, Vatanian N, Robinson K, Sobin L, Barcus M, Valentino K, Qi L, Hunter S, Hariharan P, Singh S, Um K, Matose T, Tomaszewski M, Barker L, Mosavel M, Siminoff L, Traino H, Flicek P, Juettemann T, Ruffier M, Sheppard D, Taylor K, Trevanion S, Zerbino D, Craft B, Goldman M, Haeussler M, Kent W, Lee C, Paten B, Rosenbloom K, Vivian J, Zhu J. Landscape of X chromosome inactivation across human tissues. Nature 2017, 550: 244-248. PMID: 29022598, PMCID: PMC5685192, DOI: 10.1038/nature24265.Peer-Reviewed Original Research
2014
Population-based structural variation discovery with Hydra-Multi
Lindberg MR, Hall IM, Quinlan AR. Population-based structural variation discovery with Hydra-Multi. Bioinformatics 2014, 31: 1286-1289. PMID: 25527832, PMCID: PMC4393510, DOI: 10.1093/bioinformatics/btu771.Peer-Reviewed Original ResearchConceptsNumber of genomesSV analysisStructural variation discoveryStructural variant detectionHuman genomeCancer Genome AtlasVariation discoveryGenomic rearrangementsGenome ProjectSequence alignmentSignal integrationIndel discoveryGenome AtlasGenomeMultiple individualsSupplementary dataSize variabilityVariant detectionCommodity hardwarePoor scalabilityAvailable datasetsAnalysis workflowScalabilityExtant toolsDiscoverySAMBLASTER: fast duplicate marking and structural variant read extraction
Faust GG, Hall IM. SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics 2014, 30: 2503-2505. PMID: 24812344, PMCID: PMC4147885, DOI: 10.1093/bioinformatics/btu314.Peer-Reviewed Original Research