Featured Publications
Exploring the Big Data Paradox for various estimands using vaccination data from the global COVID-19 Trends and Impact Survey (CTIS)
Yang Y, Dempsey W, Han P, Deshmukh Y, Richardson S, Tom B, Mukherjee B. Exploring the Big Data Paradox for various estimands using vaccination data from the global COVID-19 Trends and Impact Survey (CTIS). Science Advances 2024, 10: eadj0266. PMID: 38820165, PMCID: PMC11314312, DOI: 10.1126/sciadv.adj0266.Peer-Reviewed Original ResearchStatistical Inference for Association Studies Using Electronic Health Records: Handling Both Selection Bias and Outcome Misclassification
Beesley L, Mukherjee B. Statistical Inference for Association Studies Using Electronic Health Records: Handling Both Selection Bias and Outcome Misclassification. Biometrics 2020, 78: 214-226. PMID: 33179768, DOI: 10.1111/biom.13400.Peer-Reviewed Original ResearchConceptsElectronic health recordsHealth recordsElectronic health record data analysisElectronic health record settingsSelection biasMichigan Genomics InitiativeAssociation studiesEHR-linkedHealth researchInverse probability weighting methodStudy sampleEffect estimatesProbability weighting methodLack of representativenessType I errorSurvey sampling literatureStandard error estimatesGold standard labelsDisease statusError estimatesStatistical inferenceMisclassificationInference strategySampling literatureStandard labelsAssociation of Polygenic Risk Scores for Multiple Cancers in a Phenome-wide Study: Results from The Michigan Genomics Initiative
Fritsche L, Gruber S, Wu Z, Schmidt E, Zawistowski M, Moser S, Blanc V, Brummett C, Kheterpal S, Abecasis G, Mukherjee B. Association of Polygenic Risk Scores for Multiple Cancers in a Phenome-wide Study: Results from The Michigan Genomics Initiative. American Journal Of Human Genetics 2018, 102: 1048-1061. PMID: 29779563, PMCID: PMC5992124, DOI: 10.1016/j.ajhg.2018.04.001.Peer-Reviewed Original ResearchConceptsPolygenic risk scoresElectronic health recordsAssociations of polygenic risk scoresPhenome-wide significant associationsPolygenic risk score associationsLongitudinal biorepository effortNon-cancer diagnosesPatients' electronic health recordsPhenome-wide association studyAnalysis of temporal orderMichigan Genomics InitiativeRisk scoreAssociated with multiple phenotypesFemale breast cancerNHGRI-EBI CatalogRisk profileGenetic risk profilesMeasures of genomic variationCancer traitsCase-control studyPheWAS analysisHealth recordsHealth systemMichigan MedicineCancer diagnosisCharacteristics Associated With Racial/Ethnic Disparities in COVID-19 Outcomes in an Academic Health Care System
Gu T, Mack J, Salvatore M, Sankar S, Valley T, Singh K, Nallamothu B, Kheterpal S, Lisabeth L, Fritsche L, Mukherjee B. Characteristics Associated With Racial/Ethnic Disparities in COVID-19 Outcomes in an Academic Health Care System. JAMA Network Open 2020, 3: e2025197. PMID: 33084902, PMCID: PMC7578774, DOI: 10.1001/jamanetworkopen.2020.25197.Peer-Reviewed Original ResearchMeSH KeywordsAdultAgedBetacoronavirusBlack or African AmericanComorbidityCoronavirus InfectionsCOVID-19Diabetes Mellitus, Type 2FemaleHealth Status DisparitiesHospitalizationHumansIntensive Care UnitsKidney DiseasesMaleMichiganMiddle AgedNeoplasmsObesityOdds RatioPandemicsPneumonia, ViralPopulation DensityRetrospective StudiesRisk FactorsSARS-CoV-2White PeopleConceptsAssociated with higher riskInternational Classification of DiseasesRisk of hospitalizationPreexisting type 2 diabetesHigher risk of hospitalizationClassification of DiseasesType 2 diabetesCOVID-19 outcomesRacial/ethnic disparitiesWhite patientsBlack patientsIntensive care unitInternational ClassificationResidential-level socioeconomic characteristicsOdds ratioStatistically significant racial differencesHigh riskAssociated with higher risk of hospitalizationNon-Hispanic blacksAssociation of risk factorsNon-Hispanic whitesMichigan Department of HealthAssociated with increased risk of hospitalizationComorbidity scoreDepartment of HealthA meta-inference framework to integrate multiple external models into a current study.
Gu T, Taylor J, Mukherjee B. A meta-inference framework to integrate multiple external models into a current study. Biostatistics 2021, 24: 406-424. PMID: 34269371, PMCID: PMC10102901, DOI: 10.1093/biostatistics/kxab017.Peer-Reviewed Original ResearchConceptsAccuracy of statistical inferenceEmpirical Bayes estimatorsSummary-level informationBias-variance trade-offRelevant external informationBayes estimatorsStatistical inferenceExternal informationExternal estimatesNaive analysisNaive combinationInternational dataWeight estimationExternal modelMeta-analysis frameworkIndividual-level dataEfficiency gainsEstimationInfluence of informationTrade-offsInformationFrameworkIncorporating functional annotation with bilevel continuous shrinkage for polygenic risk prediction
Zhuang Y, Kim N, Fritsche L, Mukherjee B, Lee S. Incorporating functional annotation with bilevel continuous shrinkage for polygenic risk prediction. BMC Bioinformatics 2024, 25: 65. PMID: 38336614, PMCID: PMC11323637, DOI: 10.1186/s12859-024-05664-2.Peer-Reviewed Original ResearchConceptsPredictive performance of polygenic risk scoresFunctional annotationGenetic architecturePerformance of polygenic risk scoresPRS-CSAnnotation informationPolygenic risk predictionGenetic risk predictionPolygenic risk scoresFunctional annotation informationKyoto Encyclopedia of GenesRisk predictionProportion of variantsEncyclopedia of GenesGenomes (KEGGSource of annotationTrait heritabilityAnnotation groupsPathway informationQuantitative traitsKyoto EncyclopediaFunctional categoriesBackgroundGenetic variantsHeritable contributionReal world data sourcesMethods for mediation analysis with high-dimensional DNA methylation data: Possible choices and comparisons
Clark-Boucher D, Zhou X, Du J, Liu Y, Needham B, Smith J, Mukherjee B. Methods for mediation analysis with high-dimensional DNA methylation data: Possible choices and comparisons. PLOS Genetics 2023, 19: e1011022. PMID: 37934796, PMCID: PMC10655967, DOI: 10.1371/journal.pgen.1011022.Peer-Reviewed Original ResearchConceptsBayesian Sparse Linear Mixed ModelMediation analysisHigh-dimensional mediation analysisMulti-ethnic cohortEpigenetic researchHealth outcomesHigh-dimensional DNA methylation dataLinear mixed modelsDNA methylation dataContinuous outcomesEvaluate DNA methylationDNA methylationMethylation dataDNAm dataMixed modelsDiverse simulationsSeamless implementationModern statistical methodsMediation effectR packageUnited StatesOutcomesExploiting Gene-Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes-Type Shrinkage Estimator to Trade-Off Between Bias and Efficiency
Mukherjee B, Chatterjee N. Exploiting Gene-Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes-Type Shrinkage Estimator to Trade-Off Between Bias and Efficiency. Biometrics 2007, 64: 685-694. PMID: 18162111, DOI: 10.1111/j.1541-0420.2007.00953.x.Peer-Reviewed Original ResearchConceptsGene-environment independenceShrinkage estimatorsLog odds ratio parametersCase-control dataGene-environment independence assumptionOdds ratio parametersCase-control estimatorsData-adaptive fashionData exampleProspective logistic regression analysisBinary exposureGene-environment associationsIndependence assumptionLogistic regression analysisCase-onlyMaximum likelihood frameworkEstimationSample sizeBinary genesRegression analysisChatterjeeExamplesWeighted averageAssumptionsRisk of Non-Melanoma Cancers in First-Degree Relatives of CDKN2A Mutation Carriers
Mukherjee B, DeLancey J, Raskin L, Everett J, Jeter J, Begg C, Orlow I, Berwick M, Armstrong B, Kricker A, Marrett L, Millikan R, Culver H, Rosso S, Zanetti R, Kanetsky P, From L, Gruber S, Investigators F. Risk of Non-Melanoma Cancers in First-Degree Relatives of CDKN2A Mutation Carriers. Journal Of The National Cancer Institute 2012, 104: 953-956. PMID: 22534780, PMCID: PMC3379723, DOI: 10.1093/jnci/djs221.Peer-Reviewed Original ResearchConceptsFirst-degree relatives of carriersCDKN2A mutation carriersFirst-degree relativesMutation carriersNon-melanoma cancersFirst-degree relatives of melanoma patientsFirst-degree relatives of mutation carriersKin-cohort methodConfidence intervalsRisk of cancerMelanoma patientsLifetime riskProband's genotypeNon-melanomaFamily membersIncreased riskGastrointestinal cancerCDKN2A mutationsWilms tumorRiskMelanoma StudyPancreatic cancerNoncarriersGenotype distributionMelanomaSet‐based tests for genetic association in longitudinal studies
He Z, Zhang M, Lee S, Smith J, Guo X, Palmas W, Kardia S, Diez Roux A, Mukherjee B. Set‐based tests for genetic association in longitudinal studies. Biometrics 2015, 71: 606-615. PMID: 25854837, PMCID: PMC4601568, DOI: 10.1111/biom.12310.Peer-Reviewed Original ResearchConceptsMulti-Ethnic Study of AtherosclerosisGenome-wide association studiesJoint effect of multiple variantsLinkage disequilibriumAssociation studiesEffects of multiple variantsMarkers of chronic diseaseGenetic variantsSet-based testGene-based testsLongitudinal outcomesMulti-Ethnic StudyGenetic association studiesStudy of AtherosclerosisChronic diseasesPhenotypic variationGenetic associationObservational studyLongitudinal analysisWithin-subject correlationMultiple variantsScore type testsJoint testJoint effectsMarker tests
2024
Improving prediction of linear regression models by integrating external information from heterogeneous populations: James–Stein estimators
Han P, Li H, Park S, Mukherjee B, Taylor J. Improving prediction of linear regression models by integrating external information from heterogeneous populations: James–Stein estimators. Biometrics 2024, 80: ujae072. PMID: 39101548, PMCID: PMC11299067, DOI: 10.1093/biomtc/ujae072.Peer-Reviewed Original ResearchConceptsJames-Stein estimatorLinear regression modelsIndividual-level dataComprehensive simulation studyRegression modelsNumerical performanceSimulation studyShrinkage methodCoefficient estimatesPredictive meanReduced modelStudy population heterogeneityInternal modelEstimationStudy populationBlood lead levelsInternational studiesCovariatesPatella bonePublished literatureLead levelsExternal studiesSummary informationPopulationSubsetsCross-shift changes in pulmonary function and occupational exposure to particulate matter among e-waste workers in Ghana
Laskaris Z, O'Neill M, Batterman S, Mukherjee B, Fobil J, Robins T. Cross-shift changes in pulmonary function and occupational exposure to particulate matter among e-waste workers in Ghana. Frontiers In Public Health 2024, 12: 1368112. PMID: 38784567, PMCID: PMC11111984, DOI: 10.3389/fpubh.2024.1368112.Peer-Reviewed Original ResearchConceptsE-waste workersExposure to particulate matterE-wasteParticulate matterAgbogbloshie e-waste siteInhalation exposure to particulate matterE-waste sitesBurning e-wasteConcentrations of PMHealth-based guidelinesExposure to airborne pollutantsExposure to PMOccupational exposure to particulate matterCross-shift changesElectronic-wasteForced vital capacityPersonal PMPM exposureAirborne pollutantsLinear mixed modelsBreathing zone concentrationsPulmonary functionComparison populationRecovery workersMixed modelsCross-Sectional Associations between Prenatal Per- and Poly-Fluoroalkyl Substances and Bioactive Lipids in Three Environmental Influences on Child Health Outcomes (ECHO) Cohorts
Suthar H, Manea T, Pak D, Woodbury M, Eick S, Cathey A, Watkins D, Strakovsky R, Ryva B, Pennathur S, Zeng L, Weller D, Park J, Smith S, DeMicco E, Padula A, Fry R, Mukherjee B, Aguiar A, Geiger S, Ng S, Huerta-Montanez G, Vélez-Vega C, Rosario Z, Cordero J, Zimmerman E, Woodruff T, Morello-Frosch R, Schantz S, Meeker J, Alshawabkeh A, Aung M, Outcomes O. Cross-Sectional Associations between Prenatal Per- and Poly-Fluoroalkyl Substances and Bioactive Lipids in Three Environmental Influences on Child Health Outcomes (ECHO) Cohorts. Environmental Science And Technology 2024, 58: 8264-8277. PMID: 38691655, PMCID: PMC11097396, DOI: 10.1021/acs.est.4c00094.Peer-Reviewed Original ResearchConceptsPFAS mixtureLinear mixed modelsBioactive lipidsChild health outcomesCross-sectional associationsPrenatal PFAS exposureBioactive lipid levelsPoly-fluoroalkyl substancesQuantile g-computationMixed modelsGestational outcomesHealth outcomesPregnancy outcomesPregnant womenCombined cohortG-computationCohort analysisProgram cohortQuartile increaseLipid levelsCohortPositive associationMeta-analysisEnvironmental influencesPFAS exposureAssociations of maternal blood metal concentrations with plasma eicosanoids among pregnant women in Puerto Rico
Kim C, Cathey A, Park S, Watkins D, Mukherjee B, Rosario-Pabón Z, Vélez-Vega C, Alshawabkeh A, Cordero J, Meeker J. Associations of maternal blood metal concentrations with plasma eicosanoids among pregnant women in Puerto Rico. The Science Of The Total Environment 2024, 928: 172295. PMID: 38588744, DOI: 10.1016/j.scitotenv.2024.172295.Peer-Reviewed Original ResearchConceptsAdverse birth outcomesSex-specific associationsBirth outcomesBlood metal concentrationsMetal concentrationsPregnant womenInfant sexEicosanoid profileMetal exposurePlasma eicosanoidsWeeks of pregnancyDecreased concentrations of CdConcentrations of CdConcentrations of CuEffect modificationRegulating inflammatory responsesBirth cohortAssessed associationsAssociated with increased concentrationsPregnancy outcomesFemale fetusesEffect sizeInflammatory activitySignificant associationInflammatory responseAvocational exposure associations with ALS risk, survival, and phenotype: A Michigan-based case-control study
Goutman S, Boss J, Jang D, Piecuch C, Farid H, Batra M, Mukherjee B, Feldman E, Batterman S. Avocational exposure associations with ALS risk, survival, and phenotype: A Michigan-based case-control study. Journal Of The Neurological Sciences 2024, 457: 122899. PMID: 38278093, PMCID: PMC11060628, DOI: 10.1016/j.jns.2024.122899.Peer-Reviewed Original ResearchConceptsALS riskLower educational attainmentAssociated with ALS riskCase-control studyExercise 5Onset ageSelf-completionExposure variablesYard workExposure associationsRecreational danceIdentified exposureExerciseEducational attainmentAL burdenEnvironmental exposuresParticipantsAL factorPersonal participationAvocational exposureRiskExposomeHobbiesALS onsetComparison correction
2023
Design and analysis heterogeneity in observational studies of COVID-19 booster effectiveness: A review and case study
Meah S, Shi X, Fritsche L, Salvatore M, Wagner A, Martin E, Mukherjee B. Design and analysis heterogeneity in observational studies of COVID-19 booster effectiveness: A review and case study. Science Advances 2023, 9: eadj3747. PMID: 38117882, PMCID: PMC10732535, DOI: 10.1126/sciadv.adj3747.Peer-Reviewed Original ResearchUncovering associations between pre-existing conditions and COVID-19 Severity: A polygenic risk score approach across three large biobanks
Fritsche L, Nam K, Du J, Kundu R, Salvatore M, Shi X, Lee S, Burgess S, Mukherjee B. Uncovering associations between pre-existing conditions and COVID-19 Severity: A polygenic risk score approach across three large biobanks. PLOS Genetics 2023, 19: e1010907. PMID: 38113267, PMCID: PMC10763941, DOI: 10.1371/journal.pgen.1010907.Peer-Reviewed Original ResearchConceptsPolygenic risk scoresMichigan Genomics InitiativeUK BiobankPre-existing conditionsPhenome-wide association studyAssociation studiesCohort-specific analysesPolygenic risk score approachUK Biobank cohortMeta-analysisIncreased risk of hospitalizationGenome-wide association studiesBody mass indexRisk of hospitalizationIdentified novel associationsRisk score approachCOVID-19 outcome dataCOVID-19 hospitalizationCOVID-19Mass indexRisk scoreBiobankCardiovascular conditionsCOVID-19 severityIncreased riskMortality and Severe Complications Among Newly Graduated Surgeons in the United States
Howard R, Thelen A, Chen X, Gates R, Krumm A, Millis M, Gupta T, Brown C, Bandeh-Ahmadi H, Wnuk G, Yee C, Ryan A, Mukherjee B, Dimick J, George B. Mortality and Severe Complications Among Newly Graduated Surgeons in the United States. Annals Of Surgery 2023, 279: 555-560. PMID: 37830271, PMCID: PMC10939969, DOI: 10.1097/sla.0000000000006128.Peer-Reviewed Original ResearchConceptsYears of practiceSevere complicationsRelative riskGeneral surgeonsIndependent practiceYears of independent practiceCareer surgeonsRelative risk of mortalityAmerican Board of SurgerySurgeon yearsMixed modelsSurgeon characteristicsRate of mortalityMedicare claimsComplicationsSurgeonsPatient outcomesPatientsMortalityAmerican BoardOutcomesEnvironmental risk scores of persistent organic pollutants associate with higher ALS risk and shorter survival in a new Michigan case/control cohort
Goutman S, Boss J, Jang D, Mukherjee B, Richardson R, Batterman S, Feldman E. Environmental risk scores of persistent organic pollutants associate with higher ALS risk and shorter survival in a new Michigan case/control cohort. Journal Of Neurology Neurosurgery & Psychiatry 2023, 95: 241-248. PMID: 37758454, PMCID: PMC11060633, DOI: 10.1136/jnnp-2023-332121.Peer-Reviewed Original ResearchConceptsEnvironmental risk scoreAmyotrophic lateral sclerosis riskPersistent organic pollutantsALS riskInterquartile increaseHigher ALS riskAssociated with ALS riskModify disease riskOrganochlorine pesticidesAssociated with riskRisk reduction strategiesPersistent organic pollutant analysisIndividual persistent organic pollutantsPersistent organic pollutant mixturesHazard ratioDisease riskRisk scoreCase/control cohortEnvironmental exposuresControl participantsGenetic susceptibilityPolychlorinated biphenylsAlpha-hexachlorocyclohexaneOrganic pollutantsMortality ratePrenatal per- and polyfluoroalkyl substances (PFAS) exposure in relation to preterm birth subtypes and size-for-gestational age in the LIFECODES cohort 2006–2008
Siwakoti R, Cathey A, Ferguson K, Hao W, Cantonwine D, Mukherjee B, McElrath T, Meeker J. Prenatal per- and polyfluoroalkyl substances (PFAS) exposure in relation to preterm birth subtypes and size-for-gestational age in the LIFECODES cohort 2006–2008. Environmental Research 2023, 237: 116967. PMID: 37634691, PMCID: PMC10913455, DOI: 10.1016/j.envres.2023.116967.Peer-Reviewed Original ResearchConceptsLarge-for-gestational agePreterm birth subtypesBayesian kernel machine regressionSize-for-gestational ageSmall-for-gestational agePreterm birthFetal sexPregnancy outcomesSex-specific estimatesIncreased risk of adverse pregnancy outcomesInterquartile range increaseRisk of adverse pregnancy outcomesBayesian kernel machine regression analysisEarly pregnancy samplesAdverse pregnancy outcomesCase-control studyPrenatal PFAS exposureAssociations of polyfluoroalkyl substancesBW z-scoreEffects of polyfluoroalkyl substancesKernel machine regressionEffect modificationEffects of prenatal exposureRange increaseStratified analysis