2011
Regression and data mining methods for analyses of multiple rare variants in the Genetic Analysis Workshop 17 mini‐exome data
Bailey‐Wilson J, Brennan JS, Bull SB, Culverhouse R, Kim Y, Jiang Y, Jung J, Li Q, Lamina C, Liu Y, Mägi R, Niu YS, Simpson CL, Wang L, Yilmaz YE, Zhang H, Zhang Z. Regression and data mining methods for analyses of multiple rare variants in the Genetic Analysis Workshop 17 mini‐exome data. Genetic Epidemiology 2011, 35: s92-s100. PMID: 22128066, PMCID: PMC3360949, DOI: 10.1002/gepi.20657.Peer-Reviewed Original ResearchConceptsData mining methodsUse of machineMachine learning methodsMining methodsLearning methodsNovel methodGenetic Analysis Workshop 17 mini-exome dataGenetic Analysis Workshop 17Extreme locus heterogeneityDNA sequence dataLocus-specific heritabilityMultiple rare variantsPopulation-specific analysesRare variantsIndividual rare variantsRare genetic variantsRare causal variantsSubset of predictorsLarge numberMultiple variantsComplex traitsMachineSequence dataCausal variantsCausal mutations
2004
A High Productivity/Low Maintenance Approach to High-performance Computation for Biomedicine: Four Case Studies
Carriero N, Osier MV, Cheung KH, Miller PL, Gerstein M, Zhao H, Wu B, Rifkin S, Chang J, Zhang H, White K, Williams K, Schultz M. A High Productivity/Low Maintenance Approach to High-performance Computation for Biomedicine: Four Case Studies. Journal Of The American Medical Informatics Association 2004, 12: 90-98. PMID: 15492032, PMCID: PMC543832, DOI: 10.1197/jamia.m1571.Peer-Reviewed Original ResearchMeSH KeywordsAmino Acid SequenceComputational BiologyComputing MethodologiesMass SpectrometryMicroarray AnalysisPhenotypeSequence AnalysisConceptsHigh performance computationLow-maintenance approachBioinformatics applicationsRepresentative bioinformatics applicationsIntensive bioinformatics applicationsBioinformatics case studyGenome-wide sequence comparisonHPC expertsHPC platformsComplex genetic analysisBioinformatics researchersSignificant speedupMass spectrometry data setsHigh-throughput biotechnologiesSequence comparisonVast amountProteomic dataOrdinal phenotypesSpectrum of techniquesDNA microarraysGenetic analysisGene expressionCase studyIterative refinementMaintenance approach