netMUG: a novel network-guided multi-view clustering workflow for dissecting genetic and facial heterogeneity
Li Z, Melograna F, Hoskens H, Duroux D, Marazita M, Walsh S, Weinberg S, Shriver M, Müller-Myhsok B, Claes P, Van Steen K. netMUG: a novel network-guided multi-view clustering workflow for dissecting genetic and facial heterogeneity. Frontiers In Genetics 2023, 14: 1286800. PMID: 38125750, PMCID: PMC10731261, DOI: 10.3389/fgene.2023.1286800.Peer-Reviewed Original ResearchMulti-view clustering frameworkHeterogeneous data sourcesSingle-view dataMulti-view dataMultiple Canonical Correlation AnalysisMulti-view clusteringMulti-view featuresClustering frameworkTrue labelsData structureFacial imagesExtraneous dataRand indexBenchmark methodsNetwork representationSynthetic dataSparse multiple canonical correlation analysisData sourcesHierarchical clusteringClusteringCanonical correlation analysisSuperior performanceNetworkGenomic dataReal data analysisILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications
Herrick N, Walsh S. ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications. BMC Bioinformatics 2023, 24: 424. PMID: 37940870, PMCID: PMC10633908, DOI: 10.1186/s12859-023-05548-x.Peer-Reviewed Original ResearchConceptsRaw genomic dataSoftware toolsHigh-performance computing clusterOwn big dataRaw data typesVariant call format filesOpen-source suiteBioinformatics software toolsDownstream applicationsDocker containersGenomic dataLocal machineComputing clusterBig dataConfiguration filesJob executionWindows platformData workflowIntermediate filesSingle commandFormat fileReproducible workflowsVCF filesData typesStorage limitations