Generalized Correlation Functions and Their Applications in Selection of Optimal Multiple Spaced Seeds for Homology Search
Kong Y. Generalized Correlation Functions and Their Applications in Selection of Optimal Multiple Spaced Seeds for Homology Search. Journal Of Computational Biology 2007, 14: 238-254. PMID: 17456017, DOI: 10.1089/cmb.2006.0008.Peer-Reviewed Original ResearchMeSH KeywordsAnimalsComputational BiologyEscherichia coliGenomeHaemophilus influenzaeHumansMiceModels, GeneticSequence Analysis, DNASequence Homology, Nucleic AcidConceptsGeneralized correlation functionCorrelation functionsHigher order approximationsGoulden–Jackson cluster methodHeuristic search methodsOrder approximationProbability qAverage propertiesSearch methodCluster methodLarge genomic dataProbability of occurrenceTheoretical backgroundMultiple seedsSpaced seedsPowerful methodOptimal seedApproximationEmpirical observationsNumber of wildcardsSet of patternsProbabilityProblemFunctionMatrix