2009
Statistical Distributions of Sequencing by Synthesis with Probabilistic Nucleotide Incorporation
Kong Y. Statistical Distributions of Sequencing by Synthesis with Probabilistic Nucleotide Incorporation. Journal Of Computational Biology 2009, 16: 817-827. PMID: 19522665, DOI: 10.1089/cmb.2008.0215.Peer-Reviewed Original ResearchStatistical Distributions of Pyrosequencing
Kong Y. Statistical Distributions of Pyrosequencing. Journal Of Computational Biology 2009, 16: 31-42. PMID: 19072582, DOI: 10.1089/cmb.2008.0106.Peer-Reviewed Original Research
2007
Generalized Correlation Functions and Their Applications in Selection of Optimal Multiple Spaced Seeds for Homology Search
Kong Y. Generalized Correlation Functions and Their Applications in Selection of Optimal Multiple Spaced Seeds for Homology Search. Journal Of Computational Biology 2007, 14: 238-254. PMID: 17456017, DOI: 10.1089/cmb.2006.0008.Peer-Reviewed Original ResearchConceptsGeneralized correlation functionCorrelation functionsHigher order approximationsGoulden–Jackson cluster methodHeuristic search methodsOrder approximationProbability qAverage propertiesSearch methodCluster methodLarge genomic dataProbability of occurrenceTheoretical backgroundMultiple seedsSpaced seedsPowerful methodOptimal seedApproximationEmpirical observationsNumber of wildcardsSet of patternsProbabilityProblemFunctionMatrix