Skip to Main Content

FAQ About YPED

Description of YPED Parameters

  • The top box contains information on the sample: the search engine, version of the search engine, search title, database searched and MS file name.
  • The protein threshold score: Protein scores are derived from ions scores as a non-probabilistic basis for ranking protein hits. (see http://www.matrixscience.com/help/scoring_help.html for details on the scoring)
  • Score: The protein score in a Peptide Summary is derived from the ions scores. For a search that contains a small number of queries, the protein score is the sum of the unique ions scores. That is, excluding the scores for duplicate matches. A small correction is applied to reduce the contribution of low-scoring random matches. This correction is a function of the total number of molecular mass matches for each query and the width of the peptide tolerance window. This correction is usually very small, except in no enzyme searches
  • Decoy database search: During the normal search, every time a protein sequence from the target database is tested, a random sequence of the same length is automatically generated and tested. The average amino acid composition of the random sequences is the same as the average composition of the target database. The matches and scores for the random sequences are recorded separately in the result file. When the search is complete, the statistics for matches to the random sequences, which are effectively sequences from a decoy database, are reported in the result header.
  • Expectation: Expectation value for the peptide match. (The number of times we would expect to obtain an equal or higher score, purely by chance. The lower this value, the more significant the result).
  • % coverage: is the coverage of the known protein that was identified by peptide matches

PEPTIDES

  • m/z: is the observed mass in the mass spectra. This might be singly, doubly, triply etc. charged. The charge is listed in the last column
  • Score: is the peptide score
  • Ion mass: is the mass determined from the m/z and the charge state
  • Ion mass calculated is the mass of the peptide form the theoretical sequence
  • Delta is the mass difference between the Ion mass in the spectra and the calculated ion mass
  • Ppm: is the parts per million determined for the peptide match (base don’t he Delta value). In the LTQ Orbitrap, this should be better than ~ 5 ppm
  • Peptides > or less than the identity threshold:
    • The identity threshold is calculated from the number of trials If there are 5000 precursor matches, a 1 in a 20 chance of getting a false positive match is a probability of P = 1 / (20 x 5000) which is a score of S = -10LogP = 50

In Mascot, the score for an MS/MS match is based on the absolute probability (P) that the observed match between the experimental data and the database sequence is a random event. The reported score is -10Log(P). So, during a search, if 1.5 x 10^5 peptides fell within the mass tolerance window about the precursor mass, and the significance threshold was chosen to be 0.05, (a 1 in 20 chance of a false positive), this would translate into a score threshold of 65.