Header

UZH-Logo

Maintenance Infos

A fast SEQUEST cross correlation algorithm


Eng, J K; Fischer, B; Grossmann, J; Maccoss, M J (2008). A fast SEQUEST cross correlation algorithm. Journal of Proteome Research, 7(10):4598-4602.

Abstract

The SEQUEST program was the first and remains one of the most widely used tools for assigning a peptide sequence within a database to a tandem mass spectrum. The cross correlation score is the primary score function implemented within SEQUEST and it is this score that makes the tool particularly sensitive. Unfortunately, this score is computationally expensive to calculate, and thus, to make the score manageable, SEQUEST uses a less sensitive but fast preliminary score and restricts the cross correlation to just the top 500 peptides returned by the preliminary score. Classically, the cross correlation score has been calculated using Fast Fourier Transforms (FFT) to generate the full correlation function. We describe an alternate method of calculating the cross correlation score that does not require FFTs and can be computed efficiently in a fraction of the time. The fast calculation allows all candidate peptides to be scored by the cross correlation function, potentially mitigating the need for the preliminary score, and enables an E-value significance calculation based on the cross correlation score distribution calculated on all candidate peptide sequences obtained from a sequence database.

Abstract

The SEQUEST program was the first and remains one of the most widely used tools for assigning a peptide sequence within a database to a tandem mass spectrum. The cross correlation score is the primary score function implemented within SEQUEST and it is this score that makes the tool particularly sensitive. Unfortunately, this score is computationally expensive to calculate, and thus, to make the score manageable, SEQUEST uses a less sensitive but fast preliminary score and restricts the cross correlation to just the top 500 peptides returned by the preliminary score. Classically, the cross correlation score has been calculated using Fast Fourier Transforms (FFT) to generate the full correlation function. We describe an alternate method of calculating the cross correlation score that does not require FFTs and can be computed efficiently in a fraction of the time. The fast calculation allows all candidate peptides to be scored by the cross correlation function, potentially mitigating the need for the preliminary score, and enables an E-value significance calculation based on the cross correlation score distribution calculated on all candidate peptide sequences obtained from a sequence database.

Statistics

Citations

Dimensions.ai Metrics
163 citations in Web of Science®
176 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

0 downloads since deposited on 06 Jan 2010
0 downloads since 12 months

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:04 Faculty of Medicine > Functional Genomics Center Zurich
08 Research Priority Programs > Systems Biology / Functional Genomics
Dewey Decimal Classification:570 Life sciences; biology
610 Medicine & health
Scopus Subject Areas:Life Sciences > Biochemistry
Physical Sciences > General Chemistry
Language:English
Date:2008
Deposited On:06 Jan 2010 07:19
Last Modified:03 Dec 2023 02:44
Publisher:American Chemical Society
ISSN:1535-3893
OA Status:Closed
Publisher DOI:https://doi.org/10.1021/pr800420s
PubMed ID:18774840