Faculty Publications
Optimizing The Size Of The Sequence Profiles To Increase The Accuracy Of Protein Sequence Alignments Generated By Profile-Profile Algorithms
Document Type
Article
Journal/Book/Conference Title
Bioinformatics
Volume
24
Issue
9
First Page
1145
Last Page
1153
Abstract
Motivation: Profile-based protein homology detection algorithms are valuable tools in genome annotation and protein classification. By utilizing information present in the sequences of homologous proteins, profile-based methods are often able to detect extremely weak relationships between protein sequences, as evidenced by the large-scale benchmarking experiments such as CASP and LiveBench. Results: We study the relationship between the sensitivity of a profile-profile method and the size of the sequence profile, which is defined as the average number of different residue types observed at the profile's positions. We also demonstrate that improvements in the sensitivity of a profile-profile method can be made by incorporating a profile-dependent scoring scheme, such as position-specific background frequencies. The techniques presented in this article are implemented in an alignment algorithm UNI-FOLD. When tested against other well-established methods for fold recognition, UNI-FOLD shows increased sensitivity and specificity in detecting remote relationships between protein sequences. © The Author 2008. Published by Oxford University Press. All rights reserved.
Department
Department of Computer Science
Original Publication Date
5-1-2008
DOI of published version
10.1093/bioinformatics/btn097
Recommended Citation
Poleksic, Aleksandar and Fienup, Mark, "Optimizing The Size Of The Sequence Profiles To Increase The Accuracy Of Protein Sequence Alignments Generated By Profile-Profile Algorithms" (2008). Faculty Publications. 2439.
https://scholarworks.uni.edu/facpub/2439