The challenge of annotating protein sequences: The tale of eight domains of unknown function in Pfam
Domains of unknown function, Function prediction, Pfam, Sequence homology
Computational Biology and Chemistry
The Pfam database is an important tool in genome annotation, since it provides a collection of curated protein families. However, a subset of these families, known as domains of unknown function (DUFs), remains poorly characterized. We have related sequences from DUF404, DUF407, DUF482, DUF608, DUF810, DUF853, DUF976 and DUF1111 to homologs in PDB, within the midnight zone (9-20%) of sequence identity. These relationships were extended to provide functional annotation by sequence analysis and model building. Also described are examples of residue plasticity within enzyme active sites, and change of function within homologous sequences of a DUF. © 2010 Elsevier Ltd.
Original Publication Date
DOI of published version
Goonesekere, Nalin C.W.; Shipely, Krysten; and O'Connor, Kevin, "The challenge of annotating protein sequences: The tale of eight domains of unknown function in Pfam" (2010). Faculty Publications. 2087.