Faculty Publications

Title

The challenge of annotating protein sequences: The tale of eight domains of unknown function in Pfam

Document Type

Article

Keywords

Domains of unknown function, Function prediction, Pfam, Sequence homology

Journal/Book/Conference Title

Computational Biology and Chemistry

Volume

34

Issue

3

First Page

210

Last Page

214

Abstract

The Pfam database is an important tool in genome annotation, since it provides a collection of curated protein families. However, a subset of these families, known as domains of unknown function (DUFs), remains poorly characterized. We have related sequences from DUF404, DUF407, DUF482, DUF608, DUF810, DUF853, DUF976 and DUF1111 to homologs in PDB, within the midnight zone (9-20%) of sequence identity. These relationships were extended to provide functional annotation by sequence analysis and model building. Also described are examples of residue plasticity within enzyme active sites, and change of function within homologous sequences of a DUF. © 2010 Elsevier Ltd.

Original Publication Date

6-1-2010

DOI of published version

10.1016/j.compbiolchem.2010.04.001

Share

COinS