Hostname: page-component-78c5997874-g7gxr Total loading time: 0 Render date: 2024-11-09T14:40:14.899Z Has data issue: false hasContentIssue false

Genome analysis: Assigning protein coding regions to three-dimensional structures

Published online by Cambridge University Press:  01 April 1999

ASAF A. SALAMOV
Affiliation:
Helix Research Institute, 1532-3 Yana, Kisarazu-shi, Chiba, 292, Japan
MAKIKO SUWA
Affiliation:
Helix Research Institute, 1532-3 Yana, Kisarazu-shi, Chiba, 292, Japan
CHRISTINE A. ORENGO
Affiliation:
Biomolecular Structure and Modelling Unit, Department of Biochemistry, University College London, Gower Street, London, United Kingdom
MARK B. SWINDELLS
Affiliation:
Helix Research Institute, 1532-3 Yana, Kisarazu-shi, Chiba, 292, Japan Inpharmatica Ltd., 60 Charlotte Street, London W1P 2AX, United Kingdom
Get access

Abstract

We describe the results of a procedure for maximizing the number of sequences that can be reliably linked to a protein of known three-dimensional structure. Unlike other methods, which try to increase sensitivity through the use of fold recognition software, we only use conventional sequence alignment tools, but apply them in a manner that significantly increases the number of relationships detected. We analyzed 11 genomes and found that, depending on the genome, between 23 and 32% of the ORFs had significant matches to proteins of known structure. In all cases, the aligned region consisted of either >100 residues or >50% of the smaller sequence. Slightly higher percentages could be attained if smaller motifs were also included. This is significantly higher than most previously reported methods, even those that have a fold-recognition component. We survey the biochemical and structural characteristics of the most frequently occurring proteins, and discuss the extent to which alignment methods can realistically assign function to gene products.

Type
Research Article
Copyright
© 1999 The Protein Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)