Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
2
pubmed:dateCreated
2000-8-11
pubmed:abstractText
The CATH database of protein structures contains approximately 18000 domains organized according to their (C)lass, (A)rchitecture, (T)opology and (H)omologous superfamily. Relationships between evolutionary related structures (homologues) within the database have been used to test the sensitivity of various sequence search methods in order to identify relatives in Genbank and other sequence databases. Subsequent application of the most sensitive and efficient algorithms, gapped blast and the profile based method, Position Specific Iterated Basic Local Alignment Tool (PSI-BLAST), could be used to assign structural data to between 22 and 36 % of microbial genomes in order to improve functional annotation and enhance understanding of biological mechanism. However, on a cautionary note, an analysis of functional conservation within fold groups and homologous superfamilies in the CATH database, revealed that whilst function was conserved in nearly 55% of enzyme families, function had diverged considerably, in some highly populated families. In these families, functional properties should be inherited far more cautiously and the probable effects of substitutions in key functional residues carefully assessed.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Feb
pubmed:issn
0300-5127
pubmed:author
pubmed:issnType
Print
pubmed:volume
28
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
269-75
pubmed:dateRevised
2006-11-15
pubmed:meshHeading
pubmed:year
2000
pubmed:articleTitle
Using the CATH domain database to assign structures and functions to the genome sequences.
pubmed:affiliation
Department of Biochemistry and Molecular Biology, University College, London, UK.
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't