Source:http://linkedlifedata.com/resource/pubmed/id/10906342
Switch to
Predicate | Object |
---|---|
rdf:type | |
lifeskim:mentions | |
pubmed:issue |
7
|
pubmed:dateCreated |
2000-10-12
|
pubmed:abstractText |
We examine how effectively simple potential functions previously developed can identify compatibilities between sequences and structures of proteins for database searches. The potential function consists of pairwise contact energies, repulsive packing potentials of residues for overly dense arrangement and short-range potentials for secondary structures, all of which were estimated from statistical preferences observed in known protein structures. Each potential energy term was modified to represent compatibilities between sequences and structures for globular proteins. Pairwise contact interactions in a sequence-structure alignment are evaluated in a mean field approximation on the basis of probabilities of site pairs to be aligned. Gap penalties are assumed to be proportional to the number of contacts at each residue position, and as a result gaps will be more frequently placed on protein surfaces than in cores. In addition to minimum energy alignments, we use probability alignments made by successively aligning site pairs in order by pairwise alignment probabilities. The results show that the present energy function and alignment method can detect well both folds compatible with a given sequence and, inversely, sequences compatible with a given fold, and yield mostly similar alignments for these two types of sequence and structure pairs. Probability alignments consisting of most reliable site pairs only can yield extremely small root mean square deviations, and including less reliable pairs increases the deviations. Also, it is observed that secondary structure potentials are usefully complementary to yield improved alignments with this method. Remarkably, by this method some individual sequence-structure pairs are detected having only 5-20% sequence identity.
|
pubmed:language |
eng
|
pubmed:journal | |
pubmed:citationSubset |
IM
|
pubmed:status |
MEDLINE
|
pubmed:month |
Jul
|
pubmed:issn |
0269-2139
|
pubmed:author | |
pubmed:issnType |
Print
|
pubmed:volume |
13
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
459-75
|
pubmed:dateRevised |
2006-11-15
|
pubmed:meshHeading |
pubmed-meshheading:10906342-Algorithms,
pubmed-meshheading:10906342-Amino Acid Sequence,
pubmed-meshheading:10906342-Databases, Factual,
pubmed-meshheading:10906342-Energy Metabolism,
pubmed-meshheading:10906342-Models, Molecular,
pubmed-meshheading:10906342-Molecular Sequence Data,
pubmed-meshheading:10906342-Probability,
pubmed-meshheading:10906342-Protein Conformation,
pubmed-meshheading:10906342-Protein Folding,
pubmed-meshheading:10906342-Protein Structure, Secondary,
pubmed-meshheading:10906342-Sequence Alignment,
pubmed-meshheading:10906342-Sequence Homology, Amino Acid
|
pubmed:year |
2000
|
pubmed:articleTitle |
Identifying sequence-structure pairs undetected by sequence alignments.
|
pubmed:affiliation |
Faculty of Technology, Gunma University, Kiryu, Gunma 376, Japan and Room B-116, Bldg 12B, MSC 5677, Laboratory of Experimental and Computational Biology, DBS, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5677,USA.
|
pubmed:publicationType |
Journal Article,
Comparative Study
|