Source:http://linkedlifedata.com/resource/pubmed/id/12784210
Switch to
Predicate | Object |
---|---|
rdf:type | |
lifeskim:mentions | |
pubmed:issue |
4
|
pubmed:dateCreated |
2003-6-4
|
pubmed:abstractText |
An important problem in computational biology is predicting the structure of the large number of putative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins homologous to the target. Remote homologs that may have significant structural similarity are often not detectable by sequence similarities alone. To address this, we incorporated predicted local structure, a generalization of secondary structure, into two-track profile hidden Markov models (HMMs). We did not rely on a simple helix-strand-coil definition of secondary structure, but experimented with a variety of local structure descriptions, following a principled protocol to establish which descriptions are most useful for improving fold recognition and alignment quality. On a test set of 1298 nonhomologous proteins, HMMs incorporating a 3-letter STRIDE alphabet improved fold recognition accuracy by 15% over amino-acid-only HMMs and 23% over PSI-BLAST, measured by ROC-65 numbers. We compared two-track HMMs to amino-acid-only HMMs on a difficult alignment test set of 200 protein pairs (structurally similar with 3-24% sequence identity). HMMs with a 6-letter STRIDE secondary track improved alignment quality by 62%, relative to DALI structural alignments, while HMMs with an STR track (an expanded DSSP alphabet that subdivides strands into six states) improved by 40% relative to CE.
|
pubmed:language |
eng
|
pubmed:journal | |
pubmed:citationSubset |
IM
|
pubmed:chemical | |
pubmed:status |
MEDLINE
|
pubmed:month |
Jun
|
pubmed:issn |
1097-0134
|
pubmed:author | |
pubmed:copyrightInfo |
Copyright 2003 Wiley-Liss, Inc.
|
pubmed:issnType |
Electronic
|
pubmed:day |
1
|
pubmed:volume |
51
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
504-14
|
pubmed:dateRevised |
2006-11-15
|
pubmed:meshHeading |
pubmed-meshheading:12784210-Algorithms,
pubmed-meshheading:12784210-Computational Biology,
pubmed-meshheading:12784210-Markov Chains,
pubmed-meshheading:12784210-Protein Folding,
pubmed-meshheading:12784210-Protein Structure, Secondary,
pubmed-meshheading:12784210-Proteins,
pubmed-meshheading:12784210-Reproducibility of Results,
pubmed-meshheading:12784210-Sequence Alignment
|
pubmed:year |
2003
|
pubmed:articleTitle |
Hidden Markov models that use predicted local structure for fold recognition: alphabets of backbone geometry.
|
pubmed:affiliation |
Center for Biomolecular Science and Engineering, Baskin School of Engineering, University of California, Santa Cruz 95064, USA. rachelk@soe.ucsc.edu
|
pubmed:publicationType |
Journal Article,
Research Support, U.S. Gov't, Non-P.H.S.,
Research Support, Non-U.S. Gov't
|