Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
3
pubmed:dateCreated
2007-4-12
pubmed:abstractText
Statistical distance dependent pair potentials are frequently used in a variety of folding, threading, and modeling studies of proteins. The applicability of these types of potentials is tightly connected to the reliability of statistical observations. We explored the possible origin and extent of false positive signals in statistical potentials by analyzing their distance dependence in a variety of randomized protein-like models. While on average potentials derived from such models are expected to equal zero at any distance, we demonstrate that systematic and significant distortions exist. These distortions originate from the limited statistical counts in local environments of proteins and from the limited size of protein structures at large distances. We suggest that these systematic errors in statistical potentials are connected to the dependence of amino acid composition on protein size and to variation in protein sizes. Additionally, atom-based potentials are dominated by a false positive signal that is due to correlation among distances measured from atoms of one residue to atoms of another residue. The significance of residue-based pairwise potentials at various spatial pair separations was assessed in this study and it was found that as few as approximately 50% of potential values were statistically significant at distances below 4 A, and only at most approximately 80% of them were significant at larger pair separations. A new definition for reference state, free of the observed systematic errors, is suggested. It has been demonstrated to generate statistical potentials that compare favorably to other publicly available ones.
pubmed:grant
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
May
pubmed:issn
1097-0134
pubmed:author
pubmed:copyrightInfo
2007 Wiley-Liss, Inc.
pubmed:issnType
Electronic
pubmed:day
15
pubmed:volume
67
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
559-68
pubmed:dateRevised
2007-12-3
pubmed:meshHeading
pubmed:year
2007
pubmed:articleTitle
Effects of amino acid composition, finite size of proteins, and sparse statistics on distance-dependent statistical pair potentials.
pubmed:affiliation
Department of Biochemistry, Seaver Center for Bioinformatics, Albert Einstein College of Medicine, Bronx, New York 10461, USA.
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't, Research Support, N.I.H., Extramural