Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
22
pubmed:dateCreated
2005-10-27
pubmed:abstractText
In this work we introduce a postprocessing filter (PostDOCK) that distinguishes true binding ligand-protein complexes from docking artifacts (that are created by DOCK 4.0.1). PostDOCK is a pattern recognition system that relies on (1) a database of complexes, (2) biochemical descriptors of those complexes, and (3) machine learning tools. We use the protein databank (PDB) as the structural database of complexes and create diverse training and validation sets from it based on the "families of structurally similar proteins" (FSSP) hierarchy. For the biochemical descriptors, we consider terms from the DOCK score, empirical scoring, and buried solvent accessible surface area. For the machine-learners, we use a random forest classifier and logistic regression. Our results were obtained on a test set of 44 structurally diverse protein targets. Our highest performing descriptor combinations obtained approximately 19-fold enrichment (39 of 44 binding complexes were correctly identified, while only allowing 2 of 44 decoy complexes), and our best overall accuracy was 92%.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Nov
pubmed:issn
0022-2623
pubmed:author
pubmed:issnType
Print
pubmed:day
3
pubmed:volume
48
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
6821-31
pubmed:dateRevised
2006-11-15
pubmed:meshHeading
pubmed:year
2005
pubmed:articleTitle
PostDOCK: a structural, empirical approach to scoring protein ligand complexes.
pubmed:affiliation
Sandia National Labs, P.O. Box 969, MS 9951, Livermore, CA 94551, USA. clayton.springer@novartis.com
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, Non-P.H.S.