17444516

Source:http://linkedlifedata.com/resource/pubmed/id/17444516

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0033684, umls-concept:C0205148, umls-concept:C0376284, umls-concept:C1522290, umls-concept:C1708943
pubmed:issue	2
pubmed:dateCreated	2007-6-26
pubmed:abstractText	Computational prediction of protein complex structures through docking offers a means to gain a mechanistic understanding of protein interactions that mediate biological processes. This is particularly important as the number of experimentally determined structures of isolated proteins exceeds the number of structures of complexes. A comprehensive docking procedure is described in which efficient sampling of conformations is achieved by matching surface normal vectors, fast filtering for shape complementarity, clustering by RMSD, and scoring the docked conformations using a supervised machine learning approach. Contacting residue pair frequencies, residue propensities, evolutionary conservation, and shape complementarity score for each docking conformation are used as input data to a Random Forest classifier. The performance of the Random Forest approach for selecting correctly docked conformations was assessed by cross-validation using a nonredundant benchmark set of X-ray structures for 93 heterodimer and 733 homodimer complexes. The single highest rank docking solution was the correct (near-native) structure for slightly more than one third of the complexes. Furthermore, the fraction of highly ranked correct structures was significantly higher than the overall fraction of correct structures, for almost all complexes. A detailed analysis of the difficult to predict complexes revealed that the majority of the homodimer cases were explained by incorrect oligomeric state annotation. Evolutionary conservation and shape complementarity score as well as both underrepresented and overrepresented residue types and residue pairs were found to make the largest contributions to the overall prediction accuracy. Finally, the method was also applied to docking unbound subunit structures from a previously published benchmark set.
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/8700181
pubmed:citationSubset	IM
pubmed:chemical	http://linkedlifedata.com/resource/pubmed/chemical/Proteins
pubmed:status	MEDLINE
pubmed:month	Aug
pubmed:issn	1097-0134
pubmed:author	pubmed-author:BordnerAndrew JAJ, pubmed-author:GorinAndrey AAA
pubmed:copyrightInfo	(c) 2007 Wiley-Liss, Inc.
pubmed:issnType	Electronic
pubmed:day	1
pubmed:volume	68
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	488-502
pubmed:meshHeading	pubmed-meshheading:17444516-Artificial Intelligence, pubmed-meshheading:17444516-Dimerization, pubmed-meshheading:17444516-Models, Molecular, pubmed-meshheading:17444516-Models, Theoretical, pubmed-meshheading:17444516-Protein Binding, pubmed-meshheading:17444516-Protein Conformation, pubmed-meshheading:17444516-Proteins, pubmed-meshheading:17444516-Reproducibility of Results, pubmed-meshheading:17444516-Surface Properties
pubmed:year	2007
pubmed:articleTitle	Protein docking using surface matching and supervised machine learning.
pubmed:affiliation	Computer Science and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831-6173, USA. bordner@ornl.gov
pubmed:publicationType	Journal Article, Research Support, U.S. Gov't, Non-P.H.S.