Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
Database issue
pubmed:dateCreated
2007-1-4
pubmed:abstractText
Protein classification by machine learning algorithms is now widely used in structural and functional annotation of proteins. The Protein Classification Benchmark collection (http://hydra.icgeb.trieste.it/benchmark) was created in order to provide standard datasets on which the performance of machine learning methods can be compared. It is primarily meant for method developers and users interested in comparing methods under standardized conditions. The collection contains datasets of sequences and structures, and each set is subdivided into positive/negative, training/test sets in several ways. There is a total of 6405 classification tasks, 3297 on protein sequences, 3095 on protein structures and 10 on protein coding regions in DNA. Typical tasks include the classification of structural domains in the SCOP and CATH databases based on their sequences or structures, as well as various functional and taxonomic classification problems. In the case of hierarchical classification schemes, the classification tasks can be defined at various levels of the hierarchy (such as classes, folds, superfamilies, etc.). For each dataset there are distance matrices available that contain all vs. all comparison of the data, based on various sequence or structure comparison methods, as well as a set of classification performance measures computed with various classifier algorithms.
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-10786297, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-10890390, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-12784372, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-12969510, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-14681400, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-14980014, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-14988126, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-15229883, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-15333456, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-15608188, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-15804412, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-15914542, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-15981264, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-16044462, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-16317070, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-16414044, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-16455867, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-16718863, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-2231712, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-5420325, http://linkedlifedata.com/resource/pubmed/commentcorrection/17142240-7265238
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Jan
pubmed:issn
1362-4962
pubmed:author
pubmed:issnType
Electronic
pubmed:volume
35
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
D232-6
pubmed:dateRevised
2009-11-18
pubmed:meshHeading
pubmed:year
2007
pubmed:articleTitle
A Protein Classification Benchmark collection for machine learning.
pubmed:affiliation
Protein Structure and Bioinformatics Group, International Centre for Genetic Engineering and Biotechnology, Padriciano 99, 34012 Trieste, Italy.
pubmed:publicationType
Journal Article, Research Support, Non-U.S. Gov't