Source:http://linkedlifedata.com/resource/pubmed/id/16570873
Switch to
Predicate | Object |
---|---|
rdf:type | |
lifeskim:mentions | |
pubmed:issue |
1
|
pubmed:dateCreated |
2006-3-30
|
pubmed:abstractText |
Support vector machines (SVMs) have shown strong generalization ability in a number of application areas, including protein structure prediction. However, the poor comprehensibility hinders the success of the SVM for protein structure prediction. The explanation of how a decision made is important for accepting the machine learning technology, especially for applications such as bioinformatics. The reasonable interpretation is not only useful to guide the "wet experiments," but also the extracted rules are helpful to integrate computational intelligence with symbolic AI systems for advanced deduction. On the other hand, a decision tree has good comprehensibility. In this paper, a novel approach to rule generation for protein secondary structure prediction by integrating merits of both the SVM and decision tree is presented. This approach combines the SVM with decision tree into a new algorithm called SVM_ DT, which proceeds in three steps. This algorithm first trains an SVM. Then, a new training set is generated through careful selection from the output of the SVM. Finally, the obtained training set is used to train a decision tree learning system and to extract the corresponding rule sets. The results of the experiments of protein secondary structure prediction on RS126 data set show that the comprehensibility of SVM_DT is much better than that of the SVM. Moreover, the generalization ability of SVM_DT is better than that of C4.5 decision trees and is similar to that of the SVM. Hence, SVM_DT can be used not only for prediction, but also for guiding biological experiments.
|
pubmed:grant | |
pubmed:language |
eng
|
pubmed:journal | |
pubmed:citationSubset |
IM
|
pubmed:chemical | |
pubmed:status |
MEDLINE
|
pubmed:month |
Mar
|
pubmed:issn |
1536-1241
|
pubmed:author | |
pubmed:issnType |
Print
|
pubmed:volume |
5
|
pubmed:owner |
NLM
|
pubmed:authorsComplete |
Y
|
pubmed:pagination |
46-53
|
pubmed:dateRevised |
2007-11-14
|
pubmed:meshHeading |
pubmed-meshheading:16570873-Algorithms,
pubmed-meshheading:16570873-Artificial Intelligence,
pubmed-meshheading:16570873-Computer Simulation,
pubmed-meshheading:16570873-Decision Support Techniques,
pubmed-meshheading:16570873-Models, Chemical,
pubmed-meshheading:16570873-Models, Molecular,
pubmed-meshheading:16570873-Pattern Recognition, Automated,
pubmed-meshheading:16570873-Protein Structure, Secondary,
pubmed-meshheading:16570873-Proteins,
pubmed-meshheading:16570873-Sequence Analysis, Protein
|
pubmed:year |
2006
|
pubmed:articleTitle |
Rule generation for protein secondary structure prediction with support vector machines and decision tree.
|
pubmed:affiliation |
Computer Science and Engineering Department, Southeast University, Nanjing 210096, China. jieyuehe@seu.edu.cn
|
pubmed:publicationType |
Journal Article,
Research Support, U.S. Gov't, Non-P.H.S.,
Research Support, Non-U.S. Gov't,
Evaluation Studies,
Research Support, N.I.H., Extramural
|