Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
5
pubmed:dateCreated
2009-9-16
pubmed:abstractText
The prediction of essential biological features based on a given protein sequence is a challenging task in computational biology. To limit the amount of in vitro verification, the prediction of essential biological activities gives the opportunity to detect so far unknown sequences with similar properties. Besides the application within the identification of proteins being involved in tumorigenesis, other functional classes of proteins can be predicted. The prediction accuracy depends on the selected machine learning approach and even more on the composition of the descriptor set used. A computational approach based on feedforward neural networks was applied for the prediction of small GTPases. Consequently, this was realized by taking secondary structure and hydrophobicity information as a preprocessing architecture and thus, as descriptors for the neural networks. We developed a neural network cluster, which consists of a filter network and four subfamily networks. The filter network was trained to identify small GTPases and the subfamily networks were trained to assign a small GTPase to one of the subfamilies. The accuracy of the prediction, whether a given sequence represents a small GTPase is very high (98.25%). The classifications of the subfamily networks yield comparable accuracy. The high prediction accuracy of the neural network cluster developed, gives the opportunity to suggest the use of hydrophobicity and secondary structure prediction in combination with a neural network cluster, as a promising method for the prediction of essential biological activities.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:chemical
pubmed:status
MEDLINE
pubmed:month
Oct
pubmed:issn
1533-0346
pubmed:author
pubmed:issnType
Print
pubmed:volume
8
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
333-41
pubmed:meshHeading
pubmed:year
2009
pubmed:articleTitle
A computational approach for the identification of small GTPases based on preprocessed amino acid sequences.
pubmed:affiliation
Department of Bioinformatics Center for Medical Biotechnology University of Duisburg-Essen Universitatsstr. 2, 45117 Essen, Germany. Dominik.Heider@uni-due.de
pubmed:publicationType
Journal Article