Statements in which the resource exists.
SubjectPredicateObjectContext
pubmed-article:16231961rdf:typepubmed:Citationlld:pubmed
pubmed-article:16231961lifeskim:mentionsumls-concept:C0575090lld:lifeskim
pubmed-article:16231961lifeskim:mentionsumls-concept:C0036576lld:lifeskim
pubmed-article:16231961lifeskim:mentionsumls-concept:C1519249lld:lifeskim
pubmed-article:16231961lifeskim:mentionsumls-concept:C0456387lld:lifeskim
pubmed-article:16231961lifeskim:mentionsumls-concept:C2348519lld:lifeskim
pubmed-article:16231961lifeskim:mentionsumls-concept:C1527118lld:lifeskim
pubmed-article:16231961pubmed:issue3lld:pubmed
pubmed-article:16231961pubmed:dateCreated2005-10-19lld:pubmed
pubmed-article:16231961pubmed:abstractTextWhen the standard approach to predict protein function by sequence homology fails, other alternative methods can be used that require only the amino acid sequence for predicting function. One such approach uses machine learning to predict protein function directly from amino acid sequence features. However, there are two issues to consider before successful functional prediction can take place: identifying discriminatory features, and overcoming the challenge of a large imbalance in the training data. We show that by applying feature subset selection followed by undersampling of the majority class, significantly better support vector machine (SVM) classifiers are generated compared with standard machine learning approaches. As well as revealing that the features selected could have the potential to advance our understanding of the relationship between sequence and function, we also show that undersampling to produce fully balanced data significantly improves performance. The best discriminating ability is achieved using SVMs together with feature selection and full undersampling; this approach strongly outperforms other competitive learning algorithms. We conclude that this combined approach can generate powerful machine learning classifiers for predicting protein function directly from sequence.lld:pubmed
pubmed-article:16231961pubmed:commentsCorrectionshttp://linkedlifedata.com/r...lld:pubmed
pubmed-article:16231961pubmed:languageenglld:pubmed
pubmed-article:16231961pubmed:journalhttp://linkedlifedata.com/r...lld:pubmed
pubmed-article:16231961pubmed:citationSubsetIMlld:pubmed
pubmed-article:16231961pubmed:chemicalhttp://linkedlifedata.com/r...lld:pubmed
pubmed-article:16231961pubmed:statusMEDLINElld:pubmed
pubmed-article:16231961pubmed:issn1175-5636lld:pubmed
pubmed-article:16231961pubmed:authorpubmed-author:BreitlingRain...lld:pubmed
pubmed-article:16231961pubmed:authorpubmed-author:GilbertDavidDlld:pubmed
pubmed-article:16231961pubmed:authorpubmed-author:Al-ShahibAliAlld:pubmed
pubmed-article:16231961pubmed:issnTypePrintlld:pubmed
pubmed-article:16231961pubmed:volume4lld:pubmed
pubmed-article:16231961pubmed:ownerNLMlld:pubmed
pubmed-article:16231961pubmed:authorsCompleteYlld:pubmed
pubmed-article:16231961pubmed:pagination195-203lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:meshHeadingpubmed-meshheading:16231961...lld:pubmed
pubmed-article:16231961pubmed:year2005lld:pubmed
pubmed-article:16231961pubmed:articleTitleFeature selection and the class imbalance problem in predicting protein function from sequence.lld:pubmed
pubmed-article:16231961pubmed:affiliationBioinformatics Research Centre, Department of Computing Science, University of Glasgow, Glasgow, UK. alshahib@dcs.gla.ac.uklld:pubmed
pubmed-article:16231961pubmed:publicationTypeJournal Articlelld:pubmed
pubmed-article:16231961pubmed:publicationTypeResearch Support, Non-U.S. Gov'tlld:pubmed
http://linkedlifedata.com/r...pubmed:referesTopubmed-article:16231961lld:pubmed
http://linkedlifedata.com/r...pubmed:referesTopubmed-article:16231961lld:pubmed
http://linkedlifedata.com/r...pubmed:referesTopubmed-article:16231961lld:pubmed
http://linkedlifedata.com/r...pubmed:referesTopubmed-article:16231961lld:pubmed
http://linkedlifedata.com/r...pubmed:referesTopubmed-article:16231961lld:pubmed
http://linkedlifedata.com/r...pubmed:referesTopubmed-article:16231961lld:pubmed