Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:dateCreated
2006-5-1
pubmed:abstractText
We demonstrate a concept and implementation of a compendium for the classification of high-dimensional data from microarray gene expression profiles. A compendium is an interactive document that bundles primary data, statistical processing methods, figures, and derived data together with the textual documentation and conclusions. Interactivity allows the reader to modify and extend these components. We address the following questions: how much does the discriminatory power of a classifier depend on the choice of the algorithm that was used to identify it; what alternative classifiers could be used just as well; how robust is the result. The answers to these questions are essential prerequisites for validation and biological interpretation of the classifiers. We show how to use this approach by looking at these questions for a specific breast cancer microarray data set that first has been studied by Huang et al. (2003).
pubmed:language
eng
pubmed:journal
pubmed:status
PubMed-not-MEDLINE
pubmed:issn
1544-6115
pubmed:author
pubmed:issnType
Electronic
pubmed:volume
3
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
Article37
pubmed:dateRevised
2006-12-4
pubmed:year
2004
pubmed:articleTitle
A compendium to ensure computational reproducibility in high-dimensional classification tasks.
pubmed:affiliation
Division of Molecular Genome Analysis, German Cancer Research Centre. m.ruschhaupt@dkfz-heidelberg.de
pubmed:publicationType
Journal Article