pubmed:abstractText |
Large gene expression studies, such as those conducted using DNA arrays, often provide millions of different pieces of data. To address the problem of analyzing such data, we describe a statistical method, which we have called 'gene shaving'. The method identifies subsets of genes with coherent expression patterns and large variation across conditions. Gene shaving differs from hierarchical clustering and other widely used methods for analyzing gene expression studies in that genes may belong to more than one cluster, and the clustering may be supervised by an outcome measure. The technique can be 'unsupervised', that is, the genes and samples are treated as unlabeled, or partially or fully supervised by using known properties of the genes or samples to assist in finding meaningful groupings.
|
pubmed:affiliation |
Department of Statistics, Sequoia Hall, Stanford University, Stanford, CA 94305, USA. tibs@stat.stanford.edu
|
pubmed:publicationType |
Journal Article,
Research Support, U.S. Gov't, P.H.S.,
Research Support, U.S. Gov't, Non-P.H.S.
|