Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
4
pubmed:dateCreated
2003-4-2
pubmed:abstractText
Global analyses of RNA expression levels are useful for classifying genes and overall phenotypes. Often these classification problems are linked, and one wants to find "marker genes" that are differentially expressed in particular sets of "conditions." We have developed a method that simultaneously clusters genes and conditions, finding distinctive "checkerboard" patterns in matrices of gene expression data, if they exist. In a cancer context, these checkerboards correspond to genes that are markedly up- or downregulated in patients with particular types of tumors. Our method, spectral biclustering, is based on the observation that checkerboard structures in matrices of expression data can be found in eigenvectors corresponding to characteristic expression patterns across genes or conditions. In addition, these eigenvectors can be readily identified by commonly used linear algebra approaches, in particular the singular value decomposition (SVD), coupled with closely integrated normalization steps. We present a number of variants of the approach, depending on whether the normalization over genes and conditions is done independently or in a coupled fashion. We then apply spectral biclustering to a selection of publicly available cancer expression data sets, and examine the degree to which the approach is able to identify checkerboard structures. Furthermore, we compare the performance of our biclustering methods against a number of reasonable benchmarks (e.g., direct application of SVD or normalized cuts to raw data).
pubmed:grant
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:month
Apr
pubmed:issn
1088-9051
pubmed:author
pubmed:issnType
Print
pubmed:volume
13
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
703-16
pubmed:dateRevised
2008-11-20
pubmed:meshHeading
pubmed-meshheading:12671006-Breast Neoplasms, pubmed-meshheading:12671006-Central Nervous System Neoplasms, pubmed-meshheading:12671006-Cerebellar Neoplasms, pubmed-meshheading:12671006-Databases, Genetic, pubmed-meshheading:12671006-Gene Expression Profiling, pubmed-meshheading:12671006-Gene Expression Regulation, Neoplastic, pubmed-meshheading:12671006-Genes, Neoplasm, pubmed-meshheading:12671006-Glioma, pubmed-meshheading:12671006-Humans, pubmed-meshheading:12671006-Leukemia, Lymphocytic, Chronic, B-Cell, pubmed-meshheading:12671006-Lymphoma, B-Cell, pubmed-meshheading:12671006-Lymphoma, Follicular, pubmed-meshheading:12671006-Lymphoma, Large B-Cell, Diffuse, pubmed-meshheading:12671006-Medulloblastoma, pubmed-meshheading:12671006-Neoplasms, Germ Cell and Embryonal, pubmed-meshheading:12671006-Oligonucleotide Array Sequence Analysis, pubmed-meshheading:12671006-Phenotype, pubmed-meshheading:12671006-Reference Values, pubmed-meshheading:12671006-Tumor Cells, Cultured
pubmed:year
2003
pubmed:articleTitle
Spectral biclustering of microarray data: coclustering genes and conditions.
pubmed:affiliation
Department of Genetics, Yale University, New Haven, Connecticut 06520, USA.
pubmed:publicationType
Journal Article, Research Support, U.S. Gov't, P.H.S., Research Support, Non-U.S. Gov't