17933851

Source:http://linkedlifedata.com/resource/pubmed/id/17933851

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0012630, umls-concept:C0017337, umls-concept:C0332152, umls-concept:C0376554, umls-concept:C0598132, umls-concept:C1511726, umls-concept:C1709016
pubmed:issue	23
pubmed:dateCreated	2007-11-28
pubmed:abstractText	MOTIVATION: Discriminant analysis for high-dimensional and low-sample-sized data has become a hot research topic in bioinformatics, mainly motivated by its importance and challenge in applications to tumor classifications for high-dimensional microarray data. Two of the popular methods are the nearest shrunken centroids, also called predictive analysis of microarray (PAM), and shrunken centroids regularized discriminant analysis (SCRDA). Both methods are modifications to the classic linear discriminant analysis (LDA) in two aspects tailored to high-dimensional and low-sample-sized data: one is the regularization of the covariance matrix, and the other is variable selection through shrinkage. In spite of their usefulness, there are potential limitations with each method. The main concern is that both PAM and SCRDA are possibly too extreme: the covariance matrix in the former is restricted to be diagonal while in the latter there is barely any restriction. Based on the biology of gene functions and given the feature of the data, it may be beneficial to estimate the covariance matrix as an intermediate between the two; furthermore, more effective shrinkage schemes may be possible. RESULTS: We propose modified LDA methods to integrate biological knowledge of gene functions (or variable groups) into classification of microarray data. Instead of simply treating all the genes independently or imposing no restriction on the correlations among the genes, we group the genes according to their biological functions extracted from existing biological knowledge or data, and propose regularized covariance estimators that encourages between-group gene independence and within-group gene correlations while maintaining the flexibility of any general covariance structure. Furthermore, we propose a shrinkage scheme on groups of genes that tends to retain or remove a whole group of the genes altogether, in contrast to the standard shrinkage on individual genes. We show that one of the proposed methods performed better than PAM and SCRDA in a simulation study and several real data examples.
pubmed:grant	http://linkedlifedata.com/resource/pubmed/grant/HL65462
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9808944
pubmed:citationSubset	IM
pubmed:status	MEDLINE
pubmed:month	Dec
pubmed:issn	1367-4811
pubmed:author	pubmed-author:FermV HVH, pubmed-author:YinP HPH
pubmed:issnType	Electronic
pubmed:day	1
pubmed:volume	23
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	3170-7
pubmed:dateRevised	2009-11-4
pubmed:meshHeading	pubmed-meshheading:17933851-Algorithms, pubmed-meshheading:17933851-Artificial Intelligence, pubmed-meshheading:17933851-Data Interpretation, Statistical, pubmed-meshheading:17933851-Discriminant Analysis, pubmed-meshheading:17933851-Gene Expression Profiling, pubmed-meshheading:17933851-Multigene Family, pubmed-meshheading:17933851-Oligonucleotide Array Sequence Analysis, pubmed-meshheading:17933851-Pattern Recognition, Automated
pubmed:year	2007
pubmed:articleTitle	Incorporating prior knowledge of gene functional groups into regularized discriminant analysis of microarray data.
pubmed:affiliation	Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building (MMC 303), Minneapolis, MN 55455-0378, USA.
pubmed:publicationType	Journal Article, Research Support, Non-U.S. Gov't, Research Support, N.I.H., Extramural