Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:dateCreated
2008-2-21
pubmed:abstractText
To gain insights into the biological function and relevance of genes using serial analysis of gene expression (SAGE) transcription profiles, one essential method is to perform clustering analysis on genes. A successful clustering analysis depends on the use of effective distance or similarity measures. For this purpose, by considering the specific properties of SAGE technology, we modeled the SAGE data by Poisson statistics and developed two Poisson-based measures to assess similarity of gene expression profiles. By employing these two distances into a K-means clustering procedure, we further developed a software package to perform clustering analysis on SAGE data. The software implementing our Poisson-based algorithms can be downloaded from http://genome.dfci.harvard.edu/sager. Our algorithm is guaranteed to converge to a local maximum when Poisson likelihood-based measure is used. The results from simulation and experimental mouse retina data demonstrate that the Poisson-based distances are more appropriate and reliable for analyzing SAGE data compared to other commonly used distances or similarity measures.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
IM
pubmed:status
MEDLINE
pubmed:issn
1064-3745
pubmed:author
pubmed:issnType
Print
pubmed:volume
387
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
185-98
pubmed:meshHeading
pubmed:year
2008
pubmed:articleTitle
Clustering analysis of SAGE transcription profiles using a Poisson approach.
pubmed:affiliation
Department of Statistics, University of California at Berkeley, Berkeley, CA, USA.
pubmed:publicationType
Journal Article