12611631

Source:http://linkedlifedata.com/resource/pubmed/id/12611631

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0162340, umls-concept:C0332285, umls-concept:C1511726, umls-concept:C1704332, umls-concept:C1709016
pubmed:issue	4
pubmed:dateCreated	2003-4-17
pubmed:abstractText	We wished to quantify the state-of-the-art of our understanding of clusters in microarray data. To do this we systematically compared the clusters produced on sets of microarray data using a representative set of clustering algorithms (hierarchical, k-means, and a modified version of QT_CLUST) with the annotation schemes MIPS, GeneOntology and GenProtEC. We assumed that if a cluster reflected known biology its members would share related ontological annotations. This assumption is the basis of "guilt-by-association" and is commonly used to assign the putative function of proteins. To statistically measure the relationship between cluster and annotation we developed a new predictive discriminatory measure. We found that the clusters found in microarray data do not in general agree with functional annotation classes. Although many statistically significant relationships can be found, the majority of clusters are not related to known biology (as described in annotation ontologies). This implies that use of guilt-by-association is not supported by annotation ontologies. Depending on the estimate of the amount of noise in the data, our results suggest that bioinformatics has only codified a small proportion of the biological knowledge required to understand microarray data.
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9815902
pubmed:citationSubset	IM
pubmed:chemical	http://linkedlifedata.com/resource/pubmed/chemical/Fungal Proteins, http://linkedlifedata.com/resource/pubmed/chemical/Proteome
pubmed:status	MEDLINE
pubmed:issn	1386-6338
pubmed:author	pubmed-author:ClareAmandaA, pubmed-author:KingRoss DRD
pubmed:issnType	Print
pubmed:volume	2
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	511-22
pubmed:dateRevised	2007-11-15
pubmed:meshHeading	pubmed-meshheading:12611631-Algorithms, pubmed-meshheading:12611631-Cluster Analysis, pubmed-meshheading:12611631-Fungal Proteins, pubmed-meshheading:12611631-Oligonucleotide Array Sequence Analysis, pubmed-meshheading:12611631-Open Reading Frames, pubmed-meshheading:12611631-Proteome, pubmed-meshheading:12611631-Software, pubmed-meshheading:12611631-Statistics as Topic
pubmed:year	2002
pubmed:articleTitle	How well do we understand the clusters found in microarray data?
pubmed:affiliation	Department of Computer Science, University of Wales Aberystwyth, Penglais, Aberystwyth SY23 3DB, UK. afc@aber.ac.uk
pubmed:publicationType	Journal Article, Research Support, Non-U.S. Gov't