10568749

Source:http://linkedlifedata.com/resource/pubmed/id/10568749

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0009085, umls-concept:C1511726
pubmed:issue	11
pubmed:dateCreated	2000-1-27
pubmed:abstractText	Clustering is one of the main mathematical challenges in large-scale gene expression analysis. We describe a clustering procedure based on a sequential k-means algorithm with additional refinements that is able to handle high-throughput data in the order of hundreds of thousands of data items measured on hundreds of variables. The practical motivation for our algorithm is oligonucleotide fingerprinting-a method for simultaneous determination of expression level for every active gene of a specific tissue-although the algorithm can be applied as well to other large-scale projects like EST clustering and qualitative clustering of DNA-chip data. As a pairwise similarity measure between two p-dimensional data points, x and y, we introduce mutual information that can be interpreted as the amount of information about x in y, and vice versa. We show that for our purposes this measure is superior to commonly used metric distances, for example, Euclidean distance. We also introduce a modified version of mutual information as a novel method for validating clustering results when the true clustering is known. The performance of our algorithm with respect to experimental noise is shown by extensive simulation studies. The algorithm is tested on a subset of 2029 cDNA clones coming from 15 different genes from a cDNA library derived from human dendritic cells. Furthermore, the clustering of these 2029 cDNA clones is demonstrated when the entire set of 76,032 cDNA clones is processed.
pubmed:commentsCorrections	http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-10077610, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-10359783, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-10371154, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-10391217, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-10409423, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-1781028, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-2047873, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-7665186, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-8401585, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-8426656, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-8919692, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-8921367, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-9419376, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-9547283, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-9826759, http://linkedlifedata.com/resource/pubmed/commentcorrection/10568749-9843981
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9518021
pubmed:citationSubset	IM
pubmed:chemical	http://linkedlifedata.com/resource/pubmed/chemical/DNA, Complementary
pubmed:status	MEDLINE
pubmed:month	Nov
pubmed:issn	1088-9051
pubmed:author	pubmed-author:BullCC, pubmed-author:HerwigRR, pubmed-author:LehrachHH, pubmed-author:MüllerCC, pubmed-author:O'BrienJJ, pubmed-author:PoustkaA JAJ
pubmed:issnType	Print
pubmed:volume	9
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	1093-105
pubmed:dateRevised	2009-11-18
pubmed:meshHeading	pubmed-meshheading:10568749-Algorithms, pubmed-meshheading:10568749-Cells, Cultured, pubmed-meshheading:10568749-Cluster Analysis, pubmed-meshheading:10568749-Computational Biology, pubmed-meshheading:10568749-DNA, Complementary, pubmed-meshheading:10568749-DNA Fingerprinting, pubmed-meshheading:10568749-Expressed Sequence Tags, pubmed-meshheading:10568749-Gene Expression, pubmed-meshheading:10568749-Humans
pubmed:year	1999
pubmed:articleTitle	Large-scale clustering of cDNA-fingerprinting data.
pubmed:affiliation	Max-Planck Institut für Molekulare Genetik, Ihnestrasse 73, D-14195 Berlin, Germany. herwig@mpimg-berlin-dahlem.mpg.de
pubmed:publicationType	Journal Article, Comparative Study, Research Support, Non-U.S. Gov't