12386007

Source:http://linkedlifedata.com/resource/pubmed/id/12386007

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0011546, umls-concept:C0439828, umls-concept:C1533716, umls-concept:C1709100
pubmed:dateCreated	2002-10-18
pubmed:abstractText	MOTIVATION: Clustering co-expressed genes usually requires the definition of 'distance' or 'similarity' between measured datasets, the most common choices being Pearson correlation or Euclidean distance. With the size of available datasets steadily increasing, it has become feasible to consider other, more general, definitions as well. One alternative, based on information theory, is the mutual information, providing a general measure of dependencies between variables. While the use of mutual information in cluster analysis and visualization of large-scale gene expression data has been suggested previously, the earlier studies did not focus on comparing different algorithms to estimate the mutual information from finite data. RESULTS: Here we describe and review several approaches to estimate the mutual information from finite datasets. Our findings show that the algorithms used so far may be quite substantially improved upon. In particular when dealing with small datasets, finite sample effects and other sources of potentially misleading results have to be taken into account.
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9808944
pubmed:citationSubset	IM
pubmed:status	MEDLINE
pubmed:issn	1367-4803
pubmed:author	pubmed-author:DaubC OCO, pubmed-author:KurthsJJ, pubmed-author:SelbigJJ, pubmed-author:SteuerRR, pubmed-author:WeiseJJ
pubmed:issnType	Print
pubmed:volume	18 Suppl 2
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	S231-40
pubmed:dateRevised	2006-11-15
pubmed:meshHeading	pubmed-meshheading:12386007-Algorithms, pubmed-meshheading:12386007-Computational Biology, pubmed-meshheading:12386007-Computer Simulation, pubmed-meshheading:12386007-Gene Expression Profiling, pubmed-meshheading:12386007-Humans, pubmed-meshheading:12386007-Models, Genetic, pubmed-meshheading:12386007-Models, Statistical, pubmed-meshheading:12386007-Oligonucleotide Array Sequence Analysis, pubmed-meshheading:12386007-Software
pubmed:year	2002
pubmed:articleTitle	The mutual information: detecting and evaluating dependencies between variables.
pubmed:affiliation	University Potsdam, Nonlinear Dynamics Group, Germany. steuer@agnld.uni-potsdam.de
pubmed:publicationType	Journal Article, Comparative Study, Evaluation Studies