11700594

Source:http://linkedlifedata.com/resource/pubmed/id/11700594

Download in:

Switch to

Custom View

Named Graph Language Inference

Statements in which the resource exists as a subject.
Predicate	Object
rdf:type	pubmed:Citation
lifeskim:mentions	umls-concept:C0017262, umls-concept:C0025663, umls-concept:C0026336, umls-concept:C0026339, umls-concept:C0220825, umls-concept:C0679201, umls-concept:C1511726, umls-concept:C1704332, umls-concept:C1709016
pubmed:dateCreated	2001-11-8
pubmed:abstractText	At present, there is a lack of a sound methodology to infer causal gene expression relationships on a genome wide basis. We address this first by examining the behaviour of some of the latest and fastest algorithms for tree and cluster analysis, particularly hierarchical methods popular in phylogenetics. Combined with these are two novel distances based on partial, rather than full, correlations. Theoretically, partial correlations should provide better evidence for regulatory genetic links than standard correlations. To compare the clusters obtained by many alternative methods we use tree consensus methods. To compare methods of analysis we used tree partition metrics followed by another level of clustering. These, and a tree fit metric, all suggest that the new distances give quite different trees than those usually obtained. In the second part we consider graphical modeling of the interactions of important genes of the cell cycle. Despite the models seeming to fit well on occasions, and despite the experimental error structure seeming close to multivariate normal, there are considerable problems to overcome. Latent variables, in this case important genes missing from the analysis, are inferred to have a strong effect on the partial correlations. Also, the data show clear evidence of sampling distributions conditional on the status of important cancer related genes, including TP53. Without full information on which genes are wild type the appropriate models cannot be fitted. These findings point to the need to include and distinguish not only all relevant genes but also all splice variants in the design phase of a microarray analysis. Failure to do so will induce problems similar to both latent variables and conditional distributions.
pubmed:language	eng
pubmed:journal	http://linkedlifedata.com/resource/pubmed/journal/9717234
pubmed:citationSubset	IM
pubmed:status	MEDLINE
pubmed:issn	0919-9454
pubmed:author	pubmed-author:KishinoHH, pubmed-author:WaddellP JPJ
pubmed:issnType	Print
pubmed:volume	11
pubmed:owner	NLM
pubmed:authorsComplete	Y
pubmed:pagination	129-40
pubmed:dateRevised	2006-11-15
pubmed:meshHeading	pubmed-meshheading:11700594-Algorithms, pubmed-meshheading:11700594-Cell Cycle, pubmed-meshheading:11700594-Cluster Analysis, pubmed-meshheading:11700594-Computational Biology, pubmed-meshheading:11700594-Gene Expression Profiling, pubmed-meshheading:11700594-Genes, p53, pubmed-meshheading:11700594-Humans, pubmed-meshheading:11700594-Models, Genetic, pubmed-meshheading:11700594-Multigene Family, pubmed-meshheading:11700594-Neoplasms, pubmed-meshheading:11700594-Oligonucleotide Array Sequence Analysis, pubmed-meshheading:11700594-Tumor Cells, Cultured
pubmed:year	2000
pubmed:articleTitle	Cluster inference methods and graphical models evaluated on NCI60 microarray gene expression data.
pubmed:affiliation	Chugai Research Institute for Molecular Medicine, 153-2 Nagai Niihari Ibaraki 300-4101, Japan. waddell@cimmed.com
pubmed:publicationType	Journal Article, Comparative Study, Research Support, Non-U.S. Gov't