Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:dateCreated
2006-6-15
pubmed:abstractText
Genetic epidemiologists have taken the challenge to identify genetic polymorphisms involved in the development of diseases. Many have collected data on large numbers of genetic markers but are not familiar with available methods to assess their association with complex diseases. Statistical methods have been developed for analyzing the relation between large numbers of genetic and environmental predictors to disease or disease-related variables in genetic association studies. In this commentary we discuss logistic regression analysis, neural networks, including the parameter decreasing method (PDM) and genetic programming optimized neural networks (GPNN) and several non-parametric methods, which include the set association approach, combinatorial partitioning method (CPM), restricted partitioning method (RPM), multifactor dimensionality reduction (MDR) method and the random forests approach. The relative strengths and weaknesses of these methods are highlighted. Logistic regression and neural networks can handle only a limited number of predictor variables, depending on the number of observations in the dataset. Therefore, they are less useful than the non-parametric methods to approach association studies with large numbers of predictor variables. GPNN on the other hand may be a useful approach to select and model important predictors, but its performance to select the important effects in the presence of large numbers of predictors needs to be examined. Both the set association approach and random forests approach are able to handle a large number of predictors and are useful in reducing these predictors to a subset of predictors with an important contribution to disease. The combinatorial methods give more insight in combination patterns for sets of genetic and/or environmental predictor variables that may be related to the outcome variable. As the non-parametric methods have different strengths and weaknesses we conclude that to approach genetic association studies using the case-control design, the application of a combination of several methods, including the set association approach, MDR and the random forests approach, will likely be a useful strategy to find the important genes and interaction patterns involved in complex diseases.
pubmed:commentsCorrections
http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11037322, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11037327, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11230170, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11295826, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11404819, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11682119, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-11731502, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12082592, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12108579, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12123488, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12123491, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12548676, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12584123, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12846935, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12914569, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12935345, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-12951571, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-14583441, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-14639704, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-14730379, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15119966, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15133310, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15305330, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15339344, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15522460, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15525222, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15588316, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15593090, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-15892116, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-16264434, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-16284379, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-16436204, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-8970487, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-9234406, http://linkedlifedata.com/resource/pubmed/commentcorrection/16630340-9433631
pubmed:language
eng
pubmed:journal
pubmed:status
PubMed-not-MEDLINE
pubmed:issn
1471-2156
pubmed:author
pubmed:issnType
Electronic
pubmed:volume
7
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
23
pubmed:dateRevised
2009-11-18
pubmed:year
2006
pubmed:articleTitle
The challenge for genetic epidemiologists: how to analyze large numbers of SNPs in relation to complex diseases.
pubmed:affiliation
Centre for Nutrition and Health, National Institute for Public Health and the Environment, PO Box 1 3720 BA Bilthoven, The Netherlands.
pubmed:publicationType
Editorial