Statements in which the resource exists as a subject.
PredicateObject
rdf:type
lifeskim:mentions
pubmed:issue
11
pubmed:dateCreated
1987-12-11
pubmed:abstractText
The effect on the correlation coefficient of discretizing data was investigated in two ways. First, the theoretical effect of dichotomizing data was calculated, and it was shown that the resulting correlation coefficient is considerably less than that between the underlying bivariate normally distributed variables. Second, computer simulations were performed of a model in which a continuous variable (measured with some error) gives rise to a counting variable through a mechanism in which the count is zero below a certain threshold value for the continuous variable and then increases linearly as the continuous variable increases. It was shown that the correlation coefficient between the observed values of the continuous and counting variables decreased as (a) the measurement error increased, (b) the slope of the relationship decreased, and (c) the number of counts decreased. It is concluded that caution is required when interpreting correlation coefficients when one or both of the variables consist of a few (say only four or five) discrete scores.
pubmed:language
eng
pubmed:journal
pubmed:citationSubset
D
pubmed:status
MEDLINE
pubmed:month
Nov
pubmed:issn
0022-0345
pubmed:author
pubmed:issnType
Print
pubmed:volume
65
pubmed:owner
NLM
pubmed:authorsComplete
Y
pubmed:pagination
1346-8
pubmed:dateRevised
2007-11-15
pubmed:meshHeading
pubmed:year
1986
pubmed:articleTitle
Interpreting the correlation coefficient when one of the variables is discrete.
pubmed:affiliation
Department of Medical Statistics, Dental School, University of Newcastle-upon-Tyne, England.
pubmed:publicationType
Journal Article