Royalty-Free photo

Correlation Coefficient as a Gateway to Skepticism

The correlation coefficient as a gateway to radical skepticism:
Suppose you calculate that two variables are moderately correlated. For instance, you find that self-reported happiness has a correlation r=0.32 with self-reported willpower, as I found in one of my studies.

What are the possible explanations for (or causes of) this?

  • A Causes B – Increasing A is a cause of increasing B but not the reverse. [e.g., more happiness causes more willpower]
  • (2) B Causes A – Increasing B is a cause of increasing A but not the reverse. [e.g., more willpower causes more happiness]
  • (3) A Causes B Causes A – Increasing A and B are both causes of increases of the other, leading to a positive feedback loop between them. [e.g., more willpower causes more happiness which causes more willpower which causes more happiness, etc.]
  • (4) X Causes A and B – There exists at least one other variable X (potentially many more than one) such that increases (or decreases) in X lead simultaneously to increases in both A and B. [e.g., living in a stable environment and having a supportive romantic partner are both situations that increase happiness and are also both situations that increase willpower]
  • (5) Non-linearity – A and B actually have a much stronger relationship than it appears (in fact they could be fully deterministically related) because correlation only captures the average extent to which A exceeding its mean coincides with B exceeding its mean, and can understate the strength of relationships that go both up and down or that go up in an inconsistent fashion (that is, it captures linear relationships, but can sometimes mask non-linear ones). [e.g., as willpower goes up happiness tends to go up as people become increasingly good at making good longer-term effective choices, but as willpower gets to extremely high levels, people start experiencing less happiness again because extreme levels of willpower are linked to reduced emotional responses of all kinds]
  • (6a) Noise – The correlation is actually much larger or smaller than it seems as a result of sampling error or noise (e.g., the sample size is too low to measure it with reliability, and so it is really more like 0.32 plus or minus 0.3 and could realistically be near zero) [e.g., with a sample size of n=30, a measured correlation of 0.32 has an insanely wide 95th percentile confidence interval of -0.045 to 0.609]
  • (6b) Too Many Tries – You found this correlation by looking at a long list of correlations that you calculated (and it stood out to you because it was one of the highest correlations), but there is actually not a significant correlation at all, it just looks that way because you tested so many hypotheses and each one had some chance of looking significant due to random fluctuations (so it’s unsurprising that one ended up being moderately large, and that’s the one you honed in on) [e.g., if you look at all pairs of correlations for 15 variables that leads to 105 distinct correlations, and with a sample size of 50 data points used to calculate each correlation, it is quite likely that you’d get a correlation of 0.32 or higher even if the true correlation between all pairs of variables was 0]
  • (7) Outliers – There is an extreme outlier in the data (e.g., a transcription error or data corruption or disgruntled survey respondent), and without it, the calculated correlation would decrease or increase dramatically. [e.g., with n=100 data points, a crazy outlier pair of 30 for A and 1.35 for B when the rest of your values are between 0 and 1 can make the calculated correlation come out to 0.32 when the correlation calculated without that one outlier is actually -0.12]
  • (8a) Selection Bias – Your data is a biased sample of the population that you actually care about due to unintentional selection effects in your sampling procedure, and so the 0.32 correlation, which you assumed applies to the broader population from which your data was drawn, is not actually representative of the population you are attempting to draw conclusions about. [e.g., you inadvertently surveyed only younger people, and in younger people, the correlation between willpower and happiness is positive, whereas in older people it is a negligible correlation, yet you’re trying to draw conclusions about the entire population which includes both young and old]
  • (8b) Correlation From Conditioning – The variables A and B are actually uncorrelated if you had looked at them in the full population of interest, but you only sampled them in some subpopulation out of convenience, and A and B are both correlated with how likely you are to end up in that subpopulation, creating an observed correlation in the subpopulation that doesn’t exist in the full population [e.g., happiness and willpower are actually uncorrelated, but you only collected data on people at camp, and whether or not a person decides to go to camp is strongly positively predicted by happiness and strongly negatively predicted by willpower; therefore campers are likely to have high happiness or low willpower (though not both) creating a correlation between happiness and willpower among campers that doesn’t exist among non-campers]
  • (9) The World Is Too Complex For Mere Mortals – Some mixture of (1), (2), (3), (4), (5) and (6a), (6b), (7), (8a), and (8b) are true.
  • (10) Wrong Problem – Of the literally infinite variety of formulas that capture the extent to which A and B vary together, the correlation (i.e., the average of the product of standard deviations from the mean) is not the formula you are truly interested in, what you actually care about is, say, the median of the product of absolute deviations from the median, but you calculated the ordinary correlation because that’s all that you could think of to do, or that’s the only relevant function the software you used had available.
  • (11) Undefined – A and B were generated from Cauchy distributions, and the true correlation between these two variables is mathematically undefined (much like how the Cauchy distribution has an undefined expected value), so your calculated value of 0.32 is a meaningless estimate of an undefined quantity.
  • (12) Bug – there was a bug in the correlation calculation or data loading code, or data cleaning code and your software just happened to output 0.32, but that’s not actually the correlation value or a value of interest at all.
  • (13) Wrong Calculation – You only thought you had run your correlation coefficient code, but you accidentally ran the wrong program (the one you wrote as a teen), and the number 0.32 was the fraction of baseball cards you had at that time that were in mint condition.
  • (14) Memory Mistake – Your faulty memory only makes you think the correlation between A and B is 0.32, when in fact, the correlation number you saw on your computer screen (a mere 60 seconds ago) was 0.23 (thanks be to the unreliable three pounds of jello between your ears).
  • (15) Dreaming – You are actually asleep right now (with your head slumped down on your keyword, and your knocked-over cup of tea dripping Earl Grey on the carpet), and you only dreamed that you calculated a 0.32 correlation, but when you wake up the correlation will turn out to be, say, -0.134 (or maybe you never collected the data in the first place and will get an F on your assignment).
  • (16) God – God, it turns out, predetermined everything, and is, therefore, the one and only cause, so is the true (and only) explanation for this correlation of 0.32.
  • (17) Solipsism – Metaphysical Solipsism turns out to be true, and the world has no independent existence outside of your mind, so in effect, you are the only cause, and therefore you are the cause of this experience of observation of 0.32 correlation (but what does it say about you that you would create this experience?)
  • (18) Many Worlds – The Many World’s hypothesis is true, and the laws of physics don’t forbid any possible values of this particular correlation, hence for any number r within the range -1 to 1, there is a you who witnessed that correlation r, but most you’s most likely witnessed a correlation close to 0.32 (as evidenced by the fact that you witnessed it and that you are unlikely to have witnessed an unlikely event).
  • (19) Simulation – The Simulation Hypothesis is true, you are part of a simulation, and the (simulated) world you live in was designed by conscious beings to have a 0.32 correlation between variables A and B (the true reason for which you will surely never understand, perhaps it has to do with getting published in a 5th dimension outer-reality academic journal on the niche topic of civilizational development in 4d space-time universes).
  • (20) Relativism – Truth and meaning are relative and culturally determined, and the definition and usage of correlation, as well as the ritualistic “scientific” process of calculating it, and the belief that it should be calculated at all, are merely an artifact of your time and place, which props up the power of the current ruling elite, yet is so deeply rooted in your culture that you fail to see any other possibility.
  • (21) Evil Demon – Descartes’ Evil Demon, as “clever and deceitful as he is powerful,” has been tricking you for the entirety of what you call your life, and all that you think you know of the external world is merely an illusion designed by this demon, including this godforsaken correlation.
  • (22) Boltzmann Brains – The universe is infinite (in space, or time) and hence contains an infinite number of brains produced by the brief chance alignment of particles into structures capable of consciousness, and given that only a finite number of brains could evolve on planets, you have a 100% chance of being one of these randomly coalescing “Boltzmann brains,” and you just happen to be one that momentarily believes there is such a thing as correlation and that you measured a 0.32 correlation between A and B (in a moment you will pop back out of existence – bye!)
  • (23) Contradiction – There is an as of yet undiscovered (but as the Incompleteness Theorem shows, impossible to rule out) contradiction in the axioms of mathematics as we know it, and a valid mathematical proof exists to show that A and B have any correlation including -1, Pi or 0.32.
  • Or perhaps I’m overthinking this, and just: “as A goes up, B tends to go up a bit, on average.”

  

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *


  1. I loved this article. It seems to me that you didn’t exactly name one of the most likely possibilities, which I believe would have belonged under the Non-Linearity item. If we reframe the measurements of happiness and willpower as measurements of *behaviors* (specifically, social behaviors), and not as relatively accurate indications of inner subjective experience, then a self-reported subjective trait score is a measure of a “behavioral proclivity bias” in offering data about oneself in a survey situation.

    I believe the question then shifts towards something like: “How correlated are the behaviors of rating one’s willpower as high (or low) in surveys and rating one’s happiness as high (or low) in surveys?” And here, my intuition is that people who report higher willpower levels are motivated to perform behaviors which generally denote strength and confidence, such as reporting a high happiness score in a survey. And obviously, I believe the reverse will also apply.

    In my view, this is an obvious and major problem with self-rating surveys. It has always shocked me how research which leverages self-reporting seems to ignore the social psychological context. I see such surveys as being akin to Facebook or Instagram posts. They are asking us, in effect, to report to the world how we are doing. I believe sophisticated measures need to also be applied to counteract the natural tendency of promoting oneself to the world, or at least, to try to measure the extent to which a person is biased towards exaggerating positive qualities, or downplaying problems. One possible approach would be to videotape each person as they fill out the survey, then show it to one or more others, and then survey the observers in a clever way to determine how motivated the original subject might be to project a positive image, vs being completely candid and being as accurate as possible.