<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>social science &#8211; Spencer Greenberg</title>
	<atom:link href="https://www.spencergreenberg.com/tag/social-science/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.spencergreenberg.com</link>
	<description></description>
	<lastBuildDate>Wed, 17 Dec 2025 01:00:27 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2024/05/cropped-icon.png?fit=32%2C32&#038;ssl=1</url>
	<title>social science &#8211; Spencer Greenberg</title>
	<link>https://www.spencergreenberg.com</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">23753251</site>	<item>
		<title>Is IQ Legitimate or B.S.?</title>
		<link>https://www.spencergreenberg.com/2025/03/is-iq-legitimate-or-b-s/</link>
					<comments>https://www.spencergreenberg.com/2025/03/is-iq-legitimate-or-b-s/#comments</comments>
		
		<dc:creator><![CDATA[Spencer]]></dc:creator>
		<pubDate>Tue, 25 Mar 2025 23:12:21 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[agreeableness]]></category>
		<category><![CDATA[Big Five]]></category>
		<category><![CDATA[cognitive task]]></category>
		<category><![CDATA[extraversion]]></category>
		<category><![CDATA[iq]]></category>
		<category><![CDATA[IQ score]]></category>
		<category><![CDATA[Legitimate]]></category>
		<category><![CDATA[neuroticism]]></category>
		<category><![CDATA[openness]]></category>
		<category><![CDATA[personality]]></category>
		<category><![CDATA[personality test]]></category>
		<category><![CDATA[predicition]]></category>
		<category><![CDATA[predictor]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[test]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=4365</guid>

					<description><![CDATA[Is the idea of IQ legit or total B.S.? With the replication crisis in social science, it&#8217;s worth asking this since a number of major psychology findings didn&#8217;t hold up under scrutiny. To find out, at Clearer Thinking, we ran a massive study. We tested thousands of people performing random subsets of 62 diverse cognitive [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Is the idea of IQ legit or total B.S.? With the replication crisis in social science, it&#8217;s worth asking this since a number of major psychology findings didn&#8217;t hold up under scrutiny.</p>



<p class="wp-block-paragraph">To find out, at Clearer Thinking, <a href="https://www.clearerthinking.org/post/what-s-really-true-about-intelligence-and-iq-we-empirically-tested-40-claims">we ran a massive study</a>.</p>



<p class="wp-block-paragraph">We tested thousands of people performing random subsets of 62 diverse cognitive tasks (vocab, math, logic, pattern recognition, reaction time, games, memorization, mental rotation, language learning, etc.)</p>



<p class="wp-block-paragraph">We successfully replicated a classic finding: performance on nearly all cognitive tasks correlates positively with performance on the other tasks—a phenomenon known as the &#8220;positive manifold,&#8221; foundational to IQ.</p>



<p class="wp-block-paragraph">IQ scores explained ~45% of variation across our diverse cognitive tasks, aligning with previous research. That&#8217;s very substantial for a single number (IQ score), but also far from capturing everything.</p>



<p class="wp-block-paragraph">Some of the remaining 55% variance is pure noise; the rest likely comes from task skill (developed through practice) and task-specific aptitudes (likely influenced by both genetics and childhood experiences).</p>



<p class="wp-block-paragraph">It&#8217;s also worth noting that while IQ is predictive of a diverse range of intelligence tasks, it doesn&#8217;t necessarily capture ALL that&#8217;s meant by intelligence. It&#8217;s unclear if it includes &#8220;street smarts,&#8221; social skills, or deep nature skills (like hunter-gatherers have).</p>



<p class="wp-block-paragraph">We confirmed IQ predicts interesting outcomes:</p>



<p class="wp-block-paragraph">• Actively open-minded thinking (r=0.43)</p>



<p class="wp-block-paragraph">• Household income (weakly, r=0.15)</p>



<p class="wp-block-paragraph">• Self-reported job performance (for lower IQ ranges only, r=0.48)</p>



<p class="wp-block-paragraph">• Celebrity worship (substantially negatively correlated with IQ, r=-0.42)</p>



<p class="wp-block-paragraph">• No link to happiness or life satisfaction.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph">IQ captures something—but WHAT? There&#8217;s no consensus. Theories include that IQ is&#8230;</p>



<p class="wp-block-paragraph">• A single cognitive resource</p>



<p class="wp-block-paragraph">• Working memory + executive control</p>



<p class="wp-block-paragraph">• A measure of brain integration</p>



<p class="wp-block-paragraph">• The result of overlapping cognitive processes</p>



<p class="wp-block-paragraph">• The top of a hierarchy of abilities</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph">Guess what&#8217;s a BETTER predictor of major life outcomes than IQ?</p>



<p class="wp-block-paragraph">The Big Five personality traits.</p>



<p class="wp-block-paragraph">In our data, openness, conscientiousness, extraversion, agreeableness, and neuroticism (when all 5 are used together) outperformed IQ at predicting most outcomes.</p>



<p class="wp-block-paragraph">IQ clearly matters—it unavoidably jumps out from the data when testing people on a diverse range of cognitive tasks.</p>



<p class="wp-block-paragraph">And yet&#8230;</p>



<p class="wp-block-paragraph">IQ is far from being all that matters and is definitely not destiny.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph">We used the data from our study to create a new cognitive assessment, which analyzes ability across 7 dimensions and aims to provide you with useful insights about your mind.</p>



<p class="wp-block-paragraph">If you&#8217;d like to take it, you can do so here:</p>



<p class="wp-block-paragraph"><a href="https://programs.clearerthinking.org/cognitive-test-intro.html">https://programs.clearerthinking.org/cognitive-test-intro.html</a></p>



<p class="wp-block-paragraph">(Proceeds support our mission.)</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><em>This piece was first written on March 25, 2025, and first appeared on my website on May 15, 2025.</em></p>



<p class="wp-block-paragraph"></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2025/03/is-iq-legitimate-or-b-s/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">4365</post-id>	</item>
		<item>
		<title>The Crud Factor: what&#8217;s the average correlation between totally random human traits?</title>
		<link>https://www.spencergreenberg.com/2025/01/the-crud-factor/</link>
					<comments>https://www.spencergreenberg.com/2025/01/the-crud-factor/#respond</comments>
		
		<dc:creator><![CDATA[Admin]]></dc:creator>
		<pubDate>Fri, 24 Jan 2025 15:17:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[crud factor]]></category>
		<category><![CDATA[effect sizes]]></category>
		<category><![CDATA[large datasets]]></category>
		<category><![CDATA[null hypothesis significance testing]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[open materials]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[random]]></category>
		<category><![CDATA[random noise]]></category>
		<category><![CDATA[review]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[spurious correlation]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=4553</guid>

					<description><![CDATA[We gathered >1 million human correlations spanning personality, demographics, behaviors, and beliefs, which enabled us to investigate an interesting question: is everything correlated with everything? In other words, is the so-called &#8220;crud factor&#8221; real? Here&#8217;s our answer. On average, there is almost no correlation between human variables (see image). The average is only r=0.02, with [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">We gathered >1 million human correlations spanning personality, demographics, behaviors, and beliefs, which enabled us to investigate an interesting question: is everything correlated with everything? In other words, is the so-called &#8220;crud factor&#8221; real? Here&#8217;s our answer.</p>



<p class="wp-block-paragraph">On average, there is almost no correlation between human variables (see image). The average is only r=0.02, with the median even smaller at r=0.01. But this is misleading because it allows positive correlations to be canceled out by negative ones.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" fetchpriority="high" decoding="async" width="750" height="604" data-attachment-id="4558" data-permalink="https://www.spencergreenberg.com/2025/01/the-crud-factor/image-24/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?fit=1200%2C967&amp;ssl=1" data-orig-size="1200,967" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?fit=750%2C604&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?resize=750%2C604&#038;ssl=1" alt="" class="wp-image-4558" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?resize=1024%2C825&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?resize=300%2C242&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?resize=768%2C619&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-4.png?w=1200&amp;ssl=1 1200w" sizes="(max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">So let&#8217;s instead look at the distribution of the absolute values of the correlations. The average *magnitude* is r=0.1, with a median of 0.07. This shows how small a correlation of 0.1 is &#8211; it&#8217;s about the magnitude you&#8217;d get correlating two human variables picked at random!</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="750" height="597" data-attachment-id="4556" data-permalink="https://www.spencergreenberg.com/2025/01/the-crud-factor/image-22/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?fit=1200%2C955&amp;ssl=1" data-orig-size="1200,955" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?fit=750%2C597&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?resize=750%2C597&#038;ssl=1" alt="" class="wp-image-4556" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?resize=1024%2C815&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?resize=300%2C239&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?resize=768%2C611&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-2.png?w=1200&amp;ssl=1 1200w" sizes="(max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">We see similar results if we look only at positive correlations. The mean and median are still r=0.1 and r=0.07. So it seems that there *is* a &#8220;crud factor&#8221; &#8211; random human things really are correlated (on average)! But not very much, just slightly. </p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="750" height="604" data-attachment-id="4557" data-permalink="https://www.spencergreenberg.com/2025/01/the-crud-factor/image-23/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?fit=1200%2C967&amp;ssl=1" data-orig-size="1200,967" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?fit=750%2C604&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?resize=750%2C604&#038;ssl=1" alt="" class="wp-image-4557" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?resize=1024%2C825&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?resize=300%2C242&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?resize=768%2C619&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-3.png?w=1200&amp;ssl=1 1200w" sizes="(max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">Yet, while there&#8217;s some correlation on average, it&#8217;s also worth noting that the most *common* correlation is right around 0! So, the correlation mean (r=0.1), median (r=0.07), and mode (r≈0) are all a bit different.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="586" data-attachment-id="4559" data-permalink="https://www.spencergreenberg.com/2025/01/the-crud-factor/image-25/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?fit=1200%2C938&amp;ssl=1" data-orig-size="1200,938" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?fit=750%2C586&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?resize=750%2C586&#038;ssl=1" alt="" class="wp-image-4559" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?resize=1024%2C800&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?resize=300%2C235&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?resize=768%2C600&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-5.png?w=1200&amp;ssl=1 1200w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">Finally, note that different data sets could find different results depending on what they contain. But, to my knowledge, this is the most diverse and largest test of its kind by far.</p>



<p class="wp-block-paragraph">But wouldn&#8217;t we measure a non-zero correlation even if there were no true correlations, just due to random chance?&nbsp;</p>



<p class="wp-block-paragraph">The answer is yes, but if there were no true correlations, the average absolute value of correlation would be a lot smaller. I ran a simulation where I generated fake correlations (where the true correlation was zero, any measured correlation was just due to noise arising from a limited sample size). I did this using the real sample sizes from our actual database of correlations.&nbsp;</p>



<p class="wp-block-paragraph">The result is that we still do measure an absolute value of correlation above zero, as expected, but it&#8217;s very small. The median absolute value of correlation is just 0.02 and the mean is r=0.03. We get pretty much the same numbers if we take the median and mean of just the positive correlations as well (instead of the absolute values). So even though a small amount of the crud factor likely is attributable to random noise, we can see that it is substantially bigger than what we&#8217;d expect if it was <em>all</em> just due to random noise.</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="559" data-attachment-id="4554" data-permalink="https://www.spencergreenberg.com/2025/01/the-crud-factor/image-20/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?fit=1466%2C1093&amp;ssl=1" data-orig-size="1466,1093" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?fit=750%2C559&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?resize=750%2C559&#038;ssl=1" alt="" class="wp-image-4554" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?resize=1024%2C763&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?resize=300%2C224&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?resize=768%2C573&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image.png?w=1466&amp;ssl=1 1466w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="595" data-attachment-id="4555" data-permalink="https://www.spencergreenberg.com/2025/01/the-crud-factor/image-21/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?fit=1377%2C1093&amp;ssl=1" data-orig-size="1377,1093" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?fit=750%2C595&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?resize=750%2C595&#038;ssl=1" alt="" class="wp-image-4555" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?resize=1024%2C813&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?resize=300%2C238&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?resize=768%2C610&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2025/10/image-1.png?w=1377&amp;ssl=1 1377w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">You can explore our 1 million correlations for free with our Personality Map tool: <a href="https://t.co/5FDd2eoNSy">https://</a><a href="http://personalitymap.io">personalitymap.io</a></p>



<p class="wp-block-paragraph">To learn more about the &#8220;crud factor&#8221; in psychology, check out this interesting paper:<br><a href="https://journals.sagepub.com/doi/pdf/10.1177/2515245920917961">Crud (Re)Defined.</a></p>



<p class="wp-block-paragraph"></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2025/01/the-crud-factor/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">4553</post-id>	</item>
		<item>
		<title>Trusting the science</title>
		<link>https://www.spencergreenberg.com/2024/11/trusting-the-science/</link>
					<comments>https://www.spencergreenberg.com/2024/11/trusting-the-science/#comments</comments>
		
		<dc:creator><![CDATA[Admin]]></dc:creator>
		<pubDate>Wed, 20 Nov 2024 15:35:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[antiintellectualism]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[dichotomous thinking]]></category>
		<category><![CDATA[distrust]]></category>
		<category><![CDATA[fake]]></category>
		<category><![CDATA[fraud]]></category>
		<category><![CDATA[fraudulent science]]></category>
		<category><![CDATA[importance hacking]]></category>
		<category><![CDATA[motivated reasoning]]></category>
		<category><![CDATA[nuanced thinking]]></category>
		<category><![CDATA[p-hacking]]></category>
		<category><![CDATA[polarization]]></category>
		<category><![CDATA[pragmatism]]></category>
		<category><![CDATA[repliation crisis]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[variability]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=4249</guid>

					<description><![CDATA[Is it a bad idea to broadly tell people to just &#8220;trust the science&#8221;? I think so. The reason stems from my thinking that all of the following are important and true (and too often overlooked) regarding science: 1) A lot of science is real AND valuable to society. 2) A lot of &#8220;science&#8221; is [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Is it a bad idea to broadly tell people to just &#8220;trust the science&#8221;? I think so.</p>



<p class="wp-block-paragraph">The reason stems from my thinking that all of the following are important and true (and too often overlooked) regarding science:</p>



<p class="wp-block-paragraph">1) A lot of science is real AND valuable to society.</p>



<p class="wp-block-paragraph">2) A lot of &#8220;science&#8221; is actually fake &#8211; see, for instance, a decent percentage of papers in psychology 15 years ago.</p>



<p class="wp-block-paragraph">3) &#8220;Science&#8221; (as an approach to knowledge discovery) is one of humanity&#8217;s greatest inventions &#8211; but in practice, it is reasonably often misapplied, or the process is distorted due to bad incentives or poor training. Unfortunately, not all fields of science have done a good job of being self-correcting either, so sometimes, fields go in bad directions for quite a while and need reform. There are different kinds of bad science:</p>



<p class="wp-block-paragraph">(i) Sometimes, science is &#8220;bad&#8221; because it uses unsound methods for figuring out the truth (such as when p-hacking is rampant).</p>



<p class="wp-block-paragraph">(ii) Sometimes it is &#8220;bad&#8221; because it overclaims (e.g., &#8220;Importance Hacking&#8221; where scientists claim they found something important/valuable when they didn&#8217;t actually demonstrate what they claim in their study. Or cases where science is used to &#8220;prove&#8221; questions that can&#8217;t be proven by science &#8211; such as which policy is better in a particular context when it&#8217;s actually a tradeoff between different values).</p>



<p class="wp-block-paragraph">(iii) Other times science is bad because it is biased (e.g., when people are only willing to run or publish studies that show X but not that show the opposite of X).</p>



<p class="wp-block-paragraph">(iv) And sometimes science is bad because it&#8217;s simply fraudulent.</p>



<p class="wp-block-paragraph">4) Promoting broad &#8220;trust the science&#8221; is misguided (and actually harmful) because a bunch of science is fake. If you tell people to always just &#8220;trust the science,&#8221; then you are going to cause them to be tricked by a bunch of bad science, or you are going to contribute to their disillusionment and loss of trust when they discover (correctly) that some of the science you&#8217;re saying is good is actually garbage.</p>



<p class="wp-block-paragraph">5) The &#8220;distrust all science&#8221; view is probably an even worse take than &#8220;trust the science.&#8221; If you distrust all science, you are likely to miss out on incredible things (such as highly effective treatments), and you set yourself up to fall for tons of things that don&#8217;t work (e.g., widely used unscientific treatments). Those who tell people to always just &#8220;trust the science&#8221; sometimes accidentally push people into the &#8220;distrust all science&#8221; view when those people realize that some of what they are being told to trust is crap.</p>



<p class="wp-block-paragraph">6) So, hard as it is, rather than promoting either &#8220;trust all science&#8221; or &#8220;distrust all science,&#8221; the course of action I believe in with regard to science education is to teach people that &#8220;Science&#8221; (as a method) is an incredibly powerful and useful invention, but that &#8220;science&#8221; (as actually practiced) is much like every other field: some of it is good, some of it is crap. There are good hairdressers and bad hairdressers, and there is good science and bad science (and unfortunately, some bad science ends up in the very top journals &#8211; while journals and peer review absolutely do block some bad science, they unfortunately still let through quite a lot of it).</p>



<p class="wp-block-paragraph">Since some science is well done, and some of it is poorly done, it&#8217;s very valuable to learn to tell the difference to make the best use of scientific results &#8211; both with regard to applying it in your own life and using it to form your beliefs about the world.</p>



<p class="wp-block-paragraph">If we pretend science is all good or all bad, we do a lot of harm. We need nuance to see through the bad stuff while maintaining the tremendous benefits.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><em>This piece was first written on November 20, 2024, and first appeared on my website on January 14, 2025.</em></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2024/11/trusting-the-science/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">4249</post-id>	</item>
		<item>
		<title>Importance Hacking: a major (yet rarely-discussed) problem in science</title>
		<link>https://www.spencergreenberg.com/2022/12/importance-hacking-a-major-yet-rarely-discussed-problem-in-science/</link>
					<comments>https://www.spencergreenberg.com/2022/12/importance-hacking-a-major-yet-rarely-discussed-problem-in-science/#comments</comments>
		
		<dc:creator><![CDATA[admin]]></dc:creator>
		<pubDate>Tue, 20 Dec 2022 01:45:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[beauty hacking]]></category>
		<category><![CDATA[career incentives]]></category>
		<category><![CDATA[chance]]></category>
		<category><![CDATA[clarity]]></category>
		<category><![CDATA[culture of science]]></category>
		<category><![CDATA[fraud]]></category>
		<category><![CDATA[generalizability]]></category>
		<category><![CDATA[generalizability crisis]]></category>
		<category><![CDATA[hacking]]></category>
		<category><![CDATA[honesty]]></category>
		<category><![CDATA[importance hacking]]></category>
		<category><![CDATA[incentives]]></category>
		<category><![CDATA[integrity]]></category>
		<category><![CDATA[novelty hacking]]></category>
		<category><![CDATA[open science]]></category>
		<category><![CDATA[overclaiming]]></category>
		<category><![CDATA[p-hacking]]></category>
		<category><![CDATA[probability]]></category>
		<category><![CDATA[psychological science]]></category>
		<category><![CDATA[publish or perish]]></category>
		<category><![CDATA[reasoning processes]]></category>
		<category><![CDATA[replication crisis]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[usefulness hacking]]></category>
		<category><![CDATA[veracity]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=3057</guid>

					<description><![CDATA[I first published this post on the Clearer Thinking blog on December 19, 2022, and first cross-posted it to this site on January 21, 2023. You have probably heard the phrase &#8220;replication crisis.&#8221; It refers to the grim fact that, in a number of fields of science, when researchers attempt to replicate previously published studies, [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph"><em>I first published this post on the <a href="https://www.clearerthinking.org/post/importance-hacking-a-major-yet-rarely-discussed-problem-in-science">Clearer Thinking blog</a> on December 19, 2022, and first cross-posted it to this site on January 21, 2023.</em></p>



<p class="wp-block-paragraph" id="viewer-1d12a"></p>



<p class="wp-block-paragraph" id="viewer-104ln">You have probably heard the phrase &#8220;replication crisis.&#8221; It refers to the grim fact that, in a number of fields of science, when researchers attempt to replicate previously published studies, they fairly often don&#8217;t get the same results. The magnitude of the problem depends on the field, but in psychology, it seems that something like <a rel="noreferrer noopener" href="http://datacolada.org/47" target="_blank"><u>40% of studies in top journals</u></a> don&#8217;t replicate. We&#8217;ve been tackling this crisis with our new <a rel="noreferrer noopener" href="https://replications.clearerthinking.org/" target="_blank"><u><em>Transparent Replications</em></u></a> project, and this post explains one of our key ideas.</p>



<p class="wp-block-paragraph" id="viewer-2dn5g">Replication failures are sometimes simply due to bad luck, but more often, they are caused by p-hacking &#8211; the use of fishy statistical techniques that lead to statistically significant (but misleading or erroneous) results. As big a problem as p-hacking is, there is another substantial problem in science that gets talked about much less. Although certain subtypes of this problem have been named previously, to my knowledge, the problem itself has no name, so I&#8217;m giving it one: &#8220;Importance Hacking.&#8221;</p>



<p class="wp-block-paragraph" id="viewer-3hoev">Academics want to publish in the top journals in their field. To understand Importance Hacking, let&#8217;s consider a (slightly oversimplified) list of the three most commonly-discussed ways to get a paper published in top psychology journals:</p>



<ol class="wp-block-list">
<li><strong>Conduct valuable research</strong> &#8211; make a genuinely interesting or important discovery, or add something valuable to the state of scientific knowledge. This is, of course, what just about everyone wants to do, but it&#8217;s very, very hard!</li>



<li><strong>Commit fraud</strong> &#8211; for instance, by making up your data. Thankfully, very few people are willing to do this because it&#8217;s so unethical. So this is by far the least used approach.</li>



<li><strong>p-hack</strong> &#8211; use fishy statistics, HARKing (i.e., hypothesizing after the results are known), selective reporting, using hidden <a href="https://en.wikipedia.org/wiki/Researcher_degrees_of_freedom" target="_blank" rel="noreferrer noopener"><u>researcher degrees of freedom</u></a>, etc., in order to get a p&lt;0.05 result that is actually just a false positive. This is a major problem and the focus of the replication crisis. Of course, false positives can also come about without fault, due to bad luck.</li>
</ol>



<p class="wp-block-paragraph" id="viewer-5plkf">But here is a fourth way to get a paper published in a top journal: Importance Hacking.</p>



<p class="wp-block-paragraph" id="viewer-ctrs5">4. <strong>Importance Hack</strong> &#8211; get a result that is actually not interesting, not important, and not valuable, but write about it in such a way that reviewers are convinced it is interesting, important, and/or valuable, so that it gets published.</p>



<p class="wp-block-paragraph" id="viewer-f54g1">For research to be valuable to society (and, in an ideal world, publishable in top journals), it must be true AND interesting (or important, useful, etc.). Researchers sometimes p-hack their results to skirt around the &#8220;true&#8221; criterion (by generating interesting false positives). On the other hand, Importance Hacking is a method for skirting the &#8220;interesting&#8221; criterion.</p>



<p class="wp-block-paragraph" id="viewer-ft7mi">Importance Hacking is related to concepts like <em>hype</em> and <em>overselling</em>, though hype and overselling are far more general. Importance Hacking refers specifically to a phenomenon whereby research with little to no value gets published in top journals due to the use of strategies that lead reviewers to misinterpret the work. On the other hand, hype and overselling are used in many ways in many stages of research (including to make valuable research appear even more valuable).</p>



<p class="wp-block-paragraph" id="viewer-dd0l9">One way to understand importance hacking is by comparing it to p-hacking. P-hacking refers to a set of bad research practices that enable researchers to publish non-existent effects. In other words, p-hacking misleads paper reviewers into thinking that non-existent effects are real. Importance Hacking, on the other hand, encompasses a different set of bad research practices: those that lead paper reviewers to believe that real (i.e., existent) results that have little to no value actually have substantial value.</p>



<p class="wp-block-paragraph" id="viewer-2tioa">This diagram illustrates how I think Importance Hacking interferes with the pipeline of producing valuable research:</p>



<figure class="wp-block-image"><img data-recalc-dims="1" decoding="async" src="https://i0.wp.com/static.wixstatic.com/media/f4e552_e1a60b1c65514edf9fef562a77c5c4ba~mv2.jpg/v1/fill/w_1480%2Ch_904%2Cal_c%2Cq_85%2Cusm_0.66_1.00_0.01%2Cenc_auto/f4e552_e1a60b1c65514edf9fef562a77c5c4ba~mv2.jpg?w=750&#038;ssl=1" alt=""/></figure>



<p class="wp-block-paragraph" id="viewer-7u47q">There are a number of subtypes of Importance Hacking based on the method used to make a result appear interesting/important/valuable when it&#8217;s not. Here is how I subdivide them:</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h2 class="wp-block-heading" id="viewer-brv18"></h2>



<h2 class="wp-block-heading" id="viewer-fh6np">Types of Importance Hacking</h2>



<p class="wp-block-paragraph" id="viewer-a5mla"><strong>1. Hacking Conclusions:</strong> make it seem like you showed some interesting thing X but actually show something else (X′) which sounds similar to X but is much less interesting/important. In these cases, researchers do not truly find what they imply they have found. This phenomenon is also closely connected with validity issues.</p>



<ul class="wp-block-list">
<li><em>Example 1: showing X is true in a simple video game but claiming that X is true in real life.</em></li>



<li><em>Example 2: showing A and B are correlated and claiming that A causes B (when really A and B are probably both caused by some third factor C, which makes the finding much less interesting).</em></li>



<li><em>Example 3: if a researcher claims to be measuring “aggression,” and couches all conclusions in these terms but is actually measuring milliliters of hot sauce that a person puts in someone else&#8217;s food. Their result about aggression will be valid only insofar as it is true that this is a valid measure of aggression.</em></li>



<li>Example 4: some types of hacking conclusions would fall under the terms &#8220;overclaiming&#8221; or &#8220;overgeneralizing;&#8221; Tal Yarkoni has a relevant paper called <a href="https://mzettersten.github.io/assets/pdf/ManyBabies_BBS_commentary.pdf" target="_blank" rel="noreferrer noopener"><em><u>The Generalizability Crisis</u></em></a><em>.</em></li>
</ul>



<p class="wp-block-paragraph" id="viewer-365fm"><strong>2. Hacking Novelty: </strong>refer to something in a way that makes it seem more novel or unintuitive than it is. Perhaps the result is already well known or is merely what just about everyone&#8217;s common sense would already tell them is true. In these cases, researchers really do find what they claim to have found, but what they found is not novel (despite them making it seem so). Hacking Novelty is also connected to the &#8220;Jingle-jangle&#8221; fallacy &#8211; where people can be led to believe two identical concepts are different because they have different names (or, more subtly, because they are operationalized somewhat differently).</p>



<ul class="wp-block-list">
<li><em>Example 1: showing something that is already well-known but giving it a new name that leads people to think it is something new. The concept of “grit” has received this criticism; some people claim it could turn out to be just another word for conscientiousness (or already known facets of conscientiousness) &#8211; though this question does not yet seem to be settled (different sides of this debate can be found in these papers: </em><a rel="noreferrer noopener" href="https://www.researchgate.net/publication/6290064_Grit_Perseverance_and_Passion_for_Long-Term_Goals" target="_blank"><em><u>1</u></em></a><em>, </em><a rel="noreferrer noopener" href="https://journals.sagepub.com/doi/pdf/10.1002/per.2171" target="_blank"><em><u>2</u></em></a><em>, </em><a rel="noreferrer noopener" href="https://drive.google.com/file/d/1NzMPCgZ_Ipbmzewgaj0dmopkfLq582NA/view" target="_blank"><em><u>3</u></em></a><em> and <u><a href="https://www.researchgate.net/publication/304032119_Much_Ado_About_Grit_A_Meta-Analytic_Synthesis_of_the_Grit_Literature">4</a></u>).</em></li>



<li><em>Example 2: showing that A and B are correlated, which seems surprising given how the constructs are named, but if you were to dig into how A and B were measured, it would be obvious they would be correlated.</em></li>



<li><em>Example 3: showing a common-sense result that almost everyone already would predict but making it seem like it&#8217;s not obvious (e.g., by giving it a fancy scientific name).</em></li>
</ul>



<p class="wp-block-paragraph" id="viewer-a209k"><strong>3. Hacking Usefulness: </strong>make a result seem useful or relevant to some important outcome when in fact, it&#8217;s useless and irrelevant. In these cases, researchers find what they claim to have found, but what they find is not useful (despite them making it sound useful).</p>



<ul class="wp-block-list">
<li><em>Example: focusing on statistical significance when the effect size is so small that the result is useless. Clinicians often distinguish between “statistical significance” and “clinical significance” to highlight the pitfalls of ignoring effect sizes when considering the importance of a finding.</em></li>
</ul>



<p class="wp-block-paragraph" id="viewer-etfss"><strong>4. Hacking Beauty: </strong>make a result seem clean and beautiful when in fact, it&#8217;s messy or hard to interpret. In these cases, researchers focus on certain details or results and tell a story around those, but they could have focused on other details or results that would have made the story less pretty, less clear-cut, or harder to make sense of. This is related to Giner-Sorolla’s 2012 paper <a href="https://journals.sagepub.com/doi/pdf/10.1177/1745691612457576" target="_blank" rel="noreferrer noopener"><em><u>Science or art: How aesthetic standards grease the way through the publication bottleneck but undermine science</u></em></a><em>. </em>Hacking beauty sometimes reduces to selective reporting of some kind (i.e., selective reporting of measures, analyses, or studies) or at least of selective focus on certain findings and not others. This becomes more difficult with pre-registration; if you have to report the results of planned analyses, there’s less room to make them look pretty (you could just <em>say</em> they’re pretty, but that seems like overclaiming)</p>



<ul class="wp-block-list">
<li><em>Example: emphasizing the parts of the result that tell a clean story while not including (or burying somewhere in the paper) the parts that contradict that story</em></li>
</ul>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph" id="viewer-56mr8">Science faces multiple challenges. Over the past decade, the <a rel="noreferrer noopener" href="https://en.wikipedia.org/wiki/Replication_crisis" target="_blank"><u>replication crisis</u></a> and subsequent <a rel="noreferrer noopener" href="https://en.wikipedia.org/wiki/Open_science" target="_blank"><u>open science movement</u></a> have greatly increased awareness of p-hacking as a problem. Measures have begun to be put in place to reduce p-hacking. Importance Hacking is another substantial problem, but it has received far less attention.</p>



<figure class="wp-block-image"><img data-recalc-dims="1" decoding="async" src="https://i0.wp.com/static.wixstatic.com/media/f4e552_94289803042f43d68a85e7c490b1fa1c~mv2.jpg/v1/fill/w_1480%2Ch_1110%2Cal_c%2Cq_85%2Cusm_0.66_1.00_0.01%2Cenc_auto/f4e552_94289803042f43d68a85e7c490b1fa1c~mv2.jpg?w=750&#038;ssl=1" alt=""/><figcaption class="wp-element-caption"><em>Digital art created using the A.I. DALL</em>·<em>E</em></figcaption></figure>



<p class="wp-block-paragraph" id="viewer-at41b"></p>



<p class="wp-block-paragraph" id="viewer-aqs8s">If a pipe is leaking from two holes and its pressure is kept fixed, then repairing one hole will result in the other one leaking faster. Similarly, as best practices increasingly become commonplace as a means to reduce p-hacking, so long as the career pressures to publish in top journals don&#8217;t let up, the occurrence of Importance Hacking may increase.</p>



<p class="wp-block-paragraph" id="viewer-3rjml">It&#8217;s time to start the conversation about how Importance Hacking can be addressed.</p>



<p class="wp-block-paragraph" id="viewer-agpq6">If you&#8217;re interested in learning more about Importance Hacking, you can listen to <a rel="noreferrer noopener" href="https://clearerthinkingpodcast.com/episode/122" target="_blank"><u>psychology professor Alexa Tullett and me discussing it on the Clearer Thinking podcast</u></a> (there, I refer to it as &#8220;Importance Laundering,&#8221; but I now think &#8220;Importance Hacking&#8221; is a better name) or me talking about it on the <a rel="noreferrer noopener" href="https://www.fourbeers.com/98" target="_blank"><u>Two Psychologists Four Beers podcast</u></a>. We also discuss my new project, <a rel="noreferrer noopener" href="https://replications.clearerthinking.org/" target="_blank"><u>Transparent Replications</u></a>, which conducts rapid replications of recently published psychology papers in top journals in an effort to shift incentives and create more reliable, replicable research. If you enjoyed this article, you may be interested in checking our <a rel="noreferrer noopener" href="https://replications.clearerthinking.org/replications/" target="_blank"><u>replication reports</u></a> and learning more <a rel="noreferrer noopener" href="https://replications.clearerthinking.org/about/" target="_blank"><u>about the project</u></a>.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph" id="viewer-es1me"><em>Did you like this article? If so, you may like to explore the ClearerThinking Podcast, where I have fun, in-depth conversations with brilliant people about ideas that matter. </em><a rel="noreferrer noopener" href="https://clearerthinkingpodcast.com/" target="_blank"><em><u>Click here to see a full list of episodes</u></em></a><em>.</em></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2022/12/importance-hacking-a-major-yet-rarely-discussed-problem-in-science/feed/</wfw:commentRss>
			<slash:comments>6</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3057</post-id>	</item>
		<item>
		<title>How can we look at the same dataset and come to wildly different conclusions?</title>
		<link>https://www.spencergreenberg.com/2022/11/how-can-we-look-at-the-same-dataset-and-come-to-wildly-different-conclusions/</link>
					<comments>https://www.spencergreenberg.com/2022/11/how-can-we-look-at-the-same-dataset-and-come-to-wildly-different-conclusions/#respond</comments>
		
		<dc:creator><![CDATA[admin]]></dc:creator>
		<pubDate>Wed, 30 Nov 2022 14:30:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[degrees of freedom]]></category>
		<category><![CDATA[hypothesis testing]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[perspective]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[statistical methods]]></category>
		<category><![CDATA[statistics]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=3602</guid>

					<description><![CDATA[Recently, a study came out where 73 research teams independently analyzed the same data, all trying to test the same hypothesis. Seventy-one of the teams came up with numerical results across a total of 1,253 models. Across these 1,253 different ways of looking at the data, about 58% showed no effect, 17% showed a positive [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Recently, a study came out where 73 research teams independently analyzed the same data, all trying to test the same hypothesis. Seventy-one of the teams came up with numerical results across a total of 1,253 models. Across these 1,253 different ways of looking at the data, about 58% showed no effect, 17% showed a positive effect, and 25% showed a negative effect. But that&#8217;s not even the oddest part.&nbsp;</p>



<p class="wp-block-paragraph">The oddest part is that despite a heroic attempt to do so, the study authors failed to explain why the different research teams reached such different conclusions:</p>



<p class="wp-block-paragraph">&#8220;More than 95% of the total variance in numerical results remains unexplained even after qualitative coding of all identifiable decisions in each team&#8217;s workflow. This reveals a universe of uncertainty that remains hidden when considering a single study in isolation.&#8221;</p>



<p class="wp-block-paragraph">The hypothesis they were trying to test was whether greater immigration reduces support for social policies (such as for government-provided healthcare).</p>



<p class="wp-block-paragraph">The study included data from 31 countries at up to five time periods each.</p>



<p class="wp-block-paragraph">If all of the countries and time points were independent data points, that would be an effective sample size of at most 5*31 = 155.</p>



<p class="wp-block-paragraph">However, the data points from one country at different points in time are highly correlated.</p>



<p class="wp-block-paragraph">So, in practice, this might be equivalent to more like 75 (independent) data points.</p>



<p class="wp-block-paragraph">If the (effective) sample size was equivalent to only about n=75, the 95th percentile confidence interval on a correlation could be pretty large (e.g., it could be +- 0.20), suggesting that false negatives would be very common (unless the relationship in question is pretty strong).</p>



<p class="wp-block-paragraph">In view of this, I think it&#8217;s possible that this study was doomed from the start.</p>



<p class="wp-block-paragraph">Why? Unless we&#8217;d expect a reasonably strong effect, maybe there just wasn&#8217;t enough data to answer the question at hand.</p>



<p class="wp-block-paragraph">Some of the teams may have come to this conclusion as well. Of the 73 research teams involved in the study, one of them conducted preliminary measurement scaling tests and concluded that the hypothesis could not be reliably tested. Another team&#8217;s preregistered models failed to converge, and so that team also had no numerical results. Some teams reached more than one conclusion, and across the 89 team conclusions reached, 12 of those conclusions (13.5%) were that the hypothesis was not testable with the data provided.</p>



<p class="wp-block-paragraph">Then, of course, there is the possibility of confounding variables &#8211; other factors that could be linked to both immigration and inflation that make the true relationship seem bigger or smaller than it really is.</p>



<p class="wp-block-paragraph">Here&#8217;s a link to the paper if you&#8217;re interested: &#8220;<a target="_blank" href="https://www.pnas.org/doi/10.1073/pnas.2203150119" rel="noreferrer noopener">Observing many researchers using the same data and hypothesis reveals a hidden universe of uncertainty</a>.&#8221;</p>



<p class="wp-block-paragraph">A big thanks to Cameron Colby Thomson for pointing me to this study!</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><em>This piece was first written on November 30, 2022, and first appeared on this site on September 29, 2023.</em></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2022/11/how-can-we-look-at-the-same-dataset-and-come-to-wildly-different-conclusions/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3602</post-id>	</item>
		<item>
		<title>It can be shockingly hard just to understand three variables</title>
		<link>https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/</link>
					<comments>https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/#respond</comments>
		
		<dc:creator><![CDATA[admin]]></dc:creator>
		<pubDate>Tue, 20 Apr 2021 00:22:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[causality]]></category>
		<category><![CDATA[confounding variables]]></category>
		<category><![CDATA[correlation]]></category>
		<category><![CDATA[diagram]]></category>
		<category><![CDATA[hidden confounder]]></category>
		<category><![CDATA[hypotheses]]></category>
		<category><![CDATA[mediation]]></category>
		<category><![CDATA[mistakes]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[statistical inference]]></category>
		<category><![CDATA[statistics]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=3729</guid>

					<description><![CDATA[In science (and when developing hypotheses more generally), it is very common to come across situations where a variable of interest (let’s call this the dependent variable, “Y”) is strongly correlated with at least two other variables (let’s call them “A” and “B”). Here are some examples:  In all these examples, we know that at [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">In science (and when developing hypotheses more generally), it is very common to come across situations where a variable of interest (let’s call this the dependent variable, “Y”) is strongly correlated with at least two other variables (let’s call them “A” and “B”). Here are some examples: </p>



<ul class="wp-block-list">
<li>If you’re a psychology researcher investigating possible causes of depression (Y), you may have trouble disentangling the effects of poor sleep quality (A) and anxiety (B), both of which tend to be correlated with depression.</li>



<li>If you’re a health researcher investigating the causes of diabetes (Y), you may have trouble disentangling the effects of high carbohydrate intake (A) and obesity (B).</li>



<li>If you’re investigating the causes of high life satisfaction (Y), you may have trouble disentangling the effects of friendship quality (A) and mental well-being (B).</li>
</ul>



<p class="wp-block-paragraph">In all these examples, we know that at least two of the variables (A and B) are related to the main variable (Y), but the really tricky question is to figure out what all the possible causal relationships are between the three. For instance, does A cause B, which causes Y, does Y cause both A and B, or is there some other explanation?&nbsp;</p>



<p class="wp-block-paragraph">In the pdf below, I sketch out 45 possible explanations to consider in situations where there are two variables that both correlate with a third variable of interest.</p>



<p class="wp-block-paragraph">First of all, there are the types of causal relationships one often expects, where A and B both cause Y in simple ways (either directly or through each other):</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="446" data-attachment-id="3739" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-1-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?fit=2376%2C1412&amp;ssl=1" data-orig-size="2376,1412" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-1" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?fit=750%2C446&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=750%2C446&#038;ssl=1" alt="" class="wp-image-3739" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=1024%2C609&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=300%2C178&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=768%2C456&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=1536%2C913&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=2048%2C1217&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?w=2250&amp;ssl=1 2250w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">Even if A and B really do cause Y, they could be interconnected to each other in complex ways:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="247" data-attachment-id="3743" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-4-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?fit=2658%2C876&amp;ssl=1" data-orig-size="2658,876" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-4" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?fit=750%2C247&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=750%2C247&#038;ssl=1" alt="" class="wp-image-3743" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=1024%2C337&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=300%2C99&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=768%2C253&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=1536%2C506&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=2048%2C675&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?w=2250&amp;ssl=1 2250w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">It also could be the case that only A or only B causes Y, with the other variable only appearing to cause Y due to a confounding effect:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="312" data-attachment-id="3740" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-2-5/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?fit=2288%2C952&amp;ssl=1" data-orig-size="2288,952" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-2" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?fit=750%2C312&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=750%2C312&#038;ssl=1" alt="" class="wp-image-3740" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=1024%2C426&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=300%2C125&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=768%2C320&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=1536%2C639&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=2048%2C852&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?w=2250&amp;ssl=1 2250w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">It’s also possible that Y is actually one of the causes rather than merely being caused by A and B:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="716" data-attachment-id="3742" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-3-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?fit=2372%2C2262&amp;ssl=1" data-orig-size="2372,2262" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-3" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?fit=750%2C716&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=750%2C716&#038;ssl=1" alt="" class="wp-image-3742" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=1024%2C977&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=300%2C286&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=768%2C732&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=1536%2C1465&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=2048%2C1953&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?w=2250&amp;ssl=1 2250w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">Then there are situations where there is a critical other variable (or set of variables – represented as a “?” below) that are integral to the causal structure:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="436" data-attachment-id="3744" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-5-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?fit=2166%2C1258&amp;ssl=1" data-orig-size="2166,1258" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-5" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?fit=750%2C436&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=750%2C436&#038;ssl=1" alt="" class="wp-image-3744" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=1024%2C595&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=300%2C174&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=768%2C446&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=1536%2C892&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=2048%2C1189&amp;ssl=1 2048w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p class="wp-block-paragraph">Finally, there are situations where Y is caused by A or B (or both), but Y also causes A or B (or both), resulting in a cyclic relationship:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="662" height="1024" data-attachment-id="3745" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-6-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?fit=1398%2C2162&amp;ssl=1" data-orig-size="1398,2162" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-6" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?fit=662%2C1024&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=662%2C1024&#038;ssl=1" alt="" class="wp-image-3745" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=662%2C1024&amp;ssl=1 662w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=194%2C300&amp;ssl=1 194w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=768%2C1188&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=993%2C1536&amp;ssl=1 993w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=1324%2C2048&amp;ssl=1 1324w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?w=1398&amp;ssl=1 1398w" sizes="auto, (max-width: 662px) 100vw, 662px" /></figure>



<p class="wp-block-paragraph">Here&#8217;s a <a href="https://1231047546.rsc.cdn77.org/images/Causal_relationships/Cause%20diagrams%20for%20one%20outcome%20all%20possibilitities%20causal%20updated_3.pdf">link to my pdf</a> showing most of the possible relationships.</p>



<div data-wp-interactive="core/file" class="wp-block-file"><object data-wp-bind--hidden="!state.hasPdfPreview" hidden class="wp-block-file__embed" data="https://www.spencergreenberg.com/wp-content/uploads/2023/11/Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1.pdf" type="application/pdf" style="width:100%;height:600px" aria-label="Embed of Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1."></object><a id="wp-block-file--media-0b96b3a8-51af-460c-a940-f0e50b5f8a08" href="https://www.spencergreenberg.com/wp-content/uploads/2023/11/Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1.pdf">Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1</a><a href="https://www.spencergreenberg.com/wp-content/uploads/2023/11/Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1.pdf" class="wp-block-file__button wp-element-button" download aria-describedby="wp-block-file--media-0b96b3a8-51af-460c-a940-f0e50b5f8a08">Download</a></div>



<p class="wp-block-paragraph">If you want to read about other challenges associated with untangling causality in the real world, you can read another post about this&nbsp;<a href="https://www.spencergreenberg.com/2023/09/three-reasons-to-be-cautious-when-reading-data-driven-explanations/" target="_blank" rel="noreferrer noopener">here</a>.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p class="wp-block-paragraph"><em>I first created this diagram on April 19, 2021. I made minor edits to the diagram and wrote this piece with assistance from Clare Harris. This piece first appeared on my <a href="https://www.spencergreenberg.com/all-essays/">website</a> on November 22, 2023.</em></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3729</post-id>	</item>
	</channel>
</rss>
