<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>hypotheses &#8211; Spencer Greenberg</title>
	<atom:link href="https://www.spencergreenberg.com/tag/hypotheses/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.spencergreenberg.com</link>
	<description></description>
	<lastBuildDate>Thu, 24 Jul 2025 22:03:36 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2024/05/cropped-icon.png?fit=32%2C32&#038;ssl=1</url>
	<title>hypotheses &#8211; Spencer Greenberg</title>
	<link>https://www.spencergreenberg.com</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">23753251</site>	<item>
		<title>It can be shockingly hard just to understand three variables</title>
		<link>https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/</link>
					<comments>https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/#respond</comments>
		
		<dc:creator><![CDATA[admin]]></dc:creator>
		<pubDate>Tue, 20 Apr 2021 00:22:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[causality]]></category>
		<category><![CDATA[confounding variables]]></category>
		<category><![CDATA[correlation]]></category>
		<category><![CDATA[diagram]]></category>
		<category><![CDATA[hidden confounder]]></category>
		<category><![CDATA[hypotheses]]></category>
		<category><![CDATA[mediation]]></category>
		<category><![CDATA[mistakes]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[social science]]></category>
		<category><![CDATA[statistical inference]]></category>
		<category><![CDATA[statistics]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=3729</guid>

					<description><![CDATA[In science (and when developing hypotheses more generally), it is very common to come across situations where a variable of interest (let’s call this the dependent variable, “Y”) is strongly correlated with at least two other variables (let’s call them “A” and “B”). Here are some examples:  In all these examples, we know that at [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p>In science (and when developing hypotheses more generally), it is very common to come across situations where a variable of interest (let’s call this the dependent variable, “Y”) is strongly correlated with at least two other variables (let’s call them “A” and “B”). Here are some examples: </p>



<ul class="wp-block-list">
<li>If you’re a psychology researcher investigating possible causes of depression (Y), you may have trouble disentangling the effects of poor sleep quality (A) and anxiety (B), both of which tend to be correlated with depression.</li>



<li>If you’re a health researcher investigating the causes of diabetes (Y), you may have trouble disentangling the effects of high carbohydrate intake (A) and obesity (B).</li>



<li>If you’re investigating the causes of high life satisfaction (Y), you may have trouble disentangling the effects of friendship quality (A) and mental well-being (B).</li>
</ul>



<p>In all these examples, we know that at least two of the variables (A and B) are related to the main variable (Y), but the really tricky question is to figure out what all the possible causal relationships are between the three. For instance, does A cause B, which causes Y, does Y cause both A and B, or is there some other explanation?&nbsp;</p>



<p>In the pdf below, I sketch out 45 possible explanations to consider in situations where there are two variables that both correlate with a third variable of interest.</p>



<p>First of all, there are the types of causal relationships one often expects, where A and B both cause Y in simple ways (either directly or through each other):</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" fetchpriority="high" decoding="async" width="750" height="446" data-attachment-id="3739" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-1-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?fit=2376%2C1412&amp;ssl=1" data-orig-size="2376,1412" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-1" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?fit=750%2C446&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=750%2C446&#038;ssl=1" alt="" class="wp-image-3739" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=1024%2C609&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=300%2C178&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=768%2C456&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=1536%2C913&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?resize=2048%2C1217&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-1.png?w=2250&amp;ssl=1 2250w" sizes="(max-width: 750px) 100vw, 750px" /></figure>



<p>Even if A and B really do cause Y, they could be interconnected to each other in complex ways:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="750" height="247" data-attachment-id="3743" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-4-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?fit=2658%2C876&amp;ssl=1" data-orig-size="2658,876" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-4" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?fit=750%2C247&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=750%2C247&#038;ssl=1" alt="" class="wp-image-3743" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=1024%2C337&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=300%2C99&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=768%2C253&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=1536%2C506&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?resize=2048%2C675&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-4.png?w=2250&amp;ssl=1 2250w" sizes="(max-width: 750px) 100vw, 750px" /></figure>



<p>It also could be the case that only A or only B causes Y, with the other variable only appearing to cause Y due to a confounding effect:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" decoding="async" width="750" height="312" data-attachment-id="3740" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-2-5/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?fit=2288%2C952&amp;ssl=1" data-orig-size="2288,952" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-2" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?fit=750%2C312&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=750%2C312&#038;ssl=1" alt="" class="wp-image-3740" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=1024%2C426&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=300%2C125&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=768%2C320&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=1536%2C639&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?resize=2048%2C852&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-2.png?w=2250&amp;ssl=1 2250w" sizes="(max-width: 750px) 100vw, 750px" /></figure>



<p>It’s also possible that Y is actually one of the causes rather than merely being caused by A and B:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="716" data-attachment-id="3742" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-3-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?fit=2372%2C2262&amp;ssl=1" data-orig-size="2372,2262" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-3" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?fit=750%2C716&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=750%2C716&#038;ssl=1" alt="" class="wp-image-3742" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=1024%2C977&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=300%2C286&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=768%2C732&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=1536%2C1465&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?resize=2048%2C1953&amp;ssl=1 2048w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-3.png?w=2250&amp;ssl=1 2250w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p>Then there are situations where there is a critical other variable (or set of variables – represented as a “?” below) that are integral to the causal structure:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="750" height="436" data-attachment-id="3744" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-5-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?fit=2166%2C1258&amp;ssl=1" data-orig-size="2166,1258" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-5" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?fit=750%2C436&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=750%2C436&#038;ssl=1" alt="" class="wp-image-3744" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=1024%2C595&amp;ssl=1 1024w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=300%2C174&amp;ssl=1 300w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=768%2C446&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=1536%2C892&amp;ssl=1 1536w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-5.png?resize=2048%2C1189&amp;ssl=1 2048w" sizes="auto, (max-width: 750px) 100vw, 750px" /></figure>



<p>Finally, there are situations where Y is caused by A or B (or both), but Y also causes A or B (or both), resulting in a cyclic relationship:</p>



<figure class="wp-block-image size-large"><img data-recalc-dims="1" loading="lazy" decoding="async" width="662" height="1024" data-attachment-id="3745" data-permalink="https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/image-6-4/" data-orig-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?fit=1398%2C2162&amp;ssl=1" data-orig-size="1398,2162" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="image-6" data-image-description="" data-image-caption="" data-large-file="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?fit=662%2C1024&amp;ssl=1" src="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=662%2C1024&#038;ssl=1" alt="" class="wp-image-3745" srcset="https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=662%2C1024&amp;ssl=1 662w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=194%2C300&amp;ssl=1 194w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=768%2C1188&amp;ssl=1 768w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=993%2C1536&amp;ssl=1 993w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?resize=1324%2C2048&amp;ssl=1 1324w, https://i0.wp.com/www.spencergreenberg.com/wp-content/uploads/2023/11/image-6.png?w=1398&amp;ssl=1 1398w" sizes="auto, (max-width: 662px) 100vw, 662px" /></figure>



<p>Here&#8217;s a <a href="https://1231047546.rsc.cdn77.org/images/Causal_relationships/Cause%20diagrams%20for%20one%20outcome%20all%20possibilitities%20causal%20updated_3.pdf">link to my pdf</a> showing most of the possible relationships.</p>



<div data-wp-interactive="core/file" class="wp-block-file"><object data-wp-bind--hidden="!state.hasPdfPreview" hidden class="wp-block-file__embed" data="https://www.spencergreenberg.com/wp-content/uploads/2023/11/Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1.pdf" type="application/pdf" style="width:100%;height:600px" aria-label="Embed of Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1."></object><a id="wp-block-file--media-0b96b3a8-51af-460c-a940-f0e50b5f8a08" href="https://www.spencergreenberg.com/wp-content/uploads/2023/11/Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1.pdf">Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1</a><a href="https://www.spencergreenberg.com/wp-content/uploads/2023/11/Cause-diagrams-for-one-outcome-all-possibilitities-causal-updated_3-1.pdf" class="wp-block-file__button wp-element-button" download aria-describedby="wp-block-file--media-0b96b3a8-51af-460c-a940-f0e50b5f8a08">Download</a></div>



<p>If you want to read about other challenges associated with untangling causality in the real world, you can read another post about this&nbsp;<a href="https://www.spencergreenberg.com/2023/09/three-reasons-to-be-cautious-when-reading-data-driven-explanations/" target="_blank" rel="noreferrer noopener">here</a>.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p><em>I first created this diagram on April 19, 2021. I made minor edits to the diagram and wrote this piece with assistance from Clare Harris. This piece first appeared on my <a href="https://www.spencergreenberg.com/all-essays/">website</a> on November 22, 2023.</em></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2021/04/it-can-be-shockingly-hard-just-to-understand-three-variables/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3729</post-id>	</item>
		<item>
		<title>Ways to develop new hypotheses about human psychology</title>
		<link>https://www.spencergreenberg.com/2017/11/ways-to-develop-new-hypotheses-about-human-psychology/</link>
					<comments>https://www.spencergreenberg.com/2017/11/ways-to-develop-new-hypotheses-about-human-psychology/#respond</comments>
		
		<dc:creator><![CDATA[Spencer]]></dc:creator>
		<pubDate>Thu, 16 Nov 2017 22:58:00 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[approach]]></category>
		<category><![CDATA[approaches]]></category>
		<category><![CDATA[apps]]></category>
		<category><![CDATA[blogger]]></category>
		<category><![CDATA[experiment]]></category>
		<category><![CDATA[expert]]></category>
		<category><![CDATA[experts]]></category>
		<category><![CDATA[human]]></category>
		<category><![CDATA[human psychology]]></category>
		<category><![CDATA[hypotheses]]></category>
		<category><![CDATA[methods]]></category>
		<category><![CDATA[model]]></category>
		<category><![CDATA[models]]></category>
		<category><![CDATA[process]]></category>
		<category><![CDATA[products]]></category>
		<category><![CDATA[psychology]]></category>
		<category><![CDATA[studies]]></category>
		<category><![CDATA[technique]]></category>
		<category><![CDATA[theories]]></category>
		<category><![CDATA[tool]]></category>
		<category><![CDATA[tools]]></category>
		<guid isPermaLink="false">https://www.spencergreenberg.com/?p=4442</guid>

					<description><![CDATA[How would you go about building improved models of human psychology so that you can better help people? It might seem nearly impossible at first, but data about psychology is all around us, and there are numerous approaches you could take to discover new insights. Here are 24 different methods you could use to better [&#8230;]]]></description>
										<content:encoded><![CDATA[
<p>How would you go about building improved models of human psychology so that you can better help people? It might seem nearly impossible at first, but data about psychology is all around us, and there are numerous approaches you could take to discover new insights.</p>



<p>Here are 24 different methods you could use to better understand the way humans work. In each case, I use &#8220;trying to figure out new things about post-traumatic stress disorder (PTSD)&#8221; as an example. What techniques am I leaving out here?</p>



<ol class="wp-block-list">
<li>Papers: Reading academic studies (e.g., from social psychology, consumer psychology, cognitive psychology, neuroscience, etc.)</li>
</ol>



<p>Example: reading the newest academic papers on treatments for PTSD.</p>



<ol start="2" class="wp-block-list">
<li>Products: Carefully examining products and apps that have been successful (as well as, potentially, those that have failed) to understand what human needs they were or weren&#8217;t satisfying and why they succeeded or failed.</li>
</ol>



<p>Example: looking at what mental health apps related to PTSD are popular in the app stores.</p>



<ol start="3" class="wp-block-list">
<li>Outliers: Investigating extreme case studies of real-life events (e.g., case studies of people with extreme psychological conditions who may have been reported about in the news or true stories about what actually happened when people were placed into really extreme conditions).</li>
</ol>



<p>Example: reading about cases of extreme cults and whether or not each cult produced PTSD-like symptoms in its members.</p>



<ol start="4" class="wp-block-list">
<li>Society: Examining the large-scale trends and organization of societies, including potentially human cultures around the world (American culture, traditional cultures, etc.) and how they are similar or different, and which societal structures tend to be created and what the effects of these structures are, etc.</li>
</ol>



<p>Example: investigating which sort of societies PTSD appears to be much more prevalent in.</p>



<ol start="5" class="wp-block-list">
<li>Conversations: Talking to smart and insightful people about what they think is true of human psychology. You can also show them the models or theories you have so far to get their criticism and suggestions for improving them.</li>
</ol>



<p>Example: asking really smart (non-experts) who had PTSD what their current theories about PTSD are.</p>



<ol start="6" class="wp-block-list">
<li>Reading: Reading books, blog posts, media articles, etc., containing information from smart or insightful people about psychology (including potentially writing from life coaches, relationship experts, marketers, thoughtful bloggers, user experience designers, game makers, etc.)</li>
</ol>



<p>Example: seeing what theories bloggers have proposed regarding PTSD that many academics may not have considered.</p>



<ol start="7" class="wp-block-list">
<li>Experts: Reaching out to experts and talking to them to understand their models of psychology (e.g., psychologists, psychiatrists, game developers, etc.). You can also show them the models and theories you have so far to get their criticism and suggestions for improving them.</li>
</ol>



<p>Example: talking to world-class experts in PTSD.</p>



<ol start="8" class="wp-block-list">
<li>Textbooks: Reading academic textbooks about psychology, cognitive science, etc.</li>
</ol>



<p>Example: Linehan&#8217;s textbook on DBT and thinking about possible applications of DBT in PTSD treatment.</p>



<ol start="9" class="wp-block-list">
<li>Models: Looking at what models or frameworks of psychology others have created (e.g., in social psychology, theoretical cognitive neuroscience, gamification research, persuasion research, etc).</li>
</ol>



<p>Example: looking at the model of PTSD that is used in Cognitive Behavioral Therapy treatment.</p>



<ol start="10" class="wp-block-list">
<li>Intuition: Our own intuition for what&#8217;s true about humans (presumably based on our first-hand experience interacting with others, as well as our first-hand, ingrained experience of just being a human).</li>
</ol>



<p>Example: We may have the intuition that after experiencing a traumatic event, people will tend to start having frequent disruptive thoughts about that event, even when doing unrelated things.</p>



<ol start="11" class="wp-block-list">
<li>Interviews: talking to individuals or groups about their thought processes, feelings, behaviors, etc., to understand their individual psychology and then attempting to extrapolate that knowledge to humans more generally.</li>
</ol>



<p>Example: interviewing 20 people with PTSD about what their experience of it is like and what does or does not seem to help them.</p>



<ol start="12" class="wp-block-list">
<li>Application: attempting to put into practice a psychological theory (e.g., by coaching individuals using a psychological technique or by trying to build an app that implements it) and paying attention to the ways it seems to work or not work.</li>
</ol>



<p>Example: creating a tool for people with PTSD and making it publicly available, then monitoring how people respond to it.</p>



<ol start="13" class="wp-block-list">
<li>Anecdotes: psychologically surprising or interesting anecdotes that we&#8217;re confident actually happened (e.g., because they happened to us or were reported on by a trustworthy source) that may have a bearing on human psychology.</li>
</ol>



<p>Example: a story a friend told you about how they (believe) they fully cured their own extreme PTSD in a matter of days.</p>



<ol start="14" class="wp-block-list">
<li>Self-observation: careful (and honest) observation of our own thoughts/beliefs/behavior/emotions and internal processes.</li>
</ol>



<p>Example: observing the mental processes in your own mind that seem to occur after a very upsetting event has happened.</p>



<ol start="15" class="wp-block-list">
<li>Self-experimentation: simply trying techniques and paying close attention to what effects they appear to have and what it feels like to apply them.</li>
</ol>



<p>Example: try the technique of writing down for a week every upsetting thought that you notice yourself having, as well as writing down everything you observe or learn about the experience of doing this.</p>



<ol start="16" class="wp-block-list">
<li>Deduction: combining propositions we already believe to produce new propositions we didn&#8217;t realize before that may be true.</li>
</ol>



<p>Example: knowing that women with PTSD are much more likely to have associated depression than men with PTSD, and knowing that depression is a strong risk factor for attempting suicide, we might predict that women with PTSD are more likely to attempt suicide than men with PTSD (even though they may not be more likely than men to actually commit suicide).</p>



<ol start="17" class="wp-block-list">
<li>Experiments: Conduct your own online randomized controlled trials, surveys, longitudinal studies, or fMRI studies, etc.</li>
</ol>



<p>Example: looking at which variables that don&#8217;t seem obviously PTSD related are, in fact, strong predictors of whether someone has PTSD.</p>



<ol start="18" class="wp-block-list">
<li>Statistics: looking up known statistics about a phenomenon.</li>
</ol>



<p>Example: looking up whether younger people or older people are known to be more likely to get PTSD in any given 12-month period.</p>



<ol start="19" class="wp-block-list">
<li>Data: finding existing data sets (e.g., large longitudinal surveys or government data sets) and running your own statistical analyses to test hypotheses.</li>
</ol>



<p>Example: measuring whether it is true that after someone gets a pet, they are more likely to recover from PTSD than similar people who do not get a pet. Running studies can also be a good way to search for new hypotheses. For instance, examining which of many variables in the data are most associated with a rapid recovery from PTSD.</p>



<ol start="20" class="wp-block-list">
<li>Literature: looking at portrayals of psychology in literature, art, myths, stories, philosophical writings, etc.</li>
</ol>



<p>Example: What hypotheses do Kafka&#8217;s novels give us about the nature of trauma?</p>



<ol start="21" class="wp-block-list">
<li>Crowdsourcing: Request suggestions for hypotheses on a social media platform like Facebook or X or on a question-answer site like Quora.</li>
</ol>



<p>Example: posting to Facebook asking people to suggest theories for why PTSD sometimes does and sometimes doesn&#8217;t happen after extreme trauma.</p>



<ol start="22" class="wp-block-list">
<li>Social experiments: bringing a group of people together under a new set of chosen social rules or guidelines and observing what happens.</li>
</ol>



<p>Example: getting a small group of people together with the idea that it will be a time to discuss traumas that members of the group have experienced.</p>



<ol start="23" class="wp-block-list">
<li>Evolution: considering what possible evolutionary function different psychological phenomena might have.</li>
</ol>



<p>Example: consider ways that aspects of PTSD responses might be evolutionarily adaptive.</p>



<ol start="24" class="wp-block-list">
<li>Practitioners: examine how people &#8220;in the field&#8221; who benefit from being able to understand this aspect of psychology do their work.</li>
</ol>



<p>For example, at veterans hospitals where rates of PTSD are high, investigate what techniques and approaches the therapists have developed to work successfully with their patients.</p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<p><em>This piece was first written on November 16, 2017, and first appeared on my website on July 22, 2025.</em></p>



<p></p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2017/11/ways-to-develop-new-hypotheses-about-human-psychology/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">4442</post-id>	</item>
		<item>
		<title>Testing Too Many Hypotheses</title>
		<link>https://www.spencergreenberg.com/2011/10/testing-too-many-hypotheses/</link>
					<comments>https://www.spencergreenberg.com/2011/10/testing-too-many-hypotheses/#comments</comments>
		
		<dc:creator><![CDATA[Spencer]]></dc:creator>
		<pubDate>Mon, 10 Oct 2011 17:16:40 +0000</pubDate>
				<category><![CDATA[Essays]]></category>
		<category><![CDATA[experiments]]></category>
		<category><![CDATA[hypotheses]]></category>
		<category><![CDATA[hypothesis test]]></category>
		<category><![CDATA[p-values]]></category>
		<category><![CDATA[probability]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[test]]></category>
		<guid isPermaLink="false">http://www.spencergreenberg.com/?p=249</guid>

					<description><![CDATA[For each dataset, there is a limit to what we can use that dataset to test. Using the standard p-value based methods of science, the more hypotheses we check against the data, the more likely it will be that some of these checks give inaccurate conclusions. And this presents a big problem for the way [&#8230;]]]></description>
										<content:encoded><![CDATA[<p>For each dataset, there is a limit to what we can use that dataset to test. Using the standard <a href="http://en.wikipedia.org/wiki/P-value">p-value based methods</a> of science, the more hypotheses we check against the data, the more likely it will be that some of these checks give inaccurate conclusions. And this presents a big problem for the way science is practiced.</p>
<p>Let&#8217;s take an example to illustrate the principle. Suppose that you have information about 1000 people selected at random from the U.S. adult population. Your dataset includes these people&#8217;s heights, weights, ages, shoe sizes, and so forth. Now, if your goal is to know the mean height of all people in America, you can produce an estimate of this quantity by averaging the heights of the 1000 people you have information about. Despite the fact that your sample contains just 1000 people, rather than the full set of 230,000,000 or so American adults of interest, your estimate will, with high probability, be within a couple of inches of the total population mean height. This is due to the fact that the 1000 people were sampled at random (so we shouldn&#8217;t expect our sample to differ from the entire population in a systematic way) and because the standard deviation of heights is not very large (if there were tremendous outliers in the data, such as 500 foot tall giants, we would need more samples to get an accurate estimate). This idea is made precise by the central limit theorem. It tells us how likely the true entire population mean is to fall different distances from our sample estimate, and says that the error of our estimate decreases like one over the square root of the size of our sample.</p>
<p>The same technique could work to approximate the mean weight of adult Americans, or age, or shoe size, or number of children. And in each case, the estimate would, with high probability, be quite accurate. We could even, if we liked, estimate all these quantities simultaneously if we collected all of this information about each of our 1000 people. But the more quantities we estimate, the greater the chance that at least one estimate is quite inaccurate. Since each estimate has some chance of being bad, if we make a sufficiently large number of estimates we should expect to get unlucky at some point and end up with one or more bad ones. So, if we aren&#8217;t just estimating mean height, but rather the mean of 50 different traits, we cannot claim that all 50 of these estimate are likely to be good. We should expect that some of them will be inaccurate, though we don&#8217;t know which ones.</p>
<p>This is where problems arise. Suppose that you are a researcher who is trying to find interesting differences between, say, southerners and northerners in the United States. Your dataset of 1000 adults contains 500 people from each group. What do you do? Well, it might seem reasonable to go ahead and compute the mean value of many different traits, and look at how these means differ between the two groups, to see if you can find any large differences that seem interesting. For instance, you may compute the average salary of each group, and see if they deviate from each other by a large enough amount to be deemed statistically significant. If they don&#8217;t, you can try another trait like IQ, or number of children, and repeat the process. If you try enough different traits, hopefully you&#8217;ll eventually find an intriguingly large difference between the groups.</p>
<p>The trouble is, we know that if you estimate a large number of quantities, some of them will be inaccurate, and so some of the apparent differences between your two groups may just be due to these inaccuracies. If you test enough traits, you will eventually find differences between the populations that look significant, even though it is just the result of chance.</p>
<p>In fact, even if northerners and southerners had no systematic differences between them, there would still be apparent differences that arose just from the particular sample of 1000 people you happened to have data on. For example, in your dataset, it just might happen that the northerners have lower numbers of children than southerners, even if this isn&#8217;t true for the underlying populations of all northerners and southerners. If you were to publish this finding, without making mention of the number of hypotheses you tested before finding it, it may seem that you had produced a meaningful result. In fact, the assessment of this result should take into account the number of hypotheses (e.g. northerners have smaller shoe sizes than southerners, northerners have greater salaries than southerners, etc.) that you tested before you discovered this one (and the <a href="http://en.wikipedia.org/wiki/Multiple_comparisons">p-values can be modified to include this information</a>). The most significant seeming deviation between the groups found after testing 100 different hypotheses is very likely greatly inflated by chance. Whereas if you had only tested a small number of hypotheses against your data, and found a strong result, this would likely be a meaningful finding.</p>
<p>As a general rule, the greater the number of data points you have, the larger the number of quantities you can accurately estimate from your dataset. On a set of just 10 points, you may not even be able to get an accurate estimate of the mean value of a single trait (unless the trait had very slow standard deviation). Whereas on a dataset of a billion points, you probably could estimate dozens of quantities accurately.</p>
<p>Unfortunately, when you&#8217;re reading a paper, there is no way to tell how many hypotheses the researcher tested on his dataset unless he chooses to publish it. And there is a strong incentive to obscure this information. If a researcher releases the fact that he tested 20 hypotheses before finding 1 which was statistically significant, readers may discredit the result, or reviewers may reject it for publication. And if the researcher spent a lot of time and money collecting his dataset, it would feel like a waste to give up on the data just because his first five hypotheses tested on it don&#8217;t pan out. It might take a lot of restraint to not just keep testing hypothesis after hypothesis until he finds something publishable.</p>
<p>But even if researchers were excessively careful, that wouldn&#8217;t fully resolve the problem. When a hypothesis is confirmed by a dataset, we must consider whether it is truly a confirmation of the hypothesis being tested, or a result of the fact that 20 researchers tested 20 false hypotheses, and this one of the 20 happened to seem true by chance. That is, if enough hypotheses are tested over all, we may find a large number of false hypotheses among them that just happen to seem true.</p>
<p>What makes this problem more pernicious is that when a hypothesis fails to pan out, the result is often not published. This is due to the fact that hypothesis disconfirmations (e.g. &#8220;no association was found between cabbage eating and longevity&#8221;) are generally less interesting and harder to publish than confirmations (e.g. &#8220;an association was found between cabbage eating and longevity&#8221;). But since most new hypotheses in science turn out to be false, we should expect the number of negative results to be very large (except in situations where previously well validated results are being confirmed). Hence, the number of published test results will be much less than the number of total tests conducted, with test failures substantially underreported. So there is no good way to tell how many times a hypotheses failed to be confirmed by tests before one researcher finally ran one that seemed to confirm it. And if a very large number of false hypotheses are tested, but mostly just the ones that turn out to look true are published, you could end up with a field&#8217;s journals being flooded with false but seemingly verified hypotheses. In exploratory fields where almost all hypotheses are false, and where disconfirmations of a hypothesis are almost never published, you might even get into a situation where <a href="http://www.plosmedicine.org/article/info:doi/10.1371/journal.pmed.0020124">most published research findings are false</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://www.spencergreenberg.com/2011/10/testing-too-many-hypotheses/feed/</wfw:commentRss>
			<slash:comments>6</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">249</post-id>	</item>
	</channel>
</rss>
