<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1742-7622-5-2</ui>
   <ji>1742-7622</ji>
   <fm>
		<dochead>Analytic perspective</dochead>
		<bibl>
			<title>
				<p>Simpson's Paradox, Lord's Paradox, and Suppression Effects are the same phenomenon &#8211; the reversal paradox</p>
			</title>
			<aug>
				<au id="A1" ca="yes">
					<snm>Tu</snm>
					<fnm>Yu-Kang</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>y.k.tu@leeds.ac.uk</email>
				</au>
				<au id="A2">
					<snm>Gunnell</snm>
					<fnm>David</fnm>
					<insr iid="I3"/>
					<email>d.j.gunnell@bristol.ac.uk</email>
				</au>
				<au id="A3">
					<snm>Gilthorpe</snm>
					<mi>S</mi>
					<fnm>Mark</fnm>
					<insr iid="I1"/>
					<email>m.s.gilthorpe@leeds.ac.uk</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Biostatistics Unit, Centre for Epidemiology &amp; Biostatistics, University of Leeds, 30/32 Hyde Terrace, Leeds, LS2 9JT, UK</p>
				</ins>
				<ins id="I2">
					<p>Leeds Dental Institute, University of Leeds, Clarendon Road, Leeds, LS2 9LU, UK</p>
				</ins>
				<ins id="I3">
					<p>Department of Social Medicine, University of Bristol, Bristol, BS8 2PR, UK</p>
				</ins>
			</insg>
			<source>Emerging Themes in Epidemiology</source>
			<issn>1742-7622</issn>
			<pubdate>2008</pubdate>
			<volume>5</volume>
			<issue>1</issue>
			<fpage>2</fpage>
			<url>http://www.ete-online.com/content/5/1/2</url>
			<xrefbib>
				<pubidlist>
					<pubid idtype="pmpid">18211676</pubid>
					<pubid idtype="doi">10.1186/1742-7622-5-2</pubid>
				</pubidlist>
			</xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>02</day>
					<month>4</month>
					<year>2007</year>
				</date>
			</rec>
			<acc>
				<date>
					<day>22</day>
					<month>1</month>
					<year>2008</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>22</day>
					<month>1</month>
					<year>2008</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2008</year>
			<collab>Tu et al; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<p>This article discusses three statistical paradoxes that pervade epidemiological research: <it>Simpson's paradox</it>, <it>Lord's paradox</it>, and <it>suppression</it>. These paradoxes have important implications for the interpretation of evidence from observational studies. This article uses hypothetical scenarios to illustrate how the three paradoxes are different manifestations of one phenomenon &#8211; the <it>reversal paradox </it>&#8211; depending on whether the outcome and explanatory variables are categorical, continuous or a combination of both; this renders the issues and remedies for any one to be similar for all three. Although the three statistical paradoxes occur in different types of variables, they share the same characteristic: the association between two variables can be reversed, diminished, or enhanced when another variable is statistically controlled for. Understanding the concepts and theory behind these paradoxes provides insights into some controversial or contradictory research findings. These paradoxes show that prior knowledge and underlying causal theory play an important role in the statistical modelling of epidemiological data, where incorrect use of statistical models might produce consistent, replicable, yet erroneous results.</p>
			</sec>
		</abs>
	</fm>
   <bdy>
		<sec>
			<st>
				<p>Introduction</p>
			</st>
			<p>This article discusses three statistical paradoxes that pervade epidemiological research: <it>Simpson's paradox</it>, <it>Lord's paradox</it>, and <it>suppression</it>. These paradoxes are not just tantalising puzzles of purely academic interest; potentially, they have serious implications for the interpretation of evidence from observational studies. Scenarios which are associated with and can be explained by these paradoxes are discussed. A concise explanation of these paradoxes and an historical overview is also provided. Simulated data based upon the foetal origins of adult diseases hypothesis <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp> are used to illustrate how the three paradoxes are different manifestations of one phenomenon &#8211; <it>the reversal paradox </it>&#8211; depending on whether the outcome and explanatory variables are categorical, continuous or a combination of both; this renders the issues and remedies for any one to be similar for all three. All statistical analyses were performed within SPSS 15.0 (SPSS Inc, Chicago, USA).</p>
		</sec>
		<sec>
			<st>
				<p>Foetal origins hypothesis</p>
			</st>
			<p>The 'foetal origins of adult disease' hypothesis (FOAD), which has evolved into the 'developmental origins of health and disease' (DOHaD) hypothesis <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
				</abbrgrp>, was proposed to explain the associations observed between low birth weight and a range of diseases in later life. These associations have been interpreted as evidence that growth retardation <it>in utero </it>has adverse long-term effects on the development of vital organ systems which predispose the individuals to a range of metabolic and related disorders in later life. Nevertheless, although an inverse association between birth weight and disease in later life was found in some studies, this relationship was only established in many studies after the current body size variables such as body mass index (BMI), body weight and/or body height were adjusted for in the regression analysis. As body sizes may be in the causal pathway from birth weight to health outcomes in later life, the justification of this adjustment of current body sizes has been questioned recently <abbrgrp>
					<abbr bid="B3">3</abbr>
					<abbr bid="B4">4</abbr>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
					<abbr bid="B8">8</abbr>
				</abbrgrp>.</p>
			<p>Using the inverse relationship between birth weight and systolic blood pressure in later life as an example, Figure <figr fid="F1">1</figr> shows the directed acyclic graphs <abbrgrp>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
				</abbrgrp> for the possible relationships between the three observed variables: birth weight, current body weight and systolic blood pressure. In Figure <figr fid="F1">1a</figr>, current body weight is on the causal pathway from birth weight to systolic blood pressure, so current body weight is not a genuine confounder and should not be adjusted for. In Figure <figr fid="F1">1b</figr>, there is no relationship between birth weight and current body weight, and therefore the latter is not a confounder for the relationship between birth weight and blood pressure either. However, this model cannot explain the observed positive correlations between birth weight and current body weight in many epidemiological studies. In Figure <figr fid="F1">1c</figr>, current body weight is a confounder because it is ancestor to both birth weight and blood pressure in the directed acyclic graph <abbrgrp>
					<abbr bid="B9">9</abbr>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
				</abbrgrp>. Obviously, this scenario is implausible in reality because current body weight cannot affect birth weight. In Figure <figr fid="F1">1d</figr>, the observed positive correlation between birth weight and current body weight is due to an unobserved confounder, UC, which affects both birth weight and current body weight. Also, there is no path from birth weight and current body weight <abbrgrp>
					<abbr bid="B7">7</abbr>
				</abbrgrp>, i.e. if UC could be identified and measured, birth weight and current body weight would be independent, conditional on UC <abbrgrp>
					<abbr bid="B12">12</abbr>
				</abbrgrp>. More complex causal diagrams for the three variables are possible by incorporating more unobserved variables in the model. However, the four scenarios in Figure <figr fid="F1">1</figr> are sufficient for our discussion in this study, so we do not pursue them further.</p>
			<fig id="F1">
				<title>
					<p>Figure 1</p>
				</title>
				<caption>
					<p>Causal models expressed as directed acyclic graphs for possible relationships between the three observed variables: birth weight (<it>BW</it>), current body weight (<it>CW</it>) and systolic blood pressure (<it>BP</it>)</p>
				</caption>
				<text>
					<p>Causal models expressed as directed acyclic graphs for possible relationships between the three observed variables: birth weight (<it>BW</it>), current body weight (<it>CW</it>) and systolic blood pressure (<it>BP</it>). <it>UC </it>is an unobserved variable that affects both <it>BW </it>and <it>CW</it>. In Figure 1d, there is a back-door path from <it>BP </it>to <it>BW </it>via <it>CW </it>and <it>UC</it>, so the association between <it>BP </it>and <it>BW </it>is therefore biased. The adjustment of <it>CW </it>can block the backdoor path from <it>BP </it>to <it>BW </it>via <it>UC</it>.</p>
				</text>
				<graphic file="1742-7622-5-2-1"/>
			</fig>
			<p>Figure <figr fid="F1">1a,c</figr> and <figr fid="F1">1d</figr> all explain the observed correlation structure amongst birth weight, current body weight and blood pressure equally well, and it is not possible to judge which one is true based upon the observed data. For example, researchers may argue current body weight is a genuine confounder in Figure <figr fid="F1">1d</figr> and therefore should be adjusted for <abbrgrp>
					<abbr bid="B7">7</abbr>
				</abbrgrp>. This can only be confirmed when the unobserved confounder (be parental, genetic, or environmental factors) is identified <it>and </it>the conditional independence between birth weight and current weight is satisfied.</p>
			<p>Nevertheless, the adjustment of current body weight in the statistical analysis will change the estimated relationship between birth weight and blood pressure, as the adjusted relationship is a conditional relationship. Differences between the unadjusted and adjusted (i.e. unconditional and conditional) relationships frequently cause confusion in the interpretations of statistical analyses and they also give rise to three statistical paradoxes, which we shall explain in the next section.</p>
		</sec>
		<sec>
			<st>
				<p>Simpson's Paradox</p>
			</st>
			<p>Simpson's paradox <abbrgrp>
					<abbr bid="B13">13</abbr>
				</abbrgrp>, or Yule's paradox <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp>, is a well known statistical phenomenon. It is observed when the relationship between two categorical variables is reversed after a third variable is introduced to the analysis of their association, or alternatively where the relationship between two variables differs within subgroups compared to that observed for the aggregated data. Although first discussed by Karl Pearson in 1899 <abbrgrp>
					<abbr bid="B15">15</abbr>
				</abbrgrp>, it is George Udny Yule, once Pearson's assistant, who provides a detailed assessment of this problem in 1903 <abbrgrp>
					<abbr bid="B14">14</abbr>
				</abbrgrp>.</p>
			<sec>
				<st>
					<p>A numerical example</p>
				</st>
				<p>Table <tblr tid="T1">1</tblr> provides a summary of a hypothetical survey of 1000 adult males in England based on data simulated using values derived from the literature <abbrgrp>
						<abbr bid="B16">16</abbr>
					</abbrgrp> and surveys conducted by the UK Department of Health <abbrgrp>
						<abbr bid="B17">17</abbr>
					</abbrgrp>. Data are simulated such that the three variables systolic blood pressure (<it>BP</it>), birth weight (<it>BW</it>), and current body weight (<it>CW</it>) are positively correlated: the correlation between <it>BP </it>and birth weight (<it>r</it>
					<sub>
						<it>BW</it>-<it>BP</it>
					</sub>) is weak (0.11); whereas the correlations between birth weight and current weight (<it>r</it>
					<sub>
						<it>BW</it>-<it>CW</it>
					</sub>) and between current weight and <it>BP </it>(<it>r</it>
					<sub>
						<it>CW</it>-<it>BP</it>
					</sub>) are reasonably strong (0.52 and 0.50, respectively).</p>
				<tbl id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Summary of the analysis of simulated systolic blood pressure, birth weight and current body weight data for 1000 adult males</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>N</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Minimum</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Maximum</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Mean</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Standard Deviation</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Current weight (kg)</b>
								</p>
							</c>
							<c ca="center">
								<p>1000</p>
							</c>
							<c ca="right">
								<p>38.02</p>
							</c>
							<c ca="right">
								<p>127.08</p>
							</c>
							<c ca="right">
								<p>82.69</p>
							</c>
							<c ca="right">
								<p>14.61</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Systolic BP (mmHg)</b>
								</p>
							</c>
							<c ca="center">
								<p>1000</p>
							</c>
							<c ca="right">
								<p>89.36</p>
							</c>
							<c ca="right">
								<p>168.88</p>
							</c>
							<c ca="right">
								<p>129.78</p>
							</c>
							<c ca="right">
								<p>11.14</p>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>Birth weight (kg)</b>
								</p>
							</c>
							<c ca="center">
								<p>1000</p>
							</c>
							<c ca="right">
								<p>1.37</p>
							</c>
							<c ca="right">
								<p>5.42</p>
							</c>
							<c ca="right">
								<p>3.51</p>
							</c>
							<c ca="right">
								<p>0.63</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
				<p>Suppose the research question is to investigate whether or not there is an association between low birth weight and high blood pressure in later life. In this hypothetical study, low birth weight is defined as birth weight lower than the population mean (i.e. &lt; 3.5 Kg), and high blood pressure is defined as systolic <it>BP </it>greater than the mean value (i.e. > 135 mmHg). The results are summarized in Table <tblr tid="T2">2</tblr>. It is noted that the probability of developing high blood pressure is 0.272 for subjects with low birth weight and 0.362 for subjects with high birth weight. This indicates that low birth weight has a protective effect of developing high blood pressure. However, when these subjects are stratified according to their current weight (> 90 Kg vs. &lt; = 90 Kg), the risk of developing high blood pressure is consistently higher amongst subjects with low birth weight compared to those with high birth weight. It seems to be quite counter-intuitive that low birth weight has an adverse effect on blood pressure for both subgroups of current weight, yet a protective effect on the groups as a whole.</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Numbers and Percentages of subjects with high blood pressure (> 135 mmHg) according to their birth weight and current body weight</p>
					</caption>
					<tblbdy cols="5">
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Normal BP</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>High BP</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Total</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Percentage of subjects with high <it>BP</it>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>
										<it>Overall:</it>
									</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Low birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>354</p>
							</c>
							<c ca="center">
								<p>132</p>
							</c>
							<c ca="center">
								<p>486</p>
							</c>
							<c ca="center">
								<p>27.2%</p>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>High birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>328</p>
							</c>
							<c ca="center">
								<p>186</p>
							</c>
							<c ca="center">
								<p>514</p>
							</c>
							<c ca="center">
								<p>36.2%</p>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Total</b>
								</p>
							</c>
							<c ca="center">
								<p>682</p>
							</c>
							<c ca="center">
								<p>318</p>
							</c>
							<c ca="center">
								<p>1,000</p>
							</c>
							<c ca="center">
								<p>31.8%</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Current weight &lt; = 90 Kg</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Low birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>329</p>
							</c>
							<c ca="center">
								<p>99</p>
							</c>
							<c ca="center">
								<p>428</p>
							</c>
							<c ca="center">
								<p>23.1%</p>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>High birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>221</p>
							</c>
							<c ca="center">
								<p>55</p>
							</c>
							<c ca="center">
								<p>276</p>
							</c>
							<c ca="center">
								<p>19.9%</p>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Total</b>
								</p>
							</c>
							<c ca="center">
								<p>550</p>
							</c>
							<c ca="center">
								<p>154</p>
							</c>
							<c ca="center">
								<p>704</p>
							</c>
							<c ca="center">
								<p>21.9%</p>
							</c>
						</r>
						<r>
							<c cspan="5">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Current weight > 90 Kg</b>
								</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Low birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>25</p>
							</c>
							<c ca="center">
								<p>33</p>
							</c>
							<c ca="center">
								<p>58</p>
							</c>
							<c ca="center">
								<p>56.9%</p>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>High birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>107</p>
							</c>
							<c ca="center">
								<p>131</p>
							</c>
							<c ca="center">
								<p>238</p>
							</c>
							<c ca="center">
								<p>55.0%</p>
							</c>
						</r>
						<r>
							<c ca="right">
								<p>
									<b>Total</b>
								</p>
							</c>
							<c ca="center">
								<p>132</p>
							</c>
							<c ca="center">
								<p>164</p>
							</c>
							<c ca="center">
								<p>296</p>
							</c>
							<c ca="center">
								<p>55.4%</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Interpretation</p>
				</st>
				<p>In this scenario, there are substantial differences in the numbers of subjects with low birth weight between the two subgroups of current weight, because lower birth weight babies on average are smaller in adulthood. Therefore, the overall relation between low birth weight and high blood pressure is a sum of weighted relations between the two variables in each subgroup. A graphical representation of this paradox, first proposed by Paik <abbrgrp>
						<abbr bid="B18">18</abbr>
					</abbrgrp>, is given in Figure <figr fid="F2">2</figr>. Due to a greater influence of the lower risk of developing high blood pressure in the subjects with low birth weight and lower current weight, the adverse relation is reversed in the whole-group analysis (solid line in Figure <figr fid="F2">2</figr>). Note that, in the following two scenarios, the adjustment for current weight will not change the relationship between birth weight and <it>BP </it>
					<abbrgrp>
						<abbr bid="B12">12</abbr>
					</abbrgrp>, if: (a) there is no difference in the percentages of subjects with high current weight between the two subgroups of birth weight (i.e. no correlation between birth weight and current weight); or (b) there is no association between <it>CWb </it>and <it>BP </it>in the subgroups stratified by <it>BWb </it>(i.e. the association between <it>BP </it>and current weight is entirely caused by the association between birth weight and <it>BP</it>). The problem is whether the relation between low birth weigh and high blood pressure in the whole group provides an answer to the intended research question, or whether the relation in the two subgroups does this. In other words, should <it>CWb </it>be considered a confounder and hence adjusted for in the statistical models?</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p>Graphical representation of Simpson's paradox</p>
					</caption>
					<text>
						<p>Graphical representation of Simpson's paradox. The two circles on the top of the panel represent subjects with lower (0) and higher birth weight (1), respectively, in the subgroup with current weight > 90 Kg. The two circles on the bottom of the panel represent subjects with lower (0) and higher birth weight (1) in the subgroup with current weight &#8804; 90 Kg. The area of the circles is proportional to the sample size of the subgroups they represent. The two dotted lines show that subjects with lower birth weight have a higher risk of developing high blood pressure in each current weight subgroups. However, as a whole group, subjects with lower birth weight have a lower risk of developing high blood pressure (the black solid line).</p>
					</text>
					<graphic file="1742-7622-5-2-2"/>
				</fig>
				<p>In statistical language, adjustment for current body weight represents a <it>conditional </it>relationship; the relationship between birth weight and blood pressure is <it>conditional </it>on current body weight. Although there are substantial differences in the numbers of subjects with low birth weight between the two subgroups of current weight, the adjustment for <it>CWb </it>indicates that <it>if </it>all subjects had the same level of current body weight, subjects with low birth weight would have a greater risk of developing high blood pressure, i.e. the adjustment of <it>CWb </it>erases the greater influence of subjects with low birth weight and lower current weight on the association between birth weight and blood pressure, as people born smaller in general grow into a smaller adults.</p>
				<p>Simpson's paradox has broad implications for epidemiological research since it indicates that making causal inference from any non-randomised study (e.g. cohort studies, case-control studies) can be difficult, because, whilst it is possible to control for the differences between cases and controls, there will always be the possibility that an unobserved and therefore unadjusted confounder might attenuate the association (or even reverse its direction) between exposure and outcome, due to the difference in the mean values or the distribution of confounders between the case or control group. Nevertheless, whether or not there is any unobserved (and therefore unadjusted) confounder may not always be an issue of debate, because in most epidemiologic studies, the important confounders are generally known. The controversy in making causal inference arises in situations where the adjusted variable may not be a genuine confounder <abbrgrp>
						<abbr bid="B6">6</abbr>
						<abbr bid="B7">7</abbr>
						<abbr bid="B19">19</abbr>
						<abbr bid="B20">20</abbr>
					</abbrgrp>. Within epidemiology, Simpson's paradox is closely linked to the concepts of confounding <abbrgrp>
						<abbr bid="B9">9</abbr>
					</abbrgrp> and incollapsibility <abbrgrp>
						<abbr bid="B10">10</abbr>
					</abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Lord's Paradox</p>
			</st>
			<p>Lord's paradox was named after two short articles in the psychology literature by Frederick M Lord regarding the use of analysis of covariance (ANCOVA) within non-experimental studies <abbrgrp>
					<abbr bid="B21">21</abbr>
					<abbr bid="B22">22</abbr>
				</abbrgrp>. In contrast to Simpson's paradox, little discussion of Lord's paradox can be found in the statistical and epidemiological literature <abbrgrp>
					<abbr bid="B23">23</abbr>
				</abbrgrp>, though social scientists have shown a great interest in this phenomenon <abbrgrp>
					<abbr bid="B24">24</abbr>
					<abbr bid="B25">25</abbr>
					<abbr bid="B26">26</abbr>
					<abbr bid="B27">27</abbr>
					<abbr bid="B28">28</abbr>
				</abbrgrp>. Lord's paradox refers to the relationship between a continuous outcome and a categorical exposure being reversed when an additional continuous covariate is introduced to the analysis. One specific example is that the additional covariate is a measure made at baseline within a longitudinal study, where the outcome is the same variable measured some time later (e.g. following an intervention). Therefore, the aim is to measure change in the outcome by adjusting for the baseline measurements, and the categorical covariate might be the exposure/control groups &#8211; this is the familiar design for ANCOVA. This controversy was first discussed in 1910 between Karl Pearson and Arthur C Pigou when they debated the role of parental alcoholism and its impact on the performance of children <abbrgrp>
					<abbr bid="B29">29</abbr>
				</abbrgrp>.</p>
			<sec>
				<st>
					<p>A numerical example</p>
				</st>
				<p>Considering the previous numerical example for Simpson's paradox, we examine current body weight (<it>CW</it>) and blood pressure (<it>BP</it>) as continuous variables, retaining birth weight as a binary (<it>BWb</it>). The two-sample <it>t</it>-test shows that, on average, the blood pressure of subjects with higher birth weight is 2.49 mmHg (95%CI: 1.12, 3.87) <it>greater </it>than those with lower birth weight. However, using ANCOVA (i.e. linear regression with a (categorical) group-allocating variable and with the adjustment of a continuous confounding variable), adjusting for current weight as a covariate, the blood pressure of subjects with higher birth weight becomes 2.94 mmHg (95%CI: 1.12, 3.87) <it>lower </it>than those with lower birth weight.</p>
			</sec>
			<sec>
				<st>
					<p>Interpretation</p>
				</st>
				<p>Differences in the results of the two analyses are due to adjustment in the second analysis for current body weight (<it>CW</it>). As current weight is positively associated with both <it>BP </it>and <it>BWb</it>, it is expected that the relation between <it>BP </it>and <it>BWb </it>will change when current weight is adjusted for. In randomised controlled trials, mean values of the adjusted baseline covariate are expected to be approximately equal across treatment and control groups since, assuming randomisation has been achieved, baseline variation should be <it>within </it>groups rather than <it>between </it>groups), i.e. there is no correlation between the group variable and adjusted covariate (i.e. in our numerical example, no correlation between <it>BWb </it>and current weight). In such circumstances it is well known that using ANCOVA achieves the same estimated treatment difference across groups as found by the <it>t</it>-test, though the former will generally have greater power <abbrgrp>
						<abbr bid="B30">30</abbr>
						<abbr bid="B31">31</abbr>
					</abbrgrp>. Recall our previous discussion of two scenarios in the section on Simpson's paradox, where the adjustment for <it>CWb </it>will not change the relationship between <it>BWb </it>and <it>BP</it>. Randomised controlled trials may thus be seen as a special case of scenario (a) where there is no difference in the mean current weight between the two sub-groups of birth weight.</p>
				<p>Figure <figr fid="F3">3</figr> is a three-dimensional representation of the associations amongst the three variables. Although the solid black line shows that subjects with higher birth weight (coded as 1) have on average a greater blood pressure than those with lower birth weight (coded as 0), the various horizontal red lines with a negative slope indicate that at each level of current weight, subjects with higher birth weight have a lower mean blood pressure than those with lower birth weight.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>A 3-dimensional scatter plot for the numerical example in Lord's paradox</p>
					</caption>
					<text>
						<p>A 3-dimensional scatter plot for the numerical example in Lord's paradox. The solid line shows that the mean blood pressure of subjects with higher birth weight (<it>BWb </it>= 1) is greater than those with lower birth weight (<it>BWb </it>= 0). However, at each level of current weight, the mean blood pressure of subjects with higher birth weight is lower than those with lower birth weight (the horizontal red lines).</p>
					</text>
					<graphic file="1742-7622-5-2-3"/>
				</fig>
				<p>In statistical language, results from the regression analyses are <it>conditional </it>on both birth weight groups having equal mean current weight in later life, and if true there would be a benefit from low birth weight in terms of blood pressure. However, since the two groups have a different mean current weight in later life, results from the regression analysis need to be interpreted with caution. In Simpson's paradox, the discussion surrounds the differences in results between unconditional and conditional risk/probability, and in Lord's paradox, discussion is around the differences in results between unconditional and conditional means.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Suppression</p>
			</st>
			<p>Of the three paradoxes, suppression effects within multiple regression are probably the least recognised amongst clinical and epidemiological researchers, though the suppression phenomenon has been extensively discussed by statisticians <abbrgrp>
					<abbr bid="B32">32</abbr>
					<abbr bid="B33">33</abbr>
					<abbr bid="B34">34</abbr>
				</abbrgrp> and methodologists from the social sciences <abbrgrp>
					<abbr bid="B35">35</abbr>
					<abbr bid="B36">36</abbr>
				</abbrgrp>. The classical definition of suppression is that a potential covariate that is unrelated to the outcome variable (i.e. has a bivariate correlation of zero) increases the overall model fit within regression (as assessed by <it>R</it>
				<sup>2</sup>, for instance) when this covariate is added to the model. This seems counter-intuitive and needs some explanation.</p>
			<p>Suppose <it>y </it>is the outcome variable, and <it>x</it>
				<sub>1</sub>and <it>x</it>
				<sub>2 </sub>are two covariates (i.e. 'explanatory' variables). Denote the bivariate Pearson correlation between <it>y </it>and <it>x</it>
				<sub>1 </sub>as <it>r</it>
				<sub>
					<it>y</it>1</sub>; the correlation between <it>y </it>and <it>x</it>
				<sub>2 </sub>as <it>r</it>
				<sub>
					<it>y</it>2</sub>; and the correlation between <it>x</it>
				<sub>1 </sub>and <it>x</it>
				<sub>2 </sub>as <it>r</it>
				<sub>12</sub>. Within multiple regression, where <it>y </it>= <it>b</it>
				<sub>0 </sub>+ <it>b</it>
				<sub>1 </sub>
				<it>x</it>
				<sub>1 </sub>+ <it>b</it>
				<sub>2 </sub>
				<it>x</it>
				<sub>2</sub>, the standardized partial regression coefficients of <it>b</it>
				<sub>1 </sub>and <it>b</it>
				<sub>2 </sub>for <it>x</it>
				<sub>1 </sub>(<it>&#946; </it>
				<sub>1</sub>) and <it>x</it>
				<sub>2 </sub>(<it>&#946; </it>
				<sub>2</sub>), respectively, are given by <abbrgrp>
					<abbr bid="B37">37</abbr>
				</abbrgrp>:</p>
			<p>
				<display-formula id="M1">
					<m:math name="1742-7622-5-2-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:mtable>
									<m:mtr>
										<m:mtd>
											<m:mrow>
												<m:msub>
													<m:mi>&#946;</m:mi>
													<m:mn>1</m:mn>
												</m:msub>
												<m:mo>=</m:mo>
												<m:mfrac>
													<m:mrow>
														<m:msub>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mi>y</m:mi>
																<m:mn>1</m:mn>
															</m:mrow>
														</m:msub>
														<m:mo>&#8722;</m:mo>
														<m:msub>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mn>12</m:mn>
															</m:mrow>
														</m:msub>
														<m:msub>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mi>y</m:mi>
																<m:mn>2</m:mn>
															</m:mrow>
														</m:msub>
													</m:mrow>
													<m:mrow>
														<m:mn>1</m:mn>
														<m:mo>&#8722;</m:mo>
														<m:msubsup>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mn>12</m:mn>
															</m:mrow>
															<m:mn>2</m:mn>
														</m:msubsup>
													</m:mrow>
												</m:mfrac>
											</m:mrow>
										</m:mtd>
										<m:mtd>
											<m:mo>&amp;</m:mo>
										</m:mtd>
										<m:mtd>
											<m:mrow>
												<m:msub>
													<m:mi>&#946;</m:mi>
													<m:mn>2</m:mn>
												</m:msub>
												<m:mo>=</m:mo>
												<m:mfrac>
													<m:mrow>
														<m:msub>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mi>y</m:mi>
																<m:mn>2</m:mn>
															</m:mrow>
														</m:msub>
														<m:mo>&#8722;</m:mo>
														<m:msub>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mn>12</m:mn>
															</m:mrow>
														</m:msub>
														<m:msub>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mi>y</m:mi>
																<m:mn>1</m:mn>
															</m:mrow>
														</m:msub>
													</m:mrow>
													<m:mrow>
														<m:mn>1</m:mn>
														<m:mo>&#8722;</m:mo>
														<m:msubsup>
															<m:mi>r</m:mi>
															<m:mrow>
																<m:mn>12</m:mn>
															</m:mrow>
															<m:mn>2</m:mn>
														</m:msubsup>
													</m:mrow>
												</m:mfrac>
											</m:mrow>
										</m:mtd>
									</m:mtr>
								</m:mtable>
								<m:mo>.</m:mo>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaqbaeqabeWaaaqaaGGaciab=j7aInaaBaaaleaacqaIXaqmaeqaaOGaeyypa0tcfa4aaSaaaeaacqWGYbGCdaWgaaqaaiabdMha5jabigdaXaqabaGaeyOeI0IaemOCai3aaSbaaeaacqaIXaqmcqaIYaGmaeqaaiabdkhaYnaaBaaabaGaemyEaKNaeGOmaidabeaaaeaacqaIXaqmcqGHsislcqWGYbGCdaqhaaqaaiabigdaXiabikdaYaqaaiabikdaYaaaaaaakeaacqGGMaGjaeaacqWFYoGydaWgaaWcbaGaeGOmaidabeaakiabg2da9KqbaoaalaaabaGaemOCai3aaSbaaeaacqWG5bqEcqaIYaGmaeqaaiabgkHiTiabdkhaYnaaBaaabaGaeGymaeJaeGOmaidabeaacqWGYbGCdaWgaaqaaiabdMha5jabigdaXaqabaaabaGaeGymaeJaeyOeI0IaemOCai3aa0baaeaacqaIXaqmcqaIYaGmaeaacqaIYaGmaaaaaaaakiabc6caUaaa@5BF8@</m:annotation>
						</m:semantics>
					</m:math>
				</display-formula>
			</p>
			<p>Now suppose that <it>y </it>is adult blood pressure (<it>BP</it>), <it>x</it>
				<sub>1 </sub>birth weight (<it>BW</it>), and <it>x</it>
				<sub>2 </sub>adult current weight (<it>CW</it>). Many studies have shown the bivariate correlation (<it>r</it>
				<sub>
					<it>y</it>1</sub>) between <it>BP </it>(<it>y</it>) and birth weight (<it>x</it>
				<sub>1</sub>) to be negative though weak <abbrgrp>
					<abbr bid="B38">38</abbr>
					<abbr bid="B39">39</abbr>
				</abbrgrp>, whilst others show this to be positive <abbrgrp>
					<abbr bid="B40">40</abbr>
				</abbrgrp>; for illustrative purposes only, assume that <it>r</it>
				<sub>
					<it>y</it>1 </sub>is zero. Many studies show the bivariate correlation (<it>r</it>
				<sub>
					<it>y</it>2</sub>) between <it>BP </it>(<it>y</it>) and current weight (<it>x</it>
				<sub>2</sub>) to be positive <abbrgrp>
					<abbr bid="B41">41</abbr>
				</abbrgrp>. When <it>BP </it>is regressed on birth weight and current weight, the model fit assessed by <it>R</it>
				<sup>2 </sup>becomes <abbrgrp>
					<abbr bid="B37">37</abbr>
				</abbrgrp>:</p>
			<p>
				<display-formula id="M2">
					<it>R</it>
					<sup>2 </sup>= <it>r</it>
					<sub>
						<it>y</it>1 </sub>* <it>&#946; </it>
					<sub>1 </sub>+ <it>r</it>
					<sub>
						<it>y</it>2 </sub>* <it>&#946; </it>
					<sub>2</sub>.</display-formula>
			</p>
			<p>Since <it>r</it>
				<sub>
					<it>y</it>1 </sub>is equal to zero, equation (2) becomes:</p>
			<p>
				<display-formula id="M3">
					<m:math name="1742-7622-5-2-i2" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msup>
									<m:mi>R</m:mi>
									<m:mn>2</m:mn>
								</m:msup>
								<m:mo>=</m:mo>
								<m:msub>
									<m:mi>r</m:mi>
									<m:mrow>
										<m:mi>y</m:mi>
										<m:mn>2</m:mn>
									</m:mrow>
								</m:msub>
								<m:mo>*</m:mo>
								<m:msub>
									<m:mi>&#946;</m:mi>
									<m:mn>2</m:mn>
								</m:msub>
								<m:mo>=</m:mo>
								<m:mfrac>
									<m:mrow>
										<m:msubsup>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mi>y</m:mi>
												<m:mn>2</m:mn>
											</m:mrow>
											<m:mn>2</m:mn>
										</m:msubsup>
									</m:mrow>
									<m:mrow>
										<m:mn>1</m:mn>
										<m:mo>&#8722;</m:mo>
										<m:msubsup>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mn>12</m:mn>
											</m:mrow>
											<m:mn>2</m:mn>
										</m:msubsup>
									</m:mrow>
								</m:mfrac>
								<m:mo>.</m:mo>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOuai1aaWbaaSqabeaacqaIYaGmaaGccqGH9aqpcqWGYbGCdaWgaaWcbaGaemyEaKNaeGOmaidabeaakiabcQcaQGGaciab=j7aInaaBaaaleaacqaIYaGmaeqaaOGaeyypa0tcfa4aaSaaaeaacqWGYbGCdaqhaaqaaiabdMha5jabikdaYaqaaiabikdaYaaaaeaacqaIXaqmcqGHsislcqWGYbGCdaqhaaqaaiabigdaXiabikdaYaqaaiabikdaYaaaaaGccqGGUaGlaaa@44F9@</m:annotation>
						</m:semantics>
					</m:math>
				</display-formula>
			</p>
			<p>Since <inline-formula>
					<m:math name="1742-7622-5-2-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:mn>1</m:mn>
								<m:mo>&#8722;</m:mo>
								<m:msubsup>
									<m:mi>r</m:mi>
									<m:mrow>
										<m:mn>12</m:mn>
									</m:mrow>
									<m:mn>2</m:mn>
								</m:msubsup>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaeGymaeJaeyOeI0IaemOCai3aa0baaSqaaiabigdaXiabikdaYaqaaiabikdaYaaaaaa@321E@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula> will always be smaller than 1, <inline-formula>
					<m:math name="1742-7622-5-2-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msup>
									<m:mi>R</m:mi>
									<m:mn>2</m:mn>
								</m:msup>
								<m:mo>=</m:mo>
								<m:mfrac>
									<m:mrow>
										<m:msubsup>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mi>y</m:mi>
												<m:mn>2</m:mn>
											</m:mrow>
											<m:mn>2</m:mn>
										</m:msubsup>
									</m:mrow>
									<m:mrow>
										<m:mn>1</m:mn>
										<m:mo>&#8722;</m:mo>
										<m:msubsup>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mn>12</m:mn>
											</m:mrow>
											<m:mn>2</m:mn>
										</m:msubsup>
									</m:mrow>
								</m:mfrac>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOuai1aaWbaaSqabeaacqaIYaGmaaGccqGH9aqpjuaGdaWcaaqaaiabdkhaYnaaDaaabaGaemyEaKNaeGOmaidabaGaeGOmaidaaaqaaiabigdaXiabgkHiTiabdkhaYnaaDaaabaGaeGymaeJaeGOmaidabaGaeGOmaidaaaaaaaa@3AFB@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula> will always be greater than <inline-formula>
					<m:math name="1742-7622-5-2-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msubsup>
									<m:mi>r</m:mi>
									<m:mrow>
										<m:mi>y</m:mi>
										<m:mn>2</m:mn>
									</m:mrow>
									<m:mn>2</m:mn>
								</m:msubsup>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aa0baaSqaaiabdMha5jabikdaYaqaaiabikdaYaaaaaa@30CC@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>. By including <it>x</it>
				<sub>1 </sub>in the regression model, more variance of <it>y </it>is 'explained', i.e. the predictability of the model increases. However, this seems counterintuitive, since the zero bivariate correlation between <it>y </it>and <it>x</it>
				<sub>1 </sub>(<it>r</it>
				<sub>
					<it>y</it>1 </sub>= 0) indicates that no more variance in <it>y </it>can be explained by <it>x</it>
				<sub>1</sub>. So where does the additional 'explained variance' in <it>y </it>come from when <it>x</it>
				<sub>1 </sub>is entered in the regression model? The answer is that the additional explained variance in <it>y </it>comes from <it>x</it>
				<sub>2</sub>.</p>
			<p>Although <it>x</it>
				<sub>1 </sub>is not correlated with <it>y</it>, it is positively correlated with <it>x</it>
				<sub>2</sub>, which in turn is positively correlated with <it>y</it>. When <it>x</it>
				<sub>1 </sub>is entered in the model, it 'suppresses' the part of <it>x</it>
				<sub>2 </sub>that is uncorrelated with <it>y</it>, thereby increasing overall predictability. In other words, the role of <it>x</it>
				<sub>1 </sub>in the model is to suppress (reduce) the noise (the uncorrelated component of <it>x</it>
				<sub>2</sub>) within the correlation between <it>y </it>and <it>x</it>
				<sub>2</sub>, as though any uncertainty in <it>x</it>
				<sub>2 </sub>'predicting' <it>y </it>is 'explained' by <it>x</it>
				<sub>1</sub>.</p>
			<p>It is not only <it>R</it>
				<sup>2 </sup>that is increased; the coefficient for <it>x</it>
				<sub>2</sub>, <inline-formula>
					<m:math name="1742-7622-5-2-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msub>
									<m:mi>&#946;</m:mi>
									<m:mn>2</m:mn>
								</m:msub>
								<m:mo>=</m:mo>
								<m:mfrac>
									<m:mrow>
										<m:msub>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mi>y</m:mi>
												<m:mn>2</m:mn>
											</m:mrow>
										</m:msub>
									</m:mrow>
									<m:mrow>
										<m:mn>1</m:mn>
										<m:mo>&#8722;</m:mo>
										<m:msubsup>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mn>12</m:mn>
											</m:mrow>
											<m:mn>2</m:mn>
										</m:msubsup>
									</m:mrow>
								</m:mfrac>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8NSdi2aaSbaaSqaaiabikdaYaqabaGccqGH9aqpjuaGdaWcaaqaaiabdkhaYnaaBaaabaGaemyEaKNaeGOmaidabeaaaeaacqaIXaqmcqGHsislcqWGYbGCdaqhaaqaaiabigdaXiabikdaYaqaaiabikdaYaaaaaaaaa@3A82@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>, becomes greater than <it>r</it>
				<sub>
					<it>y</it>2</sub>. Furthermore, although <it>r</it>
				<sub>
					<it>y</it>1 </sub>is equal to zero, <it>&#946;</it>
				<sub>1 </sub>is not zero and becomes negative: <inline-formula>
					<m:math name="1742-7622-5-2-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
						<m:semantics>
							<m:mrow>
								<m:msub>
									<m:mi>&#946;</m:mi>
									<m:mn>1</m:mn>
								</m:msub>
								<m:mo>=</m:mo>
								<m:mfrac>
									<m:mrow>
										<m:mo>&#8722;</m:mo>
										<m:msub>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mn>12</m:mn>
											</m:mrow>
										</m:msub>
										<m:msub>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mi>y</m:mi>
												<m:mn>2</m:mn>
											</m:mrow>
										</m:msub>
									</m:mrow>
									<m:mrow>
										<m:mn>1</m:mn>
										<m:mo>&#8722;</m:mo>
										<m:msubsup>
											<m:mi>r</m:mi>
											<m:mrow>
												<m:mn>12</m:mn>
											</m:mrow>
											<m:mn>2</m:mn>
										</m:msubsup>
									</m:mrow>
								</m:mfrac>
							</m:mrow>
							<m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaacciGae8NSdi2aaSbaaSqaaiabigdaXaqabaGccqGH9aqpjuaGdaWcaaqaaiabgkHiTiabdkhaYnaaBaaabaGaeGymaeJaeGOmaidabeaacqWGYbGCdaWgaaqaaiabdMha5jabikdaYaqabaaabaGaeGymaeJaeyOeI0IaemOCai3aa0baaeaacqaIXaqmcqaIYaGmaeaacqaIYaGmaaaaaaaa@3EDD@</m:annotation>
						</m:semantics>
					</m:math>
				</inline-formula>. In general, the greater the positive correlation between <it>x</it>
				<sub>1 </sub>and <it>x</it>
				<sub>2</sub>, the greater the absolute value of <it>&#946;</it>
				<sub>1 </sub>and <it>&#946;</it>
				<sub>2</sub>. However, having <it>r</it>
				<sub>
					<it>y</it>1 </sub>equal zero (or being negative) is not necessary to observe suppression; <it>r</it>
				<sub>
					<it>y</it>1 </sub>may be positive and <it>x</it>
				<sub>1 </sub>may still be a suppressor <abbrgrp>
					<abbr bid="B35">35</abbr>
				</abbrgrp>.</p>
			<p>It was Paul Horst, in 1941, who first explored this curious phenomenon within educational research <abbrgrp>
					<abbr bid="B42">42</abbr>
				</abbrgrp>, and in the last few decades, many statisticians have been interested in this topic <abbrgrp>
					<abbr bid="B33">33</abbr>
					<abbr bid="B34">34</abbr>
					<abbr bid="B35">35</abbr>
				</abbrgrp>. There are still very few discussions within the clinical and epidemiological literature regarding the impact of suppression (i.e. the impact on the changes in the regression coefficients and <it>R</it>
				<sup>2</sup>) on the interpretation of non-randomised studies whilst making statistical adjustment for covariates within regression <abbrgrp>
					<abbr bid="B12">12</abbr>
					<abbr bid="B43">43</abbr>
				</abbrgrp>.</p>
			<sec>
				<st>
					<p>A numerical example</p>
				</st>
				<p>Considering the previous numerical examples for Simpson's paradox and Lord's paradox, all three variables are now treated as continuous. Simple regression shows a positive association between <it>BP </it>and birth weight: the regression coefficient for birth weight is 1.861 mmHg/Kg (95% CI: 0.770, 2.953). Simple regression also reveals a positive association between <it>BP </it>and current weight: the regression coefficient for current weight is 0.382 (95% CI = 0.341, 0.423) mmHg/Kg. Following the practice of many previous studies, <it>BP </it>is regressed on birth weight and current weight simultaneously and the partial regression coefficients for birth weight and current weight are -3.708 (95% CI = -4.794, -2.622) and 0.465 (95% CI = 0.418, 0.512) mmHg/Kg respectively, and both are highly statistically significant (Table <tblr tid="T3">3</tblr>). Thus, after adjusting for current weight, birth weight has a significant inverse association with <it>BP</it>, suggesting that hypertension is associated with lower birth weight.</p>
				<tbl id="T3">
					<title>
						<p>Table 3</p>
					</title>
					<caption>
						<p>Simple and multiple regression models for simulated hypothetical data on birth weight (<it>BW</it>), blood pressure (<it>BP</it>), and current body weight (<it>CW</it>); the dependent variable in all three models is <it>BP</it>.</p>
					</caption>
					<tblbdy cols="6">
						<r>
							<c ca="center">
								<p>
									<b>Model</b>
								</p>
							</c>
							<c cspan="2" ca="center">
								<p>
									<b>Regression Coefficients (Standard Errors)</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Standardised Coefficients</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<it>P</it>-values</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>
										<it>R<sup>2</sup>
										</it>
									</b>
								</p>
							</c>
						</r>
						<r>
							<c cspan="6">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>1</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Intercept</b>
								</p>
							</c>
							<c ca="center">
								<p>123.258 (1.981)</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>(&lt; 0.001) <sup>&#8224;</sup>
								</p>
							</c>
							<c ca="center">
								<p>0.011</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>1.861 (0.556)</p>
							</c>
							<c ca="center">
								<p>-0.105</p>
							</c>
							<c ca="center">
								<p>0.001</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>2</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Intercept</b>
								</p>
							</c>
							<c ca="center">
								<p>98.173 (1.755)</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>(&lt; 0.001)<sup>&#8224;</sup>
								</p>
							</c>
							<c ca="center">
								<p>0.251</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Current weight</b>
								</p>
							</c>
							<c ca="center">
								<p>0.382 (0.021)</p>
							</c>
							<c ca="center">
								<p>0.501</p>
							</c>
							<c ca="center">
								<p>&lt; 0.001</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c ca="center">
								<p>
									<b>3</b>
								</p>
							</c>
							<c ca="center">
								<p>
									<b>Intercept</b>
								</p>
							</c>
							<c ca="center">
								<p>104.330 (1.948)</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>(&lt; 0.001)<sup>&#8224;</sup>
								</p>
							</c>
							<c ca="center">
								<p>0.283</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Birth weight</b>
								</p>
							</c>
							<c ca="center">
								<p>-3.708 (0.553)</p>
							</c>
							<c ca="center">
								<p>-0.210</p>
							</c>
							<c ca="center">
								<p>&lt; 0.001</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>
									<b>Current Weight</b>
								</p>
							</c>
							<c ca="center">
								<p>0.465 (0.024)</p>
							</c>
							<c ca="center">
								<p>0.610</p>
							</c>
							<c ca="center">
								<p>&lt; 0.001</p>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>
							<sup>&#8224; </sup>It is irrelevant to formally test the intercept for statistical significance in this instance.</p>
					</tblfn>
				</tbl>
				<p>It is noteworthy that not only the association of birth weight with <it>BP </it>is reversed (coefficients change from 1.861 to -3.708 mmHg/Kg), but that the impact of current weight also increases from 0.382 to 0.465 mmHg/Kg. The <it>R</it>
					<sup>2 </sup>for multiple regression is 0.283, which is greater than the sum of the squared correlations for birth weight ((0.105)<sup>2 </sup>= 0.011) and current weight ((0.501)<sup>2 </sup>= 0.251), i.e. 0.262. Therefore, the explained variance of <it>BP </it>is greater than the sum of the explained variances for the two simple regression models.</p>
				<p>Figure <figr fid="F4">4</figr> is a three-dimensional representation of the associations amongst the three continuous variables. Although the solid black line shows that birth weight has a positive association with blood pressure, the various horizontal red lines with a negative slope indicate that at each level of current weight, birth weight has an inverse relationship with blood pressure.</p>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>A 3-dimensional scatter plot for the numerical example of suppression</p>
					</caption>
					<text>
						<p>A 3-dimensional scatter plot for the numerical example of suppression. The solid black line shows that the marginal relation between <it>BP </it>and birth weight is positive. However, the conditional relation between <it>BP </it>and birth weight (conditional on current weight) is reversed (the horizontal red lines).</p>
					</text>
					<graphic file="1742-7622-5-2-4"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Interpretation</p>
				</st>
				<p>In the hypothetical foetal origins example, the strength of association between <it>BP </it>and birth weight differs considerably between simple regression and multiple regression. Which model genuinely reflects their true causal relationship depends on whether or not current weight should be adjusted for; whether or not current weight is a confounder for the relationship between <it>BP </it>and birth weight, which depends upon biological and clinical knowledge, not <it>ad hoc </it>statistical analyses and changes in the estimated effects <abbrgrp>
						<abbr bid="B11">11</abbr>
					</abbrgrp>. The question is whether or not it is also biologically and clinically feasible to isolate the independent effect of birth weight on <it>BP </it>by removing the impact of current weight on <it>BP </it>
					<abbrgrp>
						<abbr bid="B3">3</abbr>
						<abbr bid="B5">5</abbr>
						<abbr bid="B6">6</abbr>
						<abbr bid="B7">7</abbr>
						<abbr bid="B44">44</abbr>
					</abbrgrp>. In other words, changes in the regression coefficient for birth weight caused by current weight being adjusted for in multiple regression is irrelevant to whether or not current weight is viewed to be a confounder. The definition of confounding depends upon the <it>a priori </it>causal model assumed by the investigator <abbrgrp>
						<abbr bid="B8">8</abbr>
						<abbr bid="B11">11</abbr>
					</abbrgrp>, which then dictates which statistical model is adopted.</p>
				<p>In statistical language, results from adjustment for current weight are <it>conditional </it>on all babies growing to the same size in adulthood. In Simpson's paradox, the 'paradox' is due to differences in the results between unconditional and conditional risk/probability, and in Lord's paradox, it is due to differences in the results between unconditional and conditional means. In suppression, the paradox is due to differences in the results between the marginal (i.e. unconditional) <it>BP-</it>birth weight relation and the <it>BP-</it>birth weight relation conditional on current weight.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>The reversal paradox is often used as the generic name for <it>Simpson's paradox</it>, <it>Lord's Paradox</it>, and <it>suppression </it>(see Table <tblr tid="T4">4</tblr>). Whilst the original definition and naming of the reversal paradox was derived from the notion that the direction of a relationship between two variables might be reversed after a third variable is introduced, this nevertheless may generalise to scenarios where the relationship between two variables is enhanced, not reduced or reversed, after the third variable is introduced (as with many studies on the foetal origins hypothesis).</p>
			<tbl id="T4">
				<title>
					<p>Table 4</p>
				</title>
				<caption>
					<p>Comparison of Simpson's paradox, Lord's Paradox, and suppression</p>
				</caption>
				<tblbdy cols="4">
					<r>
						<c ca="center">
							<p>
								<b>Type of Reversal Paradox</b>
							</p>
						</c>
						<c ca="center">
							<p>
								<b>Outcome (<it>illustrated example</it>)</b>
							</p>
						</c>
						<c ca="center">
							<p>
								<b>Exposure (<it>illustrated example</it>)</b>
							</p>
						</c>
						<c ca="center">
							<p>
								<b>Covariate/'Confounder' (<it>illustrated example</it>)</b>
							</p>
						</c>
					</r>
					<r>
						<c cspan="4">
							<hr/>
						</c>
					</r>
					<r>
						<c ca="center">
							<p>
								<b>Simpson's Paradox</b>
							</p>
						</c>
						<c ca="center">
							<p>Categorical (<it>hypertension</it>)</p>
						</c>
						<c ca="center">
							<p>Categorical (<it>birth weight: high vs. low</it>)</p>
						</c>
						<c ca="center">
							<p>Categorical (<it>current weight: high vs. low</it>)</p>
						</c>
					</r>
					<r>
						<c ca="center">
							<p>
								<b>Lord's paradox</b>
							</p>
						</c>
						<c ca="center">
							<p>Continuous (<it>blood pressure</it>)</p>
						</c>
						<c ca="center">
							<p>Categorical (<it>birth weight: high vs. low</it>)</p>
						</c>
						<c ca="center">
							<p>Continuous (<it>current weight</it>)</p>
						</c>
					</r>
					<r>
						<c ca="center">
							<p>
								<b>Suppression</b>
							</p>
						</c>
						<c ca="center">
							<p>Continuous (<it>blood pressure</it>)</p>
						</c>
						<c ca="center">
							<p>Continuous (<it>birth weight</it>)</p>
						</c>
						<c ca="center">
							<p>Continuous (<it>current weight</it>)</p>
						</c>
					</r>
				</tblbdy>
			</tbl>
			<p>In non-randomised studies, the reversal paradox can often occur due to 'controlling' for what is typically termed a confounder, even though a clear definition of what is meant by 'confounder' is rarely provided (contingent on understanding its role in the biological/clinical process being modelled). Differences in the strength or even direction of any association between outcome and exposure might give rise to contradictory interpretations regarding potential <it>causal </it>relationships. Furthermore, it is very difficult, if not impossible, to compare results across studies where many varied attempts are made to control for different confounders, especially in the absence of any consistent reasoning given for the choice of confounders. In some situations, statistical adjustment might introduce bias rather than eliminate it <abbrgrp>
					<abbr bid="B45">45</abbr>
				</abbrgrp>.</p>
			<p>It might be suggested that the adjustment of current weight in our foetal origins example can be viewed as estimations of direct and indirect effects, such as those in path analysis or structural equation modelling. Recall Figure <figr fid="F1">1a</figr>, the path from birth weight to BP is to estimate the direct effect of birth weight &#8594; BP, and then the path from birth weight &#8594; current weight &#8594; BP is to estimate the indirect effect. For instance, in the model 3 of Table <tblr tid="T3">3</tblr>, the regression coefficient for birth weight, -3.708, is the direct effect, and the indirect effect is derived from 0.465 (the regression coefficient for current weight in model 3) multiplied by 11.976 (the simple regression coefficient for birth weight when current weight is regressed on birth weight) = 5.569. The total effect is therefore -3.708 + 5.569 = 1.861, which is the simple regression coefficient for birth weight in the model 1 of Table <tblr tid="T3">3</tblr>. Our reservation with interpreting the results from model 3 as the partition of the total effect into direct and indirect effect is that many variables, such as current height and current BMI, can be put in between birth weight and BP, and it can be claimed that there is more than one indirect effect. Furthermore, any body size measured after birth, for example, body weight at year one, year two etc, can be adjusted for in the model and presumably used to estimate the indirect effects and direct effect. Whilst the total effect of birth weight on BP is not affected by the numbers of intermediate body size variables in the model, the estimation of 'direct' effect differs when different intermediate variables are adjusted for. Unless there is experimental evidence to support the notion that there are indeed different paths of direct and indirect effects from birth weight to BP, we are cautious of using such terminology to label the results from multiple regression, as with model 3. In other words, to determine whether the unconditional or conditional relationship reflects the true physiological relationship between birth weight and blood pressure, experiments in which birth weight and current weight can be manipulated are required in order to estimate the impact of birth weight on blood pressure.</p>
			<p>Although the three statistical paradoxes occur in different types of variables, they share the same characteristic: the association between two variables can be reversed, diminished, or enhanced when another variable is statistically controlled for. Understanding the concepts and theory behind these paradoxes will provide insights into some of the controversial or contradictory results from previous research. Prior knowledge and theory play an important role in the statistical modelling of non-randomised data. Incorrect use of statistical models might produce consistent, replicable, yet erroneous results.</p>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st>
			<p>The author(s) declare that they have no competing interests.</p>
		</sec>
	</bdy>
   <bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We are very grateful for the constructive comments of two reviewers. One reviewer brought to our attention of the excellent paper by Cox and Wermuth <abbrgrp>
						<abbr bid="B9">9</abbr>
					</abbrgrp>. YKT conceived the ideas of this study and wrote the first draft. DG and MSG contributed to the discussion of these ideas and writing of the final draft.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Fetal origins of coronary heart disease</p>
				</title>
				<aug>
					<au>
						<snm>Barker</snm>
						<fnm>DJ</fnm>
					</au>
				</aug>
				<source>BMJ</source>
				<pubdate>1995</pubdate>
				<volume>311</volume>
				<fpage>171</fpage>
				<lpage>4</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">7613432</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>Fetal origins of adult disease: strength of effects and biological basis</p>
				</title>
				<aug>
					<au>
						<snm>Barker</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Eriksson</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Forsen</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Osmond</snm>
						<fnm>C</fnm>
					</au>
				</aug>
				<source>Int J Epidemiol</source>
				<pubdate>2002</pubdate>
				<volume>31</volume>
				<fpage>1235</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/ije/31.6.1235</pubid>
						<pubid idtype="pmpid" link="fulltext">12540728</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B3">
				<title>
					<p>Early nutritional origins of hypertension: a hypothesis still lacking support</p>
				</title>
				<aug>
					<au>
						<snm>Paneth</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Ahmed</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Stein</snm>
						<fnm>AD</fnm>
					</au>
				</aug>
				<source>Journal of Hypertensio</source>
				<pubdate>1996</pubdate>
				<volume>14</volume>
				<issue>5</issue>
				<fpage>S121</fpage>
				<lpage>S129</lpage>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Fetal origins of adult disease-the hypothesis revisited</p>
				</title>
				<aug>
					<au>
						<snm>Lucas</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Fewtrell</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Cole</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>BMJ</source>
				<pubdate>1999</pubdate>
				<volume>319</volume>
				<fpage>245</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1116334</pubid>
						<pubid idtype="pmpid" link="fulltext">10417093</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p>Unravelling the fetal origins hypothesis: is there really an inverse association between birth weight and subsequent blood pressure?</p>
				</title>
				<aug>
					<au>
						<snm>Huxley</snm>
						<fnm>RR</fnm>
					</au>
					<au>
						<snm>Neil</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Collins</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Lancet</source>
				<pubdate>2002</pubdate>
				<volume>360</volume>
				<fpage>659</fpage>
				<lpage>65</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0140-6736(02)09834-3</pubid>
						<pubid idtype="pmpid" link="fulltext">12241871</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p>Why evidence for the fetal origins of adult disease might be a statistical artifact: the "reversal paradox" for the relation between birth weight and blood pressure in later life</p>
				</title>
				<aug>
					<au>
						<snm>Tu</snm>
						<fnm>YK</fnm>
					</au>
					<au>
						<snm>West</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Ellison</snm>
						<fnm>GTH</fnm>
					</au>
					<au>
						<snm>Gilthorpe</snm>
						<fnm>MS</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2005</pubdate>
				<volume>161</volume>
				<fpage>27</fpage>
				<lpage>32</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwi002</pubid>
						<pubid idtype="pmpid" link="fulltext">15615910</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Invited commentary: Barker meets Simpson</p>
				</title>
				<aug>
					<au>
						<snm>Weinberg</snm>
						<fnm>CR</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2005</pubdate>
				<volume>161</volume>
				<fpage>33</fpage>
				<lpage>5</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwi003</pubid>
						<pubid idtype="pmpid" link="fulltext">15615911</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Statistical issues in life course epidemiology</p>
				</title>
				<aug>
					<au>
						<snm>De Stavola</snm>
						<fnm>BL</fnm>
					</au>
					<au>
						<snm>Nitsch</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>dos Santos Silva</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>McCormack</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Hardy</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Mann</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Cole</snm>
						<fnm>TJ</fnm>
					</au>
					<au>
						<snm>Morton</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Leon</snm>
						<fnm>DA</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2006</pubdate>
				<volume>163</volume>
				<fpage>84</fpage>
				<lpage>96</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwj003</pubid>
						<pubid idtype="pmpid" link="fulltext">16306313</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>Causality: Models, Reasoning, and Inference</p>
				</title>
				<aug>
					<au>
						<snm>Pearl</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<publisher>Cambridge: Cambridge University Press</publisher>
				<pubdate>2000</pubdate>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Confounding and collapsibility in causal inference</p>
				</title>
				<aug>
					<au>
						<snm>Greenland</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Robins</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Pearl</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Stat Sci</source>
				<pubdate>1999</pubdate>
				<volume>14</volume>
				<fpage>29</fpage>
				<lpage>46</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1214/ss/1009211805</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Statistics for Epidemiology</p>
				</title>
				<aug>
					<au>
						<snm>Jewell</snm>
						<fnm>NP</fnm>
					</au>
				</aug>
				<publisher>London: Chapman &amp; Hall</publisher>
				<pubdate>2004</pubdate>
			</bibl>
			<bibl id="B12">
				<title>
					<p>A general condition for avoiding effect reversal after marginalisation</p>
				</title>
				<aug>
					<au>
						<snm>Cox</snm>
						<fnm>DR</fnm>
					</au>
					<au>
						<snm>Wermuth</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>J R Statist Soc B</source>
				<pubdate>2003</pubdate>
				<volume>65</volume>
				<fpage>937</fpage>
				<lpage>941</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1111/1467-9868.00424</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>The interpretation of interaction in contingency tables</p>
				</title>
				<aug>
					<au>
						<snm>Simpson</snm>
						<fnm>EH</fnm>
					</au>
				</aug>
				<source>J R Stat Soc Ser B</source>
				<pubdate>1951</pubdate>
				<volume>13</volume>
				<fpage>238</fpage>
				<lpage>41</lpage>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Notes on the theory of association of attributes in statistics</p>
				</title>
				<aug>
					<au>
						<snm>Yule</snm>
						<fnm>GU</fnm>
					</au>
				</aug>
				<source>Biometrika</source>
				<pubdate>1903</pubdate>
				<volume>2</volume>
				<fpage>121</fpage>
				<lpage>34</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1093/biomet/2.2.121</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>Mathematical contributions to the theory of evolution: VI &#8211; Genetic (reproductive) selection: Inheritance of fertility in man, and of fecundity in thoroughbred racehorses</p>
				</title>
				<aug>
					<au>
						<snm>Pearson</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Lee</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Bramley-Moore</snm>
						<fnm>L</fnm>
					</au>
				</aug>
				<source>Philos Trans R Soc Lond A</source>
				<pubdate>1899</pubdate>
				<volume>192</volume>
				<fpage>257</fpage>
				<lpage>330</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1098/rsta.1899.0006</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>Intergenerational influences affecting birth outcome. II. Preterm delivery and gestational age in the children of the 1958 British birth cohort</p>
				</title>
				<aug>
					<au>
						<snm>Hennessy</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Alberman</snm>
						<fnm>E</fnm>
					</au>
				</aug>
				<source>Paediatr Perinat Epidemiol</source>
				<pubdate>1998</pubdate>
				<volume>12</volume>
				<issue>1</issue>
				<fpage>61</fpage>
				<lpage>75</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1046/j.1365-3016.1998.0120s1061.x</pubid>
						<pubid idtype="pmpid" link="fulltext">9690274</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Website of the Department of Health, United Kingdom</p>
				</title>
				<url>http://www.doh.gov.uk</url>
			</bibl>
			<bibl id="B18">
				<title>
					<p>A graphical representation of a three-way contingency table: Simpson's paradox and correlation</p>
				</title>
				<aug>
					<au>
						<snm>Paik</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Am Stat</source>
				<pubdate>1985</pubdate>
				<volume>39</volume>
				<fpage>53</fpage>
				<lpage>54</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/2683907</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>The "birth weight" paradox uncovered?</p>
				</title>
				<aug>
					<au>
						<snm>Hernandez-Diaz</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Schisterman</snm>
						<fnm>EF</fnm>
					</au>
					<au>
						<snm>Hernan</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2006</pubdate>
				<volume>164</volume>
				<fpage>1115</fpage>
				<lpage>1120</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwj275</pubid>
						<pubid idtype="pmpid" link="fulltext">16931543</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Invited Commentary: The perils of birth weight &#8211; a lesson from directed acyclic graphs</p>
				</title>
				<aug>
					<au>
						<snm>Wilcox</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2006</pubdate>
				<volume>164</volume>
				<fpage>1121</fpage>
				<lpage>1123</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwj276</pubid>
						<pubid idtype="pmpid" link="fulltext">16931545</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>A paradox in the interpretation of group comparisons</p>
				</title>
				<aug>
					<au>
						<snm>Lord</snm>
						<fnm>FM</fnm>
					</au>
				</aug>
				<source>Psychol Bull</source>
				<pubdate>1967</pubdate>
				<volume>68</volume>
				<fpage>304</fpage>
				<lpage>5</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1037/h0025105</pubid>
						<pubid idtype="pmpid">6062585</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Statistical adjustments when comparing preexisting groups</p>
				</title>
				<aug>
					<au>
						<snm>Lord</snm>
						<fnm>FM</fnm>
					</au>
				</aug>
				<source>Psychol Bull</source>
				<pubdate>1969</pubdate>
				<volume>72</volume>
				<fpage>337</fpage>
				<lpage>8</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1037/h0028108</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>When is baseline adjustment useful in analysis of change? An example with education and cognitive change</p>
				</title>
				<aug>
					<au>
						<snm>Glymour</snm>
						<fnm>MM</fnm>
					</au>
					<au>
						<snm>Weuve</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Berkman</snm>
						<fnm>LF</fnm>
					</au>
					<au>
						<snm>Kawachi</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Robins</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2005</pubdate>
				<volume>162</volume>
				<fpage>267</fpage>
				<lpage>278</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwi187</pubid>
						<pubid idtype="pmpid" link="fulltext">15987729</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p>Deconstructuring statistical questions</p>
				</title>
				<aug>
					<au>
						<snm>Hand</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>J R Stat Soc Ser A Stat Soc</source>
				<pubdate>1994</pubdate>
				<volume>157</volume>
				<fpage>317</fpage>
				<lpage>56</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/2983526</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B25">
				<title>
					<p>A primer on regression artefact</p>
				</title>
				<aug>
					<au>
						<snm>Campbell</snm>
						<fnm>DT</fnm>
					</au>
					<au>
						<snm>Kenny</snm>
						<fnm>DA</fnm>
					</au>
				</aug>
				<publisher>Guildford: The Guilford Press</publisher>
				<pubdate>1999</pubdate>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Regression artifacts and other customs of dubious desert</p>
				</title>
				<aug>
					<au>
						<snm>Mohr</snm>
						<fnm>LB</fnm>
					</au>
				</aug>
				<source>Eval Program Plann</source>
				<pubdate>2000</pubdate>
				<volume>23</volume>
				<fpage>397</fpage>
				<lpage>409</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1016/S0149-7189(00)00029-X</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>Regression facts and artifacts</p>
				</title>
				<aug>
					<au>
						<snm>Reichardt</snm>
						<fnm>CS</fnm>
					</au>
				</aug>
				<source>Eval Program Plann</source>
				<pubdate>2000</pubdate>
				<volume>23</volume>
				<fpage>411</fpage>
				<lpage>4</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1016/S0149-7189(00)00030-6</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>Adjusting for differential base rates: Lord's paradox again</p>
				</title>
				<aug>
					<au>
						<snm>Wainer</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Psychol Bull</source>
				<pubdate>1991</pubdate>
				<volume>109</volume>
				<fpage>147</fpage>
				<lpage>51</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1037/0033-2909.109.1.147</pubid>
						<pubid idtype="pmpid" link="fulltext">2006226</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>Statistics on the Table</p>
				</title>
				<aug>
					<au>
						<snm>Stigler</snm>
						<fnm>SM</fnm>
					</au>
				</aug>
				<publisher>Cambridge, Massachusetts: Harvard University Press</publisher>
				<pubdate>1999</pubdate>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Analysing controlled trials with baseline and follow up measurements</p>
				</title>
				<aug>
					<au>
						<snm>Vickers</snm>
						<fnm>AJ</fnm>
					</au>
					<au>
						<snm>Altman</snm>
						<fnm>DG</fnm>
					</au>
				</aug>
				<source>BMJ</source>
				<pubdate>2001</pubdate>
				<volume>323</volume>
				<fpage>1123</fpage>
				<lpage>4</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1121605</pubid>
						<pubid idtype="pmpid" link="fulltext">11701584</pubid>
						<pubid idtype="doi">10.1136/bmj.323.7321.1123</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Statistical power for analyses of changes in randomized controlled trials</p>
				</title>
				<aug>
					<au>
						<snm>Tu</snm>
						<fnm>YK</fnm>
					</au>
					<au>
						<snm>Blance</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Clerehugh</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Gilthorpe</snm>
						<fnm>MS</fnm>
					</au>
				</aug>
				<source>J Dent Res</source>
				<pubdate>2005</pubdate>
				<volume>84</volume>
				<fpage>283</fpage>
				<lpage>287</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">15723872</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Suppression and enhancement in bivariate regression</p>
				</title>
				<aug>
					<au>
						<snm>Lewis</snm>
						<fnm>JW</fnm>
					</au>
					<au>
						<snm>Escobar</snm>
						<fnm>LA</fnm>
					</au>
				</aug>
				<source>Statistician</source>
				<pubdate>1986</pubdate>
				<volume>35</volume>
				<fpage>17</fpage>
				<lpage>26</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/2988294</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>A quirk in multiple regression: the whole regression can be greater than the sum of its parts</p>
				</title>
				<aug>
					<au>
						<snm>Bertrand</snm>
						<fnm>PV</fnm>
					</au>
					<au>
						<snm>Holder</snm>
						<fnm>RL</fnm>
					</au>
				</aug>
				<source>Statistician</source>
				<pubdate>1988</pubdate>
				<volume>37</volume>
				<fpage>371</fpage>
				<lpage>4</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/2348761</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>The relationship among sums of squares, correlation coefficients and suppression</p>
				</title>
				<aug>
					<au>
						<snm>Sharpe</snm>
						<fnm>NR</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>RA</fnm>
					</au>
				</aug>
				<source>Am Stat</source>
				<pubdate>1997</pubdate>
				<volume>51</volume>
				<fpage>46</fpage>
				<lpage>48</lpage>
				<xrefbib>
					<pubid idtype="doi">10.2307/2684693</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>Graphical views of suppression and multicollinearity in multiple linear regression</p>
				</title>
				<aug>
					<au>
						<snm>Friedman</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Wall</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>Am Stat</source>
				<pubdate>2005</pubdate>
				<fpage>127</fpage>
				<lpage>136</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1198/000313005X41337</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Applied multiple regression/correlation analysis for the behavioural sciences</p>
				</title>
				<aug>
					<au>
						<snm>Cohen</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Cohen</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<publisher>London: LEA</publisher>
				<pubdate>1983</pubdate>
			</bibl>
			<bibl id="B37">
				<title>
					<p>Multiple regression in behavioral research: Explanation and prediction</p>
				</title>
				<aug>
					<au>
						<snm>Pedhazur</snm>
						<fnm>EJ</fnm>
					</au>
				</aug>
				<publisher>Fort Worth: Harcourt</publisher>
				<pubdate>1997</pubdate>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Blood pressure and birth weight in the first year university student aged 18&#8211;25</p>
				</title>
				<aug>
					<au>
						<snm>Stocks</snm>
						<fnm>NP</fnm>
					</au>
					<au>
						<snm>Davey Smith</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Public Health</source>
				<pubdate>1999</pubdate>
				<volume>113</volume>
				<fpage>273</fpage>
				<lpage>7</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0033-3506(99)00179-1</pubid>
						<pubid idtype="pmpid">10637518</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Birth size, growth, and blood pressure between the ages of 7 and 26 years: failure to support the fetal origins hypothesis</p>
				</title>
				<aug>
					<au>
						<snm>Williams</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Poulton</snm>
						<fnm>R</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2002</pubdate>
				<volume>155</volume>
				<fpage>849</fpage>
				<lpage>52</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/155.9.849</pubid>
						<pubid idtype="pmpid" link="fulltext">11978589</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Blood pressure in relation to birth weight in twins and singleton controls matched for gestational age</p>
				</title>
				<aug>
					<au>
						<snm>McNeill</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Tuya</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Campbell</snm>
						<fnm>DM</fnm>
					</au>
					<au>
						<snm>Haggarty</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Smith</snm>
						<fnm>WCS</fnm>
					</au>
					<au>
						<snm>Masson</snm>
						<fnm>LF</fnm>
					</au>
					<au>
						<snm>Cumming</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Broom</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Haites</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Am J Epidemiol</source>
				<pubdate>2003</pubdate>
				<volume>158</volume>
				<fpage>150</fpage>
				<lpage>5</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/aje/kwg130</pubid>
						<pubid idtype="pmpid" link="fulltext">12851228</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>What is the effect of adjusting for more than one measure of current body size on the relation between birth weight and blood pressure?</p>
				</title>
				<aug>
					<au>
						<snm>Tu</snm>
						<fnm>YK</fnm>
					</au>
					<au>
						<snm>Gilthorpe</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>TH Ellison</snm>
						<fnm>GTH</fnm>
					</au>
				</aug>
				<source>J Hum Hypertens</source>
				<pubdate>2006</pubdate>
				<volume>20</volume>
				<fpage>646</fpage>
				<lpage>657</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/sj.jhh.1002044</pubid>
						<pubid idtype="pmpid" link="fulltext">16691183</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>The role of prediction variables which are independent of the criterion</p>
				</title>
				<aug>
					<au>
						<snm>Horst</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>The Prediction of Personal Adjustment</source>
				<publisher>New York: Social Science Research Council</publisher>
				<editor>Horst P</editor>
				<pubdate>1941</pubdate>
				<fpage>431</fpage>
				<lpage>6</lpage>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Equivalence of the mediation, confounding and suppression effect</p>
				</title>
				<aug>
					<au>
						<snm>MacKinnon</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Krull</snm>
						<fnm>JL</fnm>
					</au>
					<au>
						<snm>Lockwood</snm>
						<fnm>CM</fnm>
					</au>
				</aug>
				<source>Prev Sci</source>
				<pubdate>2000</pubdate>
				<volume>1</volume>
				<fpage>173</fpage>
				<lpage>81</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1023/A:1026595011371</pubid>
						<pubid idtype="pmpid" link="fulltext">11523746</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p>Growth, current size and the role of the 'reversal paradox' in the foetal origins of adult disease: an illustration using vector geometry</p>
				</title>
				<aug>
					<au>
						<snm>Tu</snm>
						<fnm>YK</fnm>
					</au>
					<au>
						<snm>Ellison</snm>
						<fnm>GTH</fnm>
					</au>
					<au>
						<snm>Gilthorpe</snm>
						<fnm>MS</fnm>
					</au>
				</aug>
				<source>Epidemiol Perspect Innov</source>
				<pubdate>2006</pubdate>
				<volume>3</volume>
				<fpage>9</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">1557505</pubid>
						<pubid idtype="pmpid" link="fulltext">16884533</pubid>
						<pubid idtype="doi">10.1186/1742-5573-3-9</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>The scandal of poor epidemiological research</p>
				</title>
				<aug>
					<au>
						<snm>Von Elm</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Egger</snm>
						<fnm>M</fnm>
					</au>
				</aug>
				<source>BMJ</source>
				<pubdate>2004</pubdate>
				<volume>329</volume>
				<fpage>868</fpage>
				<lpage>9</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">523097</pubid>
						<pubid idtype="pmpid" link="fulltext">15485939</pubid>
						<pubid idtype="doi">10.1136/bmj.329.7471.868</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
