<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1742-7622-5-5</ui>
   <ji>1742-7622</ji>
   <fm>
      <dochead>Commentary</dochead>
      <bibl>
         <title>
            <p>The role of causal reasoning in understanding Simpson's paradox, Lord's paradox, and the suppression effect: covariate selection in the analysis of observational studies</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Arah</snm>
               <mi>A</mi>
               <fnm>Onyebuchi</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>o.a.arah@amc.uva.nl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Social Medicine, Academic Medical Center, University of Amsterdam, PO Box 22700, 1100 DE Amsterdam, The Netherlands</p>
            </ins>
            <ins id="I2">
               <p>Department of Epidemiology, University of California, Los Angeles (UCLA), School of Public Health, Los Angeles, CA 90095-1772, USA</p>
            </ins>
         </insg>
         <source>Emerging Themes in Epidemiology</source>
         <issn>1742-7622</issn>
         <pubdate>2008</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>5</fpage>
         <url>http://www.ete-online.com/content/5/1/5</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18302750</pubid>
               <pubid idtype="doi">10.1186/1742-7622-5-5</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>06</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>26</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>26</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Arah; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>Tu <it>et al </it>present an analysis of the equivalence of three paradoxes, namely, Simpson's, Lord's, and the suppression phenomena. They conclude that all three simply reiterate the occurrence of a change in the association of any two variables when a third variable is statistically controlled for. This is not surprising because reversal or change in magnitude is common in conditional analysis. At the heart of the phenomenon of change in magnitude, with or without reversal of effect estimate, is the question of which to use: the unadjusted (combined table) or adjusted (sub-table) estimate. Hence, Simpson's paradox and related phenomena are a problem of covariate selection and adjustment (when to adjust or not) in the causal analysis of non-experimental data. It cannot be overemphasized that although these paradoxes reveal the perils of using statistical criteria to guide causal analysis, they hold neither the explanations of the phenomenon they depict nor the pointers on how to avoid them. The explanations and solutions lie in causal reasoning which relies on background knowledge, not statistical criteria.</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Commentary</p>
         </st>
         <p>Simpson's paradox, Lord's paradox, and the suppression effect are examples of the perils of the statistical interpretation of a real but complex world. By rearing their heads intermittently in the literature they remind us about the inadequacy of statistical criteria for causal analysis. Those who believe in letting the data speak for themselves are in for a disappointment.</p>
         <p>Tu <it>et al </it>present an analysis of the equivalence of three paradoxes, concluding that all three simply reiterate the unsurprising change in the association of any two variables when a third variable is statistically controlled for <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. I call this unsurprising because reversal or change in magnitude is common in conditional analysis. To avoid either, we must avoid conditional analysis altogether. What is it about Simpson's and Lord's paradoxes or the suppression effect, beyond their pointing out the obvious, that attracts the intermittent and sometimes alarmist interests seen in the literature? Why are they paradoxes? A paradox is a seemingly absurd or self-contradictory statement or proposition that may in fact be true <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. What is so self-contradictory about the Simpson's, Lord's, and suppression phenomena that may turn out to be true? After reading the paper by Tu <it>et al </it>one still gets the uneasy feeling that the paradoxes are anything but surprising, that the statistical phenomenon they purport to represent are in fact causal in nature, requiring a causal language not a statistical one, and that the problem can be resolved only with causal reasoning. So, why bother with the statistics of these paradoxes, much less their equivalence, in the first instance if both the correct language and resolution lie elsewhere? Although we are given a glimpse of the appropriate tools (such as the implied causal calculus of directed acyclic graphs <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>), we must look beyond the authors' paper for satisfactory answers.</p>
         <p>At the heart of the phenomenon of change in magnitude, with or without reversal of effect estimate, is the question of which to use: the unadjusted (combined table) or adjusted (sub-table) estimate. Simpson's and Lord's paradoxes generate shock when change in direction or magnitude (or both) of an estimate is observed while we are thinking causes; we then start wondering which estimate is the correct one. Suppression effect in addition frets about model fit to assess the correctness of unadjusted versus adjusted estimates. In each case, the researcher is looking at statistics to tell her what she may not admit to: which estimate must she accord causal interpretation in what causal world? In other words, these paradoxes arise in the context of covariate selection, especially when looking to select variables for adequate control of confounding in causal analysis <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Causal diagrams and their related causal calculus have emerged as a mathematically rigorous approach to depicting causal relations among variables, making underlying causal assumptions transparent, and guiding the selection of a sufficient set of covariates for consistent effect estimation <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. In all causal diagrams or directed acyclic graphs (DAGs), it is important to note that the missing arrows are very important: they connote what the researcher believes to be the absence of causal effects encoded by those missing arrows. The knowledge needed to draw these DAGs and to guide subsequent analysis resides outside the study data. Hence, there can be no causal inference without background knowledge <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B7">7</abbr></abbrgrp>. To appreciate the causal approach to the paradoxes, consider the three-variable model of birth weight (BW), current weight (CW), and blood pressure (BP) used by the authors and seen in the life course epidemiology literature. In addition to the directed graphs presented by the authors, I have added a few additional, non-trivial, and non-redundant (although by no means exhaustive) graphs that could have generated their observed correlations (Figures <figr fid="F1">1</figr> to <figr fid="F7">7</figr>).</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Directed acyclic graph (DAG) showing that birth weight (<it>BW</it>) has a direct effect as well as an indirect effect via current weight (<it>CW</it>) on blood pressure (<it>BP</it>)</p>
            </caption>
            <text>
               <p>
                  <b>Directed acyclic graph (DAG) showing that birth weight (<it>BW</it>) has a direct effect as well as an indirect effect via current weight (<it>CW</it>) on blood pressure (<it>BP</it>).</b>
               </p>
            </text>
            <graphic file="1742-7622-5-5-1"/>
         </fig>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>DAG showing a scenario where birth weight (<it>BW</it>) has a causal effect on and shares a common cause &#8211; current weight (<it>CW</it>) &#8211; with blood pressure (<it>BP</it>)</p>
            </caption>
            <text>
               <p><b>DAG showing a scenario where birth weight (<it>BW</it>) has a causal effect on and shares a common cause &#8211; current weight (<it>CW</it>) &#8211; with blood pressure (<it>BP</it>).</b> That is, the relationship between <it>BW </it>and <it>BP </it>is confounded by <it>CW</it>.</p>
            </text>
            <graphic file="1742-7622-5-5-2"/>
         </fig>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>DAG showing where, in addition to its effect on blood pressure (<it>BP</it>), birth weight (<it>BW</it>) shares an unmeasured common cause (<it>U</it>) with current weight (<it>CW</it>) which also has an effect on <it>BP</it></p>
            </caption>
            <text>
               <p>
                  <b>DAG showing where, in addition to its effect on blood pressure (<it>BP</it>), birth weight (<it>BW</it>) shares an unmeasured common cause (<it>U</it>) with current weight (<it>CW</it>) which also has an effect on <it>BP</it>.</b>
               </p>
            </text>
            <graphic file="1742-7622-5-5-3"/>
         </fig>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>DAG showing where birth weight (<it>BW</it>) has a direct effect on blood pressure (<it>BP</it>) and shares an unmeasured common cause (<it>U</it><sub>1</sub>) with current weight (<it>CW</it>) which itself has another unmeasured common cause (<it>U</it><sub>2</sub>) with <it>BP</it></p>
            </caption>
            <text>
               <p>
                  <b>DAG showing where birth weight (<it>BW</it>) has a direct effect on blood pressure (<it>BP</it>) and shares an unmeasured common cause (<it>U</it><sub>1</sub>) with current weight (<it>CW</it>) which itself has another unmeasured common cause (<it>U</it><sub>2</sub>) with <it>BP</it>.</b>
               </p>
            </text>
            <graphic file="1742-7622-5-5-4"/>
         </fig>
         <fig id="F5">
            <title>
               <p>Figure 5</p>
            </title>
            <caption>
               <p>DAG depicting each variable pair &#8211; <it>BW</it>/<it>CW</it>, <it>CW</it>/<it>BP</it>, or <it>BW</it>/<it>BP </it>&#8211; as being connected only by an unmeasured common cause (<it>U</it><sub>1</sub>, <it>U</it><sub>2 </sub>or <it>U</it><sub>3 </sub>respectively)</p>
            </caption>
            <text>
               <p>
                  <b>DAG depicting each variable pair &#8211; <it>BW</it>/<it>CW</it>, <it>CW</it>/<it>BP</it>, or <it>BW</it>/<it>BP </it>&#8211; as being connected only by an unmeasured common cause (<it>U</it><sub>1</sub>, <it>U</it><sub>2 </sub>or <it>U</it><sub>3 </sub>respectively).</b>
               </p>
            </text>
            <graphic file="1742-7622-5-5-5"/>
         </fig>
         <fig id="F6">
            <title>
               <p>Figure 6</p>
            </title>
            <caption>
               <p>DAG where, like in Figure 2, the effect of birth weight (<it>BW</it>) on blood pressure (<it>BP</it>) is confounded by current weight (<it>CW</it>) which itself has an unmeasured common cause (<it>U</it>) with <it>BP</it></p>
            </caption>
            <text>
               <p>
                  <b>DAG where, like in Figure 2, the effect of birth weight (<it>BW</it>) on blood pressure (<it>BP</it>) is confounded by current weight (<it>CW</it>) which itself has an unmeasured common cause (<it>U</it>) with <it>BP</it>.</b>
               </p>
            </text>
            <graphic file="1742-7622-5-5-6"/>
         </fig>
         <fig id="F7">
            <title>
               <p>Figure 7</p>
            </title>
            <caption>
               <p>DAG depicting a modification of Figure 1 where <it>CW </it>has a common cause (<it>U</it>) with <it>BP</it></p>
            </caption>
            <text>
               <p><b>DAG depicting a modification of Figure 1 where <it>CW </it>has a common cause (<it>U</it>) with <it>BP</it>.</b> In the directed path <it>BW</it>&#8594;<it>CW</it>&#8592;<it>U</it>&#8594;<it>BP</it>, <it>CW </it>acts as a collider if left uncontrolled for in the analysis of the effect of <it>BW </it>on <it>BP</it>.</p>
            </text>
            <graphic file="1742-7622-5-5-7"/>
         </fig>
         <p>Now, suppose there are no other unmeasured covariates given the DAGs in Figures <figr fid="F1">1</figr> to <figr fid="F7">7</figr>. If Figure <figr fid="F1">1</figr> is the true state of affairs, to estimate the <it>total </it>effect of BW on BP, the unadjusted analysis will suffice. If, however, Figure <figr fid="F2">2</figr> or <figr fid="F3">3</figr> applies, then the adjusted (that is conditional on CW) is needed to estimate the total effect of BW on BP. This is because conditioning on CW will block the back-door BW to BP: BW&#8592;CW&#8594;BP in Figure <figr fid="F2">2</figr> or BW&#8592;U&#8594;CW&#8594;BP in Figure <figr fid="F3">3</figr>. The reader could by now have doubts about the correctness of Figure <figr fid="F2">2</figr> where a later observation CW is a confounder of the effect of BW on BP since one could argue that, by occurring after BW, CW could not be seen as a common cause of both BW and BP. (See Hern&#225;n <it>et al </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp> for an accessible defence of the structural approach to confounding and selection bias using DAGs.) Nonetheless, while temporality <it>seemingly </it>excludes CW as a confounder in Figure <figr fid="F2">2</figr>, it does not exclude CW from ever being part of a confounding path as seen in Figure <figr fid="F3">3</figr>. Both BW and CW are more likely to be the result of a common cause (U), possibly genetic. Based on background knowledge and common sense, Figure <figr fid="F3">3</figr> is more plausible than <figr fid="F2">2</figr>. Therefore, temporality cannot be used to judge whether a variable is a confounder, part of a sufficient subset of covariates needed to block a backdoor, or not <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>Figure <figr fid="F4">4</figr> presents a scenario where the unadjusted effect of BW on BP is the correct estimate since CW is a collider (that is, without conditioning, it already acts as a blocker) in the DAG depicting two unobserved common causes of BW and CW and of CW and BP. This scenario is closely related to that in Figure <figr fid="F5">5</figr> where BW has no effect on, but shares an unobserved common cause (U<sub>3</sub>) with, BP. In all scenarios, our choice of which (unadjusted or adjusted) estimate to use is not based on the magnitude or direction of the estimate but on the governing causal relations. Put this way, Simpson's paradox becomes a problem of covariate adjustment (when to adjust or not) in the causal analysis of non-experimental or observational data. The paradox arises due to the causal interpretation of the observation that the proportion of a given level of BW is evidence for making an educated guess of the proportion of a given BP level in an observed sample <it>if </it>the status of the third related covariate CW is unknown <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. What we really want to answer is "Does BW cause BP?", <it>not </it>"Does observing BW allow us to predict BP?".</p>
         <p>As Pearl has noted <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, people think "causes", not proportions (the thing that drives the paradox in Simpson's paradox); "reversal" is possible in the calculus of proportions but impossible in the calculus of causes. Put in Pearl's causal language, the invariance of causal interpretation that is wrongly used to interpret evidence of reversal in proportions in Simpson's paradox is as follows:</p>
         <p>
            <display-formula id="M1">Pr(BP=high | <it>do</it>{BW=high}, high CW) &lt; Pr(BP=high | <it>do</it>{BW=low}, high CW)</display-formula>
         </p>
         <p>
            <display-formula id="M2">Pr(BP=high | <it>do</it>{BW=high}, low CW) &lt; Pr(BP=high | <it>do</it>{BW=low}, low CW)</display-formula>
         </p>
         <p>where, according to our causal intuition, the combined or unadjusted analysis should be:</p>
         <p>
            <display-formula id="M3">Pr(BP=high | <it>do</it>{BW=high}) &lt; Pr(BP=high | <it>do</it>{BW=low})</display-formula>
         </p>
         <p>The inequalities in (1), (2) and (3) reflect the "sure thing principle" which when applied to Tu <it>et al</it>'s paper would then go as follows: an action <it>do</it>{BW} which decreases the probability of the event BP in each CW subpopulation must also decrease the probability of BP in the whole population, provided that the action <it>do</it>{BW} does not change the distribution of the CW subpopulations. See Pearl <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> for a formal proof, although the sure thing principle follows naturally from the semantics of actions as modifiers of mechanisms, as embodied by the <it>do</it>(&#183;) operator. What is numerically observed in Simpson's paradox, however, is</p>
         <p>
            <display-formula id="M4">Pr(BP=high | BW=high) > Pr(BP=high | BW=low)</display-formula>
         </p>
         <p>which goes against our causal intuition or inclination to think "causes". If the DAG represented in Figure <figr fid="F3">3</figr> &#8211; or, for the sake of argument, Figure <figr fid="F2">2</figr> &#8211; applies, then we must consult the conditional analysis represented by inequalities 1 and 2, not the observed unconditional analysis in inequality 4. In this context, inequality 4 can only be seen as an evidence of BP that BW provides in the absence of information on CW, not as a statement of the causal effect of BW on BP which is what inequality 3 captures <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. That is, Simpson's paradox arises because once CW is unknown to us, and we observe, for instance, that the proportion of {BW=high} is higher than that of {BW=low}, we have evidence for predicting (as in inequality 4) that the observable proportion of {BP=high} would also be higher than that of {BP=low} in the non-experimental data, but we cannot take this observation to imply that {BW=high} causes {BP=high} which goes against our causal knowledge that <it>doing</it>{BW=low} causes {BP=high} as depicted in inequality 3. Hence, prediction does not imply aetiology. The former tends to deal with usually transitory proportions whereas the latter deals with invariant causal relations.</p>
         <p>A further illustration of the futility of the continued statistical discussion of the paradoxes is captured in the discussion of the suppression effect: how an unrelated covariate (CW) "increases the overall model fit ...assessed by <it>R</it><sup>2</sup>..." <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Tu <it>et al </it>should not be surprised that suppression is little known in epidemiology because epidemiologists do not and should not use the squared multiple-correlation-coefficient <it>R</it><sup>2 </sup>as a measure of goodness-of-fit. As Tu <it>et al </it>algebraically admit, <it>R</it><sup>2 </sup>is only an indication of the proportion of the variance in BP or outcome that is attributable to the variation in the fitted mean of BP <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. It is known that the expected value of <it>R</it><sup>2 </sup>can increase as more and even unrelated variables are added to the model thus making it a useless criterion for guiding covariate selection <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>Furthermore, Tu <it>et al </it>make a passing mention of direct effect versus indirect effect (as might be the case in the consideration of adjustment in Figure <figr fid="F1">1</figr>). This is, of course, beyond the scope of their paper and, therefore, my commentary. I refer the curious reader to the important work on the complex issues involved in the estimation of direct effect <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. Suffice it to say that, in common situations where total effect estimation is possible, direct effect may be unidentifiable. For instance, although all effects of BW on BP can still be consistently estimated even in a scenario where there is an additional unobserved common cause (U) of CW and BP as in Figure <figr fid="F6">6</figr> (modified from Figure <figr fid="F2">2</figr>), the direct effect of BW on BP cannot be identified without measuring U in Figure <figr fid="F7">7</figr> which is a similar modification of Figure <figr fid="F1">1</figr>. Like Pearl <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> and Holland and Rubin <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, I take these paradoxes to be related to causal concepts which are, thus, best understood in the context of causal analysis.</p>
         <p>In conclusion, it cannot be overemphasized that although Simpson's and related paradoxes reveal the perils of using statistical criteria to guide causal analysis, they hold neither the explanations of the phenomenon they purport to depict nor the pointers on how to avoid them. The explanations and solutions lie in causal reasoning which relies on background knowledge, not statistical criteria. It is high time we stopped treating misinterpreted signs and symptoms ('paradoxes'), and got on with the business of handling the disease ('causality'). We should rightly turn our attention to the perennial problem of covariate selection for causal analysis using non-experimental data.</p>
      </sec>
      <sec>
         <st>
            <p>Competing interests</p>
         </st>
         <p>OAA is an associate faculty editor of the journal <it>Emerging Themes in Epidemiology </it>(ETE).</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by a Rubicon fellowship (grant number 825.06.026) awarded by the Board of the Council for Earth and Life Sciences (ALW), at the Netherlands Organisation for Scientific Research (NWO). The author thanks Timothy Hallett, and ETE's editorial board and associate editors for their insightful comments. This paper represents author's own opinions, but not those of ETE or other relevant affiliations.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Simpson's paradox, Lord's paradox, and suppression effects are the same phenomenon &#8211; the reversal paradox</p>
            </title>
            <aug>
               <au>
                  <snm>Tu</snm>
                  <fnm>Y-K</fnm>
               </au>
               <au>
                  <snm>Gunnell</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Gilthorpe</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Emerg Themes Epidemiol</source>
            <pubdate>2008</pubdate>
            <volume>5</volume>
            <fpage>2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2254615</pubid>
                  <pubid idtype="pmpid" link="fulltext">18211676</pubid>
                  <pubid idtype="doi">10.1186/1742-7622-5-2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <cnm>Oxford University</cnm>
               </au>
            </aug>
            <source>Oxford Dictionary, Thesaurus, and Wordpower Guide</source>
            <publisher>Oxford: Oxford University Press</publisher>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Causal diagrams for empirical research</p>
            </title>
            <aug>
               <au>
                  <snm>Pearl</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>1995</pubdate>
            <volume>82</volume>
            <fpage>669</fpage>
            <lpage>710</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/biomet/82.4.669</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Causal diagrams for epidemiologic research</p>
            </title>
            <aug>
               <au>
                  <snm>Greenland</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pearl</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Robins</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Epidemiol</source>
            <pubdate>1999</pubdate>
            <volume>10</volume>
            <issue>1</issue>
            <fpage>37</fpage>
            <lpage>48</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00001648-199901000-00008</pubid>
                  <pubid idtype="pmpid" link="fulltext">9888278</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <aug>
               <au>
                  <snm>Pearl</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Causality. Models, Reasoning and Inference</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Causal inference in health sciences</p>
            </title>
            <aug>
               <au>
                  <snm>Pearl</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Health Serv Outcomes Res Methodol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>189</fpage>
            <lpage>220</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1020315127304</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Data, design, and background knowledge in etiologic inference</p>
            </title>
            <aug>
               <au>
                  <snm>Robins</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Epidemiol</source>
            <pubdate>2001</pubdate>
            <volume>12</volume>
            <issue>3</issue>
            <fpage>313</fpage>
            <lpage>320</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00001648-200105000-00011</pubid>
                  <pubid idtype="pmpid" link="fulltext">11338312</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A structural approach to selection bias</p>
            </title>
            <aug>
               <au>
                  <snm>Hernan</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hernandez-Diaz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Robins</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Epidemiol</source>
            <pubdate>2004</pubdate>
            <volume>15</volume>
            <issue>5</issue>
            <fpage>615</fpage>
            <lpage>625</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/01.ede.0000135174.63482.43</pubid>
                  <pubid idtype="pmpid" link="fulltext">15308962</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <aug>
               <au>
                  <snm>Rothman</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Greenland</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lash</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <cnm>(eds)</cnm>
               </au>
            </aug>
            <source>Modern Epidemiology</source>
            <publisher>Philadelphia: Lippincott</publisher>
            <edition>3</edition>
            <pubdate>2008</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B10">
            <aug>
               <au>
                  <snm>Altman</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Practical Statistics for Medical Research</source>
            <publisher>Boca Raton, FL: Chapman &amp; Hall</publisher>
            <pubdate>1991</pubdate>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Identifiability and exchangeability for direct and indirect effects</p>
            </title>
            <aug>
               <au>
                  <snm>Robins</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Greenland</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Epidemiol</source>
            <pubdate>1992</pubdate>
            <volume>3</volume>
            <issue>2</issue>
            <fpage>143</fpage>
            <lpage>155</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1097/00001648-199203000-00013</pubid>
                  <pubid idtype="pmpid">1576220</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Direct and indirect effects</p>
            </title>
            <aug>
               <au>
                  <snm>Pearl</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence</source>
            <publisher>San Francisco: Morgan Kaufmann</publisher>
            <pubdate>2001</pubdate>
            <fpage>411</fpage>
            <lpage>420</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Fallibility in estimating direct effects</p>
            </title>
            <aug>
               <au>
                  <snm>Cole</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Hernan</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Int J Epidemiol</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>163</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/ije/31.1.163</pubid>
                  <pubid idtype="pmpid" link="fulltext">11914314</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Estimation of direct causal effects</p>
            </title>
            <aug>
               <au>
                  <snm>Petersen</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Sinisi</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>van der Laan</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Epidemiol</source>
            <pubdate>2006</pubdate>
            <volume>17</volume>
            <fpage>276</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1097/01.ede.0000208475.99429.2d</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>On Lord's paradox</p>
            </title>
            <aug>
               <au>
                  <snm>Holland</snm>
                  <fnm>PW</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Principles of Modern Psychological Measurement</source>
            <publisher>Hillsdale, NJ: Lawrence Erlbaum Associates</publisher>
            <editor>Wainer H, Messick S</editor>
            <pubdate>1982</pubdate>
            <fpage>3</fpage>
            <lpage>25</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
