Causal Inference

Una introducción a la inferencia causal

Bart Ortiz from GeNeura research group

Based on Judea Pearl paper

Introducción

Introducción

  • what is the efficacy of a given drug in a given population?
  • Whether data can prove an employer guilty of hiring discrimination?
  • What fraction of past crimes could have been avoided by a given policy?
  • What was the cause of death of a givenindividual, in a specific incident?
These are causal questions because they require some knowledge of the data-generating process; they cannot be computed from the data alone, nor from the distributions that govern the data.

Differences between association and causation

Standard statistical analysis

Assess parameters of a distribution from samples drawn of that distribution.

With the help of such parameters, one can infer associations among variables, estimate beliefs or probabilities of past and future events, as well as update those probabilities in light of new evidence or new measurements.
  • correlation
  • regression
  • dependence
  • conditional independence
  • likelihood
  • marginalization

Causal analysis

Its aim is to infer not only beliefs or probabilities under static conditions, but also the dynamics of beliefs under changing conditions.
  • randomization
  • influence
  • effect
  • confounding
  • spurious correlation
  • faithfulness/stability
  • intervention
  • explanation

Structural Causal Model (SCM)

  • structural equation models (SEM)
  • the potential outcome framework
  • the graphical models

Structural Equations Models

Let be $X$ a disease variable and $Y$ a certain symptom of the disease. $$ y = \beta x+u_y $$

Where $x$ is the severity of disease and $y$ is the severity of the symptom.

Structural Equations Models

But this is symetrical and limited. So we changed it: \begin{aligned} x &=f_{X}\left(u_{X}\right) \\ y &=f_{Y}\left(y, u_{X}\right) \end{aligned}

Structural Equations Models

And we can overcome nonlinearity: \begin{aligned} z &=f_{Z}\left(u_{Z}\right) \\ x &=f_{X}\left(z, u_{X}\right) \\ y &=f_{Y}\left(x, u_{Y}\right) \end{aligned}

Structural Equations Models

And model interventions: \begin{aligned} z &=f_{Z}\left(u_{Z}\right) \\ x &=x_0 \\ y &=f_{Y}\left(x, u_{Y}\right) \end{aligned}

Structural Equations Models

  • D-separation
  • Causal Sufficiency
  • Causal feedback

Structural Equations Models

Any probability distribution induced by an acyclic, causally sufficient SCM M can be factorized as: $$p_{\mathcal{M}}\left(X_{1}, \ldots, X_{N}\right)=\prod_{i=1}^{N} p_{\mathcal{M}}\left(X_{i} | \mathbf{X}_{\mathrm{pa}(i)}\right)$$ And, after and intervention: $$p_{\mathcal{M}_{\xi_{l}}}\left(X_{1}, \ldots, X_{N} | \operatorname{do}\left(\mathbf{X}_{l}=\xi_{l}\right)\right)=$$ $$\prod_{i=1 \atop i \notin l}^{N} p_{\mathcal{M}}\left(X_{i} | \mathbf{X}_{\mathrm{pa}(i)}\right) \prod_{i \in I} \mathbf{1}_{\left[X_{i}=\xi_{i}\right]}$$

Causal infrence and ML

Diagrams from inFERENCe webpage

Causal infrence and ML

Causal infrence and ML

Causal infrence and ML

Conclusiones

  • Fundamental area to develop an accurate description of reality
  • Now we have mathematical formulation for this
  • Is not a question of whether you work on deep learning or causal inference
““while probabilities encode our beliefs about a static world, causality tells us whether and how probabilities change when the world changes, be it by intervention or by act of imagination.””
Judea Pearl (2018). The Book of Why: The New Science of Cause and Effect

THE END

- Dudas?

- Source code & documentation

Some related links

Original paper

More on inference and cool causal diagrams

Do-calculus

Popular introduction