Lecture 11: Theories of Change
A public policy program contains four elements:
\[\text{Inputs} \rightarrow \text{Activities} \rightarrow \text{Outputs} \rightarrow \text{Outcomes}\]
Inputs: budget, municipal training facilities
Activities: teacher training, textbook development
Outputs: 10,000 teachers trained, 1M textbooks delivered
Outcomes: students improve on final math exams
Experimental Data
Observational Data
Directed Acyclic Graphs (DAGs):
Four steps:
Example — factors affecting life expectancy:
Group related variables:
Simplified: GDP, Education, Public services, Urbanization, Conflict
| Type | Role of \(Z\) | Problem? |
|---|---|---|
| Confounding | \(Z\) causes both \(X\) and \(Y\) | Yes |
| Collision | \(X\) and \(Y\) both cause \(Z\) | Yes |
| Mediation | \(X \rightarrow Z \rightarrow Y\) | Helpful |
Run four regressions:
We write the causal effect as:
\[E(Y \mid X = x)\]
“Expected value of \(Y\), given that \(X\) takes value \(x\)”
Examples:
In RCTs, we control who gets treated — confounders vanish
\[P(Y \mid X = x) \neq P(Y \mid X)\]
In a treatment vs. control setting:
\[\Delta = E(Y \mid X = 1) - E(Y \mid X = 0)\]
We can never observe both potential outcomes for one person:
\[\Delta_i = Y^1_i - \text{ ?}\]
| Patient | Treatment | Outcome 1 | Outcome 2 |
|---|---|---|---|
| 1 | 1 | \(Y_1\) (Treated) | \(Y_1\) (Untreated) |
| 2 | 0 | \(Y_2\) (Treated) | \(Y_2\) (Untreated) |
| 3 | 1 | \(Y_3\) (Treated) | \(Y_3\) (Untreated) |
| 4 | 1 | \(Y_4\) (Treated) | \(Y_4\) (Untreated) |
| 5 | 0 | \(Y_5\) (Treated) | \(Y_5\) (Untreated) |
| Patient | Treatment | Outcome 1 | Outcome 2 |
|---|---|---|---|
| 1 | 1 | \(Y_1\) (Treated) | ??? |
| 2 | 0 | ??? | \(Y_2\) (Untreated) |
| 3 | 1 | \(Y_3\) (Treated) | ??? |
| 4 | 1 | \(Y_4\) (Treated) | ??? |
| 5 | 0 | ??? | \(Y_5\) (Untreated) |
We compute group averages from what we observe:
Average Treatment on the Treated (ATT):
\[ATT = \bar{Y}_T \mid X = 1\]
Average Treatment on the Untreated (ATU):
\[ATU = \bar{Y}_U \mid X = 0\]
Combine ATT and ATU:
\[ATE = ATT - ATU\]
\[ATE = (ATT - ATU) + \text{Selection Bias}\]
Effect of graduating from Bocconi on monthly income?
| Student | Bocconi | Income |
|---|---|---|
| 1 | 1 | 2,400 |
| 2 | 0 | 2,000 |
| 3 | 1 | 2,500 |
| 4 | 1 | 2,800 |
| 5 | 0 | 2,100 |
| 6 | 0 | 2,000 |
| 7 | 0 | 2,200 |
\[ATT = \frac{2400 + 2500 + 2800}{3} \approx 2567\]
\[ATU = \frac{2000 + 2100 + 2000 + 2200}{4} = 2075\]
\[ATE \approx 2567 - 2075 = 492\]
Why might there be selection bias?
Randomization makes treatment independent of potential outcomes:
\[X \perp\!\!\!\perp Y^1, Y^0\]
Internal validity: findings are reliable for the sample
External validity: findings generalize to the population
Popescu (JCU) Statistical Analysis Lecture 11: Theories of Change