Paper Review - IVs and Mendelian Randomization

Summary #

Detailed Notes #

3 IV Requirements #

IV must have a direct influence on the treatment
IV must not covary with the unmeasured confounding that impacts the outcome
IV must not have a direct influence on the outcome

Relevant considerations for deciding whether to do an IV analysis #

Is there any unmeasured confounding?
- If not, is the answer that we can just use a normal regression?
When is unmeasured confounding especially likely?
- If someone’s trying to help the patient, it’s likely there are some unrecorded features the helper took into account.
- Ex: Doctor doesn’t record that the patient smokes or eats french fries for lunch every day.
What about if the IV assumptions are plausible but don’t definitely hold?
- Going back to the NICU example, maybe people in this geography are exposed to pollution.
- Ensembling IVs and finding results pointing in the same direction in all cases provides better evidence than controlling for one measured confounder in each study (because of unknown unknowns).
when different IVs are used, each IV identifies the average treatment effect for a different subgroup, so that we would only expect that findings from the different IVs would agree in direction if the average treatment effects for the different subgroups have the same direction
Strength of the IVs
- Weak IVs -> high variance treatment effect estimates even with large #s of participants

Examples of common IVs #

Ex. 1: Random Encouragement Trials #

Doing an experiment to measure impact of smoking during pregnancy on birthweight
Staff person encourages subset of mothers who smoke to stop smoking using pamphlets, guidance, etc.
IV is whether the mother received encouragement
Criteria check:
1. In theory, influences the treatment (whether the mother smokes)
2. Not sure what the unmeasured confounding is here…
3. Can have direct effect on the outcome via affecting other similar decisions (drinking alcohol, other healthy behaviors)

Ex. 2: Distance to Specialty Care Provider #

Say you want to understand how going to a specialist doctor vs. a general practitioner affects outcomes
You can use the fact that people will tend to go to closer doctors and select distance to each as an IV
Best for conditions that surprise you, since the more time you have to plan the less distance will impact your decision
But so many confounders… Maybe you can measure them?
- Socioeconomic background’s the obvious one, but at least it’s known
Criteria check:
1. Influences the treatment, especially for urgent care
2. Unmeasured confounding seems like a big risk

Ex. 3: Preference-Based IVs #

Take advantage of variation in medical practice patterns between groups
Ex: surgery vs. endovascular therapy for aneurysms using hospital as the IV

Ex. 4: Genes as IVs (Mendelian Randomization) #

This is the reason we’re interested in this
Ex:

Another example is that [32] studied the effect of maternal smoking on orofacial clefts in babies using genetic variants that increase the probability that a mother smokes as IVs.
Genes have to satisfy additional criteria to work as IVs
- Genes can’t be associated with population stratification
  - Risks violating criteria 2
  - Suppose we want use ALDH2 (Asian glow gene) to study the effect of alcohol consumption on high blood pressure
  - This gene is highly correlated with Asian ancestry so only works as an IV if we’ve fully measured the influence of ancestry
  - Q: What does “measure” mean in this context? Do we need to have enough subjects of Asian ancestry to actually regress on this attribute?
- No genetic linkage to other causal genes
  - Risks violating criteria 2
  - Genes close together on parental chromosomes tend to get inherited together
  - If subject inherits two genes that are causally associated with the outcome we want to measure but only use one as an IV, then we have unmeasured confounding
- No direct effect through “pleiotropy”
  - Risks violating criteria 1
  - Pleiotropy is where a gene has multiple functions
Additional papers on Mendelian Randomization:
- Mendelian randomization as an instrumental variable approach to causal inference
- Mendelian randomization: using genes as instruments for making causal inferences in epidemiology.

Formal IV Assumptions and Estimation for Binary IVs and Treatments #

Setting #

Notation note: bolded characters are vectors
\( N \): # of subjects
\( \mathbf{z} \): \( N \)-dimensional vector of IV assignments
- \( z_i \in \{0, 1\} \) for all \( i \in [N] \)
\( \mathbf{D^z} \in {0,1}^N \): \( N \)-dimensional vector of treatment assignments under IV assignment \( \mathbf{z} \)
- Denote single treatment assignment with \( \mathbf{d} \)
\( \mathbf{Y^{(z, d)}} \): \( N \)-dimensional vector of potential outcomes under IV assignment \( \mathbf{z} \) and treatment assignment \( \mathbf{d} \)
- \( Y^{(\mathbf{z, d})}_i \) is the outcome subject \( i \) would have under IV assignment \( \mathbf{z} \) and treatment assignment \( \mathbf{d} \)
- Assume we can write \( \mathbb{E}[Y] \) to denote expected value of outcome for a randomly sampled subject
\( \mathbf{X} \in \mathbb{R}^{N \times \dots} \): observed covariates for subjects

5 Formal IV Assumptions #

Stable Unit Treatment Value Assumption
Criteria 1
Criteria 2
Criteria 3
Monotonicity Note to self: Are these easier to encode in causal DAGs?

Stable Unit Treatment Value Assumption (SUTVA) #

Formal statement: \[ \mathbf{z}_i = \mathbf{z}’_i \rightarrow D^{\mathbf{z}}_i = D^{\mathbf{z’}}_i \land \mathbf{d}_i = \mathbf{d}’_i \rightarrow \mathbf{Y}^{(\mathbf{z, d})}_i = \mathbf{Y}^{(\mathbf{z’, d’})}_i \]
Plain English: if the IV assignments are the same, then the treatment assignments have to be the same. And if the treatment assignments are the same, then the potential outcomes have to be the same.

Formal Versions of Three Criteria #

IV positive correlated with treatment: \[ \mathbb{E}[D^1 \mid \mathbf{X}] > \mathbb{E}[D^0 \mid \mathbf{X}] \]
- Note: this seems like a weird way to write this…
IV independent of unmeasured confounders: IV conditionally independent of \( \mathbf{D} \)s, \( \mathbf{Y} \)s on observed covariates
Exclusion restriction: \( Y^{(\mathbf{d, z})} = Y^d \)

These alone allow effect bounds to be determined but not point identified

Compliance classes and point identification #

Always-takers, compliers, defiers, never-takers (self-explanatory)
How do you know if someone’s a complier? Estimate them from other factors, presumably the covariates?
What do these classes mean in the context of genes as IVs?
- Always-takers are people who do the thing regardless of whether they have the gene?
- Compliers are people who do the thing if they have the gene
- Defiers are people who only do the thing if they don’t have the gene
- Never-takers are people who never do the thing

Approach 1 to enabling point identification: assume monotonicity #

Monotonicity means there are no defiers
- Automatically satisfied if only subjects who receive the IV can get the treatment (ex: random encouragement)
  - Only compliers and never-takers because always-takers can’t exist either since not everyone has opportunity to take the treatment
- However, this only tells us info about compliers if our outcomes are binary
Average effect for compliers and always takers is identifiable in this approach (compliance average causal effect, CACE / LATE): \( \mathbb{E}(Y^1_i - Y^0_i \mid C_i = \text{complier}) \)

Approach 2 to enabling point identification: assume effects same across compliance classes #

Strongest version: assume expectation of difference between \( Y^0 \) and \( Y^1 \) uniform across compliance classes
In other words, treatment effects same amongst always takers and compliers conditional on the covariates

IV Estimation #

Horribly typeset proof that CACE is identified as described in approach 1
Can determine CASE using a 2-stage least squares (2SLS) process
- Regress \( \mathbf{D} \) on \( \mathbf{Z} \) to get \( \hat{\mathbb{E}}(\mathbf{D} \mid \mathbf{Z}) \) (hat denotes sample mean)
- Then regress \( Y \) on \( \hat{\mathbb{E}}(\mathbf{D} \mid \mathbf{Z}) \) to get an estimate of the CACE
All of above assumes no observed covariates, but that’s not very realistic
When we have observed covariates, we regress on \( \mathbf{Z} \) and \( \mathbf{X} \) (observed covariates)
- Can guarantee estimate consistency if the effect of \( \mathbf{X} \) on the outcomes for compliers is linear and CACE given \( \mathbf{X} \) is the same for all \( \mathbf{X} \)
- Impt for our stuff - what sort of covariates are going to be involved if we’re trying to do the pioneer TF thing?
Bias-variance trade-off: if IV is valid, asymptotically unbiased (multiple regression is not). If IV is not perfectly valid, 2SLS is asymptotically biased too but has better variance properties?
- Look at this paper, which develops a tool for deciding when to use 2SLS vs. multiple regression

Case Studies #

Ex. 2: Treatment in a randomized trial with nonadherence #

Intention to treat (ITT) effect: effect of being assigned treatment vs. not
Nonadherence means ITT does not equal effect of receiving treatment vs. control
If nonadherence pattern differs between sample and actual target pop., we need to know the effect of receiving the treatment
Ex.: offer kids Vitamin A to see if mortality goes down
- Some kids didn’t take the pills (still factored into ITT estimate)
- Notably different from actual real-world intervention of fortifying food
Ex.: want to know failure rate of contraception if youactually use it
What if we just estimate \(\hat{\mathbb{E}}(Y \mid Z=1, D=1) - \hat{\mathbf{E}}(Y \mid Z=0, D=0)\) instead?
- Can be systematically biased if defiance correlated with some other trait like having a worse diet
- Instead, treatement assignment can be the IV
Why?

the denominator of (4) is an estimate of the proportion of compliers.
- Isn’t the estimate of proportion of compliers really \[ \frac{1}{2}(\hat{\mathbb{E}}[D_i \mid Z_i = 1] + (1-\hat{\mathbb{E}}[D_i \mid Z_i = 1])) \]