Paper Review - IVs and Mendelian Randomization
Summary #
Detailed Notes #
3 IV Requirements #
- IV must have a direct influence on the treatment
- IV must not covary with the unmeasured confounding that impacts the outcome
- IV must not have a direct influence on the outcome
Relevant considerations for deciding whether to do an IV analysis #
-
Is there any unmeasured confounding?
- If not, is the answer that we can just use a normal regression?
-
When is unmeasured confounding especially likely?
- If someone’s trying to help the patient, it’s likely there are some unrecorded features the helper took into account.
- Ex: Doctor doesn’t record that the patient smokes or eats french fries for lunch every day.
-
What about if the IV assumptions are plausible but don’t definitely hold?
- Going back to the NICU example, maybe people in this geography are exposed to pollution.
- Ensembling IVs and finding results pointing in the same direction in all cases provides better evidence than controlling for one measured confounder in each study (because of unknown unknowns).
when different IVs are used, each IV identifies the average treatment effect for a different subgroup, so that we would only expect that findings from the different IVs would agree in direction if the average treatment effects for the different subgroups have the same direction
-
Strength of the IVs
- Weak IVs -> high variance treatment effect estimates even with large #s of participants
Examples of common IVs #
Ex. 1: Random Encouragement Trials #
- Doing an experiment to measure impact of smoking during pregnancy on birthweight
- Staff person encourages subset of mothers who smoke to stop smoking using pamphlets, guidance, etc.
- IV is whether the mother received encouragement
- Criteria check:
- In theory, influences the treatment (whether the mother smokes)
- Not sure what the unmeasured confounding is here…
- Can have direct effect on the outcome via affecting other similar decisions (drinking alcohol, other healthy behaviors)
Ex. 2: Distance to Specialty Care Provider #
- Say you want to understand how going to a specialist doctor vs. a general practitioner affects outcomes
- You can use the fact that people will tend to go to closer doctors and select distance to each as an IV
- Best for conditions that surprise you, since the more time you have to plan the less distance will impact your decision
- But so many confounders… Maybe you can measure them?
- Socioeconomic background’s the obvious one, but at least it’s known
- Criteria check:
- Influences the treatment, especially for urgent care
- Unmeasured confounding seems like a big risk
Ex. 3: Preference-Based IVs #
- Take advantage of variation in medical practice patterns between groups
- Ex: surgery vs. endovascular therapy for aneurysms using hospital as the IV
Ex. 4: Genes as IVs (Mendelian Randomization) #
-
This is the reason we’re interested in this
-
Ex:
Another example is that [32] studied the effect of maternal smoking on orofacial clefts in babies using genetic variants that increase the probability that a mother smokes as IVs.
-
Genes have to satisfy additional criteria to work as IVs
- Genes can’t be associated with population stratification
- Risks violating criteria 2
- Suppose we want use ALDH2 (Asian glow gene) to study the effect of alcohol consumption on high blood pressure
- This gene is highly correlated with Asian ancestry so only works as an IV if we’ve fully measured the influence of ancestry
- Q: What does “measure” mean in this context? Do we need to have enough subjects of Asian ancestry to actually regress on this attribute?
- No genetic linkage to other causal genes
- Risks violating criteria 2
- Genes close together on parental chromosomes tend to get inherited together
- If subject inherits two genes that are causally associated with the outcome we want to measure but only use one as an IV, then we have unmeasured confounding
- No direct effect through “pleiotropy”
- Risks violating criteria 1
- Pleiotropy is where a gene has multiple functions
- Genes can’t be associated with population stratification
-
Additional papers on Mendelian Randomization:
Formal IV Assumptions and Estimation for Binary IVs and Treatments #
Setting #
- Notation note: bolded characters are vectors
- \( N \): # of subjects
- \( \mathbf{z} \): \( N \)-dimensional vector of IV assignments
- \( z_i \in \{0, 1\} \) for all \( i \in [N] \)
- \( \mathbf{D^z} \in {0,1}^N \): \( N \)-dimensional vector of treatment assignments under IV assignment \( \mathbf{z} \)
- Denote single treatment assignment with \( \mathbf{d} \)
- \( \mathbf{Y^{(z, d)}} \): \( N \)-dimensional vector of potential outcomes under IV assignment \( \mathbf{z} \) and treatment assignment \( \mathbf{d} \)
- \( Y^{(\mathbf{z, d})}_i \) is the outcome subject \( i \) would have under IV assignment \( \mathbf{z} \) and treatment assignment \( \mathbf{d} \)
- Assume we can write \( \mathbb{E}[Y] \) to denote expected value of outcome for a randomly sampled subject
- \( \mathbf{X} \in \mathbb{R}^{N \times \dots} \): observed covariates for subjects
5 Formal IV Assumptions #
- Stable Unit Treatment Value Assumption
- Criteria 1
- Criteria 2
- Criteria 3
- Monotonicity Note to self: Are these easier to encode in causal DAGs?
Stable Unit Treatment Value Assumption (SUTVA) #
- Formal statement: \[ \mathbf{z}_i = \mathbf{z}’_i \rightarrow D^{\mathbf{z}}_i = D^{\mathbf{z’}}_i \land \mathbf{d}_i = \mathbf{d}’_i \rightarrow \mathbf{Y}^{(\mathbf{z, d})}_i = \mathbf{Y}^{(\mathbf{z’, d’})}_i \]
- Plain English: if the IV assignments are the same, then the treatment assignments have to be the same. And if the treatment assignments are the same, then the potential outcomes have to be the same.
Formal Versions of Three Criteria #
- IV positive correlated with treatment:
\[ \mathbb{E}[D^1 \mid \mathbf{X}] > \mathbb{E}[D^0 \mid \mathbf{X}] \]
- Note: this seems like a weird way to write this…
- IV independent of unmeasured confounders: IV conditionally independent of \( \mathbf{D} \)s, \( \mathbf{Y} \)s on observed covariates
- Exclusion restriction: \( Y^{(\mathbf{d, z})} = Y^d \)
- These alone allow effect bounds to be determined but not point identified
Compliance classes and point identification #
- Always-takers, compliers, defiers, never-takers (self-explanatory)
- How do you know if someone’s a complier? Estimate them from other factors, presumably the covariates?
- What do these classes mean in the context of genes as IVs?
- Always-takers are people who do the thing regardless of whether they have the gene?
- Compliers are people who do the thing if they have the gene
- Defiers are people who only do the thing if they don’t have the gene
- Never-takers are people who never do the thing
Approach 1 to enabling point identification: assume monotonicity #
- Monotonicity means there are no defiers
- Automatically satisfied if only subjects who receive the IV can get the treatment (ex: random encouragement)
- Only compliers and never-takers because always-takers can’t exist either since not everyone has opportunity to take the treatment
- However, this only tells us info about compliers if our outcomes are binary
- Automatically satisfied if only subjects who receive the IV can get the treatment (ex: random encouragement)
- Average effect for compliers and always takers is identifiable in this approach (compliance average causal effect, CACE / LATE): \( \mathbb{E}(Y^1_i - Y^0_i \mid C_i = \text{complier}) \)
Approach 2 to enabling point identification: assume effects same across compliance classes #
- Strongest version: assume expectation of difference between \( Y^0 \) and \( Y^1 \) uniform across compliance classes
- In other words, treatment effects same amongst always takers and compliers conditional on the covariates
IV Estimation #
- Horribly typeset proof that CACE is identified as described in approach 1
- Can determine CASE using a 2-stage least squares (2SLS) process
- Regress \( \mathbf{D} \) on \( \mathbf{Z} \) to get \( \hat{\mathbb{E}}(\mathbf{D} \mid \mathbf{Z}) \) (hat denotes sample mean)
- Then regress \( Y \) on \( \hat{\mathbb{E}}(\mathbf{D} \mid \mathbf{Z}) \) to get an estimate of the CACE
- All of above assumes no observed covariates, but that’s not very realistic
- When we have observed covariates, we regress on \( \mathbf{Z} \) and \( \mathbf{X} \) (observed covariates)
- Can guarantee estimate consistency if the effect of \( \mathbf{X} \) on the outcomes for compliers is linear and CACE given \( \mathbf{X} \) is the same for all \( \mathbf{X} \)
- Impt for our stuff - what sort of covariates are going to be involved if we’re trying to do the pioneer TF thing?
- Bias-variance trade-off: if IV is valid, asymptotically unbiased (multiple regression is not). If IV is not perfectly valid, 2SLS is asymptotically biased too but has better variance properties?
- Look at this paper, which develops a tool for deciding when to use 2SLS vs. multiple regression
Case Studies #
Ex. 2: Treatment in a randomized trial with nonadherence #
- Intention to treat (ITT) effect: effect of being assigned treatment vs. not
- Nonadherence means ITT does not equal effect of receiving treatment vs. control
- If nonadherence pattern differs between sample and actual target pop., we need to know the effect of receiving the treatment
- Ex.: offer kids Vitamin A to see if mortality goes down
- Some kids didn’t take the pills (still factored into ITT estimate)
- Notably different from actual real-world intervention of fortifying food
- Ex.: want to know failure rate of contraception if youactually use it
- What if we just estimate \(\hat{\mathbb{E}}(Y \mid Z=1, D=1) - \hat{\mathbf{E}}(Y \mid Z=0, D=0)\) instead?
- Can be systematically biased if defiance correlated with some other trait like having a worse diet
- Instead, treatement assignment can be the IV
- Why?
the denominator of (4) is an estimate of the proportion of compliers.
- Isn’t the estimate of proportion of compliers really \[ \frac{1}{2}(\hat{\mathbb{E}}[D_i \mid Z_i = 1] + (1-\hat{\mathbb{E}}[D_i \mid Z_i = 1])) \]