Twin Studies
The Basic Models
Suppose we're interested in how important genes or familial environment are in determining people's heights. For simplicity, suppose (1) each chromosome has a "height score" (2) person's height is a weighted sum of (a) all their chromosomes' "height scores", (b) their familial environment "height score", and (c) random noise. These are the assumptions that underlie the ACE model ACE model, which is, as far as I can tell, the most commonly used model in analyzing twin study data.
Computing the estimates of the amount of variance in height due to genes, common environment, and unshared environment is straightforward in this model if you have the correlation of monozygotic (identical) twins' heights and the correlation between dizygotic (fraternal) twins' heights.
To compute the genetic contribution, just take the difference between the two correlations and double it. To get the common environment contribution, just take the dizygotic twin correlation and subtract half the genetic contribution. The unshared environment is just whatever variance is left over, which includes things like measurement error and luck.
The other common model is the ADE model ADE model.
Because ACE ignores genetic dominance effects, its biased against genetic explanations and in favor of common environment ones. Because ADE ignores common environment, it's biased the other way. In my experience, ACE is far more common in the literature. There is a model that includes both common environment and genetic dominance effects: the ACDE model, but its typically not used because it is difficult/impossible to estimate all four parameters using a sample of only monozygotic and dizygotic twins.
Measurement Error
Suppose you're trying to estimate the heritability of height. Each person has some platonic height, but your dataset, sadly, only has measured height. Measured height can differ from platonic height due to a number of factors: rounding, human error, height's tendency to shrink throughout the day, etc.
Presumably, your genes affect your platonic height but not the sources of error that end up mediating measured height. That is, the only way genes affect your measured height is through your platonic height.
When we are computing the monozygotic and dizygotic correlations, we can adjust for this source of measurement error. Here's how we do so:
Let $r$ be the correlation between true height and measured height. Note, since true height is impossible to observed directly, we can't observe this correlation directly either.
To get around this issue, we can measure someone's height twice, the correlation between the two measured heights should be $r^2$. Hence, the square root of the "test-retest" correlation should be $r$.
If we want to get the true $r_{MZ}$, it turns out we can just divide it by $r^2$. Ditto for $r_{DZ}$. Hence, the heritability estimate becomes
$$ 2 \left( \frac{r_{MZ}}{r^2} - \frac{r_{DZ}}{r^2} \right) $$Or just
$$ \frac{1}{r^2} 2 \left( r_{MZ} - r_{DZ} \right) $$The hawk-eyed reader will note that this just means our adjusted estimate of heritability is the naive estimate of heritability divided by $r^2$.
Lest you think I'm a crank, here's people with credentials talking about the same issue Frisell, and here is a study where this kind of thing played a major issue Riemann.
Assortative Mating
Assortative mating will cause naive twin studies to over-estimate the effects of shared environment and underestimate the effects of additive genes. Sadly, the vast majority of twin studies ignore this issue.
How strong is this effect?
Suppose there are no common environment effect. If we know (a) the true heritability and (b) the observed phenotype correlation between the mom and dad, we can compute the expected conclusion of a naive twin study that assumes no assortative mating. We can do so using some simple code.Running this code with different parameters yields this table:
h2 | Father-Mother Correlation | ||||||||||
0.00 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 1.00 | |
0% | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 | 0.002 |
10% | 0.102 | 0.102 | 0.101 | 0.101 | 0.101 | 0.100 | 0.100 | 0.099 | 0.099 | 0.098 | 0.098 |
20% | 0.203 | 0.201 | 0.199 | 0.197 | 0.195 | 0.193 | 0.192 | 0.190 | 0.188 | 0.186 | 0.184 |
30% | 0.303 | 0.298 | 0.294 | 0.290 | 0.286 | 0.282 | 0.278 | 0.274 | 0.271 | 0.267 | 0.263 |
40% | 0.402 | 0.395 | 0.387 | 0.380 | 0.373 | 0.367 | 0.360 | 0.354 | 0.348 | 0.342 | 0.335 |
50% | 0.502 | 0.490 | 0.479 | 0.468 | 0.457 | 0.447 | 0.437 | 0.428 | 0.419 | 0.411 | 0.402 |
60% | 0.602 | 0.585 | 0.568 | 0.553 | 0.538 | 0.524 | 0.511 | 0.498 | 0.486 | 0.475 | 0.463 |
70% | 0.702 | 0.678 | 0.656 | 0.636 | 0.616 | 0.598 | 0.581 | 0.564 | 0.549 | 0.534 | 0.520 |
80% | 0.801 | 0.771 | 0.742 | 0.716 | 0.691 | 0.668 | 0.647 | 0.627 | 0.608 | 0.590 | 0.572 |
90% | 0.901 | 0.862 | 0.827 | 0.794 | 0.764 | 0.736 | 0.710 | 0.686 | 0.663 | 0.642 | 0.621 |
100% | 0.999 | 0.952 | 0.909 | 0.869 | 0.833 | 0.800 | 0.769 | 0.741 | 0.715 | 0.690 | 0.667 |
Note: Even though I used a sample of two million hypothetical twins, there is still some sampling error, which is why (for instance) the first row is close to zero but doesn't equal it precisely. This is likewise why the first column is close to the actual heritability but not exactly the same.
We can use this table to adjust twin studies for assortative mating. For instance, if a twin study finds a heritability of 72% but there is a correlation of 0.3 between fathers and mothers for that trait, the true heritability is probably a smidge above 80%.
Age
In general,
One place this shows up is educational attainment, where the years of education obtained has a large common environment component - probably because it is the result largely of events and decisions that occur in childhood, when parents still have large influence over their children. It notably contrasts with standardized high school test scores, which generally have approximately zero common environment component, and with IQ, which is probably the trait most famous for having a heritability that shifts with age.
As an aside, I am forced to wonder whether this has anything to do with the fadeout of educational interventions over time.
Other Methodological Shortcomings
In addition to the issues with using ACE or ADE, twin studies typically suffer from other methodological shortcomings
- Twin studies often use questionnaires, often not verifying the reported information including whether the pair of people in question are even twins
. Misclassifying people a fraternal twin as identical (or vice-versa) biases estimates against genes and towards common environment. The unreliability of self-reported data is rando and is similar to the measurement error discussed above and will generally bias estimates of genes and shared environment downwards - the exception is if the unreliability is non-random (e.g. twins are likely to misreport in similar ways). - Twin studies frequently draw on (non-random) convenience samples.
- When twin studies use siblings rather than (or in addition to) dizygotic twins, this causes birth-order effects to be interpreted as genetic effects, biasing heritability estimates upwards and common environment estimates downwards.
- If monozygotic twins are treated more similarly than dizygotic twins (or siblings if they're used), this biases common and unshared environment downwards while biasing genetic estimates upwards.
- Maybe twins make choices to be more (or less) similar. This would bias estimates of unshared environment upwards. Alternatively, maybe this force is stronger in identical twins than fraternal twins, which would bias estimates of heritability upwards. This hypothesis is difficult to evaluate, but here are some related studies Fortuna Mark.
To the literature's credit, publication bias seems generally minor, with one major exception: the choice of ACE or ADE. Generally, studies start with an ACE model and, if this results in a negative estimate for common environment, change to an ADE model. This use of choice causes a positive bias for dominance genetic effects, a positive bias for common environment effects, and a negative bias for additive genetic effects - especially in small sample sizes Meta-analysis of the heritability of human traits based on fifty years of twin studies (see also the supplement).- TODO: gene-environment interactions Kaprio Avinun
- TODO Grotzinger
todo Røysamb
Appendix: Assortative Mating II
If a study is assuming zero common environment, then it is common to estimate heritability as just $h^2 = r_{MZ}$. In this case, assortative mating will bias heritability estimates upwards, and the corrections needed to account for it are different:
h2 | Father-Mother Correlation | ||||||||||
0.00 | 0.10 | 0.20 | 0.30 | 0.40 | 0.50 | 0.60 | 0.70 | 0.80 | 0.90 | 1.00 | |
0% | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 | 0.001 |
10% | 0.100 | 0.104 | 0.109 | 0.113 | 0.117 | 0.122 | 0.126 | 0.130 | 0.134 | 0.139 | 0.143 |
20% | 0.199 | 0.207 | 0.215 | 0.223 | 0.230 | 0.237 | 0.245 | 0.252 | 0.259 | 0.266 | 0.273 |
30% | 0.299 | 0.309 | 0.320 | 0.329 | 0.339 | 0.348 | 0.357 | 0.366 | 0.375 | 0.383 | 0.391 |
40% | 0.399 | 0.411 | 0.422 | 0.433 | 0.444 | 0.454 | 0.464 | 0.473 | 0.482 | 0.491 | 0.500 |
50% | 0.499 | 0.511 | 0.523 | 0.534 | 0.545 | 0.555 | 0.565 | 0.574 | 0.583 | 0.592 | 0.600 |
60% | 0.599 | 0.611 | 0.622 | 0.632 | 0.642 | 0.652 | 0.661 | 0.669 | 0.677 | 0.685 | 0.692 |
70% | 0.699 | 0.710 | 0.719 | 0.728 | 0.736 | 0.744 | 0.752 | 0.759 | 0.765 | 0.772 | 0.778 |
80% | 0.800 | 0.807 | 0.814 | 0.821 | 0.827 | 0.833 | 0.838 | 0.844 | 0.848 | 0.853 | 0.857 |
90% | 0.900 | 0.904 | 0.908 | 0.912 | 0.915 | 0.918 | 0.921 | 0.924 | 0.926 | 0.929 | 0.931 |
100% | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |