epistemology twins

Twin Studies

The Basic Models

Suppose we're interested in how important genes or familial environment are in determining people's heights. For simplicity, suppose (1) each chromosome has a "height score" (2) person's height is a weighted sum of (a) all their chromosomes' "height scores", (b) their familial environment "height score", and (c) random noise. These are the assumptions that underlie the ACE model ACE model, which is, as far as I can tell, the most commonly used model in analyzing twin study data.

Computing the estimates of the amount of variance in height due to genes, common environment, and unshared environment is straightforward in this model if you have the correlation of monozygotic (identical) twins' heights and the correlation between dizygotic (fraternal) twins' heights.

To compute the genetic contribution, just take the difference between the two correlations and double it. To get the common environment contribution, just take the dizygotic twin correlation and subtract half the genetic contribution. The unshared environment is just whatever variance is left over, which includes things like measurement error and luck.

The other common model is the ADE model ADE model. The downside of the ADE model is that it requires the assumption that there is zero common environment. The upside is that it relaxes the assumption that all genetic effects are linear by allowing for dominance effects.

Because ACE ignores genetic dominance effects, its biased against genetic explanations and in favor of common environment ones. Because ADE ignores common environment, it's biased the other way. In my experience, ACE is far more common in the literature. There is a model that includes both common environment and genetic dominance effects: the ACDE model, but its typically not used because it is difficult/impossible to estimate all four parameters using a sample of only monozygotic and dizygotic twins.

Measurement Error

Suppose you're trying to estimate the heritability of height. Each person has some platonic height, but your dataset, sadly, only has measured height. Measured height can differ from platonic height due to a number of factors: rounding, human error, height's tendency to shrink throughout the day, etc.

Presumably, your genes affect your platonic height but not the sources of error that end up mediating measured height. That is, the only way genes affect your measured height is through your platonic height.

When we are computing the monozygotic and dizygotic correlations, we can adjust for this source of measurement error. Here's how we do so:

Let $r$ be the correlation between true height and measured height. Note, since true height is impossible to observed directly, we can't observe this correlation directly either.

To get around this issue, we can measure someone's height twice, the correlation between the two measured heights should be $r^2$. Hence, the square root of the "test-retest" correlation should be $r$.

If we want to get the true $r_{MZ}$, it turns out we can just divide it by $r^2$. Ditto for $r_{DZ}$. Hence, the heritability estimate becomes

$$ 2 \left( \frac{r_{MZ}}{r^2} - \frac{r_{DZ}}{r^2} \right) $$

Or just

$$ \frac{1}{r^2} 2 \left( r_{MZ} - r_{DZ} \right) $$

The hawk-eyed reader will note that this just means our adjusted estimate of heritability is the naive estimate of heritability divided by $r^2$.

Lest you think I'm a crank, here's people with credentials talking about the same issue Frisell, and here is a study where this kind of thing played a major issue Riemann.

Assortative Mating

Assortative mating will cause naive twin studies to over-estimate the effects of shared environment and underestimate the effects of additive genes. Sadly, the vast majority of twin studies ignore this issue.

How strong is this effect?

Suppose there are no common environment effect. If we know (a) the true heritability and (b) the observed phenotype correlation between the mom and dad, we can compute the expected conclusion of a naive twin study that assumes no assortative mating. We can do so using some simple code.

Running this code with different parameters yields this table:

h²	Father-Mother Correlation
	0.00	0.10	0.20	0.30	0.40	0.50	0.60	0.70	0.80	0.90	1.00
0%	0.002	0.002	0.002	0.002	0.002	0.002	0.002	0.002	0.002	0.002	0.002
10%	0.102	0.102	0.101	0.101	0.101	0.100	0.100	0.099	0.099	0.098	0.098
20%	0.203	0.201	0.199	0.197	0.195	0.193	0.192	0.190	0.188	0.186	0.184
30%	0.303	0.298	0.294	0.290	0.286	0.282	0.278	0.274	0.271	0.267	0.263
40%	0.402	0.395	0.387	0.380	0.373	0.367	0.360	0.354	0.348	0.342	0.335
50%	0.502	0.490	0.479	0.468	0.457	0.447	0.437	0.428	0.419	0.411	0.402
60%	0.602	0.585	0.568	0.553	0.538	0.524	0.511	0.498	0.486	0.475	0.463
70%	0.702	0.678	0.656	0.636	0.616	0.598	0.581	0.564	0.549	0.534	0.520
80%	0.801	0.771	0.742	0.716	0.691	0.668	0.647	0.627	0.608	0.590	0.572
90%	0.901	0.862	0.827	0.794	0.764	0.736	0.710	0.686	0.663	0.642	0.621
100%	0.999	0.952	0.909	0.869	0.833	0.800	0.769	0.741	0.715	0.690	0.667

Note: Even though I used a sample of two million hypothetical twins, there is still some sampling error, which is why (for instance) the first row is close to zero but doesn't equal it precisely. This is likewise why the first column is close to the actual heritability but not exactly the same.

We can use this table to adjust twin studies for assortative mating. For instance, if a twin study finds a heritability of 72% but there is a correlation of 0.3 between fathers and mothers for that trait, the true heritability is probably a smidge above 80%.

Age

In general, the heritability of psychiatric traits increase with age Bergen and the common environment of traits decrease with age Meta-analysis of the heritability of human traits based on fifty years of twin studies . For this reason, estimates derived from studies of, say, young children should generally not be used to make conclusions about adults (and vice versa).

One place this shows up is educational attainment, where the years of education obtained has a large common environment component - probably because it is the result largely of events and decisions that occur in childhood, when parents still have large influence over their children. It notably contrasts with standardized high school test scores, which generally have approximately zero common environment component, and with IQ, which is probably the trait most famous for having a heritability that shifts with age.

As an aside, I am forced to wonder whether this has anything to do with the fadeout of educational interventions over time.

Other Methodological Shortcomings

In addition to the issues with using ACE or ADE, twin studies typically suffer from other methodological shortcomings

Twin studies often use questionnaires, often not verifying the reported information including whether the pair of people in question are even twins . Misclassifying people a fraternal twin as identical (or vice-versa) biases estimates against genes and towards common environment. The unreliability of self-reported data is rando and is similar to the measurement error discussed above and will generally bias estimates of genes and shared environment downwards - the exception is if the unreliability is non-random (e.g. twins are likely to misreport in similar ways).

Twin studies frequently draw on (non-random) convenience samples.
When twin studies use siblings rather than (or in addition to) dizygotic twins, this causes birth-order effects to be interpreted as genetic effects, biasing heritability estimates upwards and common environment estimates downwards.
If monozygotic twins are treated more similarly than dizygotic twins (or siblings if they're used), this biases common and unshared environment downwards while biasing genetic estimates upwards.
Maybe twins make choices to be more (or less) similar. This would bias estimates of unshared environment upwards. Alternatively, maybe this force is stronger in identical twins than fraternal twins, which would bias estimates of heritability upwards. This hypothesis is difficult to evaluate, but here are some related studies Fortuna Mark.
To the literature's credit, publication bias seems generally minor, with one major exception: the choice of ACE or ADE. Generally, studies start with an ACE model and, if this results in a negative estimate for common environment, change to an ADE model. This use of choice causes a positive bias for dominance genetic effects, a positive bias for common environment effects, and a negative bias for additive genetic effects - especially in small sample sizes Meta-analysis of the heritability of human traits based on fifty years of twin studies (see also the supplement).
TODO: gene-environment interactions Kaprio Avinun
TODO Grotzinger

Despite the widespread belief that smaller populations should have less additive genetic variance, the empirical evidence is scant Wood.

todo Røysamb

Appendix: Assortative Mating II

If a study is assuming zero common environment, then it is common to estimate heritability as just $h^2 = r_{MZ}$. In this case, assortative mating will bias heritability estimates upwards, and the corrections needed to account for it are different:

h²	Father-Mother Correlation
	0.00	0.10	0.20	0.30	0.40	0.50	0.60	0.70	0.80	0.90	1.00
0%	0.001	0.001	0.001	0.001	0.001	0.001	0.001	0.001	0.001	0.001	0.001
10%	0.100	0.104	0.109	0.113	0.117	0.122	0.126	0.130	0.134	0.139	0.143
20%	0.199	0.207	0.215	0.223	0.230	0.237	0.245	0.252	0.259	0.266	0.273
30%	0.299	0.309	0.320	0.329	0.339	0.348	0.357	0.366	0.375	0.383	0.391
40%	0.399	0.411	0.422	0.433	0.444	0.454	0.464	0.473	0.482	0.491	0.500
50%	0.499	0.511	0.523	0.534	0.545	0.555	0.565	0.574	0.583	0.592	0.600
60%	0.599	0.611	0.622	0.632	0.642	0.652	0.661	0.669	0.677	0.685	0.692
70%	0.699	0.710	0.719	0.728	0.736	0.744	0.752	0.759	0.765	0.772	0.778
80%	0.800	0.807	0.814	0.821	0.827	0.833	0.838	0.844	0.848	0.853	0.857
90%	0.900	0.904	0.908	0.912	0.915	0.918	0.921	0.924	0.926	0.929	0.931
100%	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000	1.000

Elks, C. E., Den Hoed, M., Zhao, J. H., Sharp, S. J., Wareham, N. J., Loos, R. J., & Ong, K. K. (2012). Variability in the heritability of body mass index: a systematic review and meta-regression. Frontiers in endocrinology, 3, 29. https://doi.org/10.3389/fendo.2012.00029 Wikipedia contributors. (2021, March 30). ACE model. In Wikipedia, The Free Encyclopedia. Retrieved 23:22, October 10, 2021, from https://en.wikipedia.org/w/index.php?title=ACE_model&oldid=1015154064 Wikipedia contributors. (2018, December 26). ADE model. In Wikipedia, The Free Encyclopedia. Retrieved 23:23, October 10, 2021, from https://en.wikipedia.org/w/index.php?title=ADE_model&oldid=875467243 Ozaki, K., Toyoda, H., Iwama, N., Kubo, S., & Ando, J. (2011). Using non-normal SEM to resolve the ACDE model in the classical twin design. Behavior Genetics, 41(2), 329-339. https://doi.org/10.1007/s10519-010-9386-5 Silver, I. A., Nedelec, J. L., Segal, N. L., & Lonergan, H. (2021). Heteropaternal Siblings Misclassified as Dizygotic Twins: A Potential Biasing Factor for Heritability Estimates?. Behavior Genetics, 51(2), 137-143. Fortuna, K., Goldner, I., & Knafo, A. (2010). Twin relationships: A comparison across monozygotic twins, dizygotic twins, and nontwin siblings in early childhood. Family Science, 1(3-4), 205-211. https://doi.org/10.1080/19424620.2010.569367 Mark, K. M., Pike, A., Latham, R. M., & Oliver, B. R. (2017). Using twins to better understand sibling relationships. Behavior genetics, 47(2), 202-214. https://doi.org/10.1007/s10519-016-9825-z Polderman, T. J., Benyamin, B., De Leeuw, C. A., Sullivan, P. F., Van Bochoven, A., Visscher, P. M., & Posthuma, D. (2015). Meta-analysis of the heritability of human traits based on fifty years of twin studies. Nature genetics, 47(7), 702-709. https://doi.org/10.1038/ng.3285 Polderman, T. J., Benyamin, B., De Leeuw, C. A., Sullivan, P. F., Van Bochoven, A., Visscher, P. M., & Posthuma, D. (2015). Supplementary Figures 1–12, Supplemen tary Note and Supplementary Tables 1–19, 22–24 and 26–31. Nature genetics, 47(7), 702-709. https://static-content.springer.com/esm/art%3A10.1038%2Fng.3285/MediaObjects/41588_2015_BFng3285_MOESM4_ESM.pdf Bergen, S. E., Gardner, C. O., & Kendler, K. S. (2007). Age-related changes in heritability of behavioral phenotypes over adolescence and young adulthood: a meta-analysis. Twin Research and Human Genetics, 10(3), 423-433. https://doi.org/10.1375/twin.10.3.423 Wood, J. L., Yates, M. C., & Fraser, D. J. (2016). Are heritability and selection related to population size in nature? Meta‐analysis and conservation implications. Evolutionary applications, 9(5), 640-657. https://doi.org/10.1111/eva.12375 Røysamb, E., & Tambs, K. (2016). The beauty, logic and limitations of twin studies. https://fhi.brage.unit.no/fhi-xmlui/bitstream/handle/11250/2431737/Roysamb_2016_The.pdf?sequence=2 Kaprio, J. (2012). Twins and the mystery of missing heritability: the contribution of gene–environment interactions. Journal of internal medicine, 272(5), 440-448. https://doi.org/10.1111/j.1365-2796.2012.02587.x Grotzinger, A. D., de la Fuente, J., Nivard, M. G., & Tucker-Drob, E. M. (2021). Pervasive Downward Bias in Estimates of Liability Scale Heritability in GWAS Meta-Analysis: A Simple Solution. medRxiv. https://doi.org/10.1101/2021.09.22.21263909 Avinun, R., & Knafo, A. (2014). Parenting as a reaction evoked by children’s genotype: A meta-analysis of children-as-twins studies. Personality and Social Psychology Review, 18(1), 87-102. https://doi.org/10.1177/1088868313498308 Frisell, T., Öberg, S., Kuja-Halkola, R., & Sjölander, A. (2012). Sibling comparison designs: Bios from non-shored confounders and measurement error. Epidemiology, 713-720. https://doi.org/10.1097/EDE.0b013e31825fa230 Riemann, R., & Kandler, C. (2010). Construct validation using multitrait‐multimethod‐twin data: The case of a general factor of personality. European Journal of Personality, 24(3), 258-277. https://doi.org/10.1002/per.760