Heritability of g
[ Deprecated: see here for most recent version. ]
From Haworth:
In 34 twin studies with a total of 4672 pairs of MZ twins, the average MZ correlation is 0.86 Familial studies of intelligence, indicating that identical twins are nearly as similar as the same person tested twice (test–retest correlation for g is about 0.90 The g Factor: The Science of Mental Ability). In contrast, the average DZ correlation is 0.60 in 41 studies, with a total of 5546 pairs of DZ twins. Heritability, the genetic effect-size indicator, can be estimated by doubling the difference between the MZ and DZ correlations because MZ twins are twice as similar genetically as DZ twins Behavioral Genetics. This heritability estimate of 52% is similar to that in the results from family and adoption studies. Moreover, meta-analyses of all of the studies yield heritability estimates of about 50%, indicating that about half of the total variance in g can be accounted for by genetic differences between individuals. Chipuer Devlin Loehlin
However, this simple conclusion masks possible developmental differences. The dozens of twin studies of g vary widely in the age of their samples, and several reviews have noted a tendency for heritability to increase with age
...
Age Category MZ DZ Childhood 0.74 (0.71-0.77)
n=10890.53 (0.49-0.57)
n=1591Adolescence 0.73 (0.70-0.74)
n=22220.46 (0.43-0.49)
n=2712Young Adulthood 0.82 (0.80-0.83)
n=14980.48 (0.44-0.51)
n=1577Combined Sample 0.76 (0.75-0.77)
n=48090.49 (0.47-0.51)
n=5880
I'm going to focus on young adulthood.
While I think this paper is a good survey of the literature, I have a few issues with its analysis. The biggest is that they conflate measured intelligence with actual intelligence. Specifically:
-
The "0.90" test-retest correlation cited comes from The g Factor: The Science of Mental Ability, where the test-retest correlation is mentioned twice:
The reliability coefficients for multiitem tests of more complex mental processes, such as measured by typical IQ tests, are generally about .90 to .95. This is higher than the reliability of people’s height and weight measured in a doctor’s office!
For intervals of less than one year, the test-retest correlations are generally above .90.
This is just a sloppy citation, but it impacts the next point...
-
The authors elide an important definitional/philosophical problem. The fact the test-retest correlation for g is only 0.90 is a result of conflating two different definitions of g: a score on a test to measure g and the platonic ideal of g itself. The latter must, by definition, have a test-retest validity of 1. This suggests the correlation between MZ twin's gs is actually more like 0.89 and that much of the "unshared environment" is, in fact, simply due to measurement error.
-
But, in fact, the problem is worse than that: this still assumes IQ tests are unbiased measure of g - e.g. it assumes your score on the Wechsler is just g plus some random noise. This is probably not the case. This can most easily be seen in the subtests: it doesn't matter how many questions you add to your math subtest, at some point the reason it doesn't perfectly measure g is that g is not just math. Each test has a test-specific factor that isn't g, and unless you take literally infinitely many tests these factors will introduce additional error that can't be estimated via simple retesting.
For instance Raven's matrices and the WAIS have a correlation of about 0.84 Paul, 0.76 (when corrected for range-restriction) O'Leary, or 0.74 McLaurin, much lower than the ~0.925 you'd expect based on simple test-retest correlations.
For this reason, it is entirely possible that the correlation between identical twins actual intelligence (as opposed to measured intelligence) is indistinguishable from 1.
The sceptic will retort that maybe the test-specific factors are even more heritable than g, which would mean traditional analysis of twin studies will overstate g's heritability. To this there are two defenses: (a) even after only accounting for the within-test measurement error, estimates of g's h^2 rise to above ~0.89, which makes it one of the most heritable traits to be measured; it is, a priori, unlikely that the test-specific traits are even more heritable, and (b)
more g-loaded tests tend to be more heritable Kan Pedersen Nijenhuis , which means test-specific traits must tend to be less heritable than g.
All the above points suggest the true correlation for MZ intelligence is approximately 1, 22% higher than the measured correlation. The same arguments apply to the DZ correlation: placing the true correlation at about r~0.59 rather than the measured r~0.49. Naively, this DZ correlation would suggest that common environment is still important, even in young adulthood. This brings us to assortative mating.
As the authors, themselves, note: they did not account for assortative mating, which they admit is probably a problem. The correlation between spouse IQs is typically reported to be in the
Moreover, most IQ studies assume non-zero common environment and zero genetic dominance effects: that is they use the ACE model rather than the ADE model. This means they are biased against finding genetic effects. We don't need to appeal to this bias, but it gives yet more evidence that traditional estimations are biased against genetic explanations.
In other words, I think bulk of the evidence suggests that true intelligence is 100% heritable, and that belief to the contrary stems from imperfect measurement and common ACE model assumptions.
That being said, in the name of full disclosure, here is all the evidence I've found against this conclusion:
Some experts believe common environment can have significant effects on intelligence in adulthood Kaplan. - Even within identical twins, IQ test scores correlate with income - albeit, they explain only about a quarter the variance Sandewall. This suggests that some of the facets of unshared environment that affect IQ scores also affect ability to earn money, which means such factors can't be mere measurement error. In my opinion, the most parsimonious explanation is that some test-specific factors help on both (imperfect) IQ tests and with earning money.