
In light of my recent posts about skin color, I thought this was an appropriate time to use this old story (which according to google remains popular) as an exemplar of Mendelian genetics beyond the single locus, that is, an exploration of variation on traits generated by a confluence of loci. The recent evidence suggests that about 5 loci are responsible for about 90% of the difference of average effect in regards to phenotype (i.e., skin reflectance) for skin color between populations (these are loci of “large effect”). This does not mean that these loci are as relevant for within population differences (or between sex differences within a population), as Europeans and Africans are often “fixed” for those genes in alternative alleles (so when you see variation within Norwegians or Nigerians for skin color, that might be due to very different genetic dynamics which only come into play when the background is controlled for). Keep in mind that this exposition is focused on between population diferences, crudely, what makes us black or white (or, in my case, a rich and sensual brown which is magnetic to the female sex).**
OK, so, assume 5 genes control skin color. Each of the genes is independent and equivalent in effect. For simplicity, assume that each gene gives a dosage of “10 units” of melanin (skin color seems to covary with the density and size of melanosomes). The genes are additive across loci, and between alleles. In other words, there is no interaction between the genes, and, there are no dominance effect so that heterozygotes are exactly in between homozygotes in regards to phenotype for a given locus. The alleles for all the genes come in two variants, On or Off, equivalent to loss of function and functional alleles. Any one On allele contributes exactly 10 units, and Off contributes 0 units, of melanin. So, if an individual is homozygous Off for all loci they express 0 units and are white. Similarly, if an individual is homozygous On for all loci they express 100 total units and are black. How do we get 100?
There are 5 loci, and, because humans are diploid, they carry two copies of each gene (two alleles at each locus). So:
5 loci * 10 units per locus * 2 alleles per locus = 100 melanin units.


But what about when the mixed race parents scramble their genes together? This is where it gets tricky. We can model this basically as a binomial process. A parent contributes 5 alleles. In the case above the parent has a 1/2 chance of either contributing an On or an Off for a given slot. You have the number of slots being n, and the probability of an On as the probability, p.
So, n = 5, and p = 0.5, e(x) = np, the expectation of contribution of melanin units is 25 units from one parent. Since they are both genetically the same, you get 25 from both parents for a cummulative total of 50 units. In other words, just like the parents you expect the average child to be about 50 units of melanin expression. But what about the black and white babies? How did they come about? The expected value has a variance around its expectation. v(x) = npq for the binomial distribution, or, 100*0.5*0.5 or 25 units of squared variance. That means that the standard deviation is 5 units of melanin about the expectation, the standard deviation being a measure of dispersion about the central tendency of this probability distribution.
In regards to the black and white twins, for the black twin the probability that a given slot is On is 0.5. From the father, there are 5 slots, and the mother there are 5 slots. So, 0.55 ~0.03125. You have to have this happen on the other 5 slots from the other parent as well, so multiply out the independent probabilities (or, just do 0.510, same result) and you get 0.000977 probability that a twin will be “black.” The chance that a twin is going to be white is symmetrical, so multiple the two together, and you get a 1 out of 1,048,576 chance of one twin being black and the other one white. The only quibble I would have is that you have two configurations, twin a could be white or black, and the same for the other twin, so I’d double the probability here as there are two combinations, so 1 out of 500,000. In contrast, there are many ways to arrange the alleles in the 10 slots to obtain 50 melanin units, explaining why this is the expectation or modal value. Here is the probability distribution for the function that models the outcome based on the genetic architecture that I outlined above incrementing up 10 units of melanin from 0 to 100:
| 0.1% at 0 units 1.0% at 10 units 4.4% at 20 units 12% at 30 units 21% at 40 units 25% at 50 units 21% at 60 units 12% at 70 units 4.4% at 80 units 1.0% at 90 units 0.1% at 100 units | ![]() |

* If the parents’ own black parent is Caribbean there is almost certainly some white ancestry from that side as well, though the expectation is closer to 5-10% than 20% as is the norm among African Americans.
** Though I’m religiously monogamous.


Comments are closed.