
First, and foremost, the genetic architecture of human pigmentation variation is characterized by the reality that most of the variation is due to a handful of loci. In other words, skin color is not monogenic Mendelian, but neither is it highly polygenic in the same fashion as height or IQ, where variation is distributed across so many loci that alleles have nearly an infinitesimal effect size. The small sample sizes and simple methodologies of aught era genomics were sufficient to capture the relatively large effect variants segregating in many populations. A second major aspect to pigmentation genomics is that the pathways seem strikingly conserved across vertebrates. That means that pelage color research could inform human genetics, and vice versa.Some of the most interesting confirmations of the power of loss of function mutations in humans occurred by inducing a similar change in zebrafish! One inference that I think one might take away from this is that ancient human populations likely exhibited variation due to polymorphism around the same set of loci as modern humans.
But, and there’s a big but, is that though the set of loci which are responsible for pigmentation variation across human populations are familiar, finite, and well characterized, the particular mutations responsible within a given locus varies quite a bit. Because derived mutations which result in reduced pigmentation are mostly loss of function all you need to do is “break” the functionality in some manner. Therefore, you might target a regulatory element, or, the exonic sequence itself, but the possibilities are rather numerous. Heather Norton’s publication from 2007, Genetic Evidence for the Convergent Evolution of Light Skin in Europeans and East Asians, is still rather relevant. For various reasons the pigmentation of Europeans has been well elucidated. That means that to a great extent the variation in West and South Eurasians more generally (and North Africans) is well understand because most of the same variants seem to be at play. The big lacunae, as pointed out by Norton et al., concerns East Asians. This is a population which is light-skinned, but lacking in the typical set of European “light” alleles.

But this is a recent development. From what I am to understand historically the peoples of Northeast Asia did refer to themselves as white in contrast to the browner people of Southeast Asia (in an analogous fashion, the people of West Asia as far east as Afghanistan consider themselves white, in contrast to the black people of South Asia). Additionally, when Europeans first encountered Northeast Asians in large numbers in the 16th century they observed that physically the people of nations such as Japan and Korea were white in color. Only with total domination of the globe by Europeans in the 19th century did the identification of white and European become such as that Northeast Asians were classed among the “colored” peoples (the appellation “yellow” was taken up by early 20th century East Asian intellectuals). But both quantitative empirical evidence and simple visual inspection can remind us that many Northeast Asians are as light in complexion as many Europeans, albeit never as pale as many Northern Europeans.
A new paper in Molecular Biology and Evolution, A genetic mechanism for convergent skin lightening during recent human evolution, goes a major step toward pinpointing what is going on in a functional sense in relation to East Asians. In fact they’re doing what occurred ten years ago for Europeans. First, they’re finding the variant through GWAS, and second, they are confirming through molecular methods and animal models that the variant of interest is actually the causal mechanism. And, they are also attempting to establish a temporal narrative by adducing signatures of selection.


All that being said, the general stylized facts are in alignment. The derived allele is common on the eastern coastal region of Eurasia, and nearly absent in Africa, Europe, and West and South Asia. But a curious aspect to me is that in the 1000 Genomes data the allele is nearly as absent in the Bangladeshi samples as it is in other South Asians. In contrast, the derived variant of EDAR, which is diagnostic of East Asian or Amerindian ancestry, is present at 5% frequency in Bangladeshis, about what you would expect assuming the attested levels of gene flow from an East Asian population. While the authors in the above study found that the effect of the allele is additive, it is curious that in the 1000 Genomes there is no variation across Japanese, North and South Chinese, and Vietnamese. The implication is that the average between group differences across these populations has to be due to variation on other loci. The indigenous Dai people in fact had the highest frequency of the derived allele in the 1000 Genomes.

These details matter because the authors estimated how deep the selection sweeps around this locus must be in terms of time. Using two methods they arrive at a figure between 10 and 15 thousand years (one method is closer to 10, another to 15). That implies that selection began before the Holocene. The interpretation the authors put on these results is that the northern East Asian groups experienced selection as they migrated up from Southeast Asia during the Pleistocene, with the Austro-Asiatic groups being basal and reflecting the ancestral state. The problem, as I suggest above, is that the Austro-Asiatic populations are a compound of genuinely basal groups (their minority ancestry) to the Northeast Asians, and a population to which other Northeast Asians further north may be basal!
One thing Eight thousand years of natural selection in Europe tells us using ancient DNA that a history of admixture is important to understanding the specific dynamics of selection. Though the haplotype based methods were roughly correct, they did not exhibit the granularity necessary to make fine-grained inferences, and did not totally predict what the empirical ancient DNA is telling us about allele frequencies across time. For example, earlier attempts to infer the selection sweep which resulted in high frequencies of SLC45A2 in Europe arrived at a figure a bit north of ~10,000 years. But it seems that a great deal of selection on this locus has been occurring more recently than 5,000 years.
And on a final note, I would point out that the intermediate frequencies of the derived allele in much of East Asia are suggestive to me that the genuine target of selection here is not skin color, but a dominant trait. The fact that the derived allele is nearly absent in Bangladeshis indicates that either the sweep up in frequency is very recent, so that not all East Asian populations experienced it, or, more likely to my mind, there is constraining selection on the trait which is the genuine target of interest in other genetic backgrounds. To decrypt what I’m saying, the derived allele is probably useful in East Asia, but entails some cost. South Asians may already have another allele which gains the same function, and so the cost resulted in purification of the derived allele in Bangladeshis (who are ~10% derived from a group very similar to the Dai).
As should be clear, this paper has some confusions. But it’s a taste of things to come. There are many Chinese who are interested in the genomics of their region, and ancient DNA should begin to unveil the past in the next few years.

Comments are closed.