

The relatively low frequency in Guangxi is to be expected. This province was Sinicized only recently. As in, the last 500 years. And it still retains a huge ethnic minority population, and many of the Han in the province likely have that ancestry. But the question still arises: why do the Han have such a high frequency of rs17822931?
Here’s a plot of frequencies:
But the ALFRED database has more details. Sardinians, Somalis, Ethiopian Jews, and Dani from the New Guinea highlands all have very low proportions or none of the derived variant. The Ethiopian Jews are about ~40% West Eurasian, due to Middle Eastern agriculturalist ancestry. Groups like the Masai also have Middle Eastern agriculturalist ancestry. I think the low frequencies of the derived variant in the Middle East are due to migration from eastern Eurasia in the relatively recent past. The frequencies of the derived variant in Europe probably came with the Ancestral North Eurasian ancestry of the steppe people. In South and Southeast Asia the frequencies are indicative of balancing selection, even if there is no such selection, while in the New World world the derived variant is at low, but appreciable frequencies.
As I mentioned in an earlier post, a 40,000 year old Siberian had the derived variant (heterozgote). I suspect the Basal Eurasians did not.
