When I wrote the Pleistocene was humanity’s Hyborian age, I meant humanity. For contingent reasons the new genetic sciences of ancient DNA have elucidated the history of northwest Eurasia first. But prior to the Great Divergence Europe was not quite so exceptional. In fact the historian Victor Lieberman wrote Strange Parallels, his macrohistory of Eurasia, to highlight just how similar the trajectories of Western Europe and mainland Southeast Asia were up until the early modern era, when the West distanced itself from the rest. In short, European prehistory updates our priors for the prehistory of us all.
For various reasons having to do with professional responsibilities I look at TreeMix plots quite often. Like PCA TreeMix is great for exploratory data analysis. You throw a bunch of populations in there, and it searches a bunch of parameters which can fit the model. But often the results are weird.
They’re not weird because they’re “wrong.” They’re weird because we’ll forcing data to give us answers, and the model pops out something which is reasonable with the conditions imposed on it. And often we just don’t have the big picture. Statistical inference was indicating strange connections between Native Americans and Europeans for the past decade…but it took ancient DNA from Siberia to resolve the mystery. Europeans and Amerindians exhibit ancestry from a shared common population. In Europe this ancestry is relatively recent, on the order of the past ~4 to 5 thousand years. Statistical genetic inference can tell us our model is missing something, but it can not always specify clearly just exactly what we’re missing.
The image above from a TreeMix plot is hard to make out; click it. But what it will show you are two things which are strange:
1) Gene flow from between the East African (mostly HapMap Masai for what it’s worth) node and Mbuti (HGDP) to the Papuans (HGDP).
2) Gene flow from near the East African node to the point which defines the whole East Eurasian, Amerindian, and Oceanian, nodes.
I would laugh this off, but I see it all the time in TreeMix. I know I’m not the only one. I have no explanation for it. It’s obviously not recent admixture. Rather, there are affinities between populations which we just don’t have a good model for. Knowing what we know about ancient Europe it is mostly likely that these gene flow edges which seem inexplicable reflect prehistoric events which make sense only in the context of population patterns which have been totally obscured over the last 10,000 years. Ancient DNA from China will probably shed a great deal of light on these topics. I predict that the Chinese will exhibit the same discontinuity with their Paleolithic ancestors that modern Europeans do, and the affinities between East Eurasians and some Africans in these TreeMix plots probably is a shadow of a “ghost population” which has been absorbed in Eurasia, and may have contributed to some of the ancestry of a group which migrated back to Africa.
Notes I set TreeMix to check for covariance across blocks of 1000 SNPs. I had 215,000 total markers in the data set (very high quality ones). I rooted it with Mbuti, set 5 migration edges, and ran it 10 times. They all looked the same. Most of the populations are pooled from public sources.