I have mentioned before that the 1000 Genomes Chinese are heterogenous. Many of the ones sampled in Beijing are North Chinese. But there is structure within the South Chinese samples as well. The PCA above shows it. I’ve pruned some of the data for clarity (it’s probably a cline really, with cut-offs and breaks happening because of variation in population density)


The Miao/Hmong samples from the HGDP are very similar to the South China cluster in admixture analysis (and less Dai than the South China 2 cluster). This is not surprising, as the Miao/Hmong are relatively recent migrants into Southeast Asia from China.


