I got a question about endogamy and Bangladeshis on of my other weblogs, as well as their relatedness to western (e.g., Iranian) and eastern (e.g., Southeast Asian) populations. Instead of talking, what do the data say? Most of you have probably seen me write about this before, but I think it might be useful to post again for Google (or Quora, as Quora seems to like my blog posts as references).
The 1000 Genomes project collected samples a whole lot of Bangladeshis in Dhaka. The figure at the top shows that the Bangladeshis overwhelmingly form a relatively tight cluster that is strongly shifted toward East Asians. There is one exception: about five individuals, several of which were collected right after each other (their sample IDs are sequential) who show almost no East Asian shift.
Since people asking me about this, and I’m running the South Asian Genotype Project, I thought I would post two non-PCA visualizations of how various South Asian groups relate to each other (along with a few outgroups).
The radial plot above is a neighbor-joining tree visualized from pairwise Fst statistics (basically a proxy for genetic distance).
I also used Treemix to generate a plot. You see the similar patterns as the one above, though the two methods are different. Treemix tests a bunch of models and sees how the data fit those models. The visualization of Fst is just a way of representing the summary statistic.
I added 5 migration edges to the plot to the right. Not sure if they add anything, but you can see that some of the nodes move around because they are so mixed.