Allen Ancient DNA Resource in PLINK format

The Reich lab released a bunch of data in January 2021. Someone emailed me about the format. I converted their earlier release to PLINK (PEDIGREE) format, and they wondered if I could do the same again for this relase. I did so. Remember that the “FAMILY ID” is the population as identified from their annotation files.

Here are the files:

v44.3_1240K_public.bed
v44.3_1240K_public.bim
v44.3_1240K_public.fam

v44.3_HO_public.bed
v44.3_HO_public.bim
v44.3_HO_public.fam

Unleash the data kraken!


The Reich lab has done a mitzvah and released a huge merged dataset of their modern and ancient populations in a big tarball. Actually, there are two files. One of them is a larger number of individuals with 600,000 SNPs (includes “Human Origins Array”) and the other has 1,200,000 SNPs, but fewer individuals. It is in EIGENSTRAT format.

For the convenience of readers who are more comfortable in PLINK/PEDIGREE format, I’ve converted them, and replaced the family ID column with population labels. The links take to you a zip file that has the three files for the binary format.