Why chimpanzees can donate blood in movies

There is a high likelihood that you know of which ABO blood group you belong to. I am A. My daughter is A. My father is B. My mother is A. I have siblings who are A, O, B, and AB. The inheritance is roughly Mendelian, with O being “recessive” to A and B (which are co-dominant with each other, ergo, AB). It is also generally common knowledge that O is a “universal donor,” while A and B can only give to individuals within their respective blood group and AB.

Because ABO was easy to assay it was one of the earliest Mendelian markers utilized in human genetics. In the first half of the 20th century while some anthropologists were measuring skulls, others were mapping out the frequency of A, B, and O. Today with much more robust genetic methods ABO has lost its old luster as a genetic marker, especially since there is a strong suspicion that the variants are strongly shaped by natural selection. This makes them only marginally useful for systematics, which rely upon loci which are honest mirrors of demographic history.

Read More

One girl, one exome

Interesting story in The San Jose Mercury News, Open-source science helps San Carlos father’s genetic quest:

“We used materials that are public, freely available,” said Rienhoff, a physician and scientist, as Beatrice frolicked nearby. “And everything we’ve learned we’ve put back out there, in the public domain. It’s for the patient’s good, and the public good.”

Born with small, weak muscles, long feet and curled fingers, Beatrice confounded all the experts.

No one else in her family had such a syndrome. In fact, apparently no one else in the world did either.

Rienhoff — a biotech consultant trained in math, medicine and genetics at Harvard, Johns Hopkins and the Fred Hutchinson Cancer Research Center in Seattle — launched a search.

He combed the publicly available medical literature, researching diseases, while jotting down each new clue or theory. Because her ailment is so rare, he knew no big labs or advocacy groups would be interested.

Read More

GEDmatch needs funds!

I noticed today that GEDmatch is trying to raise funds to cover the cost of their web services. What are those services? Basically if you get raw data back from direct-to-consumer genotyping firms GEDmatch allows you to run further analytics. You can do some of these yourself…but most people aren’t going to know how to convert their files into pedigree format and use PLINK. As genotype data becomes more and more common there’ll be more need for analytic services like GEDmatch, whether for profit or for fun.

One can imagine a near future where much of the work can be offloaded to desktop applications (e.g., Promethease does this for traits and diseases). But the problem is that there is greater returns to the analysis when you aggregate the source data into huge agglomerations, and if people are doing their own analyses on their own systems that’s not going to happen (the GWAS information in Promethease uses aggregated information implicitly with the studies they rely on). This is why GEDmatch and openSNP are important.

In any case, if you have used GEDmatch and wish to give, this would be a good time.

Dogs, domesticated before agriculture

The domestication of the dog is a complex and unresolved topic. But at this point I am convinced that this is one domestication event which well predates agriculture. To some extent this is common sense. There are tentative archaeological finds of domestic dogs in the New World almost immediately after widespread human habitation of the Western hemisphere, >10,000 years ago. More concretely domestic dog DNA has been retrieved from ~9,250 year old coprolites in Texas. The distinctiveness of the New World dogs is well attested genetically. Eskimo dogs for example are nested in a well diverged clade with “ancient dogs” (e.g., Basenji), indicating their early separation from the main Eurasian stock. Additionally, from talking to a dog geneticist I am to understand that the Eskimo dogs themselves are likely new arrivals, and superseded older dog lineages in the far north.

Read More

The arcane art of ancient admixture


I have mentioned the PLoS Genetics paper, The Date of Interbreeding between Neandertals and Modern Humans, before because a version of it was put up on arXiv. The final paper has a few additions. For example, it mentions the generally panned (at least in the circles I run in) PNAS paper which suggested that ancient population structure could produce the same patterns which were earlier used to infer admixture with Neandertals (the authors also point to Yang et al. as a support for the proposition of admixture rather than structure). The primary result, dating the admixture between Neandertals and anatomically modern humans ~40-80,000 years before the present, is reiterated.

An interesting aspect is that their method is to utilize linkage disequilibrium (LD) decay. It’s interesting because tens of thousands of years is a hell of a long time to be able to detect an admixture event via LD! In particular because there’s likely a palimpsest effect where there are intervening admixtures and other assorted demographic events (e.g., bottlenecks and selective sweeps can also generate LD). So how’d they do it? Basically the authors figured out a way to ascertain which pairs of  SNPs may have introgressed from Neandertals by comparing the frequency in modern humans to Neandertals at those given SNPs (in particular, by looking at variants at low frequency in Africans and derived in Neandertals). A major technical problem here is the “genetic map” which allows one to assess what the nature of recombination over time is going to be which breaks apart the associations which are the hallmark of LD is not particular precise enough to robustly allow them to make the inferences that they want.

Read More

All original GSS analyses in charts are Creative Commons

Every now and then I get emails/inquiries about using my simple “quick & dirty” charts. I always give permission, and in fact I never complain when other (usually more popular!) weblogs use them. Even on the very rare occasions where attribution is not given (I suspect this is usually an oversight in haste). This is fair enough, as I regularly use figures and tables from scientific papers in my blog posts. I believe this is all reasonably under “fair use.” But I’ve decided to explicitly assert that the charts I produce are under the Creatives Commons license. This is not to force people to attribute. Even if someone uses my chart and does not attribute I won’t really sue or anything like that (unless they’re somehow miraculously making millions off them). Rather, it’s a nudge to those who would use the charts, and should also make it so that people don’t have to contact me directly via email or Twitter. If you want to use the chart or data, permissions is given implicitly.

Buddy can you spare a selective sweep

The Pith: Natural selection comes in different flavors in its genetic constituents. Some of those constituents are more elusive than others. That makes “reading the label” a non-trivial activity.

As you may know when you look at patterns of variation in the genome of a given organism you can make various inferences from the nature of these patterns. But the power of those inferences is conditional on the details of the real demographic and evolutionary histories, as well as the assumptions made about the models one which is testing. When delving into the domain of population genomics some of the concepts and models may seem abstruse, but the reality is that such details are the stuff of which evolution is built. A new paper in PLoS Genetics may seem excessively esoteric and theoretical, but it speaks to very important processes which shape the evolutionary trajectory of a given population. The paper is titled Distinguishing between Selective Sweeps from Standing Variation and from a De Novo Mutation. Here’s the author summary:

Considerable effort has been devoted to detecting genes that are under natural selection, and hundreds of such genes have been identified in previous studies. Here, we present a method for extending these studies by inferring parameters, such as selection coefficients and the time when a selected variant arose. Of particular interest is the question whether the selective pressure was already present when the selected variant was first introduced into a population. In this case, the variant would be selected right after it originated in the population, a process we call selection from a de novo mutation. We contrast this with selection from standing variation, where the selected variant predates the selective pressure. We present a method to distinguish these two scenarios, test its accuracy, and apply it to seven human genes. We find three genes, ADH1B, EDAR, and LCT, that were presumably selected from a de novo mutation and two other genes, ASPM and PSCA, which we infer to be under selection from standing variation.

The dynamic which they refer to seems to be a reframing of the conundrum of detecting hard sweeps vs. soft sweeps. In the former you case have a new mutation, so its frequency is ~1/(2N). It is quickly subject to natural selection (though stochastic processes dominate at low frequencies, so probability of extinction is high), and adaptation drives the allele to fixation (or nearly to fixation). In the latter scenario you have a great deal of extant genetic variation, present in numerous different allelic variants. A novel selection pressure reshapes the frequency landscape, but you can not ascribe the genetic shift to only one allele. It is no surprise that the former is easier to model and detect than the latter. Much of the evolutionary genomics of the 2000s focused on hard sweeps from de novo mutations because they were low hanging fruit. The methods had reasonable power to detect them (as well as many false positives!). But of late many are suspecting that hard sweeps are not the full story, and that much of evolutionary genetic process may be characterized by a combination of hard sweeps, soft sweeps (from standing variation), various forms of negative selection, not to mention the plethora of possibilities which abound in the domain of balancing selection.

Many of the details of the paper may seem overly technical and opaque (and to be fair, I will say here that the figures are somewhat difficult to decrypt, though the subject is not one that lends itself to general clarity), but the major finding is straightforward, and illustrated in figure 4 (I’ve added labels):

Read More

Newsweek formed me (sort of)

You know that Newsweek is ending its print edition. This was long in the coming. What I find interesting is that apparently its circulation peaked in 1991, at 3.3 million. It declined to 3.1 million in 2007, and literally cratered over the past 5 years to 1.5 million! Thinking back to my own past I remember my interest in Newsweek‘s cartoons, as well as the in-class discussion triggered by the latest issue of Time. There was a time when these were relevant publications. But that ended in 1995, with the rise of the internet. For years the weeklies still maintained the illusion of relevance, but I think they were living on borrowed time. People went through the motions because they always had. After all, the cover of Time was important, everyone knew it. Until no one did. The collapse in circulation is just a reflection of the fact that this emperor had long ago shed its clothes.

The general social survey: information is not free

Last week the GSS was down. I was very sad. The SDA team explains the situation:

Part of the popularity of our demonstration archive is that it is free for end users. We are happy to provide this service. It is a valuable resource for the academic community and it also publicizes the value of our SDA software. However, the flip side of providing this free service is that it does not generate any income to offset the cost of providing the infrastructure required. We receive no funding from GSS for hosting their datasets — which is often a surprise to our users. Almost all of our income comes from the fees provided by licensing the SDA software to other data archives (like ICPSR and IPUMS), and virtually all of that income goes to support the programming and technical support that we provide them. We obviously need some additional sources of revenue.

Yes, a young boy was forced from school for carrying cystic fibrosis alleles

Update II: This comment sums up the pertinent issues.

Update: Please see comments below. This may be an infectious disease story, and not a genetic one.

When a reader sent me an email about the story, I assumed it was a rather sophisticated hoax. The short of it is that an 11 year old boy, Colman Chadam, has been pulled out of his school in Palo Alto because he carries alleles for cystic fibrosis, though he is asymptomatic (i.e., he has never manifested any symptoms of c.f.) administrators are worried that he might pose a risk to some students who do show progression of c.f. (bacterial infections can spread from child to child). As you probably know about 1 out of 30 people of European descent is a carrier for one of the many thousands of mutant variants for cystic fibrosis. The details of Colman Chadam’s results are not totally clear. Is he just a carrier? Or does he have two copies of cystic fibrosis, but somehow they differ enough that they can functionally complement each other?

We don’t know. But we do know that the lawyer from the school, Lenore Silverman, has stated that “The district is not willing to risk a potentially life-threatening illness among kids.” The risk here is non-existent. Not to be creepy, but does the school require that people with various venereal diseases also avoid the premises? With the small, but non-trivial, frequency of sexual abuse at school they too post a illness risk to the kids. In fact, more than Colman Chadam.

You can read everything at The San Francisco Chronicle. Perhaps you want to contact some of the staff at Jordan Middle School in Palo Alto. These people should be ashamed of themselves. I know we live in a litigious society, and being in education isn’t the easiest job, but we deserve better than this!