Selection in humans, a follow-up

Share on FacebookShare on Google+Email this to someoneTweet about this on Twitter

To follow up on a previous post on determining the fraction of the human genome under selection (as well as Coffee Mug’s post on non-coding RNAs), here’s a paper which claims that, in raw numbers, more non-coding DNA has been under selection in humans than coding DNA. But still, using their method, based on the distribution of insertions and deletions in the genome, there’s not a whole lot of the genome under selection– something like 0.3%. However:

For four reasons, [this estimate] may be a considerable under-estimate. First, the method necessarily only exploits orthologous regions that retain sufficient resemblance to allow their accurate alignment. Lineage-specific or orthologous segments whose sequences have diverged greatly as a consequence of positive selection are thus not able to be aligned and are not counted towards the genomic total. For example, it has been shown that sequences unalignable between human and mouse often contain structural RNA elements. Secondly, sequences are often not included if they have recently gained function, owing to their sequence divergence being intermediate between those of neutral and constrained sequences. Thirdly, the method misses adaptive sequence within which selection has not acted heterogeneously, but instead has driven both beneficial indels and substitutions to fixation. Finally, it also overlooks positively selected sites, or short regions, that are scattered among a majority of constrained bases.

This is also interesting, given results from Bruce Lahn’s lab showing accelerated evolution of nervous system genes in humans:

More recently, we exploited this signature of selection upon indels to conduct a genome-wide scan for positive selection on small functional intronic elements. We find such elements to be especially abundant in the introns of genes that are expressed in the brain (Lunter and Ponting, submitted for publication)

3 Comments

  1. That is an interesting paper. Thanks for the link.

  2. Interesting ideas. Generally speaking, how can we expect accelerated evolution of a single gene within a lineage to relate to phylogenetic patterns of variation? Specifically, what is the relationship between Lahn’s work and the “human accelerated regions” work of David Haussler et al?

  3. 1. if a region is evolving neutrally, one would expect evolution along a single lineage to occur at about the same rate as throughout the rest of the phylogeny. under positive selection, the evolution along a particular branch should be faster.  
     
    2. lahn’s work focused on genes. the human accelerated region pointed out by haussler et al is non-coding. the research is related in that it looks to find regions which are accelerated along the human lineage, in particular molecules involved in neurodevelopment.

a