MBE Advance Access published online on February 24, 2007
Molecular Biology and Evolution, doi:10.1093/molbev/msm035
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
The Majority of Recent Short DNA Insertions in the Human Genome are Tandem Duplications
Max-Planck-Institute for Molecular Genetics, Ihnestr. 63-73, 14195 Berlin, Germany
corresponding author: Philipp W. Messer, Max-Planck-Institute for Molecular Genetics, Ihnestr. 63-73, 14195 Berlin, Germany, tel: 30-8413-1161, fax: 30-8413-1152, email: philipp.messer{at}molgen.mpg.de
Received for publication December 7, 2006. Revision received February 13, 2007. Accepted for publication February 20, 2007.
Nucleotide substitutions, insertions, and deletions constitute the principal molecular mechanisms generating genetic variation on small length scales. In contrast to substitutions, the nature of short DNA insertions and deletions (indels) is far less understood. With the recent availability of whole genome multiple alignments between human and other primates detailed investigations on indel characteristics and origin have come within reach. Here we show that the majority of short (1-100 bp) DNA insertions in the human lineage are tandem duplications of directly adjacent sequence segments with conserved polarity. Indels in microsatellites comprise only a small fraction. The underlying molecular processes generating indels do not necessarily rely on the presence of preexisting duplicates, as would be expected for unequal crossing over, as well as replication slippage. Instead, our findings point towards a non recombination-mediated mechanism preferentially occurring in the male germline. Surprisingly, non-frameshifting tandem duplications and deletions in coding regions still occur at approximately 50% of their genomic background rates. As is already well established in the context of gene and segmental duplications, our results demonstrate that duplications are also likely to constitute the predominant process for rapid generation of new genetic material and function on smaller scales.
Key Words: insertions deletions human evolution duplications