Skip Navigation



MBE Advance Access published online on June 1, 2007

Molecular Biology and Evolution, doi:10.1093/molbev/msm108
This Article
Right arrow Advance Access manuscript (PDF) Freely available
Right arrow Supplementary Material
Right arrowOA All Versions of this Article:
24/8/1792    most recent
msm108v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Google Scholar
Right arrow Articles by Hernandez, R. D.
Right arrow Articles by Bustamante, C. D.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hernandez, R. D.
Right arrow Articles by Bustamante, C. D.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Authors.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (
http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Research Article

Context Dependence, Ancestral Misidentification, and Spurious Signatures of Natural Selection

Ryan D. Hernandez*, Scott H. Williamson* and Carlos D. Bustamante*

* Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14850 USA

Corresponding Author: Carlos D. Bustamante, Department of Biological Statistics and Computational Biology, 101 Biotechnology Building, Cornell University, Ithaca, NY 14850, phone (607)255-1640, fax (607)255-4698, cdb28{at}cornell.edu

Received for publication August 16, 2006. Revision received March 15, 2007. Revision received May 7, 2007. Accepted for publication May 11, 2007.

Population genetic analyses often use polymorphism data from one species, and orthologous genomic sequences from closely related outgroup species. These outgroup sequences are frequently used to identify ancestral alleles at segregating sites and to compare the patterns of polymorphism and divergence. Inherent in such studies is the assumption of parsimony, which posits that the ancestral state of each single nucleotide polymorphism (SNP) is the allele that matches the orthologous site in the outgroup sequence, and that all nucleotide substitutions between species have been observed. This study tests the effect of violating the parsimony assumption when mutation rates vary across sites and over time. Using a context-dependent mutation model that accounts for elevated mutation rates at CpG dinucleotides, increased propensity for transitional versus transversional mutations, as well as other directional and contextual mutation biases estimated along the human lineage, we show (using both simulations and a theoretical model) that enough unobserved substitutions could have occurred since the divergence of human and chimpanzee to cause many statistical tests to spuriously reject neutrality. Moreover, using both the chimpanzee and rhesus macaque genomes to parsimoniously identify ancestral states causes a large fraction of the data to be removed while not completely alleviating problem. By constructing a novel model of the context-dependent mutation process, we can correct polymorphism data for the effect of ancestral misidentification using a single outgroup.

Key Words: context-dependence • single nucleotide polymorphism • site-frequency spectrum • ancestral misidentification


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
K. E. Lohmueller, C. D. Bustamante, and A. G. Clark
Methods for Human Demographic Inference Using Haplotype Patterns From Genomewide Single-Nucleotide Polymorphism Data
Genetics, May 1, 2009; 182(1): 217 - 231.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. D. Hernandez
A flexible forward simulator for populations subject to selection and demography
Bioinformatics, December 1, 2008; 24(23): 2786 - 2787.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
M. M. Desai and J. B. Plotkin
The Polymorphism Frequency Spectrum of Finitely Many Sites Under Selection
Genetics, December 1, 2008; 180(4): 2175 - 2191.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. P. Foxe, V.-u.-N. Dar, H. Zheng, M. Nordborg, B. S. Gaut, and S. I. Wright
Selection on Amino Acid Substitutions in Arabidopsis
Mol. Biol. Evol., July 1, 2008; 25(7): 1375 - 1383.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
S. Casillas, A. Barbadilla, and C. M. Bergman
Purifying Selection Maintains Highly Conserved Noncoding Sequences in Drosophila
Mol. Biol. Evol., October 1, 2007; 24(10): 2222 - 2234.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
R. D. Hernandez, S. H. Williamson, L. Zhu, and C. D. Bustamante
Context-Dependent Mutation Rates May Cause Spurious Signatures of a Fixation Bias Favoring Higher GC-Content in Humans
Mol. Biol. Evol., October 1, 2007; 24(10): 2196 - 2202.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.