MBE Advance Access published online on March 5, 2007
Molecular Biology and Evolution, doi:10.1093/molbev/msm042
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
RESEARCH ARTICLE |
Multiple Hypothesis Testing to Detect Lineages Under Positive Selection that Affects Only a Few Sites
Department of Biology, and Centre for Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), University College London, London, UK
* Address for correspondence: Ziheng Yang, Department of Biologys, University College London, Darwin Building, Gower Street, London WC1E 6BT, UK. Tel: +44 (20) 7679 4379, Fax: +44 (20) 7679 7096. Email: z.yang{at}ucl.ac.uk
Received for publication January 26, 2007. Accepted for publication February 26, 2007.
Detection of positive Darwinian selection has become ever more important with the rapid growth of genomic data sets. Recent branch-site models of codon substitution account for variation of selective pressure over branches on the tree and across sites in the sequence, and provide a means to detect short episodes of molecular adaptation affecting just a few sites. In likelihood ratio tests based on such models, the branches to be tested for positive selection have to be specified a priori. In the absence of a biological hypothesis to designate so-called foreground branches, one may test many branches, but a correction for multiple testing becomes necessary. In this paper, we employ computer simulation to evaluate the performance of six multiple-test correction procedures when the branch-site models are used to test every branch on the phylogeny for positive selection. Four of the methods control the family-wise error rates (FWER) while the other two control the false discovery rate (FDR). We found that all correction procedures achieved acceptable FWER except for extremely divergent sequences and serious model violations, when the test may become unreliable. The power of the test to detect positive selection is influenced by the strength of selection and the sequence divergence, with the highest power observed at intermediate divergences. The four correction procedures that control the FWER had similar power. We recommend Rom's procedure for its slightly higher power, but the simple Bonferroni correction is useable as well. The two correction procedures that control the FDR had slightly more power but also higher FWER. We demonstrate the multiple-test procedures by analyzing gene sequences from the extracellular domain of the cluster of differentiation 2 gene (CD2) from 10 mammalian species. Both our simulation and real data analysis suggest that the multiple-test procedures are useful when multiple branches have to be tested on the same data set.
Key Words: Multiple hypothesis testing family-wise error rate (FWER) false discovery rate (FDR) positive selection branch-site model molecular adaptation
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Fang, C.-T. Ting, C.-R. Lee, K.-H. Chu, C.-C. Wang, and S.-C. Tsaur Molecular Evolution and Functional Diversification of Fatty Acid Desaturases after Recurrent Gene Duplication in Drosophila Mol. Biol. Evol., July 1, 2009; 26(7): 1447 - 1456. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Schneider, A. Souvorov, N. Sabath, G. Landan, G. H. Gonnet, and D. Graur Estimates of Positive Darwinian Selection Are Inflated by Errors in Sequencing, Annotation, and Alignment Gen Biol Evol, June 22, 2009; 2009(0): 114 - 118. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Miyake, N. Takebayashi, and D. E. Wolf Possible Diversifying Selection in the Imprinted Gene, MEDEA, in Arabidopsis Mol. Biol. Evol., April 1, 2009; 26(4): 843 - 857. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Anisimova and C. Kosiol Investigating Protein-Coding Sequence Evolution with Probabilistic Codon Substitution Models Mol. Biol. Evol., February 1, 2009; 26(2): 255 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Delport, K. Scheffler, and C. Seoighe Models of coding sequence evolution Brief Bioinform, January 1, 2009; 10(1): 97 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Proux, R. A. Studer, S. Moretti, and M. Robinson-Rechavi Selectome: a database of positive selection Nucleic Acids Res., January 1, 2009; 37(suppl_1): D404 - D407. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Ali and R. Meier Positive Selection in ASPM Is Correlated with Cerebral Cortex Evolution across Primates but Not with Whole-Brain Size Mol. Biol. Evol., November 1, 2008; 25(11): 2247 - 2250. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Studer, S. Penel, L. Duret, and M. Robinson-Rechavi Pervasive positive selection on duplicated and nonduplicated vertebrate protein coding genes Genome Res., September 1, 2008; 18(9): 1393 - 1402. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Yang PAML 4: Phylogenetic Analysis by Maximum Likelihood Mol. Biol. Evol., August 1, 2007; 24(8): 1586 - 1591. [Abstract] [Full Text] [PDF] |
||||




