MBE Advance Access originally published online on March 14, 2006
Molecular Biology and Evolution 2006 23(6):1101-1103; doi:10.1093/molbev/msk002
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Letter |
Adaptive Protein Evolution and Regulatory Divergence in Drosophila


* Department of Ecology and Evolutionary Biology, University of Arizona;
Department of Plant Sciences, University of Arizona; and
Department of Computer Science, University of Arizona
E-mail: jgood{at}email.arizona.edu.
| Abstract |
|---|
|
|
|---|
Two recent studies demonstrated a positive correlation between divergence in gene expression and protein sequence in Drosophila. This correlation could be driven by positive selection or variation in functional constraint. To distinguish between these alternatives, we compared patterns of molecular evolution for 1,862 genes with two previously reported estimates of expression divergence in Drosophila. We found a slight negative trend (nonsignificant) between positive selection on protein sequence and divergence in expression levels between Drosophila melanogaster and Drosophila simulans. Conversely, shifts in expression patterns during Drosophila development showed a positive association with adaptive protein evolution, though as before the relationship was weak and not significant. Overall, we found no strong evidence for an increase in the incidence of positive selection on protein-coding regions in genes with divergent expression in Drosophila, suggesting that the previously reported positive association between protein and regulatory divergence primarily reflects variation in functional constraint.
Key Words: gene expression positive selection protein evolution
| Introduction |
|---|
|
|
|---|
Although changes in gene regulation and in protein structure have been conventionally treated as independent modes of evolution (King and Wilson 1975
A positive correlation between rates of protein sequence evolution and expression divergence could be driven by positive selection on genes causing both protein structure and regulation to evolve rapidly (Nuzhdin et al. 2004
). An alternative explanation for a coupling between regulatory and structural divergence is variation in purifying selection: genes with less functionally constrained protein sequence may also be less constrained in expression (Castillo-Davis, Hartl, and Achaz 2004
; Lemos et al. 2005
). To distinguish between these alternative hypotheses, we examined the extent to which divergence in gene expression is associated with positive selection on protein sequences in Drosophila. Using genomic data from five closely related species of Drosophila (D. melanogaster, D. simulans, Drosophila yakuba, Drosophila erecta, and Drosophila ananassae), we used a maximum likelihood framework to calculate rates of protein evolution for 1,862 genes and tested for evidence of positive selection (Yang et al. 2000
). We compared these data with two different estimates of divergence in gene expression: (1) divergence in the level of expression between D. melanogaster and D. simulans (Ranz et al. 2003
) and (2) divergence in the pattern of expression during metamorphosis in D. melanogaster, D. simulans, and D. yakuba (Rifkin, Kim, and White 2003
).
We found that genes with divergent expression levels between D. melanogaster and D. simulans have rates of protein evolution (dN/dS) that are similar to genes with no expression divergence (table 1; all comparisons Wilcoxon P > 0.10). However, in all comparisons, genes with divergent expression levels do have higher median dN/dS levels than genes with similar levels of expression, consistent with previous reports of a weak positive correlation between expression divergence and protein evolution (Nuzhdin et al. 2004
; Lemos et al. 2005
). Interestingly, we found that female-biased and male-biased genes evolve at similar rates, in contrast to a previous report that male-biased genes show a fourfold increase in dN/dS (Zhang, Hambuch, and Parsch 2004
).
|
We tested for a positive association between divergence in gene expression level and positive selection on protein sequence and found a slightly higher incidence of positive selection (4.6% under selection) in genes with similar expression levels between D. melanogaster and D. simulans than in genes with divergent expression (3.2% under selection; fig. 1). A negative association was also observed when considering only male-biased genes (6.8% vs. 5.6%) and only female-biased genes (4.0% vs. 3.6%). None of these differences was significant using a Fisher's Exact test; however, in each case, the trend was in the opposite direction of what would be expected if adaptive protein evolution drives the positive relationship between the rate of protein evolution and expression divergence.
|
We also examined expression divergence during metamorphosis in four D. melanogaster lines, one D. simulans, and one D. yakuba (Rifkin, Kim, and White 2003
The lack of a strong relationship between structural and regulatory evolution in our study may reflect noise introduced by combining neutral and adaptive expression divergence. For example, it is possible that positive selection for expression divergence could be most frequent in genes with less functionally constrained protein sequence. Such a scenario could generate a positive correlation between expression and protein sequence divergence but is difficult to evaluate given the current data. The analysis by Rifkin, Kim, and White (2003
and updates) of expression patterns during metamorphosis represents one of the first attempts to classify evolutionary modes of regulatory divergence using genome-level data (see also Khaitovich, Pääbo, and Weiss 2005
) but is not a formal test of positive selection on expression. We combined genes identified to be neutrally evolving with those identified as undergoing lineage-specific selection to form our set of genes with divergent expression during metamorphosis (fig. 1). However, we found no difference in the rate of protein sequence evolution (Wilcoxon P > 0.10) or the frequency of positive selection on protein-coding regions between genes whose expression patterns were classified by Rifkin, Kim, and White (2003)
as evolving due to lineage-specific selection and genes for which neutrality could not be rejected (5.9% vs. 6.3%, fig. S1, see Supplementary Material online).
Overall, these data suggest that positive selection on protein sequence is not strongly coupled with divergence in gene expression in Drosophila (fig. 1). Thus, the correlation between dN and expression divergence previously documented by Nuzhdin et al. (2004)
and Lemos et al. (2005)
is more likely due to differences in purifying selection than in adaptive evolution on both protein structure and gene regulation.
| Methods |
|---|
|
|
|---|
All predicted protein and nucleotide sequences for D. melanogaster were downloaded from the Ensembl database (http://www.ensembl.org/index.html). Orthologous sequences from D. simulans, D. yakuba, D. erecta, and D. ananassae were identified from the Eisen Group Annotations Release 1.0 (February 18, 2005; Venky Iyer, Daniel Pollard, and Michael Eisen) based on a combination of direct pairwise comparisons with D. melanogaster and computational gene prediction. Orthologous amino acid sequences were aligned with ClustalW v1.8 using default parameters (Thompson, Higgins, and Gibson 1994
All analyses on the rate of protein evolution among taxa and tests of positive selection were conducted using the codeml program in the PAML package v3.14 (Yang 1997
). Pairwise estimates of the number of nonsynonymous substitutions per nonsynonymous site (dN) and the number of synonymous substitutions per synonymous site (dS) were calculated using maximum likelihood (Goldman and Yang 1994
). To test for evidence of positive selection in each gene, we fit sequence data from D. melanogaster, D. simulans, D. yakuba, D. erecta, and D. ananassae to two models allowing evolutionary rates (dN/dS) to vary across codon sites (models M7 and M8; Yang et al. 2000
). M7 assumes that dN/dS values vary across codon sites but are functionally constrained between 0 and 1, while M8 allows for an additional category of sites under positive selection (dN/dS > 1). Genes were assumed to be under positive selection if the data fit M8 better with a likelihood ratio test (P < 0.01). We required the dN/dS ratio for positively selected sites in model M8 to be greater than 1.1 (
> 1.1) and the proportion of positively selected sites to exceed 1%. Less conservative criteria produced similar results.
We defined expression divergence according to previously published results. Divergent transcripts between D. melanogaster and D. simulans were defined as genes with nonoverlapping 95% confidence intervals in expression level between species estimated using a Bayesian approach (Ranz et al. 2003
). Rifkin, Kim, and White (2003)
surveyed gene expression in D. melanogaster (four lines), D. simulans, and D. yakuba at two developmental time points. In their experiment, expression divergence was classified into three evolutionary categories based on patterns of variation among the six lineages (see Rifkin, Kim, and White 2003
for details). Briefly, genes were assumed to be evolving due to stabilizing selection if they showed little or no variation among the six lineages (i.e., genes not divergent in expression in our study). Genes showing significant variation among lineages (i.e., genes divergent in expression) were classified as evolving due to lineage-specific selection if they showed insignificant variation within D. melanogaster or neutral if the patterns were consistent with a mutation-drift model. We used an updated version of these data available from the authors at http://genome.med.yale.edu/Comparative/. Estimates of protein evolution and summarized expression data for all 1,862 genes are given in table S1 (see Supplementary Material online).
| Supplementary Materials |
|---|
|
|
|---|
Supplementary fig. S1 and table S1 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank V. Iyer, D. Pollard, and M. Eisen for making their genome annotations publicly available. We are grateful to H. Ochman and the Integrative Graduate Education and Research Traineeship (IGERT) fellows for insightful discussion during the development of this project. The manuscript was improved by comments from M. Dean, H. Ochman, M. Nachman, and two anonymous reviewers. This research was conducted as part of the University of Arizona National Science Foundation IGERT grant Genomics Initiative (DGE-0114420).
| Footnotes |
|---|
Marcy Uyenoyama, Associate Editor
| References |
|---|
|
|
|---|
Berry, A., and M. Kreitman. 1993. Molecular analysis of an allozyme cline: alcohol dehydrogenase in Drosophila melanogaster on the east coast of North America. Genetics 134:869893.[Abstract]
Castillo-Davis, C. I., D. L. Hartl, and G. Achaz. 2004. cis-Regulatory and protein evolution in orthologous and duplicate genes. Genome Res. 14:15301536.
Gilad, Y., S. A. Rifkin, P. Bertone, M. Gerstein, and K. P. White. 2005. Multi-species microarrays reveal the effect of sequence divergence on gene expression profiles. Genome Res. 15:674680.
Goldman, N., and Z. Yang. 1994. A codon-based model of nucleotide substitution for protein-coding DNA sequences. Mol. Biol. Evol. 11:725736.[Abstract]
Khaitovich, P., S. Pääbo, and G. Weiss. 2005. Toward a neutral evolutionary model of gene expression. Genetics 170:929939.
King, M. C., and A. C. Wilson. 1975. Evolution at two levels in humans and chimpanzees. Science 188:107116.
Laurie-Ahlberg, C. C. 1985. Genetic variation affecting the expression of enzyme-coding genes in Drosophila: an evolutionary perspective. Isozymes Curr. Top. Biol. Med. Res. 12:3388.[ISI][Medline]
Lemos, B., B. R. Bettencourt, C. D. Meiklejohn, and D. L. Hartl. 2005. Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions. Mol. Biol. Evol. 22:13451354.
Nuzhdin, S. V., M. L. Wayne, K. L. Harmon, and L. M. McIntyre. 2004. Common patterns of evolution of gene expression level and protein sequence in Drosophila. Mol. Biol. Evol. 21:13081317.
Odgers, W. A., M. J. Healy, and J. G. Oakeshott. 1995. Nucleotide polymorphism in the 5'-promoter region of esterase-6 in Drosophila melanogaster and its relationship to enzyme activity variation. Genetics 141:215222.[Abstract]
Ranz, J. M., C. I. Castillo-Davis, C. D. Meiklejohn, and D. L. Hartl. 2003. Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science 300:17421745.
Rifkin, S. A., J. Kim, and K. P. White. 2003. Evolution of gene expression in the Drosophila melanogaster subgroup. Nat. Genet. 33:138144.[CrossRef][ISI][Medline]
Thompson, J. D., D. G. Higgins, and T. J. Gibson. 1994. CLUSTALW: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22:46734680.
Wernersson, R., and A. G. Pedersen. 2003. RevTransconstructing alignments of coding DNA from aligned amino acid sequences. Nucleic Acids Res. 31:35373539.
Yang, Z. 1997. PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13:555556.
Yang, Z. H., R. Nielsen, N. Goldman, and A. M. K. Pedersen. 2000. Codon substitution models for heterogeneous selection pressure at amino acid sites. Genetics 155:431449.
Zhang, Z., T. M. Hambuch, and J. Parsch. 2004. Molecular evolution of sex-biased genes in Drosophila. Mol. Biol. Evol. 21:21302139.
![]()
CiteULike
Connotea
Del.icio.us What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
