MBE Advance Access originally published online on June 2, 2006
Molecular Biology and Evolution 2006 23(8):1465-1479; doi:10.1093/molbev/msl025
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Review |
Duplication and Divergence of 2 Distinct Pancreatic Ribonuclease Genes in Leaf-Eating African and Asian Colobine Monkeys



* Department of Biological Sciences, University at Albany, State University of New York;
Forensic Science Laboratory, Department of Public Safety, Meriden, Connecticut;
Canada's Michael Smith Genome Science Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada; and
Pathology Department, Columbia University
E-mail: john.schienman{at}uconn.edu.
| Abstract |
|---|
|
|
|---|
Unique among primates, the colobine monkeys have adapted to a predominantly leaf-eating diet by evolving a foregut that utilizes bacterial fermentation to breakdown and absorb nutrients from such a food source. It has been hypothesized that pancreatic ribonuclease (pRNase) has been recruited to perform a role as a digestive enzyme in foregut fermenters, such as artiodactyl ruminants and the colobines. We present molecular analyses of 23 pRNase gene sequences generated from 8 primate taxa, including 2 African and 2 Asian colobine species. The pRNase gene is single copy in all noncolobine primate species assayed but has duplicated more than once in both the African and Asian colobine monkeys. Phylogenetic reconstructions show that the pRNase-coding and noncoding regions are under different evolutionary constraints, with high levels of concerted evolution among gene duplicates occurring predominantly in the noncoding regions. Our data suggest that 2 functionally distinct pRNases have been selected for in the colobine monkeys, with one group adapting to the role of a digestive enzyme by evolving at an increased rate with loss of positive charge, namely arginine residues. Conclusions relating our data to general hypotheses of evolution following gene duplication are discussed.
Key Words: positive selection RNASE A foregut fermentation concerted evolution
| Introduction |
|---|
|
|
|---|
Although conservation of orthologous gene sequences between divergent species suggests that there is an important, conserved function of its protein product (negative selection), rapid evolution of orthologous or paralogous sequences on certain lineages can suggest selection of an equally important, novel function in adaptation to a new environment (positive selection). The case for positive Darwinian selection becomes stronger when distantly related species share that new environment and convergently share similar changes in the orthologous sequence, whereas more closely related lineages share neither. A classic example of this is seen when comparing lysozyme c (stomach lysozyme) between foregut-fermenting and nonforegut-fermenting mammals (Stewart and Wilson 1987
Another enzyme that appears to have been recruited to a new function in foregut-fermenting mammals is pancreatic ribonuclease (pRNase) (Barnard 1969
). RNase 1, more commonly known as pRNase (due to its high levels in ruminant pancreas), appears to serve a role as a digestive enzyme, but its role in the guts of nonforegut-fermenting mammals is unknown. In the ruminants, bacterial RNAs must be digested in the small intestine to release the phosphorus and nitrogen necessary for the dietary needs of the host animal as well as recycling to the microbial fauna in the foregut (Barnard 1969
). Artiodactyl ruminants have as much as 151,200 times the amount of pRNase (ug/g) in their pancreas relative to most other nonforegut-fermenting mammals (Barnard 1969
; Beintema et al. 1973
). Similarly, langur monkey has an increase of approximately 60-fold of pRNase relative to human (Beintema 1990
). Through isolation and sequencing of the pRNase protein directly from langur pancreas, Beintema (1990)
discovered that the langur enzyme has less positive charge, one less potential glycosylation site, and much less glycosylation relative to the human sequence. These findings supported the hypothesis that species with ruminant-like digestion would have less positive charge and less attached carbohydrate (Beintema et al. 1976
; Libonati et al. 1976
; Beintema et al. 1988
) and suggested that pRNase had been recruited in colobine monkeys to perform a similar digestive function to that in artiodactyl ruminants (Beintema 1990
).
This enzyme is a member of the RNase A superfamily, a group of homologous proteins that have been isolated from many vertebrate taxa including, mammals, birds, reptiles, and amphibians (Barnard 1969
; Beintema et al. 1973
). All presumably evolved from a common ancestral gene but have diverged and acquired novel functions. For example, RNases 2 and 3, also known as eosinophil-derived neurotoxin (EDN) and eosinophil cationic protein (ECP), are present in the large specific granules of eosinophilic leukocytes (Snyder and Gleich 1997
) and are transcribed from genes that are thought to be derived from a single ancestral gene that duplicated on the Old World monkey lineage after the divergence of the Old World monkeys from the New World monkeys (Rosenberg et al. 1995
). EDN has a high RNase activity, nonphysiological neurotoxicity, and probably functions as an antiviral agent by destroying genomic RNA of retroviruses (Domachowske and Rosenberg 1997
). In contrast, ECP has very low RNase activity but is toxic to various pathogens by making pores in their cellular membranes (Rosenberg and Dyer 1995
). Similarly, bovine RNase 1 (pRNase) has duplicated twice in some artiodactyl ruminants to generate 2 novel proteins, bovine seminal and brain ribonucleases (D'Alessio et al. 1972
; Suzuki et al. 1987
; Watanabe et al. 1988
; D'Alessio et al. 1991
; Sasso et al. 1991
; Breukelman et al. 1993
). Interestingly, seminal ribonuclease forms a protein dimer and is not bound nor inhibited by ribonuclease inhibitor protein in this form (Murthy and Sirdeshmukh 1992
), although the function of either the seminal or brain RNase is unknown. Another, more extreme, example of disparate function in this superfamily is the comparison of mammalian RNase 5 (angiogenin), which is involved in tumorogenesis, with amphibian onconase, which has specific antitumor activity. Clearly, duplication, divergence, and acquiring novel function is recurrent in the RNase A superfamily.
Evolution of foregut fermentation in artiodactyl ruminants probably occurred between 55 and 60 MYA. In these species, levels of stomach lysozyme and pRNase are much increased in comparison with their nonforegut relatives such as horse and pig (fig. 1A). Colobine monkeys have also evolved foregut fermentation and, similarly, have increased levels of these 2 enzymes in their guts when compared with the other primate groups (fig. 1). However, this adaptation in colobine monkeys has occurred relatively recently (ca. 15 MYA) and is thus a model system to detect positive selection in enzymes that have been recruited to serve a role in this system (fig. 1B) (Stewart and Wilson 1987
; Stewart et al. 1987
). There are 2 main groups of colobine monkeys (the African and the Asian), which diverged approximately 1011 MYA. For this study, 2 species from each continent, representing deep lineages, were chosen for pRNase sequencing. The black-and-white colobus and red colobus species from Africa diverged approximately 89 MYA. Similarly, the langur and proboscis monkey species from Asia diverged at approximately that same time.
|
For several years, our lab has been tracing the evolutionary history of pRNase in the primates (Lee 1993
| Materials and Methods |
|---|
|
|
|---|
Tissues, DNA Sources, and DNA Isolation
Tissues for DNA extractions used in this study were from a variety of sources (animal's ID number or name included if applicable): orangutan (Pongo pygmaeus), YN93-312, liver, Yerkes Regional Primate Research Center, H. McClure; baboon (Papio hamadryas), 821222, liver, Washington Regional PRC; patas monkey (Erythrocebus patas), R229, testicle, New York University, T. Disotell; Guyanese squirrel monkey (Saimiri sciureus sciureus), 4530, liver, University of South Alabama, K. Gibson; black-and-white colobus (Colobus guerza), Flat-top, liver, Houston Zoological Gardens, B. Lester; red colobus (Piliocolobus badius), liver, Louisiana State University, P. Marx and P. Telfer; Hanuman langur (Semnopithicus entellus), Coast, heart, University of California, Berkeley, P. Dolhinow; proboscis monkey (Nasalis larvatus), 861205, liver, Bronx Zoo, G. Amato. The tissues from all species, except red colobus, were from captive animals in the United States; red colobus tissue was imported by P. Marx under his CITES and other permits. The amount of red colobus tissue we obtained was insufficient for preparation of DNA for blot analysis. Human DNA was from an unidentified Caucasian male. Genomic DNA was isolated either using a phenol/chloroform extraction and ethanol precipitation protocol (Strauss 1998
DNA Sequencing
In order to generate DNA sequence for the pRNase gene from a variety of primate species, the strategy of polymerase chain reaction (PCR) from genomic DNA isolated from individuals for these species using primers designed based on the human pRNase gene was employed. This project was initiated before the Human Genome was publicaly available; thus, to obtain the human DNA sequence upstream of the pRNase gene, a Human GenomeWalker Kit (Clontech, Mountain View, CA, cat # K1803-1) was used according to the manufacturer's instructions. To obtain these sequences from the other species that have a single copy of the pRNase gene, an upstream oligo primer, RH43 (5'-GCTGAAACAATCTGAATACAGAGC-3'), was designed and used with RH27 or RH28 to produce amplicons of approximately 2.5 kb; these amplicons were sequenced directly. To obtain gene-specific sequences for direct sequencing from the colobine species, oligo primer, RH43, was used with downstream, gene-specific oligo primers for each unique pRNase sequence that was generated.
PCR amplicons were generated from 1 µg of genomic DNA of a single individual using the oligonucleotide primers JS1 (5'-GATGAGCCATTTGAGCAC-3') or JS2 (5'-GTTCTTTTTCTAAGAGGGCTC-3') and RH28 (5'-GCCAGAAACAGTACCTAG-3') or RH27 (5'-GCTTCAGAAATCAGGTGTTTGCA-3') in a 50-µl reaction containing 1x PCR Buffer II (50 mM KCl, 10 mM Tris-HCl, pH 8.3), 3 or 4 mM MgCl2, 200 µM deoxynucleoside triphosphates, 0.2 µM each primer, and 12 units of Taq Polymerase (Roche Diagnostics, Branchburg, NJ). Cycling conditions were 95 °C for 1 min, 60 °C for 1 min, then 72 °C for 1 min, and 20 s for 30 cycles. Amplicons were sequenced directly using primers JS1, JS2, MA2, NewN, NewC, RH27, and/or RH28 and a BigDye Terminator DNA Sequencing Kit (Version 1) according to the manufacturer's instructions (Applied BioSystems, Foster City, CA, cat # 4303149). For sequencing from species that generated multiple gene sequences, the PCR amplicons were cloned using a TOPO TA Cloning Kit (Invitrogen, Carlsbad, CA, cat # K4500-01), and plasmid DNA was isolated using a QIAprep Spin Miniprep Kit (Qiagen Inc., cat # 27106). Sequence was generated from 20 to 30 clones and analyzed by creating a multiple sequence alignment using the software SeqEd version 1.0.3 (Applied Biosystems). Based on the clone sequences from a given individual, primers were designed with 3' ends specific to individual genes or alleles using the differences observed between gene sequences. These gene/allele-specific primers were used to generate amplicons from genomic DNA and to directly sequence these amplicons to determine if sequences that were rare among the clones were real or PCR artifacts. The sequences of all oligo primers used for PCR and/or complete double-stranded coverage sequencing, and for which gene sequences they were designed, are available on request. The 23-pRNase sequences generated and analyzed for this study can be found in the GenBank database by the accession numbers DQ494857DQ494879.
Genomic Blot Analyses
Purified genomic DNAs were quantitated by comparison with a known amount of human genomic DNA (Clontech) after electrophoresis in an ethidium bromidestained 0.7% agarose gel. Approximately 10 µg of DNA was digested with restriction enzymes KpnI, and/or PstI according to manufacturer's instructions (Fermentas, Hanover, MD), and electrophoresed in a 0.55% agarose gel at 3.5 V/cm for 20 h. The DNA in the gel was blotted onto a Zeta-Probe GT blotting membrane (Bio-Rad Laboratories, Hercules, CA). The membrane was hybridized with either a 5'- or 3'-pRNase probe (see below) and was washed according to the manufacturer's instructions (with HCl depurination).
For the 5'-pRNase probe, the primers JS1 and JS11 were used to PCR amplify a 569-bp fragment of the pRNase gene (see fig. 2A) from black-and-white colobus (see below). Similarly, for the 3'-pRNase probe, the primers RH24 and RH27 were used to amplify a 323-bp fragment (see fig. 2A). These fragments were gel purified, diluted 1,000-fold, and reamplified twice. Each fragment was labeled with [
-32P] deoxycytidine triphosphate and deoxyadenosine triphosphate using a High Prime DNA Labeling Kit (Roche Diagnostics).
|
Sequence Alignments and Phylogenetic Analyses
DNA sequences were aligned using ClustalX freeware (Thompson et al. 1997
|
| Results and Discussion |
|---|
|
|
|---|
Determination of Gene Copy Number by Genomic Blot Hybridization
As an alternative method to determine pRNase gene copy number in the colobines relative to noncolobine primates, several genomic blots were generated and hybridized with probes specific to various regions of the gene (fig. 2 and data not shown). Figure 2A presents a schematic of the primate pRNase gene structure, which consists of 2 exons separated by an intron ranging from 702 to 709 bp (depending on species and allele or paralogue). The total gene length, from transcriptional start site to the polyadenylation signal, consists of approximately 1500 bp. The 5' untranslated region (UTR) is split between exons 1 and 2. The second exon also contains the entire coding region for the signal peptide and the mature protein, as well as the 3' UTR. Genomic blots were created from DNAs isolated from the same individuals used to generate the pRNase gene sequences for the various species. Prior to electrophoresis and blotting, the DNA had been digested with 2 enzymes in single and double enzyme digestion combinations. See figure 2A for the approximate location of the cutting sites for the restriction enzymes KpnI and PstI within the pRNase gene sequences. Several identical blotted membranes were made and then hybridized with either a probe specific for the 5'-coding and intron portion of the gene ("5' probe," in fig. 2), the 3'-coding portion ("3'probe"), or the intron portion only ("intron probe," data not shown). Figure 2B shows the blot and hybridization results for the orangutan, black-and-white colobus, and Hanuman langur individuals using the 5' probe (left) and the 3' probe (right).
The pRNase genomic blots from orangutan (fig. 2B), as well as those from human and baboon (data not shown), show a single hybridizing fragment for both the 5'- and 3' probes for DNA digested with these enzymes. This confirms our PCR sequencing results and previous blot experiments using human and mouse genomic DNAs done by others (Breukelman et al. 1993
) that suggest there is a single gene in most mammals, with the notable exception of the artiodactyl ruminants. The presence of a processed pseudogene in the Asian colobines is suggested by the blot data because there are fragments that hybridize to the 5'- and 3' probes but not the intron probe (* in fig. 2B; plus data not shown). The blot data for the digestions of these colobine individuals are consistent with the minimum number of paralogous pRNase genes as determined by PCR sequencing, with the presence of 2 hybridizing fragments in the black-and-white colobus and 3 in the Hanuman langur. The KpnI/PstI double digestion of the langur DNA shows 4 hybridized fragments. The 2 middle-sized fragments do not hybridize to the intron probe and most likely represent the 2 pseudogene alleles, Langur-
a and
b (see gene name designations in next section). For these 2 alleles, the first KpnI site located downstream of the gene's 3' end must be a slightly different base pair distance from the upstream PstI site. Proboscis monkey genomic DNA had a pattern of hybridization similar to that of langur (not shown). Not enough genomic DNA was available for preparation of blots for the red colobus individual, so a comparison of blot hybridization patterns between red colobus and the other colobine species could not be made.
Sequencing of Pancreatic Ribonuclease Genes
To determine pRNase gene sequences as well as gene copy number, one primer pair, JS1 and RH27, was used to generate PCR products for direct sequencing from DNA of a single individual for all species tested. Direct sequencing was performed with these primers, as well as with primers JS2 and MA2. Direct sequencing from the human, orangutan, and baboon PCR products produced a single, clean gene sequence (data not shown). However, the 4 colobine species (black-and-white colobus, red colobus, Hanuman langur, and proboscis monkey) all produced "dirty" sequence; that is, more than one nucleotide peak was observed at multiple base positions throughout the sequencing electropherograms, which is suggestive of more than one allele or pRNase gene sequence being present in that individual.
To determine each unique pRNase gene sequence, PCR products from each of these 4 species were cloned and between 20 and 30 clones were sequenced from each. Additionally, a second primer pair, NewN and NewC, was used to generate PCR products that were subsequently cloned and sequenced. When analyzing the aligned sequence data from all the clones (data not shown), a point mutation contained in one clone only was considered to be an error generated either by the Taq DNA polymerase (Roche) during the PCR or the bacterial cloning process and not considered a real gene or allele sequence. In addition, there existed sequences in singular clones that appeared to be a mosaic or hybrid of 2 of the other unique sequences represented by multiple clones. There are at least 2 ways that mosaic or hybrid sequences could be generated as artifacts during PCR when multiple homologous gene sequences are amplified by a single set of primers. One way is through crossover PCR (Meyerhans et al. 1990
). A second way is through mismatch annealing of a complete Watson strand of one gene sequence with a complete Crick strand of another, followed by unbiased mismatch repair in the Escherichia coli host on transformation. To determine which cloned sequences might be such PCR artifacts, specific primers were designed to regions of closely spaced nucleotide differences in the sequences represented most frequently among the initial clones. PCR products were then generated again from genomic DNA using these specific primers and directly sequenced. If multiple products were present in direct sequencing, then these products were cloned and sequenced. This process was repeated until single "clean" gene sequence could be obtained directly from PCR products generated from genomic DNA.
From the above procedures, 3 unique pRNase gene sequences were produced from a single black-and-white colobus individual and 5 unique sequences each were produced from single individuals of red colobus, Hanuman langur, and proboscis monkey. The letter and number designations of each sequence name were decided based on several criteria. First, the amino acid that is present at 3 specific positions in the translated pRNase sequences of this study can define 2 mutually exclusive groups (positions 1, 6, and 39 of the mature protein). For one of these groups, 2 of these 3 amino acid residues match the human sequence that is known to be expressed in relatively low or moderate levels in numerous human tissues, including pancreas (numerous = N type). Second is the knowledge that one of the langur gene sequences, which is part of the second group defined above (i.e., does not match the human sequence when translated), corresponds to the single ribonuclease protein sequenced from an individual langur pancreas (Beintema 1990
). This along with our reverse transcriptasePCR experiments on the black-and-white colobus sequences (data not shown) suggest that this second group of colobine sequences are the gene sequences expressed predominantly in colobine pancreas (predominantly pancreatic = P type). Third is the fact that 2 of the gene sequences in langur and one in proboscis monkey are processed, retroposed pseudogenes (pseudogene =
type, see below). Lastly is the assumption of a simple model of gene duplication of pRNase in the colobine species (See "Evolutionary Tree Inference section" and fig. 3). The number of distinct gene sequences generated from single individuals and the genomic blot hybridization results suggest a minimum of 2 pRNase genes in black-and-white colobus and a minimum of 3 pRNase genes in the other 3 colobine species studied here. The 3 gene sequences for black-and-white colobus were thus named B&WCol-Pa, B&WCol-Pb, and B&WCol-N. The 5 gene sequences of red colobus were named RedCol-P1a, RedCol-P1b, RedCol-P2a, RedCol-P2b, and RedCol-N. The 2 Asian species (langur and proboscis) share a gene duplication event not found in the African colobus monkeys. As mentioned above, langur and proboscis monkey share unique gene sequences in that these sequences appear to be processed, retroposed pseudogenes. They lack the intronic sequence found in other pRNase genes, have the remnants of a polyA tail, and are flanked by direct repeats. The 5 gene sequences of Hanuman langur and proboscis monkey were thus named Langur-P, Langur-Na, Langur-Nb, Langur-
a, and Langur-
b and Proboscis-Pa, Proboscis-Pb, Proboscis-Na, Proboscis-Nb, and Proboscis-
, respectively.
|
Evolutionary Tree Inferences of the Primate pRNase Gene Sequences
The species chosen for this study have a well-established phylogeny with the black-and-white colobus and red colobus species belonging to an African colobine clade, and the Hanuman langur and proboscis monkey species belong to an Asian colobine clade. All 4 species represent deep lineages in their respective clades. This known species tree allowed us to subject the pRNase sequences to phylogeny-based analyses to help infer the orthologous and paralogous relationships between the genes. Under ideal circumstances, orthologous gene trees should recreate the species tree. Ideal circumstances include higher levels of phylogenetic signal than noise (e.g., homoplasy) and lack of gene conversion or unequal crossover exchange between paralogues. The expectation would be the following order of relatedness of the colobine gene sequences. Allelic sequences should be the most similar, followed by orthologues within the African or Asian colobines, then orthologues between the African and Asian colobines, and finally paralogues that are the result of gene duplication that occurred before the species divergence.
Two trees depicting inferred phylogenetic relationships between the gene sequences are presented in figure 3. The maximum likelihood tree was generated by using approaches available in PAUP*, see figure legend (Swofford 1998
). The second tree represents our "proposed biological" tree based on the known phylogenetic relationships of the colobines and the reasoning used to name the different gene sequences discussed earlier. Trees were also generated using parsimony and distance methods (data not shown). The computationally derived trees have different topologies relative to each other, as well as to the proposed biological tree. The discrepancies between these trees become less of a concern when considering the fact that the supporting bootstrap values for many of the clades are quite low, in many cases well below 50%, suggesting that the ideal circumstances discussed above are not being met. Possible reasons for this will be discussed below.
To test if any of these trees were significantly different from each other with respect to the sequence data, a ShimodairaHasegawa test (one tailed) with 1,000 bootstrap replicates under full optimization was performed on 6 tree topologies. In addition to the 2 tree topologies shown in figure 3, trees generated by parsimony, distance, distance using a maximum likelihood to generate DNA distances, and Bayesian maximum likelihood methods (not shown) were also tested. Parameters for the base frequencies, substitution rate matrix, proportion of invariable sites, and gamma shape of the frequency of substitution at variable sites were estimated for each tree. In testing the null hypothesis that the trees are not significantly different statistically, none of the trees were found to be significantly better than the others. The tree topology with the best log-likelihood score was the one generated by maximum likelihood (fig. 3A) using full estimation of the parameters listed above. The differences in log-likelihood scores from this tree ranged from 1.01 (P = 0.723) for the Bayesian maximum likelihood tree to 7.62 (P = 0.283) for the distance tree using maximum likelihood for DNA distance (significance when P < 0.05). The proposed biological tree had a difference in log-likelihood of 4.91 (P = 0.461) from the best tree and thus cannot be ruled out by this test.
For any of the computationally derived trees to be a true representation of the evolutionary history of these colobine pRNase gene sequences, a specific number of gene duplications and deletions are required at nodes throughout each tree (e.g., fig. 3A). The phylogeny generated by parsimony requires that 5 independent duplications and 5 independent deletions of the genes from the various genomes would have had to occur in the colobine lineage. Likewise, the distance- and maximum likelihoodbased phylogenies require 7 and 5 gene duplications, respectively and 10 and 5 gene deletions, respectively. In contrast, the proposed biological phylogeny requires only 3 gene duplications and no gene deletion events. Thus, the proposed biological tree is the most parsimonious with respect to gene duplications and deletions and is consistent with the genomic blots. Lack of high bootstrap support could be in part due to the short time intervals of some nodes (24 Myr). Other reasons include gene conversion, concerted evolution, and selection on coding regions.
Nonindependent Evolution of Colobine pRNase Gene Duplicates
There are several possible reasons for these low support values and the different tree topologies generated. First, gene conversion could have occurred between pRNase sequences making a gene sequence in one species appear as a mosaic of paralogous and orthologous sequences relative to another species' sequence. Supporting evidence for such gene conversion occurs when examining the B&WCol-Pa allele and B&WCol-N gene sequence; these genes share a single nucleotide deletion 5 bp before the normal stop codon that is not shared with the B&WCol-Pb allele. This substitution creates a premature stop codon and a resulting protein that would be 2 amino acids shorter than the other primate ribonucleases. If the Pa and N sequences from this individual are not alleles of the same genetic locus, it is very unlikely that they would have independently deleted the same critical nucleotide. But, by pairwise nucleotide difference, the Pa and Pb gene sequences are 3-fold more similar to each other than to the N sequence (0.9% vs. 2.83.2%, respectively), suggesting that the Pa and Pb sequences are most likely alleles (or the result of a more recent duplication event), and if so, the Pa and N sequences cannot be allelic. Alternatively, unequal crossover or exchange could be occurring if the gene duplicates are linked relatively closely on a chromosome. Unequal alignment of sister or homologous chromatids during meiosis, followed by a crossover event and fixation, could delete and reduplicate different gene sequences in the different species. This could create the situation where certain pRNase orthologues may no longer exist in every colobine species. Of course, these 2 possibilities are not mutually exclusive and could have both occurred in each species.
If a gene duplication event occurs before speciation and the duplicates are evolving independently and at relatively equal rates, then the orthologues between species should be more similar at the nucleotide level than paralogues within a species. Additionally, if multiple species share a duplicated gene, it suggests that the duplication event preceded the speciation event. Contrary to this, if biased gene conversion and/or unequal crossover events are occurring between the paralogues within species, then these duplicates are not evolving independently. In the extreme case of high levels of recombination, the duplicates within a species are more similar to each other than to their orthologues in the other species. This phenomenon is known as "concerted evolution" (Zimmer et al. 1980
; Coen et al. 1982
). In our data, the level of pairwise nucleotide divergence between apparent paralogous coding sequences (within one species) is similar to that observed between some of the apparent orthologous coding sequences (e.g., RedCol-P2a to RedCol-N, 5.3% compared with RedCol-P2a to B&WCol-Pb, also 5.3%). This suggests that the duplicates within a species are not evolving independently and the possibility that relatively high levels of recombination might be occurring between paralogous pRNase genes. One caveat to this possibility is that positive selection, causing rapid evolution of the pancreatic-type coding sequences, could be influencing these levels of divergence, that is, the duplicates are not evolving at relatively equal rates.
To further test this hypothesis on sequences that should have little or no selection acting on them, 1,744 bp of noncoding contiguous DNA, consisting of upstream 5' untranslated, intronic, and 3' UTRs, were sequenced for 13 of the pRNase genes and subjected to phylogenetic analyses. Figure 4 presents phylograms obtained from maximum likelihood analysis performed on the noncoding and coding sequences for the 13 taxa. In support of the idea that concerted evolution is occurring between the pRNase duplicates, the noncoding sequences of duplicates within a species are more closely related to each other than to their orthologues, with high bootstrap support. ShimodairaHasegawa tests were performed, as before, with the coding and, separately, the noncoding sequence data and these 2 tree topologies, as well as the proposed biological topology, for the 13 sequences only. The noncoding sequences were consistent only with the topology generated for them in maximum likelihood (P < 0.0005 for the other 2 topologies). For the coding sequences, none of the 3 topologies were significantly better than the others (proposed biological topology, P = 0.46; noncoding topology, P = 0.16), but clearly, the coding data are more consistent with the former than the latter. To test if these results were simply due to the disparity in the length of sequences analyzed (1,744 bp of noncoding vs. 468 bp of coding), the 1,744 bp of noncoding sequence was divided into smaller subsections and reanalyzed as above (data not shown). The 1,744 bp of noncoding sequence was divided up in several different ways before analysis. First, the noncoding sequence was divided into the first 540 bp, the second 540 bp, and the final 664 bp. Second, just the 884 bp of intron and 3'UTR were analyzed. Lastly, this 884 bp was divided into 2 approximately equal halves and analyzed. Although there was somewhat lower bootstrap support for some of the clades, neither were there significant differences in the best tree topologies nor were any of the subdivided noncoding data consistent with the proposed biological or coding sequence tree topology. These results strongly suggest that the intron and other noncoding sequences adjacent to the pRNase-coding sequence are being highly homogenized between duplicates within each of the 4 colobine species; thus, the pRNase gene duplicates are not evolving totally independently.
|
This nonindependent evolution of apparent paralogous sequences within each of these species could be due to biased gene conversion or the occurrence of independent duplications and deletions after speciation. Discerning between these 2 mechanisms is difficult, because both could be occurring to some degree. However, if either mechanism was predominant, different predictions would be made with respect to shared amino acid replacements between paralogues within and orthologues between each species. Independent duplications and deletions through unequal exchange occur after speciation; thus, this scenario would predict that paralogues within a species would share neutral amino acid replacements not seen in the proteins of the other species. Conversely, if the duplication event occurred before speciation and biased gene conversion was the predominant mechanism of nonindependent evolution, then there should be a significant number of shared amino acid replacements between apparent orthologues that arose before the speciation events.
Amino Acid Replacements and Selection of Colobine pRNase
This can be tested with our data by generating phylogenetic reconstructions of the most parsimonious amino acid replacements with respect to specific lineages or branches in a tree relating the sequences. We performed this test on both the proposed biological tree configuration (see fig. 3B) and a tree configuration that can be best explained by postspeciation duplications and deletions (see fig. 4B) using the software package MacClade 4 (Maddison DR and Maddison WP 2000
) and a PROTPARS matrix (Felsenstein 1993
) to translate the gene sequences (data not shown). The results show that for the recent duplication and deletion tree configuration, the majority of amino acid replacements map to the terminal branches of the P-type sequences with few, if any, replacements being shared uniquely by paralogues within a species. This would require many independent parallel changes in the P-type sequences. On the other hand, when the replacements are mapped onto the proposed biological configuration, the amino acid replacements are more evenly distributed throughout the branches leading from the putative, original duplication event before speciation to the terminal branches of the P-type sequences. Additionally, the tree configuration representing independent duplications and deletions requires a tree length of 92 amino acid replacements, whereas the proposed biological tree configuration requires only 83 replacements. These results further support the proposed biological tree as the true representation of the evolutionary history of the pRNase gene sequences and strongly suggest that biased gene conversion is the predominant mechanism for the nonindependent evolution of paralogues within each species.
Also of noteand consistent with the hypothesis of intergenic recombinationis the fact that the branch lengths of the noncoding sequences are relatively equal to each other, that is, these regions are evolving at similar rates. In comparison, the branch lengths for the coding sequences are not equal. From the point of the first duplication, the P-labeled (P-type) DNA sequences are evolving 2.83 (Langur-P/Langur-Na) to 5.97 (B&WCol-Pb/B&WCol-N) times faster than the N-labeled (N-type) sequences within the species. To compare branch lengths between coding and the adjacent noncoding sequence for each gene, the coding sequences were mapped onto the tree topology obtained from the 13 noncoding sequences (data not shown). Interestingly, when this is done and the branch lengths compared, the P-type coding sequences are evolving 1.713.44 times faster than their corresponding, adjacent noncoding sequences, whereas the N-type coding sequences are evolving slightly slower than the corresponding noncoding sequences (0.420.93). If it is assumed that the majority of each noncoding sequence is evolving at the neutral rate, then the P-type sequences are evolving 23 times faster than the neutral rate, suggesting the possibility that they have been positively selected (provided that the majority of change is not just at synonymous sites), against a background of strong homogenization due to intergenic recombination.
To more rigorously test for selection on these genes, rates of nonsynonymous and synonymous substitution (abbreviated dN for number of nonsynonymous changes per number of nonsynonymous sites and dS for number of synonymous changes per number of synonymous sites in the DNA sequence) were first calculated using pairwise comparisons performed in the FENS program (APJ de Koning and CB Stewart, program available from the authors). A dN/dS < 1 implies negative or purifying selection, dN/dS = 1 implies neutral or no selection, and dN/dS > 1 implies positive or adaptive selection. Although all P-type sequences had a dN/dS > 1, few were found to be significant at less than the 5% level (using the statistical z-tests performed by FENS), and significance level was dependent on the pairwise comparison and/or the dN/dS method used. Several of the N-type sequence comparisons had dN/dS > 1 as well, but these were always less than the P-type sequences in the same species. For example, using Ina's method (Ina 1995
), Proboscis-Na and -Nb had a dN/dS of 1.66 and 1.49 relative to 2.15 and 2.41 for Proboscis-Pa and -Pb, respectively. The number of nonsynonymous substitutions for the P-type sequences (ranging from 1723,
= 20.4) was always between 22.5 times higher than that of the N-type sequences (ranging from 8 to 10,
= 9), but the synonymous rate for the P-types was higher as well (ranging from 35,
= 4 compared with 26,
= 3).
In the previous analysis, individual pairs of sequences are compared and statistically analyzed as if each pair is independent of the others, when in reality they are not. To determine and compare dN/dS ratios in a simultaneous manner for a large group of sequences from many species, an alternative approach was performed using lineage-specific likelihood ratio tests for positive selection against the null hypothesis of dN/dS = 1 (Yang 1998
). Using this method with the nucleotide sequences and the tree topology shown in figure 6, 2 of the 3 P-type lineages were significant with respect to positive selection (RedCol-P1 and -P2 lineage, P = 0.015 and the Langur-P, Proboscis-Pa, and -Pb lineage, P = 0.088). The B&WCol-Pa and -Pb lineage showed no significance (P = 0.548). This lineage did have a relatively high number of nonsynonymous substitutions (7.3) but also had 1.8 synonymous substitutions, whereas the other 2 lineages mentioned above had zero. These analyses suggest that positive selection is occurring on at least some of the P-type sequences at the nucleotide level due to pressure for adaptive change at the protein sequence level in response to a new environment or function. We next examined whether there are specific amino acid residue changes shared uniquely by the P-type proteins, suggesting that those residues are important with respect to such adaptive change.
Loss of Arginines in the Highly Expressed pRNase Genes
To visualize the differences at the protein level between the DNA sequences generated in this study and which amino acid residue positions might have functional importance, the DNA sequences were translated to protein sequences and aligned (fig. 5). The inferred ancestral primate protein sequence is presented at the top and the bovine pRNase sequence at the bottom for comparison. The 8 cysteines, which are well known to stabilize the three-dimensional structure of ribonuclease through disulfide bridges (Klink et al. 2000
), as well as the active site lysine and 2 histidine residues, are conserved among almost all of the sequences. The one exception is the Langur-
a sequence, which would have a threonine substituted for the active site lysine-41, if it were actually transcribed and translated; this provides further evidence that this gene is, as it otherwise appears to be, a retroposed processed pseudogene. RNase A enzymes are known to bind 3 adjacent bases of a ribonucleotide chain in the active site cleft (Cuchillo et al. 1997
). In addition to the cysteine and active site residues, there are 13 other residues thought to help stabilize the binding of the ribonucleotide chain by interacting with the 3 nitrogenous bases as well as the 3 phosphate groups (see Nogues et al. 1998
for review). The majority of these residues are also conserved in the sequences presented here, but there are some exceptions. The Langur-Na sequence has a glycine-10 for arginine-10 substitution. This is one of the 2 residues thought to stabilize binding of the substrate at the phosphate group linking the second and third bound bases of the RNA molecule being cleaved (Richardson et al. 1990
; Boix et al. 1994
) and allows the enzyme to have endonuclease as opposed to exonuclease activity (Moussaoui et al. 1996
). Lysine-66, thought to stabilize the binding through the 5' phosphate of the first bound base (Matthew and Richards 1982
; Brunger et al. 1985
), has been substituted by glutamic acid in the B&WCol-Pa sequence. Glutamic acid-111, thought to be one of the 3 residues involved with stabilizing the second bound base, has been conservatively substituted by aspartic acid in the B&WCol-Pa and -Pb sequences. Site-directed mutagenesis and kinetic studies showed no effect when Glu-111 was mutated, but Glu-111 appears to be involved in catalysis when CpG is used as a substrate (Tarragona-Fiol et al. 1993
). Lysine-1 is thought to interact with the third bound base; however, this residue position is an arginine in the New World monkey, Cercopithecine, and Colobine N-type sequences presented in this study. The Colobine P-type sequences have a lysine, serine, or glycine at this residue, suggesting the possibility that simply the lack of an arginine residue is of functional importance here for the pancreatic-expressed ribonucleases.
|
Overall, there is an apparent trend of substitution in the Colobine P-type protein sequences to eliminate arginine residues. Ten such sites have been highlighted by bold characters in figure 5, with 7 of them exclusive to some or all of the Colobine P-type relative to the N-type proteins. Nine of these sites are arginines in the inferred ancestral primate sequence. One of these sites is an apparent gain of an arginine in the Colobine lineage with a subsequent substitution to Lysine in some of the Colobine P-type sequences. Interestingly, for 7 of these 10 positions, an arginine is not present at the corresponding residue in the bovine pRNase. This suggests a possible selective pressure to eliminate arginines that are not essential to the protein's function in foregut-fermenting animals. A similar trend has been noted for Colobine stomach lysozymes (Stewart and Wilson 1987
Figure 6 presents a summary of our working hypothesis of pRNase gene duplicates in foregut fermenting, colobine monkeys. We believe this phylogeny best represents the true evolutionary history of the colobine pRNase sequences. Our model proposes that soon after the origin of foregut fermentation in the ancestral colobine lineage, the single pRNase gene duplicated, presumably, under selective pressure for more gene product to better able the animal to digest copious amounts of bacterial RNA and thus absorb and recycle the nitrogen and phosphorus fixed into the bacteria during foregut fermentation (Barnard 1969
). One of the duplicates began to evolve changes in the protein-coding region, notably loss of arginine residues relative to lysines. Concomitantly, changes possibly occurred in the regulatory sequences of this duplicate to enhance its expression in pancreas. Evidence for this change in expression level is taken from the fact that (Beintema 1990
) was able to generate a single, clean P-type protein sequence from Hanuman langur pancreas tissue. At least one other duplication event occurred in the ancestor of the Asian colobine lineage, with the retroposition of a P-type sequence, suggesting that this gene is, or at least was, also expressed in germ line cells.
The rate of evolution at the protein sequence level of the P-type relative to N-type sequences is approximately 7 times faster, suggesting positive selection in the P-type sequences. The average number of replacements averaged over all equally parsimonious reconstructions for the P-type sequences ranges from 11.0 (Langur-P) to 18.0 (RedCol-P2), with the average being 13.7. In comparison, the average number of replacements for N-type sequences is 1.89. A significant proportion of these changes appear to be loss of arginines along the P-type lineages. In figure 6, independent losses of arginine residues have been mapped onto the protein sequence lineages. The type of mutation at the nucleotide level responsible for that arginine loss is also denoted, with respect to the CpG mutational mechanism. A methylated CpG dinucleotide that becomes deaminated will be repaired to TpG or CpA, which is estimated to have a frequency 10 times that of the neutral point mutation rate. Four of the 6 codons for arginine contain a CpG dinucleotide, and thus, the loss of arginines could simply be loss of that particular amino acid at neutral positions in the protein due to the elevated CpG dinucleotide mutation rate. Given that only 6 of the 17 independent arginine losses along colobine lineages appear to be due to CpG dinucleotide instability, we argue that the data strongly support the hypothesis of selection for arginine loss.
Three of the arginine (R) residues that changed along colobine lineages (R39, R91, and R98) have several equally parsimonious placements on the tree at the amino acid level. However, analysis of the DNA sequences for these amino acid positions reveals that, for 2 of the cases, one assignment is actually more parsimonious than the others. For example, R39 has 2 arrangements on the tree, each requiring just 2 independent changes. The arrangement where R39 was lost on the lineage leading to all of the P-type and
-type sequences requires one loss due to mutation at a CpG dinucleotide and a subsequent regain of the arginine along the lineage leading to the pseudogenes, via a non-CpG point mutation. The other, more likely, arrangement requires that R39 was lost independently twice, once along the lineage leading to the African colobine P-type sequences and once on the lineage leading to the Asian colobine P-type sequences. This arrangement is considered more likely here because both events would be due to a mutation at a CpG site. Similarly, the assignments chosen here for R98 (which requires a minimum of 4 changes) is for 4 independent losses, 3 being due to mutations at CpG sites. The mutation for the Langur-P sequence is a CGG codon mutating to a GGG codon. This mutation is at a CpG dinucleotide site but is not due to the mutational mechanism responsible for the elevated CpG mutation rate. The alternative would require 2 losses due to a CpG site and a regain with subsequent non-CpG loss in the Langur-P sequence. R91 has 3 equally parsimonious arrangements, with none of the possible mutations involving CpG sites, so in this case, one loss in the ancestor before duplication followed by 2 independent regains of the arginine in the
-type lineage and in the African colobine N-type sequences is shown in the figure. This was chosen over 2 losses/one regain or 3 losses because it is more conservative with respect to the total number of independent arginine losses. Interestingly, under this scenario, R91 is regained in the African N-type lineage and the Asian
-type lineage which we propose are not under selection for arginine loss. For comparison, there are only 2 independent losses that occur in the N-type sequences, the R91 loss that predates the first duplication event and R10 for the Langur-Na sequence. If considered from the opposite perspective, namely arginine gain, the same overall pattern is seen. One arginine gain that predates the duplication event, R78, and R91 being independently regained in the African colobine N-type and Asian colobine
-type sequences. So, in summary, these data strongly suggest that colobine pRNase P-type sequences are selectively losing arginines (and regaining arginines at a lower rate), relative to colobine N-type sequences.
| Conclusions |
|---|
|
|
|---|
In this study, we present data showing multiple pRNase gene sequences, and their corresponding translated protein sequences, for 2 African (black-and-white and red colobus) and 2 Asian (Hanuman langur and proboscis monkey) colobine species. Both sequencing of PCR-amplified DNA and genomic blot analyses are consistent with the following number and type of pRNase duplication events in the colobine clade. These data show that at least one duplication of pRNase occurred in the lineage leading to the modern day African and Asian colobine monkeys. Both Asian colobines as well as red colobus show evidence of a second pRNase duplication event. In the Asian colobines, this duplication is due to retroposition of a transcribed copy of the gene and was most likely "dead-on-arrival," suggesting this duplicate has had little, if any, impact on the evolution of functional pRNase duplicates. In contrast, the additional duplication in red colobus is of the entire gene, similar to the original duplication event, and thus likely due to segmental duplication or unequal crossover. Lineage-specific likelihood ratio tests for positive selection and episodic evolution suggest such selection on some of the P-type DNA sequences (Yang 1998
Beintema (1990)
showed that the langur pancreas had 5060 times the amount of pRNase relative to human (Beintema et al. 1984
). The cow, also a foregut fermenter, has higher levels of pRNase in the gut as well. Although artiodactyl ruminants have 3 duplicate pancreatic-like RNaseA genes, only one of these genes codes for a pancreatic RNase. In correspondence with this, one of the initial impetuses for this work was to find hypothetical positive selection events at nucleotide sites in the promoter of a single colobine pRNase gene that would be the cause of increased transcription and thus increased levels of protein in the pancreas of these species. We generated 1700 bp of DNA sequence upstream of the translational start site for 13 of the 23 gene sequences in hopes of finding altered promoter elements that could be responsible for such expression differences, such as a consensus pancreatic transcription factor-1binding site. This site is found in many genes expressed in the pancreas and usually occurs within 500 bp upstream of the 5' transcriptional start site (Samuelson et al. 1991
; Kruse et al. 1995
; Rose and MacDonald 1997
). No pancreatic promoter elements specific to the colobine sequences were found by visual examination or by TRANSFAC (Windgender et al. 2000). What we did find was a pattern of gene duplication and divergence for the colobine pRNase-coding region despite high levels of homogenization between paralogues by gene conversion.
Mosaic concerted evolution (Wen and Irwin 1999
) appears to be playing a significant role in the evolution of pRNase gene duplicates. There is an apparently high level of homogenization occurring in the 1,744 bp of noncoding sequence immediately adjacent to the 468 bp of coding sequence. This level of homogenization appears to extend into the coding region, but only to a certain degree, suggesting that the complete homogenization of the entire coding sequence is being prevented by selection for 2 functionally distinct pRNase proteins in these species. This supports the case for positive selection of a novel pRNase protein in the past because it would be expected that any new variant might be quickly overwritten. It also supports negative selection on both duplicates once the episodic events had created a reasonably well-adapted pRNase duplicate to act as a digestive enzyme in foregut fermenters, because these 2 distinct duplicates are being maintained. Although independent duplications and deletions occurring in each species separately cannot be ruled out, we favor gene conversion events between pRNase duplicates that existed prior to speciation as the predominant mechanism of this concerted evolution. The reasons we favor this hypothesis are the number of independent duplications and deletions required for the alternate hypothesis, apparent paralogous pRNase sequences sharing the same single nucleotide deletion in the coding region, and the results obtained from amino acid replacement reconstructions mapped onto the phylogenetic trees depicting both scenarios. In addition, this pattern is not without precedence. The gene duplicates of both the human red and green opsin (Zhao et al. 1998
) and the RHCE and RHD (Innan 2003
) genes appear to be undergoing a similar pattern of evolution. In both cases, high levels of apparent gene conversion are keeping the respective paralogues homogenized while divergence has occurred for a small region within each gene (and corresponding protein) that is critical for the functional difference of the 2 distinct types.
Following up on our earlier report (Schienman et al. 2000
), Zhang et al. (2002)
sequenced 2 of the pRNase genes from a douc langur (an Asian colobine), which they name, RNASE1B and RNASE1, respectively. Our results strongly confirm these findings and expand them to additional Asian species and to the African colobines as well. The authors report that analysis of the douc langur sequences suggest a duplication event of approximately 4 MYA. Due to the accepted divergence time of African and Asian colobines (Delson 1994
), our results would strongly suggest that the timing of the duplication event is prior to the African/Asian colobine split and is more on the order of 1012 MYA. The more likely scenario, consistent with both results, is that homogenization by intergenic recombination between pRNase duplicates is the cause of a more recent estimate of the event that created the douc langur pRNase duplicates.
These authors also report that the douc langur RNASE1B (P-type) has 6 times the ribonuclease activity relative to RNASE1 (N-type) at pH 6.3. This pH optimum for RNASE1B is within the unusual pH range of 67 (for a primate) of the colobine monkey small intestine (Kay and Davies 1994
) and suggests that this duplicate has been positively selected to function in this intestinal environment. They state that this change in optimal pH for RNASE1B function is the result of the loss of the overall positive charge of the protein at the amino acid level. Our work confirms this but also extends this result by showing that this change in net charge of the P-type proteins is predominantly due to the selective loss of positively charged arginine residues. There are at least 2 possible reasons why a loss of arginine residues might be favored in an enzyme that functions in the environment of the digestive tract of a foregut-fermenting animal (Stewart and Wilson 1987
). First is the fact that arginyl bonds are more sensitive than lysyl bonds to hydrolysis by pancreatic trypsin (Vonk and Western 1984
; Craik et al. 1985
). Second is the fact that arginine can be irreversibly modified by bacterial fermentation products such as diacetyl (Davies and Neuberger 1969
; Kaiser et al. 1985
). In the case of chicken LZM, this modification alters or "kills" the enzyme activity. In foregut-fermenting animals, not only will pancreatic trypsin be present in the small intestine but also bacterial fermentation products will be as well. Additionally, in such animals, there is a need to digest large amounts of bacterial RNA in order to be able to recycle sufficient levels of nitrogen and phosphorus sequestered by the foregut microbes (Barnard 1969
). From these facts, it follows that the loss of arginine residues in a protein required to have high activity and/or stability in the gut environment might be selected for. This hypothesis is strengthened by the fact that a similar pattern of arginine loss is seen in an unrelated enzyme, stomach lysozyme, also recruited to function in the unique digestive tract environment of foregut fermenters (Stewart and Wilson 1987
). To test the hypothesis that the P-type protein has evolved as a digestive enzyme by being more stable in the small intestine of a colobine or ruminant, in vitro experiments with P- and N-type purified proteins (including site-directed mutagenic variants) incubated with biologically relevant concentrations of diacetyl and pancreatic trypsin could be assayed for level and longevity of their RNase activity.
Gene duplication has long been thought by evolutionary biologists to be the source of novel gene function. One of the first hypotheses to describe what happens to paralogues after the duplication event was put forth by Ohno (1970)
. His hypothesis stated that after gene duplication, one gene copy would be entirely redundant and thus freed from all selective constraint. This copy would accumulate mutations at a neutral rate and in most cases become a nonfunctional pseudogene. Occasionally, these accumulated neutral mutations could eventually resurrect the gene with a new function. We believe our data to be another example that do not support this hypothesis. The biological scenario of foregut fermentation development suggests that the original gene duplicates were beneficial from the beginning via a protein dosage effect. This is in support of data seen by Hughes MK and Hughes AL (1993)
and Kondrashov et al. (2002)
. The former showed that duplicate genes of the tetraploid frog Xenopus laevis were not freed from functional constraint but were under strong purifying selection. The latter used data from a number of bacterial, archaeal, and eukaryotic genomes to compare the mode and strength of selection acting on recently duplicated genes. They hypothesized that gene duplications that persist are beneficial from their time of origin, due primarily to a protein dosage effect in response to variable environmental conditions. If the stability and activity of the original pRNase protein in the colobine lineage were curtailed as described above, then it follows that 2 gene copies enhance the ability to digest RNA in the small intestine. Also, in contrast to Ohno's hypothesis, our data support that one duplicate was not evolving neutrally due to redundancy but under strong positive selection despite high levels of homogenization by gene conversion similar to the situation seen in the human red and green opsin and RHCE and RHD genes (Zhao et al. 1998
; Innan 2003
). Our data may also be in support of a subfunctionalization model (see Hughes 2005
for a short review), where the original protein had 2 functions and the duplicates each evolved to specialize in only one of the functions. The expression of human pRNase in many tissues other than the gut along with data published by Sorrentino et al. (2003)
and Zhang (2003)
suggest that the original protein in colobines had a second function of high activity toward double-stranded RNA (not seen in bovine pRNase), perhaps acting as an agent against viruses with a dsRNA genome. Given that this activity seems to be dependent on noncatalytic positive charges on the exposed surface of the protein (Sorrentino et al. 2003
), the retention of this activity in both duplicates of the colobine lineage would be selected against by the unique environment of the colobine small intestine.
| Footnotes |
|---|
Claudia Schmidt-Dannert, Associate Editor
| References |
|---|
|
|
|---|
Auerbach MR. 1997. Pancreatic ribonuclease evolution in the primates: a molecular phylogenetic approach [dissertation]. Albany, NY: Department of Biological Sciences, University at Albany.
Barnard EA. 1969. Biological function of pancreatic ribonuclease. Nature 221:3404.[CrossRef][Medline]
Bauchop T, Martucci RW. 1968. Ruminant-like digestion of the langur monkey. Science 161:698700.
Beintema JJ. 1990. The primary structure of langur (Presbytis entellus) pancreatic ribonuclease: adaptive features in digestive enzymes in mammals. Mol Biol Evol 7:4707.[Abstract]
Beintema JJ, Blank A, Schieven GL, Dekker CA, Sorrentino S, Libonati M. 1988. Differences in glycosylation pattern of human secretory ribonucleases. Biochem J 255:5015.[Web of Science][Medline]
Beintema JJ, Gaastra W, Lenstra JA, Welling GW, Fitch WM. 1977. The molecular evolution of pancreatic ribonuclease. J Mol Evol 10:4971.[CrossRef][Web of Science][Medline]
Beintema JJ, Gaastra W, Scheffer AJ, Welling GW. 1976. Carbohydrate in pancreatic ribonucleases. Eur J Biochem 63:4418.[Web of Science][Medline]
Beintema JJ, Scheffer AJ, van Dijk H, Welling GW, Zwiers H. 1973. Pancreatic ribonuclease distribution and comparisons in mammals. Nat New Biol 241:768.[Web of Science][Medline]
Beintema JJ, Wietzes P, Weickmann JL, Glitz DG. 1984. The amino acid sequence of human pancreatic ribonuclease. Anal Biochem 136:4864.[CrossRef][Web of Science][Medline]
Boix E, Nogues MV, Schein CH, Benner SA, Cuchillo CM. 1994. Reverse transphosphorylation by ribonuclease A needs an intact p2-binding site. Point mutations at Lys-7 and Arg-10 alter the catalytic properties of the enzyme. J Biol Chem 269:252934.
Breukelman HJ, Beintema JJ, Confalone E, Costanzo C, Sasso MP, Carsana A, Palmieri M, Furia A. 1993. Sequences related to the ox pancreatic ribonuclease coding region in the genomic DNA of mammalian species. J Mol Evol 37:2935.[Medline]
Brunger AT, Brooks CL III, Karplus M. 1985. Active site dynamics of ribonuclease. Proc Natl Acad Sci USA 82:845862.
Coen E, Strachan T, Dover G. 1982. Dynamics of concerted evolution of ribosomal DNA and histone gene families in the melanogaster species subgroup of Drosophila. J Mol Biol 158:1735.[CrossRef][Web of Science][Medline]
Craik CS, Largman C, Fletcher T, Roczniak S, Barr PJ, Fletterick R, Rutter WJ. 1985. Redesigning trypsin: alteration of substrate specificity. Science 228:2917.
Cuchillo CM, Vilanova M, Nogues MV. 1997. Pancreatic ribonucleases. In: D'Alessio G, Riordan JF, editors. Ribonucleases: structures and functions. New York: Academic Press. p 271304.
D'Alessio G, Di Donato A, Parente A, Piccoli R. 1991. Seminal RNase: a unique member of the ribonuclease superfamily. Trends Biochem Sci 16:1046.[CrossRef][Web of Science][Medline]
D'Alessio G, Floridi A, De Prisco R, Pignero A, Leone E. 1972. Bull semen ribonucleases. 1. Purification and physico-chemical properties of the major component. Eur J Biochem 26:15361.[Web of Science][Medline]
Davies RC, Neuberger A. 1969. Modification of lysine and arginine residues of lysozyme and the effect on enzymatic activity. Biochim Biophys Acta 178:30617.[Medline]
Delson E. 1994. Evolutionary history of the colobine monkeys in paleoenvironmental perspective. In: Davies AG, Oates JF, editors. Colobine monkeys: their ecology, behaviour, and evolution. Cambridge, UK: Cambridge University Press. p 1143.
Dobson DE, Prager EM, Wilson AC. 1984. Stomach lysozymes of ruminants. I. Distribution and catalytic properties. J Biol Chem 259:1160716.
Domachowske JB, Rosenberg HF. 1997. Eosinophils inhibit retroviral transduction of human target cells by a ribonuclease-dependent mechanism. J Leukoc Biol 62:3638.[Abstract]
Felsenstein J. 1993. PHYLIP (phylogeny inference package) Version 3.5c. Seattle, WA: Distributed by the author, Department of Genetics, University of Washington.
Hughes AL. 2005. Gene duplication and the origin of novel proteins. Proc Natl Acad Sci USA 102:87912.
Hughes MK, Hughes AL. 1993. Evolution of duplicate genes in a tetraploid animal, Xenopus laevis. Mol Biol Evol 10:13609.[Abstract]
Ina Y. 1995. New methods for estimating the numbers of synonymous and nonsynonymous substitutions. J Mol Evol 40:190226.[CrossRef][Web of Science][Medline]
Innan H. 2003. A two-locus gene conversion model with selection and its application to the human RHCE and RHD genes. Proc Natl Acad Sci USA 100:87938.
Kaiser ET, Lawrence DS, Rokita SE. 1985. The chemical modification of enzymatic specificity. Annu Rev Biochem 54:565595.[CrossRef][Web of Science][Medline]
Kay RNB, Davies AG. 1994. Digestive physiology. In: Davies AG, Oates JF, editors. Colobine monkeys: their ecology, behaviour, and evolution. Cambridge, UK: Cambridge University Press. p 22949.
Klink TA, Woycechowsky KJ, Taylor KM, Raines RT. 2000. Contribution of disulfide bonds to the conformational stability and catalytic activity of ribonuclease A. Eur J Biochem 267:56672.[Web of Science][Medline]
Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV. 2002. Selection in the evolution of gene duplications. Genome Biol 3:RESEARCH0008.[Medline]
Kruse F, Rose SD, Swift GH, Hammer RE, MacDonald RJ. 1995. Cooperation between elements of an organ-specific transcriptional enhancer in animals. Mol Cell Biol 15:438594.[Abstract]
Lee H. 1993. Evolution of old world monkey ribonuclease genes [dissertation]. Albany, NY: Department of Biological Sciences, University at Albany.
Libonati M, Furia A, Beintema JJ. 1976. Basic charges on mammalian ribonulease molecules and the ability to attack double-stranded RNA. Eur J Biochem 69:44551.[CrossRef]
Maddison DR, Maddison WP. 2000. MacClade 4: analysis of phylogeny and character evolution. Sunderland, MA: Sinauer Associates.
Matthew JB, Richards FM. 1982. Anion binding and pH-dependent electrostatic effects in ribonuclease. Biochemistry 21:498999.[CrossRef][Medline]
Meyerhans A, Vartanian JP, Wain-Hobson S. 1990. DNA recombination during PCR. Nucleic Acids Res 18:168791.
Moussaoui M, Guasch A, Boix E, Cuchillo C, Nogues M. 1996. The role of non-catalytic binding subsites in the endonuclease activity of bovine pancreatic ribonuclease A. J Biol Chem 271:468792.
Murthy BS, Sirdeshmukh R. 1992. Sensitivity of monomeric and dimeric forms of bovine seminal ribonuclease to human placental ribonuclease inhibitor. Biochem J 281(Pt 2):3438.
Nicholas KB, Nicholas HB Jr, Deerfield DW II. 1997. GeneDoc: analysis and visualization of genetic variation. EMBNEW News 4:14.
Nogues MV, Moussaoui M, Boix E, Vilanova M, Ribo M, Cuchillo CM. 1998. The contribution of noncatalytic phosphate-binding subsites to the mechanism of bovine pancreatic ribonuclease A. Cell Mol Life Sci 54:76674.[CrossRef][Web of Science][Medline]
Ohno S. 1970. Evolution by gene duplication. Berlin, Germany: Springer-Verlag.
Richardson RM, Pares X, Cuchillo CM. 1990. Chemical modification by pyridoxal 5'-phosphate and cyclohexane-1,2-dione indicates that Lys-7 and Arg-10 are involved in the p2 phosphate-binding subsite of bovine pancreatic ribonuclease A. Biochem J 267:5939.[Web of Science][Medline]
Rose SD, MacDonald RJ. 1997. Evolutionary silencing of the human elastase I gene (ELA1). Hum Mol Genet 6:897903.
Rosenberg HF, Dyer KD. 1995. Eosinophil cationic protein and eosinophil-derived neurotoxin. Evolution of novel function in a primate ribonuclease gene family. J Biol Chem 270:30234.
Rosenberg HF, Dyer KD, Tiffany HL, Gonzalez M. 1995. Rapid evolution of a unique family of primate ribonuclease genes. Nat Genet 10:21923.[CrossRef][Web of Science][Medline]
Samuelson LC, Wiebauer K, Howard G, Schmid RM, Koeplin D, Meisler MH. 1991. Isolation of the murine ribonuclease gene Rib-1: structure and tissue specific expression in pancreas and parotid gland. Nucleic Acids Res 19:693541.
Sasso MP, Carsana A, Confalone E, Cosi C, Sorrentino S, Viola M, Palmieri M, Russo E, Furia A. 1991. Molecular cloning of the gene encoding the bovine brain ribonuclease and its expression in different regions of the brain. Nucleic Acids Res 19:646974.
Schienman JE, Holt RA, Stewart CB. 2000. Adaptive evolution of pancreatic ribonuclease in primates. Poster Presentation at the Society for Molecular Biology and Evolution Meeting; 2000 June 1418; New Haven, CT: Yale University.
Snyder MR, Gleich GJ. 1997. Eosinophil-associated ribonucleases. In: D'Alessio G, Riordan JF, editors. Ribonucleases: structures and functions. New York: Academic Press. p 42544.
Sorrentino S, Naddeo M, Russo A, D'Alessio G. 2003. Degradation of double-stranded RNA by human pancreatic ribonuclease: crucial role of noncatalytic basic amino acid residues. Biochemistry 42:1018290.[CrossRef][Medline]
Stewart CB, Schilling JW, Wilson AC. 1987. Adaptive evolution in the stomach lysozymes of foregut fermenters. Nature 330:4014.[CrossRef][Medline]
Stewart CB, Wilson AC. 1987. Sequence convergence and functional adaptation of stomach lysozymes from foregut fermenters. Cold Spring Harb Symp Quant Biol 52:8919.
Strauss WM. 1998. Preparation of genomic DNA from mammalian tissue. In: Ausubel FM, Brent R, Kingston RE, Moore DD, Seidman JG, Smith JA, Struhl K, editors. Current protocols in molecular biology. New York: John Wiley & Sons, Inc. p 2.2.12.2.3.
Suzuki H, Parente A, Farina B, Greco L, La Montagna R, Leone E. 1987. Complete amino-acid sequence of bovine seminal ribonuclease, a dimeric protein from seminal plasma. Biol Chem Hoppe-Seyler 368:130512.[Medline]
Swofford DL. 1998. PAUP*: phylogenetic analysis using parsimony (*and other methods). Sunderland, MA: Sinauer Associates.
Tarragona-Fiol A, Eggelte HJ, Harbron S, Sanchez E, Taylorson CJ, Ward JM, Rabin BR. 1993. Identification by site-directed mutagenesis of amino acids in the B2 subsite of bovine pancreatic ribonuclease A. Protein Eng 6:9016.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. 1997. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res 25:487682.
Vonk HJ, Western JRH. 1984. Comparative biochemistry and physiology of enzymatic digestion. London: Academic Press.
Watanabe H, Katoh H, Ishii M, Komoda Y, Sanda A, Takizawa Y, Ohgi K, Irie M. 1988. Primary structure of a ribonuclease from bovine brain. J Biochem (Tokyo) 104:93945.
Wen Y, Irwin DM. 1999. Mosaic evolution of ruminant stomach lysozyme genes. Mol Phylogenet Evol 13:47482.[CrossRef][Web of Science][Medline]
Wingender E, Chen X, Hehl R, Karas H, Liebich I, Matys V, Meinhardt T, Pruss M, Reuter I, Schacherer F. 2000. TRANSFAC: an integrated system for gene expression regulation. Nucleic Acids Res 28:3169.
Yang Z. 1998. Likelihood ratio tests for detecting positive selection and application to primate lysozyme evolution. Mol Biol Evol 15:56873.[Abstract]
Zhang J. 2003. Parallel functional changes in the digestive RNases of ruminants and colobines by divergent amino acid substitutions. Mol Biol Evol 20:13107.
Zhang J, Zhang YP, Rosenberg HF. 2002. Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey. Nat Genet 30:4115.[CrossRef][Web of Science][Medline]
Zhao Z, Hewett-Emmett D, Li WH. 1998. Frequent gene conversion between human red and green opsin genes. J Mol Evol 46:4946.[CrossRef][Web of Science][Medline]
Zimmer EA, Martin SL, Beverley SM, Kan YW, Wilson AC. 1980. Rapid duplication and loss of genes coding for the alpha chains of hemoglobin. Proc Natl Acad Sci USA 77:215862.
![]()
CiteULike
Connotea
Del.icio.us What's this?
Related articles in MBE:
- Duplication and Divergence of Two Distinct Pancreatic Ribonuclease Genes in Leaf-Eating African and Asian Colobine Monkeys
MBE 2006 23: 1817.[Extract] [FREE Full Text]
This article has been cited by other articles:
![]() |
M. J. Lawson and L. Zhang Sexy gene conversions: locating gene conversions on the X-chromosome Nucleic Acids Res., May 31, 2009; (2009) gkp421v1. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



), with numbers directly above, denote independent losses of arginine residues on that lineage at that residue position relative to the start of the mature protein sequence. The open circles denote that the loss was due to a CpG mutation. The total number of postively charged arginines and lysines (+), total number of negatively charged glutamic and aspartic acids (), total number of arginines (Arg), arginine versus lysine ratio (R/K), isoelectric point (PI), and the overall charge of the protein at a pH of 7.0 (pH 7.0) appears in a table to the right of each species protein sequence.
) and (Ø), respectively. The letter immediately following the gene deletion symbol denotes for which species it has been deleted (B = black-and-white colobus, L = Hanuman langur, P = proboscis monkey, and F = both African colobines). The number of required gene duplications and deletions required for each tree is listed in parentheses directly above that tree. Sequences that we believe are an allelic pair is denoted by (). The proposed biological tree is based on the known phylogenetic relationships between the species groups and reasons discussed in the results section. The maximum likelihood cladogram was generated in PAUP* 4.0b8-10 (Swofford) using the General Time Reversible + G + I model with estimation of all parameters, heuristic search, starting trees via stepwise additions (10 replicates) with random seed; 100 bootstrap replicates with fast heuristic search, parameters estimated, and starting trees via stepwise addition with random addition sequence. For parameters not mentioned here, the default settings were used. The squirrel monkey pRNase sequence was used as the outgroup. The number directly above or below each lineage branch is the bootstrap support for that clade.

