Skip Navigation


MBE Advance Access originally published online on October 8, 2008
Molecular Biology and Evolution 2009 26(1):47-59; doi:10.1093/molbev/msn222
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Data
Right arrow An erratum has been published
Right arrow All Versions of this Article:
26/1/47    most recent
msn222v2
msn222v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kuraku, S.
Right arrow Articles by Kuratani, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kuraku, S.
Right arrow Articles by Kuratani, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2008. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Timing of Genome Duplications Relative to the Origin of the Vertebrates: Did Cyclostomes Diverge before or after?

Shigehiro Kuraku*,{dagger}, Axel Meyer{dagger} and Shigeru Kuratani*

* Laboratory for Evolutionary Morphology, RIKEN Center for Developmental Biology, Kobe, Japan
{dagger} Lehrstuhl für Zoologie und Evolutionsbiologie, Department of Biology, University of Konstanz, 78457 Konstanz, Germany

E-mail: shigehiro.kuraku{at}uni-konstanz.de.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Two rounds of whole-genome duplications are thought to have played an important role in the establishment of gene repertoires in vertebrates. These events occurred during chordate evolution after the split of the urochordate and cephalochordate lineages but before the radiation of extant gnathostomes (jawed vertebrates). During this interval, diverse agnathans (jawless fishes), including cyclostomes (hagfishes and lampreys), diverged. However, there is no solid evidence for the timing of these genome duplications in relation to the divergence of cyclostomes from the gnathostome lineage. We conducted cDNA sequencing in diverse early vertebrates for members of homeobox-containing (Dlx and ParaHox) and other gene families that would serve as landmarks for genome duplications. Including these new sequences, we performed a molecular phylogenetic census using the maximum likelihood method for 55 gene families. In most of these gene families, we detected many more gene duplications before the cyclostome–gnathostome split, than after. Many of these gene families (e.g., visual opsins, RAR, Notch) have multiple paralogs in conserved, syntenic genomic regions that must have been generated by large-scale duplication events. Taken together, this indicates that the genome duplications occurred before the cyclostome–gnathostome split. We propose that the redundancy in gene repertoires possessed by all vertebrates, including hagfishes and lampreys, was introduced primarily by genome duplications. Apart from subsequent lineage-specific modifications, these ancient genome duplication events might serve generally to distinguish vertebrates from invertebrates at the genomic level.

Key Words: conserved synteny • hagfish • genome duplication • lamprey • 2R hypothesis


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Phylogenetic and genomic studies have shown to near certainty that the chordates experienced genome duplications (McLysaght et al. 2002Go; Lundin et al. 2003Go; Dehal and Boore 2005Go; Putnam et al. 2008Go). These events are implicated in the evolutionary path leading to gnathostomes after the divergence from the urochordate and cephalochordate lineages but before that of chondrichthyans (Holland et al. 1994Go; Miyata and Suga 2001Go; Venkatesh et al. 2007Go; Hufton et al. 2008Go). During this interval, diverse groups of agnathans branched off successively, two of which (hagfishes and lampreys) have survived to date (Forey and Janvier 1993Go). Classically, these two groups of agnathans were placed in a single taxon, the Cyclostomata (Duméril 1806Go). However, from subsequent morphological observations, it was believed that hagfishes diverged first in vertebrate phylogeny so that lampreys are a sister group of gnathostomes (Forey 1984Go; Maisey 1986Go). Recently, reanalysis of hagfish morphology from a developmental viewpoint provided evidence against this hypothesis (Ota et al. 2007Go), which suggested that further developmental analyses of hagfishes might provide more confident evidence of the monophyly of Cyclostomata. Furthermore, molecular phylogenetic analyses of diverse genes have strongly supported the monophyly of the Cyclostomata (summarized in Kuraku and Kuratani 2006Go). Now, another question is whether cyclostomes diverged before or after the genome duplications (fig. 1), which will be critical for a deeper understanding of vertebrate evolution at the molecular level.


Figure 1
View larger version (27K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Three possible scenarios for timings of 2R genome duplications. Expected tree topologies (hypotheses A–C) for gene phylogeny are illustrated for an imaginary gene family comprising one invertebrate outgroup (Inv.) and four gnathostome (Gna.) genes with an intact set of cyclostome (Cyc.) genes. Hypothesis A has not been explicitly proposed so far, whereas there are some reports supporting hypothesis B (Pendleton et al. 1993Go; Sharman and Holland 1998Go; Escriva et al. 2002Go; Force et al. 2002Go; Stadler et al. 2004Go) or C (Fried et al. 2003Go; Furlong et al. 2007Go). Expected numbers of gene duplications before/after the cyclostome–gnathostome split in this gene family are indicated in the middle. Timings of genome duplications are shown as black arrows on the species phylogeny.

 
The idea of genome duplications, first proposed by Ohno (1970)Go, has been strengthened by the observations that model vertebrates often have multiple (usually as many as four) duplicated genes (paralogs) corresponding to a single set of invertebrate orthologs (Holland et al. 1994Go; Sidow 1996Go; Spring 1997Go). This has been confirmed by the discoveries that similar arrays of genes are found on multiple chromosomes of model vertebrates (reviewed in Kasahara 2007Go). However, in cyclostomes, gene repertoires and the structures of landmark gene clusters, such as Hox and Dlx, tend to indicate an incomplete or degenerate state of their genomes (Pendleton et al. 1993Go; Sharman and Holland 1998Go; Neidert et al. 2001Go; Force et al. 2002Go; Irvine et al. 2002Go). This has prompted many researchers to believe that cyclostomes diverged in the midst of or before successive genome duplications (hypothesis B or C in fig. 1) and to attribute their "primitive" morphological traits to their possible intermediate genomic state (e.g., Donoghue and Purnell 2005Go). The picture is further obscured by possible gene duplications that have been imputed to the cyclostome lineage, as reported for Hox genes (Fried et al. 2003Go; Stadler et al. 2004Go), although it is unknown whether this represents a genome-wide phenomenon.

To date, investigations on intragenomic conserved syntenies (paralogons) have contributed to the identification of ancient genome duplications. However, signatures of genome duplications are obscured by subsequent lineage-specific gene duplications and gene losses. Therefore, simply counting the numbers of gene repertoires or conserved syntenies will not necessarily provide a reliable estimate for the scenario of genome evolution: the key to this question is the tree topology of molecular phylogenetic trees. For example, the timing of the genome duplication that occurred in the actinopterygian lineage was dated using only three protein-coding genes (Hoegg et al. 2004Go), and this result has been confirmed by subsequent analyses (Crow et al. 2006Go; Steinke et al. 2006Go). Hence, in principle, gene sampling from the animal groups in question and the knowledge of their phylogenetic relationships based on other independent information should lead us to a clear picture of genome evolution, even in the absence of large-scale genome sequencing. However, phylogenetic analyses of cyclostome genes have occasionally produced contradictory results. An analysis using 33 gene families that aimed to resolve the timing of the genome duplications failed to provide a unified scenario but suggested that one genome duplication occurred before the cyclostome–gnathostome split and the other occurred after (hypothesis B) (Escriva et al. 2002Go). However, this analysis was based on a relatively unsophisticated phylogenetic method—Neighbor-Joining (NJ) method using distance matrix inferred by Poisson correction. Moreover, a recent phylogenetic analysis based on the whole-genome sequencing of Branchiostoma floridae could not solve this question, either (Putnam et al. 2008Go). Therefore, a reanalysis is necessary.

In this study, to assess the timing of genome duplications in early vertebrate evolution, we performed a set of molecular phylogenetic analyses with all annotated cyclostome genes available in public databases in addition to ones sequenced in this study. By optimizing data sets and phylogenetic methods, together with thorough taxon sampling, we obtained consistent results suggesting that the two rounds of genome duplication occurred before the divergence between ancestors of cyclostomes and gnathostomes.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
General Molecular Phylogenetics
Sequences were retrieved from the NCBI Entrez Protein database (URL: http://www.ncbi.nlm.nih.gov/sites/entrez?db = Protein) and Ensembl (URL: http://www.ensembl.org/). Multiple alignments of amino acid sequences were constructed using MAFFT (Katoh et al. 2005Go) followed by manual improvement. Unambiguously aligned regions without gaps were used for tree inferences (for the alignments used in this study, see supplementary data set S1, Supplementary Material online). To select gene families, we used the NJ method (Saitou and Nei 1987Go) and the quartet-puzzling algorithm implemented in Tree-Puzzle 5.2 (Schmidt et al. 2002Go). We excluded teleost fishes and Xenopus laevis, which have undergone additional genome duplications in independent lineages (Bisbee et al. 1977Go; Hoegg et al. 2004Go; Crow et al. 2006Go), and instead included nonteleost actinopterygians and chondrichthyans, when available. Multiple invertebrate sequences with ordinary evolutionary rates, absence of many unique gaps in alignment and robust support of orthologies, were rigorously selected for outgroups. The maximum likelihood (ML) analysis for selected gene families was performed using Tree-Puzzle 5.2, by inputting all possible tree topologies in "user-defined trees" mode, assuming the JTT + F + {Gamma}8 model. Phylogenetic relationships within gnathostomes and outgroups were constrained based on a generally accepted species phylogeny (Meyer and Zardoya 2003Go; Cracraft and Donoghue 2004Go). Bootstrap probabilities were calculated with 1,000 replicates. Bayesian posterior probabilities were calculated using MrBayes 3.1 (Ronquist and Huelsenbeck 2003Go). Shimodaira–Hasegawa (SH) test and approximately unbiased (AU) test were performed using CONSEL (Shimodaira and Hasegawa 2001Go). For counting discrete numbers of gene duplications, NJ trees were also inferred with the data sets used in the ML analysis, assuming the JTT + {Gamma}4 model.

Probabilistic Counts of Gene Duplications
After performing ML analyses, the probability of each tree topology was calculated with the resampling of estimated log-likelihoods (RELL) approximation (Kishino et al. 1990Go) using CONSEL (Shimodaira and Hasegawa 2001Go). For each gene family, the number of gene duplications before the cyclostome–gnathostome split (Nbef) was calculated as

Formula (1)
where nbefi is the number of gene duplications on that branch in a given tree topology (topology i) and Pi is the RELL probability of the topology (fig. 2). Standard error was calculated based on bootstrap resampling of log-likelihood produced above by the ML method (1,000 replicates). Similarly, the number of gene duplications after the cyclostome–gnathostome split (Naft), the number of gene duplications in the cyclostome lineage (Ncyc), and the number of gene duplications unassigned to any branch (Nunk) were calculated as follows:

Formula (2)

Formula (3)

Formula (4)


Figure 2
View larger version (11K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— Indices used for counting gene duplications. Nbef, Naft, and Ncyc represent the numbers of gene duplications estimated to occur in each branch indicated. Nunk represents the estimated number of gene duplications whose timings were not successfully assigned to any of Nbef, Naft, or Ncyc.

 
Assumption of the Two-Round Pattern
In gene families with three or four gnathostome paralogs, among all possible tree topologies, those representing the two-round (2R) pattern of gene duplication—for example, ((a, b), (c, d)), where ad represent four gnathostome paralogs—were selected, and they were categorized in hypotheses A–C (supplementary table S1, Supplementary Material online). When the topology did not allow us to distinguish between hypotheses A and B, the result was categorized as "hypothesis A/B." In gene families with only two gnathostome paralogs, a tree topology with a gene duplication before the cyclostome–gnathostome split is shown as "hypothesis A/B," whereas one with a gene duplication after that split is shown as "hypothesis B/C." Probabilities of these hypotheses were calculated by summing RELL probabilities of tree topologies that supported each hypothesis.

Selection of Gene Families
For 3,207 entries for protein-coding hagfish and lamprey genes in the NCBI Protein database, we surveyed gene families for further analysis. Our criteria were the length of any unambiguous alignment (≥150 amino acids), the availability of invertebrate orthologs as outgroups (e.g., orthologies of cyclostome genes to gnathostome genes are sometimes discussed carelessly in the absence of appropriate outgroups [Kawakoshi et al. 2006Go; Haitina et al. 2007Go]), the number of gnathostome paralogs corresponding to a single invertebrate ortholog (>2 paralogs) and the resolvability of gnathostome phylogeny (for genes excluded, see supplementary data set S2, Supplementary Material online). To reduce orthology/paralogy misidentification, we excluded gene families that are prone to gene duplications (i.e., gene families with more than two gene duplications in the lineage leading to humans during gnathostome evolution; e.g., globins, neurofilaments, and olfactory receptors). We also excluded gene families in which more than two paralogs are found on a single human chromosome because this study was intended to focus exclusively on inter- rather than intrachromosomal, gene duplications (note that, even under this criterion, tandem duplications cannot be completely excluded from our data set; see Discussion). Finally, we selected 55 gene families that contained at least one cyclostome gene (see supplementary table S2, Supplementary Material online). Of these, 21 had two gnathostome paralogs, whereas 34 had more than three gnathostome paralogs. These two family categories were analyzed separately. In gene families with two and three gnathostome paralogs, two and one duplicates were assumed to have been lost secondarily following the genome duplication event, respectively. Using the human genome as a reference, members of these gene families, totaling 150 genes, covered all the 22 autosomes and the X chromosome (supplementary fig. S1, Supplementary Material online).

Identification of Synteny Groups
Amino acid sequences of human protein-coding genes, sorted in physical order, were retrieved for each chromosome from Ensembl via the EnsMart interface. We performed pairwise BlastP searches (Altschul et al. 1997Go) reciprocally between chromosomes harboring members of each gene family. For interchromosomal gene pairs that showed high "bits" scores (>100), we inferred molecular phylogenetic trees using amino acid sequences of other animals as above. We excluded gene families in which family members were revealed to have duplicated before the split between invertebrate chordates and vertebrate lineages or within gnathostome lineages, even if those family members were located on the set of chromosomes in question.

The synteny group containing the opsin gene family, one of the gene families on which we focused in this study, was previously reported (Nordstrom et al. 2004Go). However, their results erroneously contained gene families whose members are located on more than two of human chromosome (HSA) 1, 3, 7, and X but were duplicated in a more ancient period of animal evolution, rather than the period of early vertebrate evolution. For example, in their study, plexin A2, B1, and B3 were proposed as components of these syntenic regions, although subfamilies A and B in the plexin family diverged in the early phase of animal evolution: plexin A2, B1, and B3 should not be regarded as the components of these synteny groups. In addition, some gene families in the synteny group they proposed, such as CLSTN1/2, KCNAB1/2, and STAG1/3, have another member that was duplicated concomitantly early in vertebrate evolution but is not located on either of HSA 1, 3, 7, and X.

In identifying the retinoic acid receptor (RAR)–related synteny group comprising HSA 3–12–17, we also detected syntenic regions on similar sets of HSAs, that is, HSA 2–7–12–17, 1–3–7–12, and 1–3–12–X. As described previously (McLysaght et al. 2002Go), these quadruplets of syntenic regions are juxtaposed in an interdigitated manner, possibly because of secondary interchromosomal rearrangements that occurred after the genome duplications.


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Analysis on Homeobox-Containing Gene Families
We first focused on the homeobox-containing gene families, Hox, Dlx, and ParaHox. In the Hox gene family, we selected paralogous group 13 (Hox13) because of the presence of an intact set (HoxA13, B13, C13, and D13) of gnathostome paralogs and a high level of divergence compared with other paralogous groups (Ferrier et al. 2000Go). So far, two Hox13 genes that cover an entire homeobox have been reported for cyclostomes (LjHox13{alpha} [GenBank accession number AB293597] and LjHox13β [AB293598]; Kuraku et al. 2008Go). In the Dlx family, four genes (DlxADlxD; AY010116 [GenBank] –AY010119 [GenBank] ) were identified previously in the sea lamprey Petromyzon marinus (Neidert et al. 2001Go). In the Japanese lamprey Lethenteron japonicum, we performed reverse transcription–polymerase chain reaction (RT–PCR) (supplementary text S1, Supplementary Material online) and isolated two novel Dlx genes (LjDlxE and LjDlxF) as well as orthologs of the four genes (LjDlxALjDlxD) reported previously for P. marinus (AB292628 [GenBank] –AB292633 [GenBank] ; supplementary figs. S2 and S3, Supplementary Material online). In the ParaHox gene family, we focused on the Cdx and Gsh subfamilies because the Xlox subfamily, the third member of the ParaHox gene family, has only one gene in most gnathostomes. For each of the Cdx and Gsh families, in addition to the hagfish homologues previously reported (Furlong et al. 2007Go), we isolated one gene in L. japonicum, designated as LjCdxA (AB368821 [GenBank] ) and LjGshA (AB368822 [GenBank] ), respectively (supplementary text S1, Supplementary Material online).

To examine the molecular phylogenies of these cyclostome genes, we performed ML tree inferences. The Dlx family was divided into two subgroups to be analyzed separately—Dlx1/4/6 and Dlx2/3/5—because each of these has a urochordate ortholog (Stock et al. 1996Go; Irvine et al. 2007Go). In the rest of this article, the phrase "gene family" denotes a single set of invertebrate orthologs plus a group of vertebrate homologues that have derived from the ancestral invertebrate ortholog.

In all these five gene families (Hox13, Dlx1/4/6, Dlx2/3/5, Cdx, and Gsh), the results did not support a specific tree topology with high confidence, leaving multiple tree topologies within 1{sigma} of log-likelihood compared with the ML tree (table 1). In addition, support values for each node were low (fig. 3). We estimated the timings of gene duplications, based on the probabilistic method for counting gene duplications for each branch (Nbef, Naft, and Ncyc; see Materials and Methods). For all these five families, we detected more gene duplications before the cyclostome–gnathostome split than after (Nbef > Naft; table 1). We also analyzed which of the three possible hypotheses would be supported under the assumption of 2R gene duplication pattern (see Materials and Methods). None of these families supported a single hypothesis with high confidence (P > 0.70; table 1).


View this table:
[in this window]
[in a new window]

 
Table 1 ML Analysis for the Homeobox-Containing Genes

 

Figure 3
View larger version (41K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— The ML trees for selected homeobox-containing gene families. Tree topologies supported by ML analysis are shown for Hox13 (A), Dlx1/4/6 (B), Dlx2/3/5 (C), Cdx (D), and Gsh (E). For simplicity, relationships within outgroups, gnathostome paralogs (Gna.), and two hagfish Gsx genes (see Furlong et al. 2007Go, for details) are not displayed. Numbers at nodes indicate Bayesian posterior probabilities (left) and intact bootstrap probabilities from ML analysis (right). A hyphen indicates that the phylogenetic relationship at the node was not fully resolved in the Bayesian analysis. Numbers in parentheses indicate supporting values obtained in analyses in which the hagfish–lamprey relationship is not constrained. Abbreviations for species names: Bf, Branchiostoma floridae; Ci, Ciona intestinalis; Dl, Diplosoma listerianum; Od, Oikopleura dioica; Cs, Ciona savignyii; Hc, Herdmania curvata; Hr, Halocynthia roretzi; Es, Euprymna scolopes; Nv, Nereis virens.

 
We also performed ML analyses for all these families without constraints on the phylogenetic relationships within gnathostomes and outgroups. In the ML trees of all families except for Dlx2/3/5, the relationships within gnathostomes were not properly reconstructed (supplementary fig. S4AE, Supplementary Material online). Additionally, in the Dlx1/4/6, Cdx, and Gsh families, the invertebrate genes that should be regarded as an outgroup did not form a single cluster (supplementary fig. S4B, D, and E, Supplementary Material online). For the Gsh and Cdx families, we performed further analyses by adding our lamprey sequences to the data sets used previously (Furlong et al. 2007Go). The ML trees did not support hypothesis C but hypothesis A (see supplementary fig. S4F and G, Supplementary Material online). Again, the relationships within the gnathostomes and outgroups were not properly reconstructed.


    ML Analysis for the 55 Gene Families without the Assumption of the "2R" Duplication Tree Topology
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
For all the 55 gene families selected with rigorous criteria (see Materials and Methods), we performed ML analyses (see supplementary table S3, Supplementary Material online). In 48 of these gene families, there were more than two tree topologies within 1{sigma} of log-likelihood compared with the ML tree (for details of log-likelihood, SH test, and AU test, see supplementary fig. S5, Supplementary Material online). In the ML trees of gene families with three or four gnathostome paralogs (n = 33), 13 families supported tree topologies with no gene duplication after the cyclostome–gnathostome split (hypothesis A; table 2A). Five families supported tree topologies with no gene duplication before the cyclostome–gnathostome split (hypothesis C), whereas 11 supported tree topologies with gene duplications both before and after the cyclostome–gnathostome split (hypothesis B; table 2A). Topologies of ML trees in four of the gene families did not allow us to distinguish between hypothesis A or B (designated hypothesis A/B). On the other hand, eight families rejected hypothesis C at a significant {sigma} level for log-likelihood, whereas only three (protein tyrosine phosphatase [PTP] R2, PTP RM/RK/RT/RL, and PACSIN1/2) and one (PTP R2) rejected hypotheses A and B, respectively (table 2A). In the ML trees of gene families with only two gnathostome paralogs (n = 21), 18 supported the tree topology with a gene duplication before the cyclostome–gnathostome split (hypothesis A/B), whereas only three supported a tree topology with a gene duplication after the cyclostome–gnathostome split (hypothesis B/C; table 2B). By contrast, none of these families rejected hypothesis A/B at a significant level of {sigma} for log-likelihood, whereas six rejected hypothesis B/C (table 2B).


View this table:
[in this window]
[in a new window]

 
Table 2 Summary of ML Analysis

 
Probabilistic Count of Gene Duplications for the 55 Gene Families
We calculated the numbers of gene duplications for each branch using the probabilistic method (see Materials and Methods; supplementary tables S4 and S5, Supplementary Material online). The total number of Nbef for all 55 families was approximately double the value for Naft (Nbef = 67.4 ± 6.4; Naft = 33.1 ± 10.2; table 3). Counts of discrete numbers of gene duplications in the ML and NJ trees produced similar results (table 3). The tendency of much higher Nbef than Naft was observed in total counts for both family categories, regardless of which cyclostome taxon (hagfishes and/or lampreys) was used (table 3).


View this table:
[in this window]
[in a new window]

 
Table 3 Total Estimated Numbers of Gene Duplications

 
To further characterize this tendency, we analyzed the distribution of Nbef and Naft among all families (fig. 4). For both family categories, the Naft value distributed mostly between 0 and 1.0 (fig. 4B and D). By contrast, the peak in the distribution of Nbef located at 0.5–1.0 for gene families with two gnathostome paralogs (fig. 4A), whereas it located at 1.0–2.0 for gene families with three to five gnathostome paralogs (fig. 4C).


Figure 4
View larger version (26K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 4.— Distributions of estimated numbers of gene duplications before/after the cyclostome–gnathostome split. (A, C) Histograms of the distribution of gene duplications before the cyclostome–gnathostome split. (B, D) Histograms of the distribution of gene duplications after the cyclostome–gnathostome split. (A, B) Results for gene families with two gnathostome paralogs. (C, D) Results for gene families with three to five gnathostome paralogs. For details of Nbef and Naft, see Materials and Methods.

 
The total number of gene duplications in the cyclostome lineage (Ncyc) was estimated at 10.2 when data for both hagfishes and lampreys were used (table 3). However, this value should be an underestimate because Ncyc is applicable only for gene families where multiple cyclostome paralogs are available (only 19 out of the 55 gene families). In the ML trees of eight of these 19 families, Ncyc was more than 0.5, suggesting high possibilities of cyclostome lineage–specific gene duplications (for phylogenetic trees of these gene families, see supplementary fig. S6, Supplementary Material online; for further discussion, see supplementary text S2, Supplementary Material online).

ML Analysis for the 55 Gene Families under the Assumption of the "2R" Duplication Tree Topology
Assumption of the 2R pattern of gene duplications is applicable only to gene families with three or four gnathostome paralogs (for discussion over relevance of 2R genome duplications, see supplementary text S2, Supplementary Material online). For these gene families (n = 33), we evaluated the probabilities of hypotheses A–C by summing the probabilities of tree topologies that were categorized in each hypothesis (see Materials and Methods). The hagfish–lamprey clusters supported by the ML trees were constrained (see supplementary text S2 and table S6, Supplementary Material online). Hypothesis A was supported by 15 gene families with the highest probabilities (table 2A). Hypotheses B and C were supported by nine and six gene families, respectively (table 2A). If a hypothesis with P < 0.05 was regarded as rejected, hypotheses A and C were rejected by two and ten gene families, respectively, whereas none of these families rejected hypothesis B (table 2A).

Analysis of Gene Families with Intensive Taxon Sampling
To gain a more definitive picture for cyclostome gene phylogenies, we focused on the gene families in which multiple cyclostome paralogs have been identified by intensive survey, namely the visual opsin and RAR families (fig. 5). For the former, five visual opsin paralogs are reported for lampreys (Hisatomi et al. 1991Go; Collin et al. 2003Go; Koyanagi et al. 2004Go), whereas gnathostomes generally possess five paralogs, including two paralogs (green and blue opsins) that have been lost secondarily in the mammalian lineage (Jacobs 1993Go). Our molecular phylogenetic study strongly suggested that each lamprey gene clusters with one of those gnathostome paralogs (bootstrap probabilities, 68–100%) (fig. 5A), suggesting that all the gene duplications occurred before the cyclostome–gnathostome split. We found that RHO, OPN1SW, and OPN1LW are located in syntenic regions comprising HSA 3, 7, and X (fig. 5B; see Materials and Methods). In addition, the chicken green opsin gene constitutes another syntenic region that includes the orthologs of PLXNA2, NFASC, WNT2B, LRRN5, MAPKAPK2, and KCND3, all of which are located within a 2.7-Mb region on chicken chromosome 26. This suggests that HSA 1, which harbors orthologs of the genes, is the fourth member of this synteny group.


Figure 5
View larger version (63K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 5.— Phylogenies and synteny groups of vertebrate visual opsin and RAR genes. (A) The ML tree of vertebrate visual opsin genes (280 amino acid sites; shape parameter for gamma distribution {alpha} = 0.92). (B) The visual opsin–related synteny group. (C) The ML tree of RAR genes (292 amino acid sites; {alpha} = 0.49). (D) The RAR-related synteny group. In (A) and (C), cyclostome genes are shown in red. Scale bar indicates 0.1 substitutions per site. Numbers at nodes indicate intact bootstrap probabilities (upper) and Bayesian posterior probabilities (lower). Relationships within the nodes shown with closed circles were constrained. Numbers in parentheses indicate supporting values obtained in analyses in which the hagfish–lamprey relationship was not constrained. In (B) and (D), human homologues derived from a single invertebrate gene are aligned horizontally and shown in matching colors.

 
In the RAR family, three human genes, {alpha}, β, and {gamma}, form a synteny group comprising HSA 17, 3, and 12, respectively (fig. 5D; see Materials and Methods). In this family, only one gene has been reported for cyclostomes (P. marinus RAR gene; U93433 [GenBank] , AY861455 [GenBank] ) (Escriva et al. 2006Go). We conducted an intensive RT–PCR survey of various tissues from cartilaginous fishes (the plownose chimaera, Callorhinchus callorhynchus, and the cloudy catshark, Scyliorhinus torazame), from a nonteleost actinopterygian fish (the Florida spotted gar, Lepisosteus platyrhincus), and from three cyclostome species (the inshore hagfish Eptatretus burgeri, the Japanese lamprey L. japonicum, and the short-headed lamprey Mordacia mordax; see supplementary text S1, Supplementary Material online). As with gnathostomes, we identified three paralogs for each cyclostome species, which were designated as RAR1, RAR2, and RAR3 (AB292622 [GenBank] –AB292624 [GenBank] ). In the ML tree, cyclostomes RAR1 and RAR3 clustered with gnathostomes RAR{gamma} and RAR{alpha}, respectively (fig. 5C). A clustering of cyclostome RAR2 with gnathostome RARβ was not supported in this tree but was supported in the second ML tree ({Delta}logL = 2.03 ± 4.16). Our probabilistic count of gene duplications detected a much higher Nbef than Naft (Nbef = 1.82 ± 0.32, Naft = 0.03 ± 0.18; table 4).


View this table:
[in this window]
[in a new window]

 
Table 4 Timing of Gene Duplications in Conserved Synteny Groups

 
It is worth noting that no evident clusterings of multiple cyclostome paralogs were detected in the ML trees of these families, as indicated by extremely small Ncyc values (Ncyc = 0.00 ± 0.00 and Ncyc = 0.04 ± 0.20, for the visual opsin and RAR families, respectively).

Analysis of Gene Families in Well-Studied Synteny Groups
Our data set contained gene families harbored by other well-characterized synteny groups: the NOTCH and bromodomain-containing BRD families in the major histocompatibility complex (MHC)-related synteny group (Kasahara et al. 1997Go) (table 4) and the amphiphysin and solute carrier family SLC4A families in the Hox-related synteny group (Larhammar et al. 2002Go) (table 4). The latter also contains the Dlx clusters. These synteny groups, conserved between chromosomes, were generated by an ancient genome duplication event (Kasahara et al. 1997Go; Larhammar et al. 2002Go). In the ML trees of these gene families, each cyclostome gene clusters with a gnathostome paralog (see supplementary fig. S7A and B, Supplementary Material online), supporting hypothesis A. Similar results were obtained for members of the RAR-related synteny group, which is extended up to an entire chromosomal level, namely the nuclear receptor–encoding NR1I family and the glucose transporter SLC2A family (table 4). Especially in the NR1I family, the P. marinus vitamin D receptor (VDR) gene clustered with gnathostome VDR genes with very high confidence (bootstrap probability, 98%; see supplementary fig. S7C, Supplementary Material online). Although the tree topology did not allow us to distinguish between hypotheses A and B, this gene family explicitly rejected hypothesis C at a significant level of {sigma} for log-likelihood (see supplementary fig. S7 and table S3, Supplementary Material online). In these gene families, although the bootstrap probabilities for the clusters consisting of a gnathostome paralog and a cyclostome paralog are not uniformly high (39–98%; see supplementary fig. S7, Supplementary Material online), our probabilistic count of gene duplications provided much higher values for Nbef than Naft (table 4), with the exceptions of the PTP LAR/{sigma}/RD and enolase {alpha}/β/{gamma} families.


    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Are Homeobox-Containing Genes Reliable Landmarks for Dating Genome Duplications?
Our current understanding of the timing of genome duplication events largely depends on analyses of gene repertoires in cyclostomes, especially the Hox, Dlx, and ParaHox genes. Our identification of six Dlx genes in the lamprey, as in other nonteleost gnathostomes (Stock et al. 1996Go; Stock 2005Go), shows that it is misleading to conclude that cyclostomes have a preduplication genomic state, simply based on the fewer number of Dlx genes known previously. In the Gsh family, we identified one lamprey gene, LjGshA, that clusters firmly with the hagfish genes (fig. 3E), suggesting that these cyclostome genes are orthologous to each other. In the Cdx family, our lamprey gene, LjCdxA, was placed close to one of the gnathostome paralogs (fig. 3D). This indicates that the hagfish Cdx gene has evolved too rapidly to reconstruct the clustering with an ortholog in the lamprey or that the lamprey LjCdxA gene is paralogous to the reported hagfish Cdx gene. If the latter is true, lampreys are expected to possess another ParaHox cluster or its remnant in addition to the one already reported for two hagfish species (Furlong et al. 2007Go).

An advantage of studying these landmark clusters in a phylogenetic context is that orthologies of harbored genes are guaranteed by their conserved genomic structures. Thus, we constrained phylogenetic relationships within gnathostome genes that reside in orthologous gene clusters, to focus only on the relationships between gnathostome and cyclostome paralogs. In our phylogenetic tree inferences including the newly identified lamprey genes, none of the five homeobox-containing gene families supported the hypothesis that genome duplications occurred after the cyclostome–gnathostome split (hypothesis C) (table 1 and fig. 3), which was recently supported by ParaHox genes (Furlong et al. 2007Go). In fact, our analysis has shown that these homeobox-containing gene families do not provide enough resolution to reconstruct the phylogenetic relationships even among gnathostomes and outgroups (supplementary fig. S3, Supplementary Material online). This indicates that conclusions on the timing of genome duplications drawn by analyses of these gene families and from other Hox paralogous groups (Force et al. 2002Go; Irvine et al. 2002Go; Fried et al. 2003Go; Stadler et al. 2004Go; Furlong et al. 2007Go), which are less divergent than Hox13 (Ferrier et al. 2000Go), cannot have sufficient level of reliability. This point strongly demonstrates the need for studies using a more reliable data set.

Evaluation of Phylogenetic Trees
In this study, we adopted the ML method, which facilitates "exhaustive" evaluation of every possible tree topology in a solid statistical framework (note that the Bayesian method, which has become increasingly popular, is powerful only in "heuristic" tree search; Felsenstein 1981Go; Holder and Lewis 2003Go). The ML method is relatively robust even when sequence data contain some degree of rate heterogeneity among lineages (Felsenstein 1978Go; Philippe et al. 2005Go), as might be expected in cases involving cyclostome genes, which have often evolved rapidly (Kuraku and Kuratani 2006Go). To estimate the relative timing of gene duplications, we applied a strategy for counting gene duplications in a probabilistic framework (see Materials and Methods). Unlike deterministic "best tree" approaches (e.g., Escriva et al. 2002Go; Putnam et al. 2008Go), from which information concerning alternative tree topologies is discarded, our approach is expected to yield a robust estimate that is not distorted by slight variations in taxon sampling, alignment construction, and model selection and thereby should allow us to extract much more phylogenetic information from the input data. This aspect was especially advantageous in our study where multiple tree topologies tended to be supported with similar log-likelihood values.

Our probabilistic approach produced results similar to those obtained in the total count of discrete numbers of gene duplications in the ML tree and NJ tree (table 3). However—as for each family separately—our approach provides a more realistic overview for the timing of gene duplications on a probabilistic basis. For example, in the ML tree of RAR (fig. 5C), we must assume three gene duplications before the cyclostome–gnathostome split, instead of two (plus a secondary loss of the gnathostome ortholog of cyclostome RAR2 and another secondary loss or unidentification of the cyclostome ortholog of gnathostome RARβ), to produce three gnathostome paralogs. However, our probabilistic count estimated Nbef at 1.82 ± 0.32. In the probabilistic count, the sums for Nbef, Naft, and Nunk for each gene family resulted in values close to the numbers that can produce the sets of gnathostome paralogs with minimum numbers of gene duplications (supplementary table S4, Supplementary Material online), which fits better parsimonious estimation.

Gene Repertoires in Cyclostome Genomes: How Many Paralogs?
In more than half of the gene families analyzed here (36 out of 55), only one reported cyclostome gene was available. This is thought to be mainly because of incomplete identification of cyclostome genes. Our cDNA survey did not only identify the first members for cyclostomes in some families (NOTCH and fringe) but also identify additional members to already reported cyclostome genes in other families (Cdx, Gsh, Dlx, RAR, RXR, and SLC2A). Among these genes, we focused on the RAR family, where we identified three paralogs in L. japonicum (LjRAR1LjRAR3) and performed intensive survey of additional paralogs and their orthologs in M. mordax and E. burgeri. With all the efforts on these three cyclostome species and different kinds of cDNA examined, we ended up with only these three paralogs (fig. 5C). Together with the identification of six Dlx genes in L. japonicum (supplementary fig. S2, Supplementary Material online), our survey has proved that cyclostomes have more redundant gene repertoires than previously thought. This compels a reconsideration of an idea that attributes fewer gene repertoires in cyclostomes to their primitive genomic status.

Timing of Genome Duplications: before the Cyclostome–Gnathostome Split or after?
Results of our ML analysis for 55 families again showed that these genome duplication events are not easily resolved, as suggested by numbers of tree topologies that could not be rejected by statistical analysis (supplementary fig. S5 and table S3, Supplementary Material online). Our analysis using these gene families provided results consistent with the monophyly of cyclostomes, the 2R pattern of genome duplications, and cyclostome lineage–specific gene duplications (discussed in supplementary text S2, Supplementary Material online).

The majority of the 55 families supported hypothesis A with the highest probabilities, regardless of the assumption of the 2R pattern of genome duplications (table 2). It is notable that hypothesis C was rejected with the highest frequency (table 2A). In this respect, gene families with only two gnathostome paralogs showed a clear result: none of them rejected hypothesis A/B, whereas six out of 21 families rejected hypothesis B/C (table 2B). This suggests that hypothesis C is not likely.

In the probabilistic count of gene duplications, the total number of Nbef was significantly larger than that of Naft (table 3; for details of each gene family, see supplementary table S5, Supplementary Material online); as shown in figure 1, if hypothesis B is true, more gene duplications should be detected after the cyclostome–gnathostome split than before, even if we assume that some of the duplicates have been lost secondarily. Moreover, the distributions of Naft did not resemble the unimodal distributions of Nbef but were skewed toward zero (fig. 4B and D). This suggests that a certain portion of Naft originates from statistical fluctuation. Thus, our probabilistic count of gene duplications supports hypothesis A.

Despite the large amount of support for hypothesis A, three of the 55 gene families in our data set explicitly supported hypothesis B or C and rejected hypothesis A (PACSIN 1/2, PTP R2, and PTP RM/RK/RT/RL families; see also supplementary table S3, Supplementary Material online). These might have been caused by a family-specific and small-scale (such as tandem) duplication event, which was not fully excluded at the step of selecting gene families (see Materials and Methods), or by methodological artifacts resulting from an inexhaustive data set, such as incomplete gene identification in cyclostomes. In the latter case, an unidentified cyclostome gene that has evolved less rapidly might cluster with one of multiple gnathostome paralogs. For example, in the Cdx family (fig. 3D), our lamprey sequence, LjCdxA, clustered with gnathostome Cdx4, suggesting that a time estimation of gene duplications based solely on one rapidly evolving cyclostome gene (hagfish Cdx genes, in this case) is sometimes misleading (for other examples, see supplementary fig. S6C and E, Supplementary Material online).

For better identification of orthology/paralogy and more precise estimation of the timing of gene duplication, we focused on two gene families, those for visual opsins and RARs, where intensive gene sampling has been performed. In both of these families, we proved that multiple gnathostome paralogs were duplicated through large-scale duplication events (fig. 5B and D) and that those events occurred before the cyclostome–gnathostome split (fig. 5A and C). It is unlikely that the scattered placement of multiple cyclostome paralogs in the trees of visual opsins and RAR family were produced by artifact, such as parallel sequence evolution between a gnathostome paralog and a cyclostome paralog. This is an unmistakable evidence for hypothesis A and suggests that the immediate common ancestor of cyclostomes and gnathostomes shared a similar level of redundancy in gene repertoires introduced by a large-scale duplication event.

The gene families shown in table 4 also provide a consistent result that gene duplications occurred before the cyclostome–gnathostome split, although gene sampling in cyclostomes is not thorough. Provided that gene families in the same synteny groups have shared their evolutionary history, this line of evidence suggests that the Hox and Dlx clusters contained in these synteny groups also duplicated before the cyclostome–gnathostome split, though they themselves did not provide sufficient resolution. Under the assumption that early vertebrates experienced only two rounds of genome duplications as reported (e.g., Dehal and Boore 2005Go), our analysis suggests that both of these rounds occurred before the cyclostome–gnathostome split.


    Conclusions
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Our investigation, which exceeded previous studies in the amount of data and precision of methods, yielded no solid evidence of hypotheses previously proposed (hypothesis B and C) but provided consistent evidence that the last common ancestor of the extant vertebrates—including hagfishes and lampreys—emerged subsequent to the genome duplications that can be regarded as a genomic synapomorphy of all extant vertebrates (hypothesis A). Importantly, our approach and our conclusion have not been influenced by controversies over the monophyly of cyclostomes, the patterns of genome duplication (2R or not) and additional genome duplication(s) specific to the cyclostome lineage (see supplementary text S2, Supplementary Materials online). If our conclusion is correct, "primitive" morphological traits in cyclostomes may be attributable to secondary modification of their gene repertoires or changes in gene function and regulation introduced secondarily in either of the gnathostome or cyclostome lineages (see Shigetani et al. 2002Go, for an example), rather than to incomplete gene repertoires at a preduplication state of cyclostome genomes. Above all, this proposed scenario underlines the need to maintain awareness of the presence of more redundant cyclostome gene repertoires than previously thought. We also advocate caution in determining orthologies between gnathostomes and cyclostomes that might be the bases for the reasonable phylogenetic evaluation of phenotypic transitions from pre-vertebrates to vertebrates at the molecular level.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
Supplementary text S1 and S2, data sets S1 and S2, figs. S1S7, and tables S1S6 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournal.org/). Accession numbers: Sequences identified in this study are deposited in GenBank (URL: http://www.ncbi.nlm.nih.gov/Genbank) under accession numbers AB292620 [GenBank] –AB292646 [GenBank] , AB368820 [GenBank] , and AB368821.


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 
We thank S. Moriyama, A. Takahashi, M. Nozaki, H. Kawauchi, K. Tamura, H. Aono, K. G. Ota, N. Iwabe, T. Miyata, and R. Kusakabe for animal tissues or nucleic acids and K. Katoh for critical reading of manuscript and valuable discussions. Our gratitude extends to anonymous reviewers for constructive comments and suggestions.


    Footnotes
 
Billie Swalla, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 ML Analysis for the...
 Discussion
 Conclusions
 Supplementary Material
 Acknowledgements
 References
 

    Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res (1997) 25:3389–3402.[Abstract/Free Full Text]

    Bisbee CA, Baker MA, Wilson AC, Haji-Azimi I, Fischberg M. Albumin phylogeny for clawed frogs (Xenopus). Science (1977) 195:785–787.[Abstract/Free Full Text]

    Collin SP, Knight MA, Davies WL, Potter IC, Hunt DM, Trezise AE. Ancient colour vision: multiple opsin genes in the ancestral vertebrates. Curr Biol (2003) 13:R864–R865.[CrossRef][Web of Science][Medline]

    Cracraft J, Donoghue MJ. Assembling the tree of life (2004) Oxford: Oxford University Press.

    Crow KD, Stadler PF, Lynch VJ, Amemiya C, Wagner GP. The "fish-specific" Hox cluster duplication is coincident with the origin of teleosts. Mol Biol Evol (2006) 23:121–136.[Abstract/Free Full Text]

    Dehal P, Boore JL. Two rounds of whole genome duplication in the ancestral vertebrate. PLoS Biol (2005) 3:e314.[CrossRef][Medline]

    Donoghue PC, Purnell MA. Genome duplication, extinction and vertebrate evolution. Trends Ecol Evol (2005) 20:312–319.[CrossRef][Medline]

    Duméril AMC. Zoologie analytique, ou méthode naturelle de classification des animaux, rendue plus facile a l'Aide de Tableaux synoptiques par (1806) Paris: Allais.

    Escriva H, Bertrand S, Germain P, Robinson-Rechavi M, Umbhauer M, Cartry J, Duffraisse M, Holland L, Gronemeyer H, Laudet V. Neofunctionalization in vertebrates: the example of retinoic acid receptors. PLoS Genet (2006) 2:e102.[CrossRef][Medline]

    Escriva H, Manzon L, Youson J, Laudet V. Analysis of lamprey and hagfish genes reveals a complex history of gene duplications during early vertebrate evolution. Mol Biol Evol (2002) 19:1440–1450.[Abstract/Free Full Text]

    Felsenstein J. Cases in which parsimony or compatibility methods will be positively misleading. Syst Zool (1978) 27:401–410.[Abstract/Free Full Text]

    Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol (1981) 17:368–376.[CrossRef][Web of Science][Medline]

    Ferrier DE, Minguillon C, Holland PW, Garcia-Fernandez J. The amphioxus Hox cluster: deuterostome posterior flexibility and Hox14. Evol Dev (2000) 2:284–293.[CrossRef][Web of Science][Medline]

    Force A, Amores A, Postlethwait JH. Hox cluster organization in the jawless vertebrate Petromyzon marinus. J Exp Zool (2002) 294:30–46.[CrossRef][Web of Science][Medline]

    Forey P, Janvier P. Agnathans and the origin of jawed vertebrates. Nature (1993) 361:129–134.[CrossRef]

    Forey PL. Yet more reflections on agnathan-gnathostome relationships. J Vert Paleont (1984) 4:330–343.

    Fried C, Prohaska SJ, Stadler PF. Independent Hox-cluster duplications in lampreys. J Exp Zool B Mol Dev Evol (2003) 299:18–25.[Medline]

    Furlong RF, Younger R, Kasahara M, Reinhardt R, Thorndyke M, Holland PW. A degenerate ParaHox gene cluster in a degenerate vertebrate. Mol Biol Evol (2007) 24:2681–2686.[Abstract/Free Full Text]

    Haitina T, Klovins J, Takahashi A, Lowgren M, Ringholm A, Enberg J, Kawauchi H, Larson ET, Fredriksson R, Schioth HB. Functional characterization of two melanocortin (MC) receptors in lamprey showing orthology to the MC1 and MC4 receptor subtypes. BMC Evol Biol (2007) 7:101.[CrossRef][Medline]

    Hisatomi O, Iwasa T, Tokunaga F, Yasui A. Isolation and characterization of lamprey rhodopsin cDNA. Biochem Biophys Res Commun (1991) 174:1125–1132.[CrossRef][Web of Science][Medline]

    Hoegg S, Brinkmann H, Taylor JS, Meyer A. Phylogenetic timing of the fish-specific genome duplication correlates with the diversification of teleost fish. J Mol Evol (2004) 59:190–203.[CrossRef][Web of Science][Medline]

    Holder M, Lewis PO. Phylogeny estimation: traditional and Bayesian approaches. Nat Rev Genet (2003) 4:275–284.[CrossRef][Web of Science][Medline]

    Holland PW, Garcia-Fernandez J, Williams NA, Sidow A. Gene duplications and the origins of vertebrate development. Dev Suppl (1994) 1994:125–133.

    Hufton AL, Groth D, Vingron H, Lehrach AJ, Poustka M, Panopoulou G, Forthcoming. Early vertebrate whole genome duplications were predated by a period of intense genome rearrangement. Genome Res (2008) 18:1582–1591.[Abstract/Free Full Text]

    Irvine SQ, Cangiano MC, Millette BJ, Gutter ES. Non-overlapping expression patterns of the clustered Dll-A/B genes in the ascidian Ciona intestinalis. J Exp Zool B Mol Dev Evol (2007) 308:428–441.[Medline]

    Irvine SQ, Carr JL, Bailey WJ, Kawasaki K, Shimizu N, Amemiya CT, Ruddle FH. Genomic analysis of Hox clusters in the sea lamprey Petromyzon marinus. J Exp Zool (2002) 294:47–62.[CrossRef][Web of Science][Medline]

    Jacobs GH. The distribution and nature of colour vision among the mammals. Biol Rev Camb Philos Soc (1993) 68:413–471.[Medline]

    Kasahara M. The 2R hypothesis: an update. Curr Opin Immunol (2007) 19:547–552.[CrossRef][Web of Science][Medline]

    Kasahara M, Nakaya J, Satta Y, Takahata N. Chromosomal duplication and the emergence of the adaptive immune system. Trends Genet (1997) 13:90–92.[CrossRef][Web of Science][Medline]

    Katoh K, Kuma K, Toh H, Miyata T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res (2005) 33:511–518.[Abstract/Free Full Text]

    Kawakoshi A, Hyodo S, Nozaki M, Takei Y. Identification of a natriuretic peptide (NP) in cyclostomes (lamprey and hagfish): cNP-4 is the ancestral gene of the NP family. Gen Comp Endocrinol (2006) 148:41–47.[CrossRef][Web of Science][Medline]

    Kishino H, Miyata T, Hasegawa M. Maximum likelihood inference of protein phylogeny and the origin of chloroplasts. J Mol Evol (1990) 30:151–160.

    Koyanagi M, Kawano E, Kinugawa Y, Oishi T, Shichida Y, Tamotsu S, Terakita A. Bistable UV pigment in the lamprey pineal. Proc Natl Acad Sci USA (2004) 101:6687–6691.[Abstract/Free Full Text]

    Kuraku S, Kuratani S. Time Scale for cyclostome evolution inferred with a phylogenetic diagnosis of hagfish and lamprey cDNA sequences. Zool Sci (2006) 23:1053–1064.[CrossRef][Web of Science][Medline]

    Kuraku S, Takio Y, Tamura K, Aono H, Meyer A, Kuratani S. Noncanonical role of Hox14 revealed by its expression patterns in lamprey and shark. Proc Natl Acad Sci USA (2008) 105:6679–6683.[Abstract/Free Full Text]

    Larhammar D, Lundin LG, Hallbook F. The human Hox-bearing chromosome regions did arise by block or chromosome (or even genome) duplications. Genome Res (2002) 12:1910–1920.[Abstract/Free Full Text]

    Lundin LG, Larhammar D, Hallbook F. Numerous groups of chromosomal regional paralogies strongly indicate two genome doublings at the root of the vertebrates. J Struct Funct Genomics (2003) 3:53–63.[CrossRef][Medline]

    Maisey JG. Heads and tails: a chordate phylogeny. Cladistics (1986) 2:201–256.

    McLysaght A, Hokamp K, Wolfe KH. Extensive genomic duplication during early chordate evolution. Nat Genet (2002) 31:200–204.[CrossRef][Web of Science][Medline]

    Meyer A, Zardoya R. Recent advances in the (molecular) phylogeny of vertebrates. Annu Rev Ecol Evol Syst (2003) 34:311–338.[CrossRef]

    Miyata T, Suga H. Divergence pattern of animal gene families and relationship with the Cambrian explosion. Bioessays (2001) 23:1018–1027.[CrossRef][Web of Science][Medline]

    Neidert AH, Virupannavar V, Hooker GW, Langeland JA. Lamprey Dlx genes and early vertebrate evolution. Proc Natl Acad Sci USA (2001) 98:1665–1670.[Abstract/Free Full Text]

    Nordstrom K, Larsson TA, Larhammar D. Extensive duplications of phototransduction genes in early vertebrate evolution correlate with block (chromosome) duplications. Genomics (2004) 83:852–872.[CrossRef][Web of Science][Medline]

    Ohno S. Evolution by gene duplication (1970) New York: Springer-Verlag.

    Ota KG, Kuraku S, Kuratani S. Hagfish embryology with reference to the evolution of the neural crest. Nature (2007) 446:672–675.[CrossRef][Web of Science][Medline]

    Pendleton JW, Nagai BK, Murtha MT, Ruddle FH. Expansion of the Hox gene family and the evolution of chordates. Proc Natl Acad Sci USA (1993) 90:6300–6304.[Abstract/Free Full Text]

    Philippe H, Zhou Y, Brinkmann H, Rodrigue N, Delsuc F. Heterotachy and long-branch attraction in phylogenetics. BMC Evol Biol (2005) 5:50.[CrossRef][Medline]

    Putnam NH, Butts T, Ferrier DE, et al. The amphioxus genome and the evolution of the chordate karyotype. Nature (2008) 453:1064–1071. (37 co-authors).[CrossRef][Web of Science][Medline]

    Ronquist F, Huelsenbeck JP. MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics (2003) 19:1572–1574.[Abstract/Free Full Text]

    Saitou N, Nei M. The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol (1987) 4:406–425.[Abstract]

    Schmidt HA, Strimmer K, Vingron M, von Haeseler A. TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics (2002) 18:502–504.[Abstract/Free Full Text]

    Sharman AC, Holland PW. Estimation of Hox gene cluster number in lampreys. Int J Dev Biol (1998) 42:617–620.[Web of Science][Medline]

    Shigetani Y, Sugahara F, Kawakami Y, Murakami Y, Hirano S, Kuratani S. Heterotopic shift of epithelial-mesenchymal interactions in vertebrate jaw evolution. Science (2002) 296:1316–1319.[Abstract/Free Full Text]

    Shimodaira H, Hasegawa M. CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics (2001) 17:1246–1247.[Abstract/Free Full Text]

    Sidow A. Gen(om)e duplications in the evolution of early vertebrates. Curr Opin Genet Dev (1996) 6:715–722.[CrossRef][Web of Science][Medline]

    Spring J. Vertebrate evolution by interspecific hybridisation—are we polyploid? FEBS Lett (1997) 400:2–8.[CrossRef][Web of Science][Medline]

    Stadler PF, Fried C, Prohaska SJ, Bailey WJ, Misof BY, Ruddle FH, Wagner GP. Evidence for independent Hox gene duplications in the hagfish lineage: a PCR-based gene inventory of Eptatretus stoutii. Mol Phylogenet Evol (2004) 32:686–694.[CrossRef][Web of Science][Medline]

    Steinke D, Hoegg S, Brinkmann H, Meyer A. Three rounds (1R/2R/3R) of genome duplications and the evolution of the glycolytic pathway in vertebrates. BMC Biol (2006) 4:16.[CrossRef][Medline]

    Stock DW. The Dlx gene complement of the leopard shark, Triakis semifasciata, resembles that of mammals: implications for genomic and morphological evolution of jawed vertebrates. Genetics (2005) 169:807–817.[Abstract/Free Full Text]

    Stock DW, Ellies DL, Zhao Z, Ekker M, Ruddle FH, Weiss KM. The evolution of the vertebrate Dlx gene family. Proc Natl Acad Sci USA (1996) 93:10858–10863.[Abstract/Free Full Text]

    Venkatesh B, Kirkness EF, Loh YH, et al, (12 co-authors). Survey sequencing and comparative analysis of the elephant shark (Callorhinchus milii) genome. PLoS Biol (2007) 5:e101.[CrossRef][Medline]

Accepted for publication September 27, 2008.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief Funct Genomic ProteomicHome page
K. Nowick and L. Stubbs
Lineage-specific transcription factors and the evolution of gene regulatory networks
Briefings in Functional Genomics, January 16, 2010; (2010) elp056v1.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
L. Yu, X.-y. Wang, W. Jin, P.-t. Luan, N. Ting, and Y.-p. Zhang
Adaptive Evolution of Digestive RNASE1 Genes in Leaf-Eating Monkeys Revisited: New Insights from Ten Additional Colobines
Mol. Biol. Evol., January 1, 2010; 27(1): 121 - 131.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
D. Larhammar, K. Nordstrom, and T. A. Larsson
Evolution of vertebrate rod and cone phototransduction genes
Phil Trans R Soc B, October 12, 2009; 364(1531): 2867 - 2880.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
Y. Shichida and T. Matsuyama
Evolution of opsins and phototransduction
Phil Trans R Soc B, October 12, 2009; 364(1531): 2881 - 2895.
[Abstract] [Full Text] [PDF]


Home page
Integr. Comp. Biol.Home page
G. Zhang
An evo-devo view on the origin of the backbone: evolutionary development of the vertebrae
Integr. Comp. Biol., August 1, 2009; 49(2): 178 - 186.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
I. Braasch, J.-N. Volff, and M. Schartl
The Endothelin System: Evolution of Vertebrate-Specific Ligand-Receptor Interactions by Three Rounds of Genome Duplication
Mol. Biol. Evol., April 1, 2009; 26(4): 783 - 799.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Data
Right arrow An erratum has been published
Right arrow All Versions of this Article:
26/1/47    most recent
msn222v2
msn222v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Kuraku, S.
Right arrow Articles by Kuratani, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Kuraku, S.
Right arrow Articles by Kuratani, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?