MBE Advance Access originally published online on March 17, 2006
Molecular Biology and Evolution 2006 23(6):1192-1202; doi:10.1093/molbev/msk003
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Article |
Asymmetric Evolution in Two Fish-Specifically Duplicated Receptor Tyrosine Kinase Paralogons Involved in Teleost Coloration

* Lehrstuhl für Zoologie und Evolutionsbiologie, Department of Biology, University of Konstanz, Konstanz, Germany; and
Center for Junior Research Fellows, University of Konstanz, Konstanz, Germany
E-mail: axel.meyer{at}uni-konstanz.de.
| Abstract |
|---|
|
|
|---|
The occurrence of a fish-specific genome duplication (FSGD) in the lineage leading to teleost fishes is widely accepted, but the consequences of this event remain elusive. Teleosts, and the cichlid fishes from the species flocks in the East African Great Lakes in particular, evolved a unique complexity and diversity of body coloration and color patterning. Several genes involved in pigment cell development have been retained in duplicate copies in the teleost genome after the FSGD. Here we investigate the evolutionary fate of one of these genes, the type III receptor tyrosine kinase (RTK) colony-stimulating factor 1 receptor (csf1r). We isolated and shotgun sequenced two paralogous csf1r genes from a bacterial artificial chromosome library of the cichlid fish Astatotilapia burtoni that are both linked to paralogs of the pdgfrß gene, another type III RTK. Two pdgfrß-csf1r paralogons were also identified in the genomes of pufferfishes and medaka, and our phylogenetic analyses suggest that the pdgfrß-csf1r locus was duplicated during the course of the FSGD. Comparisons of teleosts and tetrapods suggest asymmetrical divergence at different levels of genomic organization between the teleost-specific pdgfrß-csf1r paralogons, which seem to have evolved as coevolutionary units. The high-evolutionary rate in the teleost B-paralogon, consisting of csf1rb and pdgfrßb, further suggests neofunctionalization by functional divergence of the extracellular, ligand-binding region of these cell-surface receptors. Finally, we hypothesize that genome duplications and the associated expansion of the RTK family might be causally linked to the evolution of coloration in vertebrates and teleost fishes in particular.
Key Words: genome duplication fish pigmentation csf1r pdgfrß cichlid
| Introduction |
|---|
|
|
|---|
Gene and genome duplications are considered major evolutionary forces adding to the genetic material that may be necessary for increasing the genomic and phenotypic complexity of vertebrates (Ohno 1970
Here, we present a comparative genomic study of two pgdfrß-colony-stimulating factor 1 receptor (csf1r) paralogons in the teleost lineage. These genes as well as the anciently paralogous gene tandem pdgfr
-kit belong to the family of type III receptor tyrosine kinases (RTKs) (Rousset et al. 1995
). We hypothesized that the type III RTK tandems represent an excellent setup for the study of the genomic consequences of large-scale duplication events in general and the FSGD in particular. Specifically, we were interested in the evolutionary fates of the two pgdfrß-csf1r paralogons after the FSGD with respect to evolutionary rates, exon/intron structure, synteny, genomic expansions/compactions, and the degree of conservation in noncoding regions. The reasons for choosing the two pgdfrß-csf1r paralogons were: (i) all four gene copies of the tandem, and the gene order within each paralogon, have been retained since the FSGD; (ii) type III RTK paralogons are linked to the ParaHox clusters, which contain developmentally important genes and which arose by the same succession of genome duplications in vertebrates (Spring 2002
; Ferrier et al. 2005
); (iii) comparative genomic data from other organisms, particularly from other teleost species, are available; and (iv) the genes have been well characterized, i.e., functional data are available for zebrafish and mouse demonstrating that members of the type III RTKs are highly interesting with respect to the evolution of pigmentation in the vertebrate lineage. For example, kit and csf1r have similar functions for the development of different neural crest-derived pigment cell types. While kit is essential for the development of the dark melanocytes in mammals and zebrafish (Parichy et al. 1999
), csf1r promotes the development of yellow xanthophores in zebrafish (Parichy et al. 2000
; Parichy and Turner 2003
). In view of several examples of other duplicated "pigmentation genes" in teleosts such as mitf (Altschmied et al. 2002
), sox10 (Lang et al. 2005b
), and tyrosinase (Hoegg et al. 2004
), the fact that teleosts possess more pigment cell types than tetrapods (Bagnara 1998
), the unique diversity of coloration and color patterning in teleosts, and the importance of pigmentation for speciation in teleosts as signaling cue for mate and species recognition (e.g., Salzburger et al. 2006
), we also investigated the pdgfrß-csf1r locus with regard to pigmentation. We chose a cich-lid fish for this study, as cichlids are well-suited models to examine teleost coloration within an evolutionary framework. For example, sexual selection on diverse male color patterns is considered to be a major factor promoting the explosive speciation and adaptive radiation in East African cichlids (Kocher 2004
; Salzburger and Meyer 2004
).
We isolated and shotgun sequenced two clones representing different pdgfrß-csf1r paralogons from a bacterial artificial chromosome (BAC) library of the haplochromine cichlid Astatotilapia burtoni (Lang et al. 2005b
). We identified two pdgfrß-csf1r paralogons in the genomes of medaka and pufferfishes, and phylogenetic reconstructions led us to conclude that the pdgfrß-csf1r locus was duplicated during the FSGD. We found pronounced differences in rates of molecular evolution between the two pdgfrß-csf1r paralogons prior to the divergence of cichlid, medaka, and pufferfishes at different levels of their genomic organization. Our results suggest asymmetrical coevolution of these paralogons resulting, most likely, in the evolution of new gene functions (neofunctionalization) in the entire B-paralogon (pdgfrßb and csf1rb). Furthermore, our results provide implications for the evolution of teleost coloration.
| Materials and Methods |
|---|
|
|
|---|
Identification of csf1r Genes in the Genomes of Cichlid and Other Teleosts
Based on the alignment of all available teleost csf1r sequences, we designed degenerate primer sets (Supplementary Table S1, Supplementary Material online) to amplify fragments of csf1r paralogs from genomic DNA of the cichlid A. burtoni as well as from other teleost fishes.
BAC Library Screening
Fragments of csf1r paralogs were polymerase chain reaction (PCR) amplified from genomic DNA of A. burtoni using specific primer sets for csf1ra (Fms-F-2961/Fms-R-3199 and Burt-Fms-F-1986/Burt-Fms-R-2186; Supplementary Table S1, Supplementary Material online) and specific PCR conditions for csf1rb (Fms2-F-1962/Fms2-R-2269; annealing temperature, 66°C; Supplementary Table S1, Supplementary Material online). Purified PCR products were labeled with the NEBlot Phototope Kit (NEB). A BAC library of A. burtoni (Lang et al. 2005b
) was then screened by hybridization of high-density filters with the probes according to the Phototope-Star Detection Kit (NEB). Positive clones were further characterized by PCR screening, sequencing, and restriction fragment patterning. Insert sizes were determined by NotI digestions.
Shotgun Sequencing and Contig Assembly
BAC DNA was obtained using the QIAGEN Large Construct Kit and sheared by sonication. The size fraction of 23 kb was blunt-end ligated into pUC18 vector (Roche) and electrotransformed into ElectroMAX DH10B T1 phage resistant cells (Invitrogen, Karlsruhe, Germany). Subclones were grown in selective medium; plasmid DNA was recovered by standard methods and sequenced directly from both ends with M13 standard primers on an ABI3100 sequencer (Applied Biosystems, Darmstadt, Germany). Sequences were quality-trimmed with PHRED (http://www.phrap.org). The Sequencher software (Gene Codes) was used to trim vector ends and for sequence assembly. Large-scale sequencing was performed until coverage of approximately 4x was obtained. Remaining gaps were closed with gap spanning primer sets so that we finally obtained a continuous sequence assembly with each base pair sequenced at least twice.
Sequence Annotation
Genomic structures of type III RTKs in cichlid were determined by sequence comparisons with the pufferfish Takifugu rubripes (Williams, Brenner, and Venkatesh 2002
) and through cDNA sequencing. Additional genes were predicted with GENSCAN (http://genes.mit.edu/GENSCAN.html) and repetitive elements were identified with RepeatMasker (http://www.repeatmasker.org/). We found orthologous loci in the genomes of the spotted green pufferfish (Tetraodon nigroviridis) (http://www.genoscope.cns.fr/externe/tetranew/) and medaka (Oryzias latipes) (http://dolphin.lab.nig.ac.jp/medaka/) by BLAT/Blast searches and deduced the sequences of their csf1r and pdgfrß paralogs by sequence comparisons to A. burtoni and Takifugu under inclusion of available express sequence tag sequences. The csf1ra-locus of zebrafish could not be included in genomic analyses due to an apparent misassembly of that region in the current zebrafish genome release (Zv4; http://www.ensembl.org/Danio_rerio/).
Sequence Analyses
Phylogenetic reconstructions of type III RTKs were performed with PAUP* (Swofford 2000
) using the maximum likelihood method (general time-reversible model, with a gamma substitution correction and a proportion of invariable sites; model parameters estimated from the data). For relative rate analyses, we applied the nonparametric rate test as implemented in MEGA3 (Kumar, Tamura, and Nei 2004
) and compared paralogs of csf1r as well as pdgfrß from teleosts with their murine ortholog as described in Van de Peer et al. (2001)
. Additionally, we performed local molecular clock tests with HyPhy (Pond, Frost, and Muse 2005
) using the GY94 codon model. To compare the BAC contigs of cichlid with those of medaka, pufferfishes, frog, chicken, mouse, and human, we downloaded the respective genomic sequences (Supplementary Table S2, Supplementary Material online) and aligned them applying the MLAGAN algorithm in mVISTA (http://genome.lbl.gov/vista/index.shtml). Conserved noncoding sequences (CNSs) were then analyzed using the MatInspector software (http://www.genomatix.de/) for the presence of putative binding sites for transcription factors that might be involved in pigment cell development such as Wnt (Dorsky, Moon, and Raible 1998
), Sox10 (Dutton et al. 2001
), Tfap2a (Knight et al. 2003
), Pax3 (Lang et al. 2005a
), and Mitf (Béjar, Hong, and Schartl 2003
). We used the DIVERGE software (Gu and Vander Velden 2002
) to identify amino acid residues that are likely to be involved in functional divergence after the duplication of Csf1r and Pdgfrß paralogs. Based on neighbor-joining trees constructed with DIVERGE, paralogous clades were selected. Posterior probabilities of functional divergence were estimated for each alignment position and assigned to the protein domain structure as obtained from PFAM (http://www.sanger.ac.uk/Software/Pfam/).
Analysis of Gene Expression
Total mRNA from various tissues of adult male and female of A. burtoni were extracted with TRIZOL LS (Invitrogen, Karlsruhe, Germany). First strand cDNA synthesis was mediated by SUPERSCRIPT II reverse transcriptase (Invitrogen, Karlsruhe, Germany). Reverse transcriptase-PCR (RT-PCR) experiments were performed with specific primer sets for csf1ra, csf1rb, pdgfrßa, and pdgfrßb (Supplementary Table S1, Supplementary Material online). RT-PCRs targeting gapdh were used as a control for the quality of the cDNA. All primers spanned at least one intron.
| Results |
|---|
|
|
|---|
Presence of Two pdgfrß-csf1r Loci in the Cichlid Genome
As two csf1r paralogs were found previously in the pufferfish T. rubripes (Williams, Brenner, and Venkatesh 2002
Duplication of the pdgfrß-csf1r Locus in Teleost Fish
Both csf1r paralogs are located downstream of pdgfrß genes, termed pdgfrßa and pdgfrßb in the following, just as in Takifugu and in the single locus of human and mouse (Williams, Brenner, and Venkatesh 2002
). A maximum likelihood phylogeny of type III RTKs including representative members of this gene family confirms that the obtained RTKs are duplicated copies of csf1r and pdgfrß (fig. 1A). The topology of the tree further suggests that the duplications of csf1r and pdgfrß in cichlid and pufferfish are the results of the FSGD (fig. 1A).
|
We also found two pdgfrß-csf1r loci in the current genome assemblies of the spotted green pufferfish and of medaka, but not in zebrafish, and obtained additional csf1r sequences from other teleosts from GenBank and using our degenerate primer sets for csf1r (Supplementary Tables 13, Supplementary Material online). The phylogenies of csf1r (fig. 1B) and pdgfrß genes (fig. 1C) are consistent with a duplication of these genes in the course of the FSGD.
Faster Evolution of B-Paralogon than of A-Paralogon RTKs
Nonparametric relative rate tests with MEGA revealed that both type III RTK genes from the B-paralogons evolved with a significantly faster rate of molecular evolution than their paralogs from the A-paralogons, both at the nucleotide level as well as at the protein level (table 1). Local molecular clock tests performed with HyPhy confirmed a faster rate in B-paralogon RTKs (P < 0.001) but suggested that the synonymous mutation rate is similar between A- and B-paralogons. Thus, it appears that not the neutral mutation rate but the selection pressure is differing between the two paralogons.
|
Intron Accumulation in B-Paralogon RTKs
In cichlid, medaka, and Tetraodon, pdgfrßb and csf1rb have additional introns compared to the A-paralogon genes. pdgfrßa and csf1ra both contain 21 exons, while pdgfrßb and csf1rb have 26 and 22 exons, respectively (Supplementary Figure S1, Supplementary Material online). An identical situation has been reported for Takifugu (Williams, Brenner, and Venkatesh 2002
Less Synteny in the B-Paralogon with the Single Tetrapod Locus
In the four teleost A-paralogons, several genes were predicted in the surrounding of the pdgfrßa-csf1ra gene tandem (fig. 2). In all B-paralogons, an enteropeptidase gene was predicted downstream of csf1rb. The comparison of the teleost paralogons to the single tetrapod loci shows that the teleost A-paralogon has retained a higher level of synteny with its tetrapod semiorthologs (fig. 2). The array of the four genes cdx1-pdgfrß-csf1r-A630042L21RIK was found for the tetrapod loci and for all teleost A-paralogons. In contrast, the synteny of the teleost B-paralogons with the tetrapod loci is restricted to the tandem array of pdgfrß-csf1r. Thus, it seems that more genomic rearrangements have occurred in the B-paralogon after duplication in teleosts, while the A-paralogon has retained more similarity with the ancestral region. Furthermore, in Tetraodon, the pdgfrß-csf1r paralogons are located on chromosome 1 and chromosome 7 (fig. 2). These two chromosomes are highly syntenic, containing over 25 paralogous gene pairs (Jaillon et al. 2004
; Woods et al. 2005
), which provides further evidence for the duplication of the pdgfrß-csf1r in the course of the FSGD.
|
Preferential Accumulation of Repetitive Elements in the B-Paralogon
By comparing both paralogons of A. burtoni, it became obvious that the B-paralogon is substantially larger compared to the A-paralogon, which is likely to result from the accumulation of repetitive elements in the B-paralogon (18.6% vs. 9.2% of their length is made up of repetitive elements; Supplementary Table S2, Supplementary Material online). Accumulation of repetitive elements particularly in the B-paralogon and an increase in length was also found in medaka but not in pufferfishes.
Less Conservation of Noncoding Regions in the B-Paralogon
Phylogenetic footprinting using VISTA revealed no sequence similarity in noncoding regions between both cichlid paralogons (not shown). Yet, no differences in gene expression among duplicated type III RTKs were apparent in cichlids (Supplementary Figure S2, Supplementary Material online). Next, we compared each cichlid paralogon individually with its orthologous and semiorthologous loci in other fish species and tetrapods, respectively (fig. 3). There was virtually no sequence similarity detected in noncoding regions between fishes and tetrapods for both paralogons. Interestingly, the overall sequence similarity among fishes was higher in the A-paralogon than in the B-paralogon in coding as well as in noncoding regions (fig. 3). Furthermore, we identified several CNSs in the surrounding of the pdgfrß-csf1r tandems with potential function in pigment cell transcription as they contain putative binding sites for transcription factors that are involved in pigment cell development (fig. 3).
|
Functional Divergence Between csf1r and pdgfrß Paralogs
One consequence of functional divergence after duplication is that some residues might be subject to altered functional constraints. This would lead to differences in evolutionary rates at these sites between paralogous proteins, i.e., type I functional divergence (Gu 1999
), which is defined as the probability that the rates in two clades are different enough between sites to be statistically independent (Gu and Vander Velden 2002
= 0 suggests a significant change of evolutionary rates at some sites between the two clades (Gu and Vander Velden 2002
> 0) has occurred in both type III RTKs after duplication with values of
Csf1r = 0.298 ± 0.086 (P < 0.01) and
Pdgfrß = 0.426 ± 0.110 (P < 0.001). The posterior probabilities of being functional divergence related for each site in the alignments are depicted in figure 4A for Csf1r and in figure 4B for Pdgfrb. We determined individual cutoff values for significance for both type III RTKs by removing residues with the highest posterior probability until
> 0 was no longer significant (Csf1r: 0.607; Pdgfrß: 0.636). Residues above this value are likely to contribute significantly to functional divergence between paralogs (Wang and Gu 2001
|
| Discussion |
|---|
|
|
|---|
Genome Duplications and the Expansion of the RTK Multigene Family
The multigene family of type III RTKs has been expanded substantially by gene and especially genome duplications during vertebrate evolution (Rousset et al. 1995
-kit and the pdgfrß-csf1r locus found in gnathostome genomes (see also Rousset et al. 1995
|
Asymmetric Coevolution in pdgfrß-csf1r Paralogons
After the duplication in the fish lineage, the pdgfrß-csf1r paralogons evolved at different rates of molecular evolution. The entire B-paralogon evolved faster than the A-paralogon, which becomes obvious at five levels of genomic organization:
- (i) pdgfrßb and csf1rb coding sequences evolve significantly faster than pdgfrßa and csf1ra, respectively, as revealed by nonparametric relative rate tests in cichlids and the other three teleost genomes. The phylogenetic tree of csf1r genes (fig. 1B) suggests that the rate acceleration of molecular evolution in csf1rb appeared in the lineage leading to acanthopterygians (represented here by cichlid, pufferfishes, medaka, and swordtail) after its split from the lineage leading to protacanthopterygians (salmon and trout). However, discrimination between the mechanisms that might lead to accelerated evolution, relaxed functional constraints, or positive Darwinian selection, is rather difficult due to more than 200 Myr that have passed after the duplication and due to the episodic nature of duplicates' sequence evolution over several hundreds of millions of years (Raes and Van de Peer 2003
).
- (ii) Type III RTKs of the teleost B-paralogon gained additional introns after duplication, while A-paralogon type III RTKs maintained the ancestral exon-intron structure.
- (iii) More genomic rearrangements have occurred in the chromosomal region of the B-paralogon type III RTKs, so that its synteny with the ancestral locus is restricted to the pdgfrßb-csf1rb gene tandem. In contrast, the synteny of the A-paralogon involves at least two additional flanking genes including cdx1, which links the A-paralogon to the ParaHox D cluster (Spring 2002
; Ferrier et al. 2005
).
- (iv) The B-paralogons of cichlid and medaka have accumulated more repetitive elements than the A-paralogons leading to a pronounced genomic expansion of the B-paralogon. The absence of this situation in the pufferfish genomes might be explained by the general compaction of their genomes (Brenner et al. 1993
).
- (v) Noncoding regions and consequently promoter sequences are less conserved in the B-paralogon. The proportion of gene duplicates in eukaryotic genomes that evolve asymmetrically has been determined previously to be 5%60%, depending on the age of the duplication event (Conant and Wagner 2003
; Wagner et al. 2005
). Van de Peer et al. (2001)
found asymmetric evolution in approximately half of FSGD duplicates in zebrafish (Van de Peer et al. 2001
). The situation of the pdgfrß-csf1r paralogons, however, is exceptional because it involves tightly linked genes and also includes their genomic environment. The kitb locus also underwent more genomic rearrangements than the kita locus in teleosts, including the loss of the pdgfr
b gene (Mellgren and Johnson 2005
). Similar to our analyses, the duplicated teleost HoxA clusters were recently characterized as "coevolutionary units" that diverge concertedly in an asymmetric manner in coding and noncoding regions (Wagner et al. 2005
). Hence, these results suggest that asymmetric divergence of fish-specific paralogons might be a rather common evolutionary pattern and that coevolutionary genomic units might not be restricted to the highly organized Hox clusters or to transcription factors in general.
- (ii) Type III RTKs of the teleost B-paralogon gained additional introns after duplication, while A-paralogon type III RTKs maintained the ancestral exon-intron structure.
Functional Divergence of Type III RTK Paralogs in Teleosts After Duplication
All four type III RTKs originating from two rounds of genome duplication (2R and 3R/FSGD) are still present at least in the genomes of cichlid, medaka, and pufferfishes. Thus, nonfunctionalization, the loss of a gene duplicate, seems to be a rather uncommon mechanism for type III RTKs, although it might have occurred in the zebrafish pdgfrßb-csf1rb paralogon. Furthermore, all four type III RTKs are expressed in a variety of adult cichlid tissues (Supplementary Figure S2, Supplementary Material online).
An important mechanism that can preserve both paralogs after duplication in a genome is subfunctionalization, the subdivision of ancestral gene functions among paralogs (Postlethwait et al. 2004
). However, all four type III RTKs were expressed nearly ubiquitously in adult cichlid (Supplementary Figure S2, Supplementary Material online), making expressional subfunctionalization in adult tissues rather unlikely. The similar expression of type III RTKs is surprising because of the pronounced sequence divergence in promoter regions between A- and B-paralogons within the cichlid genome. In pufferfish, small differences in gene expression between the type III RTK paralogs were found, but still similarities prevailed (Williams, Brenner, and Venkatesh 2002
). Nevertheless, it is possible that differences in cellular, temporal, and quantitative gene expression might exist between the four type III RTK genes. Subfunctionalization can also be achieved at the protein level (Hughes 1994
), but subdivision of protein functions would probably lead to more similar evolutionary rates of duplicates as observed here for the two type III RTK paralogons.
Several lines of evidence seem to suggest that neofunctionalization, the evolution of new gene functions, has occurred in B-paralogs of csf1r and pdgfrß. First, the observed asymmetry of evolutionary rates between the two paralogons and particularly the amount of amino acid exchanges suggest neofunctionalization. Second, csf1ra is required by macrophages and osteoclasts in teleosts, a function that is shared with the single Csf1r in amniotes (Parichy et al. 2000
; Roca et al. 2005
) and, therefore, is likely to be ancestral. Furthermore, the csf1ra gene is essential to recruit pigment cell precursors to xanthophore fates in zebrafish (Parichy et al. 2000
; Parichy and Turner 2003
). Recently, we found that csf1ra is also expressed in xanthophores in cichlids (W. Salzburger, I. Braasch, and A. Meyer, unpublished data). A function of Csf1r in pigment cell or neural crest development has not been found in mammals, which is not surprising, because mammals lost xanthophores (and iridophores) secondarily. It would thus be interesting to analyze, whether or not Csf1r is also involved in the development of xanthophores in amphibians. Most likely, the pigmentation function of the teleost csf1ra gene might have already been present in the single ancestral csf1r gene. Consequently, because csf1ra fulfills ancestral functions in fish, csf1rb might have evolved new functions, which would explain the asymmetric rates of molecular evolution of the csf1r paralogs. In the light of these observations, we propose neofunctionalization to be the most likely mechanism for the evolution of present day's functions of the entire B-paralogon.
Genome Duplication, Signaling Cascade Expansions, and the Evolution of Teleost Coloration
Type III RTKs function as membrane spanning cell-surface receptors that bind ligand homo- or heterodimers and are activated by homo- or heterodimerization, resulting in autophosphorylation of intracellular tyrosine residues and subsequent triggering of the signal transduction cascade (Schlessinger 2000
). Amino acid residues that are most likely involved in functional divergence of pdgfrß and csf1r paralogs are mainly found in the extracellular, ligand-binding, and receptor interacting domain, suggesting that new ligand-binding affinities and receptor interactions might have evolved in fish after the duplication of pdgfrß and csf1r. Therefore, it seems plausible that the FSGD and the associated multiplication of RTKs improved the number of possible ligand-receptor and receptor-receptor interactions and, thus, might have increased the complexity of signal transduction by type III RTKs, probably by functional specialization of signaling pathway paralogs.
What might have been the impact of genome duplications and the expansion of the type III RTK family for the evolution of teleost coloration? According to the model presented in figure 5B, genome duplications and the resulting expansion of the RTK multigene family coincide phylogenetically with the appearance of pigment cell innovations. The first round of genome duplication in the vertebrate lineage (1R) took place after their split from cephalochordates. This event coincides with the acquisition of the neural crest in vertebrates and of the neural crest-derived melanophores (Shimeld and Holland 2000
). The second round of genome duplication (2R), which probably took place in the gnathostome stem lineage after the divergence of agnathans (Holland et al. 1994
), correlates with the duplication of the proto-pdgfrproto-kit/csf1r locus (fig. 5B). The presence of xanthophores and iridophores in amphibians and reptiles (Bagnara 1998
) and their absence in lampreys (Mellgren and Johnson 2002
) suggests that these two pigment cell types arose only in the gnathostome lineage. The similar roles of kit and csf1r in the development of melanophores and xanthophores, respectively, seem to suggest that functional divergence of kit and csf1r was associated with the expansion of the pigment cell repertoire.
The FSGD (3R) occurred approximately 250350 MYA, close to the most recent common ancestor of teleosts after their divergence from the basal lineages including bichirs, sturgeons, gars, and bowfin (Hoegg et al. 2004
; Crow et al. 2006
). This genome duplication coincides with the emergence of fish-specific paralogs of csf1r, pdgfrß (this study), kit, pdgfr
(Mellgren and Johnson 2005
), and other pigmentation genes such as mitf (Altschmied et al. 2002
) or sox10 (Lang et al. 2005b
). Thus, duplicates originating from the FSGD might have been retained particularly in the context of pigmentation. Ever since this event, chromatophore sublineages, i.e., subpopulations of chromatophores that develop under different genetic control (Mellgren and Johnson 2002
), might have been present in teleosts. The csf1rb paralog was, most likely, lost secondarily in the lineage leading to zebrafish and probably underwent neofunctionalization in the lineage leading to acanthopterygians. In the light of the presented model, genome duplications, and the FSGD in particular are likely to have been responsible for the expansion of the type III RTK family and might have resulted in an increase in signal cascade complexity that could be causally linked to the complexity and diversity of coloration and color patterning in teleost fishes.
Finally, the present study underlines the suitability of cichlids as models in comparative genomic investigationsfor studying their unique evolutionary success but also as genomic models of perciform fishes, the most species-rich group of teleosts (Kocher et al. 2004
). Thus, we emphasize the importance of sequencing a cichlid genome in the near future.
| Supplementary Material |
|---|
|
|
|---|
Supplementary Tables S1S3 and Supplementary Figures S1S2 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank M. Lang, N. Siegel, and J. Luo for assistance in the lab and other members from the Meyer lab as well as J. S. Taylor and J.-N. Volff for valuable suggestions on the manuscript. This work was funded by a Marie Curie Fellowship of the European Union, grants from the Landesstiftung Baden-Württemberg GmbH, and from the Center for Junior Research Fellows, University of Konstanz (to W.S.) and by the Deutsche Forschungsgemeinschaft (DFG) and the University of Konstanz (to A.M.).
| Footnotes |
|---|
1 Present address: BioFuture Research Group "Evolutionary Fish Genomics", Physiological Chemistry I, Biozentrum, University of Würzburg, Germany
Adriana Briscoe, Associate Editor
| References |
|---|
|
|
|---|
Altschmied, J., J. Delfgaauw, B. Wilde, J. Duschl, L. Bouneau, J. N. Volff, and M. Schartl. 2002. Subfunctionalization of duplicate mitf genes associated with differential degeneration of alternative exons in fish. Genetics 161:259267.
Amores, A., A. Force, Y. L. Yan et al. (13 co-authors). 1998. Zebrafish hox clusters and vertebrate genome evolution. Science 282:17111714.
Bagnara, J. T. 1998. Comparative anatomy and physiology of pigment cells in nonmammalian tissues. Pp. 940 in J. J. Nordlund, R. E. Boissy, V. J. Hearing, R. A. King, and J.-P. Ortonne, eds. The pigmentary system: physiology and pathophysiology. Oxford University Press, New York.
Béjar, J., Y. Hong, and M. Schartl. 2003. Mitf expression is sufficient to direct differentiation of medaka blastula derived stem cells to melanocytes. Development 130:65456553.
Brenner, S., G. Elgar, R. Sandford, A. Macrae, B. Venkatesh, and S. Aparicio. 1993. Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature 366:265268.[CrossRef][Medline]
Christoffels, A., E. G. Koh, J. M. Chia, S. Brenner, S. Aparicio, and B. Venkatesh. 2004. Fugu genome analysis provides evidence for a whole-genome duplication early during the evolution of ray-finned fishes. Mol. Biol. Evol. 21:11461151.
Conant, G. C., and A. Wagner. 2003. Asymmetric sequence divergence of duplicate genes. Genome Res. 13:20522058.
Crow, K. D., P. F. Stadler, V. J. Lynch, C. Amemiya, and G. P. Wagner. 2006. The "fish-specific" Hox cluster duplication is coincident with the origin of teleosts. Mol. Biol. Evol. 23:121136.
Dehal, P., and J. L. Boore. 2005. Two rounds of whole genome duplication in the ancestral vertebrate. PLoS Biol. 3:e314.[CrossRef][Medline]
Dorsky, R. I., R. T. Moon, and D. W. Raible. 1998. Control of neural crest cell fate by the Wnt signalling pathway. Nature 396:370373.[CrossRef][Medline]
Dutton, K. A., A. Pauliny, S. S. Lopes, S. Elworthy, T. J. Carney, J. Rauch, R. Geisler, P. Haffter, and R. N. Kelsh. 2001. Zebrafish colourless encodes sox10 and specifies non-ectomesenchymal neural crest fates. Development 128:41134125.
Ferrier, D. E., K. Dewar, A. Cook, J. L. Chang, A. Hill-Force, and C. Amemiya. 2005. The chordate ParaHox cluster. Curr. Biol. 15:R820R822.[CrossRef][ISI][Medline]
Furlong, R. F., and P. W. Holland. 2002. Were vertebrates octoploid? Philos. Trans. R. Soc. Lond. B Biol. Sci. 357:531544.[CrossRef][ISI][Medline]
Gu, J., and X. Gu. 2003. Natural history and functional divergence of protein tyrosine kinases. Gene 317:4957.[CrossRef][ISI][Medline]
Gu, X. 1999. Statistical methods for testing functional divergence after gene duplication. Mol. Biol. Evol. 16:16641674.[Abstract]
Gu, X., and K. Vander Velden. 2002. DIVERGE: phylogeny-based analysis for functional-structural divergence of a protein family. Bioinformatics 18:500501.
Hoegg, S., H. Brinkmann, J. S. Taylor, and A. Meyer. 2004. Phylogenetic timing of the fish-specific genome duplication correlates with the diversification of teleost fish. J. Mol. Evol. 59:190203.[CrossRef][ISI][Medline]
Hoegg, S., and A. Meyer. 2005. Hox clusters as models for vertebrate genome evolution. Trends Genet. 21:421424.[CrossRef][ISI][Medline]
Holland, P. W., J. Garcia-Fernandez, N. A. Williams, and A. Sidow. 1994. Gene duplications and the origins of vertebrate development. Dev. Suppl. 125133.
Hughes, A. L. 1994. The evolution of functionally novel proteins after gene duplication. Proc. R. Soc. Lond. B Biol. Sci. 256:119124.[Medline]
Hughes, A. L., and R. Friedman. 2003. 2R or not 2R: testing hypotheses of genome duplication in early vertebrates. J. Struct. Funct. Genomics 3:8593.[CrossRef][Medline]
Jaillon, O., J. M. Aury, F. Brunet et al. (61 co-authors). 2004. Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotype. Nature 431:946957.[CrossRef][Medline]
Knight, R. D., S. Nair, S. S. Nelson, A. Afshar, Y. Javidan, R. Geisler, G. J. Rauch, and T. F. Schilling. 2003. Lockjaw encodes a zebrafish tfap2a required for early neural crest development. Development 130:57555768.
Kocher, T. D. 2004. Adaptive evolution and explosive speciation: the cichlid fish model. Nat. Rev. Genet. 5:288298.[CrossRef][ISI][Medline]
Kocher, T., R. D. Fernald, H. A. Hofmann, A. Meyer, N. Okada, and D. Penman. 2004. Genome sequence of a cichlid fish: the Tilapia (Oreochromis niloticus). http://hcqs.unh.edu/cichlid/.
Kumar, S., K. Tamura, and M. Nei. 2004. MEGA3: integrated software for molecular evolutionary genetics analysis and sequence alignment. Brief. Bioinform. 5:150163.
Lang, D., M. M. Lu, L. Huang, K. A. Engleka, M. Zhang, E. Y. Chu, S. Lipner, A. Skoultchi, S. E. Millar, and J. A. Epstein. 2005a. Pax3 functions at a nodal point in melanocyte stem cell differentiation. Nature 433:884887.[CrossRef][Medline]
Lang, M., T. Miyake, I. Braasch, D. Tinnemore, N. Siegel, W. Salzburger, C. T. Amemiya, and A. Meyer. 2005b. A BAC library of the East African haplochromine cichlid fish Astatotilapia burtoni. J. Exp. Zool. B Mol. Dev. Evol. 306B:3544.
Larhammar, D., L. G. Lundin, and F. Hallbook. 2002. The human Hox-bearing chromosome regions did arise by block or chromosome (or even genome) duplications. Genome Res. 12:19101920.
Leveugle, M., K. Prat, C. Popovici, D. Birnbaum, and F. Coulier. 2004. Phylogenetic analysis of Ciona intestinalis gene superfamilies supports the hypothesis of successive gene expansions. J. Mol. Evol. 58:168181.[CrossRef][ISI][Medline]
McLysaght, A., K. Hokamp, and K. H. Wolfe. 2002. Extensive genomic duplication during early chordate evolution. Nat. Genet. 31:200204.[CrossRef][ISI][Medline]
Mellgren, E. M., and S. L. Johnson. 2002. The evolution of morphological complexity in zebrafish stripes. Trends Genet. 18:128134.[CrossRef][ISI][Medline]
. 2005. kitb, a second zebrafish ortholog of mouse Kit. Dev. Genes Evol. 215:18.[CrossRef][ISI][Medline]
Meyer, A., and M. Schartl. 1999. Gene and genome duplications in vertebrates: the one-to-four (-to-eight in fish) rule and the evolution of novel gene functions. Curr. Opin. Cell Biol. 11:699704.[CrossRef][ISI][Medline]
Meyer, A., and Y. Van de Peer. 2005. From 2R to 3R: evidence for a fish-specific genome duplication (FSGD). Bioessays 27:937945.[CrossRef][ISI][Medline]
Naruse, K., M. Tanaka, K. Mita, A. Shima, J. Postlethwait, and H. Mitani. 2004. A medaka gene map: the trace of ancestral vertebrate proto-chromosomes revealed by comparative gene mapping. Genome Res. 14:820828.
Ohno, S. 1970. Evolution by gene duplication. Springer-Verlag, New York.
Parichy, D. M., D. G. Ransom, B. Paw, L. I. Zon, and S. L. Johnson. 2000. An orthologue of the kit-related gene fms is required for development of neural crest-derived xanthophores and a subpopulation of adult melanocytes in the zebrafish, Danio rerio. Development 127:30313044.[Abstract]
Parichy, D. M., J. F. Rawls, S. J. Pratt, T. T. Whitfield, and S. L. Johnson. 1999. Zebrafish sparse corresponds to an orthologue of c-kit and is required for the morphogenesis of a subpopulation of melanocytes, but is not essential for hematopoiesis or primordial germ cell development. Development 126:34253436.[Abstract]
Parichy, D. M., and J. M. Turner. 2003. Temporal and cellular requirements for Fms signaling during zebrafish adult pigment pattern development. Development 130:817833.
Pond, S. L., S. D. Frost, and S. V. Muse. 2005. HyPhy: hypothesis testing using phylogenies. Bioinformatics 21:676679.
Postlethwait, J., A. Amores, W. Cresko, A. Singer, and Y. L. Yan. 2004. Subfunction partitioning, the teleost radiation and the annotation of the human genome. Trends Genet. 20:481490.[CrossRef][ISI][Medline]
Postlethwait, J. H., I. G. Woods, P. Ngo-Hazelett, Y. L. Yan, P. D. Kelly, F. Chu, H. Huang, A. Hill-Force, and W. S. Talbot. 2000. Zebrafish comparative genomics and the origins of vertebrate chromosomes. Genome Res. 10:18901902.
Raes, J., and Y. Van de Peer. 2003. Gene duplication, the evolution of novel gene functions, and detecting functional divergence of duplicates in silico. Appl. Bioinform. 2:91101.
Roca, F. J., M. A. Sepulcre, G. Lopez-Castejon, J. Meseguer, and V. Mulero. 2005. The colony-stimulating factor-1 receptor is a specific marker of macrophages from the bony fish gilthead seabream. Mol. Immunol. 43:14181423.
Rousset, D., F. Agnes, P. Lachaume, C. Andre, and F. Galibert. 1995. Molecular evolution of the genes encoding receptor tyrosine kinase with immunoglobulinlike domains. J. Mol. Evol. 41:421429.[CrossRef][ISI][Medline]
Salzburger, W., and A. Meyer. 2004. The species flocks of East African cichlid fishes: recent advances in molecular phylogenetics and population genetics. Naturwissenschaften 91:277290.[ISI][Medline]
Salzburger, W., H. Niederstätter, A. Brandstätter, B. Berger, W. Parson, J. Snoeks, and C. Sturmbauer. 2006. Colour-assortative mating among populations of Tropheus moorii, a cichlid fish from Lake Tanganyika, East Africa. Proc. R. Soc. Lond. B Biol. Sci. 273:257266.[CrossRef]
Santini, S., J. L. Boore, and A. Meyer. 2003. Evolutionary conservation of regulatory elements in vertebrate Hox gene clusters. Genome Res. 13:11111122.
Schlessinger, J. 2000. Cell signaling by receptor tyrosine kinases. Cell 103:211225.[CrossRef][ISI][Medline]
Shimeld, S. M., and P. W. Holland. 2000. Vertebrate innovations. Proc. Natl. Acad. Sci. USA 97:44494452.
Spring, J. 1997. Vertebrate evolution by interspecific hybridisationare we polyploid? FEBS Lett. 400:28.[CrossRef][ISI][Medline]
. 2002. Genome duplication strikes back. Nat. Genet. 31:128129.[ISI][Medline]
Steinke, D., W. Salzburger, and A. Meyer. 2006. Novel relationships among ten fish model species revealed based on a phylogenomic analysis using ESTs. J. Mol. Evol. (in press).
Swofford, D. 2000. PAUP*: phylogenetic analysis using parsimony (* and other methods). Sinauer Associates, Sunderland, Mass.
Taylor, J. S., I. Braasch, T. Frickey, A. Meyer, and Y. Van de Peer. 2003. Genome duplication, a trait shared by 22000 species of ray-finned fish. Genome Res. 13:382390.
Taylor, J. S., Y. Van de Peer, I. Braasch, and A. Meyer. 2001. Comparative genomics provides evidence for an ancient genome duplication event in fish. Philos. Trans. R. Soc. Lond. B Biol. Sci. 356:16611679.[CrossRef][ISI][Medline]
Van de Peer, Y., J. S. Taylor, I. Braasch, and A. Meyer. 2001. The ghost of selection past: rates of evolution and functional divergence of anciently duplicated genes. J. Mol. Evol. 53:436446.[CrossRef][ISI][Medline]
Vandepoele, K., W. De Vos, J. S. Taylor, A. Meyer, and Y. Van de Peer. 2004. Major events in the genome evolution of vertebrates: paranome age and size differ considerably between ray-finned fishes and land vertebrates. Proc. Natl. Acad. Sci. USA 101:16381643.
Volff, J. N. 2005. Genome evolution and biodiversity in teleost fish. Heredity 94:280294.[CrossRef][ISI][Medline]
Wagner, G. P., K. Takahashi, V. Lynch, S. J. Prohaska, C. Fried, P. F. Stadler, and C. Amemiya. 2005. Molecular evolution of duplicated ray finned fish HoxA clusters: increased synonymous substitution rate and asymmetrical co-divergence of coding and non-coding sequences. J. Mol. Evol. 60:665676.[CrossRef][ISI][Medline]
Wang, Y., and X. Gu. 2001. Functional divergence in the caspase gene family and altered functional constraints: statistical analysis and prediction. Genetics 158:13111320.
W




