MBE Advance Access originally published online on February 28, 2007
Molecular Biology and Evolution 2007 24(5):1161-1180; doi:10.1093/molbev/msm036
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2007 The Authors.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Research Articles |
Gene Relocations within Chloroplast Genomes of Jasminum and Menodora (Oleaceae) Are Due to Multiple, Overlapping Inversions


* School of Life Sciences and Biotechnology, Korea University, Seoul, Korea
Section of Integrative Biology and Institute of Cellular and Molecular Biology, University of Texas, Austin
E-mail: kimkj{at}korea.ac.kr.
| Abstract |
|---|
|
|
|---|
The chloroplast (cp) DNA sequence of Jasminum nudiflorum (OleaceaeJasmineae) is completed and compared with the large single-copy region sequences from 6 related species. The cp genomes of the tribe Jasmineae (Jasminum and Menodora) show several distinctive rearrangements, including inversions, gene duplications, insertions, inverted repeat expansions, and gene and intron losses. The ycf4psaI region in Jasminum section Primulina was relocated as a result of 2 overlapping inversions of 21,169 and 18,414 bp. The 1st, larger inversion is shared by all members of the Jasmineae indicating that it occurred in the common ancestor of the tribe. Similar rearrangements were also identified in the cp genome of Menodora. In this case, 2 fragments including ycf4 and rps4trnSycf3 genes were moved by 2 additional inversions of 14 and 59 kb that are unique to Menodora. Other rearrangements in the Oleaceae are confined to certain regions of the Jasminum and Menodora cp genomes, including the presence of highly repeated sequences and duplications of coding and noncoding sequences that are inserted into clpP and between rbcL and psaI. These insertions are correlated with the loss of 2 introns in clpP and a serial loss of segments of accD. The loss of the accD gene and clpP introns in both the monocot family Poaceae and the eudicot family Oleaceae are clearly independent evolutionary events. However, their genome organization is surprisingly similar despite the distant relationship of these 2 angiosperm families.
Key Words: chloroplast genome inversions gene duplications insertions inverted repeat expansions gene and intron losses Jasminum Menodora
| Introduction |
|---|
|
|
|---|
Chloroplast (cp) genome organization in land plants is highly conserved (Palmer 1991
The angiosperm family Oleaceae includes approximately 29 genera and 600 species that are widely distributed in tropical and temperate regions of the world (Heywood 1978
). The tribe Jasmineae contains only 2 genera, Jasminum and Menodora (Taylor 1945
; Johnson 1957
; Wallander and Albert 2000
). Jasminum is the largest genus of the Oleaceae with approximately 200 species (Rehder 1940
; Green 2004
) confined to the Old World. The genus is traditionally classified into 4 sections (Alternifolia, Unifoliolata, Pinnatifolia, and Trifoliolata) based on the leaf arrangement and the leaflet number (De Candolle 1844
); however, several systematic studies have suggested that most of these sections are artificial (Rohwer 1994
, 1995
; Green 1997
, 2001
). Molecular phylogenies (Lee H-L and Kim K-J, unpublished data) support the recognition of 3 clades: an alternate-leaved group (section Alternifolia), section Primulina (Green 2001
), and an opposite-leaved group that contains the majority of the species. In contrast, Menodora is a small genus with approximately 23 species (Steyermark 1932
; Turner 1991
) and is distributed in southwestern North America, southern South America, and southern Africa.
In this paper, we investigate the organization and evolution of cp genomes in the Oleaceae. Several unusual features of the genomes of Menodora and Jasminum are described, which provide valuable insights into cpDNA evolution. The comparisons identify multiple inversions, gene duplications, gene and intron losses, IR expansion, and a high incidence of dispersed repeats. Gene relocation caused by multiple inversions is a notable feature of the cp genomes of the Jasmineae. The phylogenetic order and timing of the major rearrangements are determined by comparative sequence analysis and phylogenetic analysis of the major lineages within the tribe.
| Materials and Methods |
|---|
|
|
|---|
Taxon Sampling
The taxa used in this study are listed in table 1. The cp genome of Jasminum nudiflorum was sequenced completely, whereas inversion regions only were sequenced from the following 5 species: Menodora longiflora; 1 species from each of the 3 major lineages of Jasminum (Jasminum le-ratii, Jasminum abyssinicum, Jasminum subhumile); and an outgroup species, Forsythia europaea. Seventeen species of Jasminum and 2 species of Menodora were used for comparative gene mapping. Twenty-five accessions of 24 species of Jasminum, 4 species of Menodora, and 21 species of other Oleaceae were used for the polymerase chain reaction (PCR) diagnosis of inversions and the comparative analysis of inversion endpoints. Forty-one ndhF and rbcL sequences representing all major clades of Oleaceae and outgroups were also used for estimating phylogenetic relationships and divergence times (table 1).
|
DNA Extraction, Enzyme Digestion, Filter Hybridization, and Restriction Site Mapping
Total DNA was isolated from the fresh leaves using the hexadecyltrimethylammonium bromide method (Doyle JJ and Doyle JA 1987), and DNA was further purified by ultracentrifugation in cesium chloride/ethidium bromide gradients. Purified total DNA was digested with 22 restriction endonucleases (AvaI, AvaII, AseI, BamHI, BanI, BanII, BclI, BglII, BstNI, BstXI, ClaI, DraI, EcoRI, Eco0109I, EcoRV, HaeII, HincII, HindIII, NciI, NsiI, XbaI, and XmnI), and the resulting fragments were separated on 11.6% agarose gels depending on the number of expected restriction fragments. Bidirectional transfer of the DNA from agarose gels to Zetabind nylon filters (AMF Cuno, Meriden, CT), radioactive labeling of 43 cloned tobacco cpDNA probes (Olmstead and Palmer 1992
Genome Sequencing and Annotation
For the complete sequencing of the J. nudiflorum cp genome, cpDNA was isolated and purified following the methods in Palmer (1986)
. The cpDNA was digested with a combination of 3 restriction enzymes (BamHI, SacI, and ClaI), and the resulting fragments were cloned into a pBluescript II vector. Inserted cpDNA fragments were shotgun sequenced using the BigDye3.0 terminal cycle sequencing kit (Applied Biosystems, Foster City, CA) and an ABI 3700 sequencer. Sequences were assembled using Sequencher (version 4.1; Gene Codes Corporation, Ann Arbor, MI). In order to fill gap regions, primers were designed using the complete cp sequences of Nicotiana and Panax (Shinozaki et al. 1986
; Kim and Lee 2004
).
A series of PCR products spanning the entire region containing the inversions were generated for J. le-ratii, J. abyssinicum, J. subhumile, M. longiflora, and F. europaea. The PCR cycle was as follows: 94 °C for 3 min, then 30 cycles of 94 °C for 30 s, 5068 °C for 1 min, and 6872 °C for 18 min (depending upon the size of the target region). PCR-amplified DNA was then purified using the QIAquick PCR purification kit (Qiagen, Hilden, Germany). Amplified cpDNA fragments were sequenced directly.
Gene annotations and comparative sequence analyses were performed using the current versions of Blast (Altschul et al. 1990
), ORF finder programs from National Center for Biotechnology Information, and DOGMA (Wyman et al. 2004
). The locations and secondary structures of tRNA genes were determined using tRNAscan-SE version 1.21 (Lowe and Eddy 1997
), DOGMA (Wyman et al. 2004
), and MFOLD version 3.0 (Zuker 2003
). Repeated sequences were identified using REPuter (Kurtz et al. 2001
).
PCR Diagnosis of Inversions and Determination of Inversion Endpoints
Ten PCR primers (table 1) were designed based on the sequence comparisons among 3 cp genome sequences of Jasminum (this study), Nicotiana (Shinozaki et al. 1986
), and Panax (Kim and Lee 2004
). Different combinations of these primers were used to test for the presence or absence of certain rearrangements. Positive PCR amplifications were sequenced and aligned to determine inversion endpoints. The standard PCR-amplification reactions were 30 cycles of 1 min denaturation at 94 °C, 1 min annealing at 5068 °C, and 28 min extension at 6872 °C.
Phylogenetic Analyses and Estimation of Divergence Times of Inversions
DNA sequence data of ndhF and rbcL for 41 taxa and the complete inversion regions (3135 kb) of 6 taxa were used for phylogenetic analysis. Forty-one ndhF and rbcL sequences representing all major clades of Oleaceae and outgroups were generated by methods described elsewhere (Olmstead et al. 1992
; Kim and Jansen 1995
). Sequence alignment, base substitution analysis, and phylogenetic analyses were performed using ClustalX (Higgins et al. 1996
), MEGA2 (Kumar et al. 2001
), PAUP 4.0 (Swofford 2002
), and MacClade Version 4.06 (Maddison WP and Maddison DR 2003
). The model of evolution was selected using the hierarchical likelihood ratio test as implemented in Modeltest 3.06 (Posada and Crandall 1998
).
Divergence times were estimated using the combined ndhF and rbcL data. A likelihood ratio test (Felsenstein 1981
) was performed on the maximum likelihood (ML) tree by comparing the scores of the ML tree with and without a molecular clock. This test rejected a molecular clock hypothesis; therefore, the penalized likelihood (PL) method (Sanderson 2002
) in the program r8s version 1.60 (Sanderson 2003
) was used. The size of the roughness penalty was specified by a smoothing parameter obtained from a cross-validation procedure. After cross-validation, the smoothing parameter was set to an absolute value of 1.0. The ML tree was converted to an ultrametric tree in PL using the truncated network (TN) algorithm, and in the process, zero length branches were collapsed. Divergence times were calibrated using Cornaceae and Oleaceae fossils (Kavadas 1956
; Muller 1981
; Takahashi et al. 2002
).
| Results |
|---|
|
|
|---|
The cp Genome Organization of J. nudiflorum
The cp genome of J. nudiflorum (GenBank accession number NC_008407) is 165,121-bp long and is divided into 4 regions by a pair of IRs (IRa and IRb) of 29,486 bp each. The LSC and SSC regions are 92,877 and 13,272 bp, respectively. The locations of the 133 genes in the J. nudiflorum cp genome are presented in figure 1. Nineteen genes (7 tRNAs, 4 rRNAs, and 8 protein-coding genes) are duplicated in the IR. In addition, a duplicate copy of trnG-GCC is located in the middle of the LSC region. Eleven protein-coding genes and a tRNA gene are located in the SSC region, whereas 61 protein-coding genes and 23 tRNA genes are located in the LSC region. Nine protein-coding genes and 6 tRNA genes each contain a single intron, and 2 protein-coding genes (rps12 and ycf3) have 2 introns (supplementary material SM 1, Supplementary Material online).
|
The J. nudiflorum cp genome consists of 57% coding sequence (50% protein and 7% RNA) and 43% noncoding sequences, including introns (12%) and intergenic spacers (31%). The overall A-T content of the J. nudiflorum cp genome is 62%. The A-T content in the noncoding region (65%) is higher than in the coding region (60%). The A-T content of the IR is 58%, whereas the A-T content in the LSC and SSC regions is 64% and 67%, respectively. The lower A-T content of the IR reflects the low A-T content in the 4 rRNA (45%) genes in this region.
The gene and intron content and gene order of J. nudiflorum differ from Nicotiana tabacum in several ways. For example, the cp genome of J. nudiflorum is missing most of accD and the 2 introns of clpP. Additionally, duplicate copies of trnG-GCC and ycf1 are present. The extra copy of ycf1 is due to the expansion of the IR at the IRbSSC boundary, accounting for much of the increase in size of the J. nudiflorum cp genome in comparison with tobacco. In contrast, rps19 is located on the LSC region because of the contraction of IR. One notable rearrangement of the J. nudiflorum cp genome is the movement of a 2.8-kb region containing ycf4 and psaI to the middle of the LSC region (fig. 1).
Comparative Gene Mapping of Oleaceae cp Genomes
The cp genomes were mapped for 32 representative species of all major genera of Oleaceae and 12 closely related outgroup species (table 1). The gene maps (not shown) indicate that all genera of the Oleaceae except Jasminum and Menodora have an identical gene order to N. tabacum. We sequenced the rearranged portions of the LSC region of the cp genomes for the representative species of Jasminum, Menodora, and the closely related genus Forsythia to better characterize the rearrangements in members of the Jasmineae.
Identification of Inversions among Members of the Oleaceae
Complete sequences for the region between psaA and psbB in the LSC region were generated from F. europaea, J. subhumile, J. le-ratii, and J. abyssinicum (fig. 2) and are approximately 32-kb long for each species. For M. longiflora, 2 additional regions of the cp genome extending from psbA to psaA (15,387 bp) and from trnK to rpl20 (18,522 bp) were sequenced because mapping data (not shown) suggested the presence of inversions in these regions (fig. 3).
|
|
The sequenced region of the outgroup F. europaea (tribe Forsythieae) is 31,495-bp long and includes 25 protein-coding and 8 tRNA genes (fig. 2). The gene content and order are identical to the Nicotiana cp genome. The sequence of J. subhumile (section Alternifolia) between psbA and psbB is 35,515-bp long, including an inversion of 21.8 kb not present in Nicotiana or Forsythia. Relative to the genomes of Nicotiana and Forsythia, one endpoint of the inversion is located between psaA and ycf3, and the other endpoint is located between ycf4 and cemA. In addition, the J. subhumile cp genome lacks more than 50% of accD but includes trnI-CAU and trnS-GGA in this region.
Sequences between psaA and psbB genes of J. le-ratii and J. abyssinicum (both members of the opposite-leaved group of Jasminum) are 31,507- and 31,044-bp long, respectively. These species have a similar gene order to J. subhumile, including one inversion of 19.3 kb possessing the same endpoints. Gene and intron content of this group are slightly different from F. europaea; they both lack most of accD and 2 introns of clpP but include trnI-CAU downstream from rbcL and rpl23 downstream from clpP.
The region between psaA and psbB in J. nudiflorum (section Primulina) is 36,404-bp long, and the gene order is similar to Forsythia and Nicotiana, except for the relocation of ycf4 and psaI. The 2.8-kb region that includes ycf4 and psaI is almost 18 kb upstream from its position in Forsythia and Nicotiana (fig. 1). One endpoint of this region is located between psaA and ycf4, and the other is between psaI and ycf3. Comparison of Jasminum, Forsythia, and Nicotiana cp genomes reveals that movement of this 2.8-kb region in J. nudiflorum is caused by 2 unequal, sequential inversions of 21 and 18 kb. The 21-kb inversion in J. nudiflorum is also present in other Jasminum species. One endpoint of this inversion is located between psaA and ycf3, and the other endpoint is between ycf4 and cemA. In addition, we hypothesize a 2nd 18-kb inversion in J. nudiflorum. One endpoint of the 2nd inversion is between ycf3 and cemA and is only 134 bp from the endpoint of the 21-kb inversion. The other endpoint of the 18-kb inversion is 2.8 kb from the 21-kb inversion endpoint. Therefore, the 2 inversions represent independent evolutionary events, even though 1 endpoint of each inversion is found in nearly the identical location. The incorporation of 134 bp of sequence outside of the 1st inversion endpoint at one end and the inclusion of 42 bp of this sequence at the other endpoint during the 2nd inversion clearly support the evolutionary ordering of these 2 inversions. The distribution pattern of these remnant sequences indicates that the 18-kb inversion occurred after the 21-kb inversion.
The gene order in the Menodora cp genome is distinct from Jasminum species as well as other genera of Oleaceae. Three inversions can explain the differences in the cp genome organization of Menodora (fig. 3). The 1st is 20 kb and is shared with all sampled species of Jasminum. Therefore, this represents a shared derived character among members of the tribe Jasmineae. The 2 endpoints of this inversion are identical to the 1st inversion (21 kb) in Jasminum (fig. 2). Two additional inversions (3rd and 4th inversions in fig. 3) are unique to Menodora. The 3rd inversion is approximately 14 kb, and the 2 endpoints are located between psaI and trnI-CAU and between trnT-UGU and rps4, and this inversion is nested within the 1st inversion (fig. 3, the trnI is truncated). The 4th inversion is the largest one at 59 kb. Because one endpoint of the 4th inversion is located within trnK-UUU, the 2 exons of trnK-UUU are located almost 59 kb apart. The remaining endpoint is between trnI-CAU and rps4 (fig. 3). The 3rd inversion occurred prior to the 4th inversion.
Distribution of Inversions in Oleaceae
We designed 10 primers (P1P10) to amplify the inversion endpoint regions (table 1 and fig. 4), and different combinations of these primers were used in PCR reactions to determine the phylogenetic distribution of the 4 Oleaceae inversions among 52 species (table 1).
|
A positive PCR amplification would be expected from the primer combinations of P1/P3 and P2/P4 for species with the 1st 21-kb inversion. In contrast, for the species without the 1st inversion, a positive PCR reaction would result from the primer combinations of P1/P2 and P3/P4. For the species with both the 1st and the 2nd inversions, the primer combinations of P2/P5 and P4/P6 would produce positive PCR reactions, whereas the primer combination of P4/P6 would produce a positive PCR reaction in the species without both inversions. For the 3rd and 4th inversions, a series of positive amplifications would be expected from the primer combinations of P6/P9, P5/P7, P1/P3, P8/P10, and P2/P4.
Representative positive and negative PCR results for 9 species are shown in figure 5. Lanes 14 represent species with only the 1st inversion (all members of Jasminum except section Primulina), lanes 56 represent species with the 1st and the 2nd inversions (section Primulina), lanes 78 are species with the 1st, 3rd, and 4th inversions (Menodora), and lane 9 represents the species without any inversions (Forsythia).
|
The PCR diagnosis (table 1 and fig. 5) indicates that the 1st inversion is shared by all sampled members of the tribe Jasmineae, whereas all other tribes of the Oleaceae lack this inversion. Our findings also indicate that the 2nd inversion is restricted to the section Primulina of Jasminum and that the 3rd and 4th inversions are unique to Menodora.
All PCR results agreed with our expectations with one exception (fig. 5); an unexpected amplification was observed for the primer combination of P2/P4 in J. nudiflorum. Based on the complete cp sequence, the primer combination of P2/P4 should not generate a PCR product; however, an amplicon was produced from total DNA. This region is between ycf3 and cemA, and we sequenced the fragment and compared it with sequences of this region from other species. The PCR product contains many base substitutions and insertions/deletions, suggesting that the fragments may not be cpDNA. The amplicon apparently came from the primer-specific sites (see table 1), but from cpDNA that underwent intracellular gene transfer prior to the 2nd inversion and is no longer resident in the cp genome. Filter hybridizations of J. nudiflorum in this region also showed weak hybridizing fragments (data not shown). These results suggest that the unexpected fragment of J. nudiflorum might be due to the transfer of this region to the nucleus or mitochondrion. Transfers of cp genes to the mitochondrion and nucleus are well documented in several lineages (Blanchard and Schmidt 1995
; Martin et al. 1998
; Cummings et al. 2003
; Matsuo et al. 2005
).
Identification of Inversion Endpoints
To identify the precise location of endpoints of the 4 inversions (figs. 2 and 3) in Jasminum and Menodora, amplified fragments of each primer pair were sequenced and aligned from representative species (table 1 and figs. 5 and 6). The sequences between primer pairs P1/P2 and P3/P4 for F. europaea, primer pairs P1/P3 and P2/P4 for J. le-ratii, J. abyssinicum, J. subhumile, M. longiflora, and primer pairs P1/P3, P2/P5, and P4/P6 for J. nudiflorum were aligned (fig. 6) for the 1st and 2nd inversion endpoints. Endpoints of the 1st inversion were easily identified from the aligned sequences because of the high level of sequence conservation across species. Distinctive, short, reverse complementary sequences are associated with both inversion endpoints. The sequence AAAAGAAA overlaps on the 2 different orientations of alignments, although the P2/P5 fragment of J. nudiflorum and the P2/P4 fragment of M. longiflora appear to have lost this overlapping sequence by deletion. This overlapping sequence corresponds to base positions 580587 upstream of psaA in the J. nudiflorum cp genome. Endpoints for the 2nd inversion in J. nudiflorum were also revealed by comparison of P4/P6 and P2/P5 fragments (fig. 6). The 2nd inversion is also associated with distinctive, short duplicated sequences of AAARGAAY. These sequence elements are duplicated at the endpoints and correspond to base positions 1,4251,431 upstream of psaI and 444450 upstream of cemA in the J. nudiflorum genome. Three bases are deleted on one endpoint. This sequence shows high identity to the 1st inversion endpoint (AAAAGAAA). One endpoint of the 2nd inversion is located 42 bp upstream of the 1st inversion endpoint (figs. 2 and 6).
|
The endpoints of the 3rd inversion in Menodora were identified by comparative alignment of sequences between P7/P8 of J. nudiflorum and P5/P7 or P8/P10 of M. longiflora. The endpoints of the 4th inversion in Menodora were identified by comparative sequence alignments between P9/P10 of J. nudiflorum and P6/P9 or P8/P10 of M. longiflora (figs. 3 and 4). One endpoint of the 3rd inversion is located 530 bp downstream of trnT-UGU, and the other endpoint is located 144 bp upstream of rps4 (arrow 3 in fig. 3; alignment not shown). The 3rd inversion is nested within the 1st inversion. Relocations of ycf4 and the rps4trnS-GGAycf3 regions between endpoints 1 and 3 were caused by the 1st and 3rd inversions, and this region was further modified by the 4th inversion. The distance between endpoint 3 and ycf4 is 779 bp, and the distance between endpoints 3 and 4 is 1,343 bp. The 1,343 bp consist of 403 bp of noncoding sequence and a 939-bp repeat. This repeat is essentially the same as the 779 bp of sequence except for a 160-bp insertion in the middle of the 779-bp fragment. One endpoint of the 4th inversion (4 in fig. 3) is located 266 bp (±73 bp) upstream of trnK-UUU exon 2, and the other endpoint is located 135 bp (±73 bp) downstream of matK (fig. 3). Size variation between P10 and P8 in Menodora species (lanes 7 and 8 of fig. 6) is due to an 852-bp insertion that is associated with the direct repeats of GARGAAGAAA at the endpoint of the 4th inversion.
Phylogenetic Distribution and Timing of Inversions
In order to determine the phylogenetic distribution and times of origin of the 4 inversions, we reconstructed a phylogeny of the Oleaceae using sequences of rbcL and ndhF. Thirty species representing most of the genera in the family were sequenced, and 10 outgroup sequences were downloaded from GenBank (table 1). The gene sequences were aligned, and phylogenetic trees were reconstructed from the combined gene sequences using ML methods with the transversion (TVM) + gamma distribution (G) model.
Among the outgroups, 2 genera of Carlemanniaceae (Carlemannia and Silvianthus) are sister to Oleaceae (fig. 7). The Oleaceae is monophyletic with the 3 tribes Myxopyreae, Fontanesieae, and Forsythieae forming the earliest diverging lineages. The tribe Jasmineae forms a strongly supported monophyletic group. Within tribe Jasmineae, 3 lineages are evident, and they correspond to 3 well-recognized taxonomic groups: the alternate-leaved Jasminum (section Alternifolia), the opposite-leaved Jasminum, and Menodora. Within the opposite-leaved Jasminum, 2 subgroups are present, one of which corresponds to section Primulina (J. nudiflorum and Jasminum mesnyi) and the other including Jasminum simplicifolium and J. le-ratii.
|
All sampled members of the Jasmineae share the 1st 21-kb inversion. The 2 species of section Primulina have a 2nd inversion that results in the relocation of the 2.8-kb ycf4psaI region. All sampled species of Menodora share the 3rd and the 4th inversions. The phylogenetic tree based on the gene sequence data clearly suggests that the 1st inversion originated prior to the other 3 inversions. The data also suggest that the 2nd inversion was independent of the 3rd and 4th inversions.
Sequence data for ndhF and rbcL were used to estimate the time of origin of the 4 inversions using a PL analysis. These estimates indicate that the Jasmineae originated in the upper Cretaceous (78.3 MYA) and that the extant species of Jasmineae diverged from a common ancestral species in the middle Eocene (42.1 MYA). The 1st cpDNA inversion occurred during this same time period (fig. 7). The 2nd inversion of the Primulina subgroup cp genome originated more recently, sometime between the Oligocene (30.3 MYA) and Quaternary (0 MYA). The 3rd and 4th inversions in Menodora occurred sometime between the middle Eocene (42.1 MYA) and middle Miocene (12.6 MYA).
In order to more accurately estimate the order and timing of the 4 inversions, we performed phylogenetic analyses of the tribe Jasmineae using cpDNA sequences between psaA and 5'-rps12, which includes the inversion regions for J. nudiflorum, J. le-ratii, J. abyssinicum, J. subhumile, M. longiflora, and F. europaea. The 6 aligned sequences were 34,607-bp long, and phylogenetic analyses using ML and general time reversible (GTR) + invariable site proportion (I) + G resulted in the tree in figure 8a. We also performed phylogenetic analyses of the accD gene region after excluding unalignable noncoding regions and all small gaps from the data matrix; the resulting matrix included 23,217 aligned base pairs, and the resulting ML tree using GTR + I + G is shown in figure 8b.
|
The topologies of the 2 trees (figs. 8a and b) differ in the position of Menodora. Menodora longifolia is sister to an alternate-leaved J. subhumile in tree A, whereas in tree B, M. longiflora is sister to the clade comprising the other 3 Jasminum species. The ML values (lnL) of tree A and tree B with gaps are 89806.04 and 89810.90, respectively. The lnL of the 2 trees without gaps are 48182.76 and 48177.34, respectively. The P values for the KishinoHasegawa test on tree A and tree B are 1.000 and 0.139, respectively. The P values for the KishinoHasegawa test on trees A and B without gaps are 0.320 and 1.000, respectively. Thus, the KishinoHasegawa test indicates that the 2 trees are not significantly different.
The phylogenetic analyses confirm our hypotheses regarding the evolutionary order of the inversion events. The 1st inversion arose prior to the other 3 inversions (character 1 in fig. 8). The 2nd inversion (character 2 in fig. 8) is independent of the 3rd and 4th inversions (characters 3 and 4 in fig. 8). The 3rd and the 4th inversions are restricted to the Menodora.
Other Rearrangements in Jasmineae cp Genomes
Duplicated Insertions of 3 tRNA Genes
Two copies of trnG-GCC are located in the LSC region of J. nudiflorum. The copy near psbZ (coordinate 39,000 in fig. 1) is at the same location in tobacco, and the copy downstream of rbcL (coordinate 63,000 in fig. 1) is the duplicate. The 2 copies have 100% sequence identity but are in reverse orientations. The noncoding sequences 42 bp upstream and 54 bp downstream of the gene are also identical in both locations, suggesting that trnG-GCC and the surrounding noncoding sequence are duplicated and inserted into a new location.
Similar duplicated insertions of tRNA genes are found in other lineages of Jasminum and Menodora. The trnI-CAU, which is usually located in the IR, also occurs downstream of rbcL in all species of Menodora and Jasminum except section Primulina. In addition, a duplicate copy of trnS-GCU, which is normally located near psbI (near coordinate 10,000; fig. 1), also occurs downstream of rbcL in the alternate-leaved species of Jasminum (represented by J. subhumile in fig. 2). All 3 duplicated insertions are located downstream of rbcL and are correlated with the reduction or loss of accD, even though the 3 tRNA genes were originally in several different regions of the cp genome.
Duplicated Insertion of rpl23
An extra copy of rpl23 is inserted downstream of clpP in the LSC region of J. le-ratii and J. abyssinicum. This gene is located between trnI-CAU and rpl2 in the IR region in tobacco. Copies of these genes in the LSC and IR regions show minor differences in length and sequence identity (supplementary material SM 2, Supplementary Material online). The IR copy is 282-bp long, and a 3-bp deletion as well as 4 base substitutions are present in the LSC region of J. le-ratii. In addition, 79 and 15 bp of noncoding sequence located upstream and downstream of both copies of rpl23, respectively, are duplicated. Different species of Jasminum exhibit variation between the 2 copies of rpl23. For example, the LSC region of J. abyssinicum terminates at position 228 because of a 2-bp deletion, causing a frameshift mutation. The duplicated segments of rpl23 and associated noncoding regions have identical sequences in all surveyed species, suggesting that this is a shared derived character of the opposite-leaved species of Jasminum.
Duplication of ycf1 by Expansion of the IR
Detailed comparisons of the IRSSC boundaries of Nicotiana, Panax, Arabidopsis, and J. nudiflorum (fig. 9) indicate that ycf1 is completely incorporated into the IR region in J. nudiflorum. This expansion makes the IR region of J. nudiflorum approximately 4 kb longer than the other species compared, and the SSC region of Jasminum is consequently shorter than these other genomes.
|
Small Inversion of trnT-UGU
The trnT-UGU is normally located on the opposite strand and upstream of trnL-UAA. However, in J. nudiflorum, it is located on the same strand as trnL-UAA (fig. 1, near coordinate 55,000). This small inversion is only present in J. nudiflorum and J. mesnyi. The change in strand of trnT-UGU is due to a small 219-bp inversion. In addition to the trnT-UGU gene, the inversion includes 24 bp upstream and 122 bp downstream of the gene sequence. Short IR sequences are conserved (GAA-TTCCTAT and ATAGGAACTTC) except one base indel at both ends of the inversion.
Interruption of trnK-UUU by Inversion
The trnK-UUU is unusual because it contains matK within its intron in most cp genomes. Yet, the cp genomes of Menodora have an interrupted trnK-UUU gene due to an inversion endpoint within the intron between matK and the 3' exon of trnK-UUU (fig. 3, inversion 4). As a result, the 5' exon is located upstream of matK, whereas the 3' exon is located upstream of the psaA gene. The 2 exons of the trnK-UUU gene are separated by 59 kb because of this large inversion (fig. 3).
Serial Loss of accD
All sampled species of Jasminum and Menodora are missing some or all of accD. The serial loss of this gene is restricted to the members of the tribe Jasmineae, thus this character supports the monophyly of the tribe Jasmineae (fig. 8, character 5). The accD is usually located between rbcL and psaI in cp genomes of land plants. Therefore, we sequenced the region from rbcL to psaI for 5 Jasmineae species that have lost portions of accD gene and compared these sequences with 6 other species in which complete accD is present. The length of this region in species with accD is conserved, ranging from 2,835 to 3,068 bp in 6 phylogenetically diverse species (Nicotiana, Arabidopsis, Panax, Spinacia, Atropa, and Forsythia). In contrast, the size of the region that corresponds to the rbcLpsaI intergenic spacer region among 5 Jasmineae species ranges from 2,922 to 5,271 bp. Furthermore, average sequence divergence for this region among Jasmineae species is 0.4428, which is far higher than the average of 0.2407 seen among the 6 diverse outgroups.
Distinctive sequence elements are present in the region that corresponds to rbcL and psaI of the Jasminum species (fig. 10). Nearly half of the 3' end of accD (9471,449 bp) remains in the alternate-leaved clade of Jasminum (represented by J. subhumile). In addition, a small fragment from the middle of accD (822907 bp) remains in all members of Jasminum (fig. 10). However, the middle section of the accD sequence is completely absent in Menodora. The data suggest that the loss of accD occurred sequentially with the 5' portion first, the 3' portion next, and finally the middle portion of the gene. Complete or partial gene fragments from other parts of cp genomes are inserted into this region (fig. 10). These fragments include an atpE gene fragment (172290 bp) in all species; a complete copy of trnI-CAU in all Jasminum except the Primulina subgroup (supplementary material SM 3, Supplementary Material online); complete trnS-GCU, partial trnP-UGG, and partial 5'-rps12 for the alternate-leaved species of Jasminum; and complete trnG-GCC for all species of Jasminum except the alternate-leaved Jasminum. The trnI-CAU and trnG-GCC genes are also inserted into this region in Menodora. In addition, a portion of the rpl23 gene is found in this region in Menodora and the Primulina subgroup of Jasminum. A massive duplication of a 12-bp repeated sequence in the middle of the noncoding region between rbcL and psaI occurs in J. nudiflorum (fig. 10). The consensus 12-bp sequence of MNNMAYYDGNNV is repeated 100 times in the region where accD has been reduced (supplementary material SM 4, Supplementary Material online). The insertion of these repeats probably originated a long time ago because the individual repeating units are very divergent. Another highly repetitive element occurs downstream of clpP in J. nudiflorum. In this case, there are 56 copies of a 27-bp repeat (YWTTTTYTTTWGDRAHHRWYAYTTHHT), and this insertion coincides with the loss of 2 introns in clpP (supplementary material SM 5, Supplementary Material online). These repeats probably originated more recently because there are no base substitutions among the individual copies of the repeat elements.
|
Intron Losses in clpP
Two introns in clpP are missing in the cp genome of J. nudiflorum. Filter hybridization and sequencing results suggest that the losses of 2 introns in clpP are confined to opposite-leaved Jasminum, with both introns being well conserved in alternate-leaved Jasminum (fig. 8, character 6). The clpP introns are also absent in Menodora cpDNA. Two independent intron losses are hypothesized in figure 8A, whereas single intron loss is supported in figure 8B. In addition to the clpP intron loss, further rearrangements are located downstream of clpP. The gene rpl23 typically found in the IR in other angiosperm cp genomes has been duplicated, and one copy occurs downstream of clpP in Menodora and the opposite-leaved species of Jasminum (figs. 2 and 3; supplementary material SM. 2, Supplementary Material online). In J. nudiflorum, the above-mentioned 27-bp sequence is repeated 56 times in this region (supplementary material SM 5, Supplementary Material online).
| Discussion |
|---|
|
|
|---|
The cp genomes of the Oleaceae tribe Jasmineae exhibit a number of unusual rearrangements; together these changes make the Jasminum and Menodora cp genomes unique among land plants. The most notable changes concern the relocation of genes due to multiple, overlapping inversions, and the separation of the 2 exons of trnK-UUU by 59 kb. Below, we discuss several unusual features of Oleaceae cp genomes and compare them with other rearranged land plant cp genomes.
Gene Relocations Caused by Multiple Inversions
The cp genomes of the tribe Jasmineae have experienced multiple movements of gene segments. In Jasminum section Primulina (J. mesnyi and J. nudiflorum), a 2.8-kb region containing ycf4, psaI, and associated noncoding sequences has been moved 18 kb by 2 consecutive inversions of 21 and 18 kb. The phylogenetic distribution of these 2 inversions (fig. 8, characters 1 and 2) suggests that the 21-kb inversion originated prior to the 18-kb inversion (figs. 2, 3, and 7). Divergence time estimates also indicate that the 1st inversion originated between the upper Cretaceous (78.3 MYA) and the middle Eocene (42.1 MYA), whereas the 2nd inversion originated between the lower Oligocene (30.3 MYA) and Quaternary (0 MYA). In Menodora, 2 regions, rps4trnS-GGAycf3 and ycf4, were relocated by 3 different inversions (fig. 3). The 1st inversion is shared among all members of Jasmineae, and the other 2 (inversions 3 and 4 in fig. 3) are unique to Menodora. The 3rd inversion likely occurred prior to the 4th inversion even though this inversion is not present in any species examined.
Relocations of genes have been documented in other angiosperm lineages, and 2 different mechanisms have been proposed to cause these events. In legumes (Palmer et al. 1988
; Perry et al. 2002
), successive expansion and contraction of the IR have resulted in the relocation of a block of 6 ribosomal protein genes (rps19rps8) from one end of the LSC region to the other end. Gene movements in Pelargonium can be attributed to both IR expansion and contraction and multiple inversions (Chumley et al. 2006
). In grasses (Quigley and Weil 1985
; Hiratsuka et al. 1989
; Doyle et al. 1992
), 3 inversions have occurred, which have caused the relocation of several genes, including trnG-GCC through psbD.
Short dispersed repeats have been suggested as one of the major factors promoting inversions (Palmer 1991
; Knox et al. 1993
). Such repeats are common in cp genomes that are highly rearranged, including algae, grasses, conifers, geranium, subclover, and Campanulaceae (Bowman and Dyer 1986
; Palmer, Nugent, and Herbon 1987
; Bowman et al. 1988
; Milligan et al. 1989
; Lidholm and Gustafsson 1991
; Doyle et al. 1992
; Knox et al. 1993
; Cosner et al. 1997
; Chumley et al. 2006
). Inversions have been shown to occur through intramolecular recombination between repeat elements (Palmer, Nugent, and Herbon 1987
; Ogihara et al. 1988
; Achaz et al. 2003
; Rocha 2003
; Pombert et al. 2006
). Both endpoints of the 1st inversion in Jasminum and Menodora are terminated by the short repeats of AAAAGAAA (fig. 6). The 2nd inversion is also associated with the similar short repeated sequences of AAARGAAY in J. nudiflorum, and 100 copies of MNNMAYYDGNNV sequence elements occur near one endpoint of this inversion (supplementary material SM 4, Supplementary Material online). The endpoints of the 1st inversion in Menodora are also associated with short repeats of AAAAGAAA. In contrast to the short repeats at the endpoints of the 1st and 2nd inversions, the ends of the 3rd and the 4th inversions have much longer repeat elements of 779 and 939 bp (fig. 10). These long duplicated sequences likely facilitated the 3rd and 4th inversions by intramolecular recombination in Menodora.
Several studies have also suggested that tRNAs play a role in promoting gene-order changes by nonhomologous recombination (Hiratsuka et al. 1989
; Knox et al. 1993
; Turmel et al. 2002
). Duplicated copies of 2 tRNAs, trnG-GCC and trnT-UGU, are located near the endpoints of inversions 3 and 4 in Menodora (fig. 3). Thus, the widespread occurrence of repeats and tRNAs near the inversion endpoints both in Menodora and Jasminum cp genomes is coincident with a series of inversions caused by intramolecular recombination.
Gene and Intron Loss
Gene and intron losses have been documented in a number of land plant cp genomes (Downie and Palmer 1992
; Raubeson and Jansen 2005
). Within the Jasmineae, accD has a series of losses in the species of Jasminum and Menodora examined, supporting the possible phylogenetic relationship within the tribe (figs. 7, 8, and 10). This gene has been lost completely in 4 other lineages of angiosperms, including the grasses (Hiratsuka et al. 1989
; Maier et al. 1995
; Katayama and Ogihara 1996
), Campanulaceae (Cosner et al. 1997
), Geraniaceae (Palmer, Nugent, and Herbon 1987
; Chumley et al. 2006
), and Lobeliaceae (Knox and Palmer 1999
). The loss of accD in Campanulaceae and Lobeliaceae is a shared derived feature of these families and supports their close phylogenetic relationship (Knox and Palmer 1999
). It is notable that the loss of accD is associated with hotspots of rearrangements in each of the families in which it has been lost. In the grasses, Menodora, and the Primulina subgroup of Jasminum in the region where accD normally resides, a pseudogene of rpl23 is inserted into the region between rbcL and psaI (fig. 10). The mechanism for insertion of different sequence elements in this region is uncertain. The accD gene loss may either accelerate gene relocations by unknown mechanisms, or various movements of genes may induce the sequential loss of accD.
Most cp genomes of land plants include 18 intron-containing genes, 15 of which have 2 exons and 3 (ycf3, rps12, and clpP) with 3 exons. A number of independent losses of introns have been detected in various genes, including rpoC1 (Downie et al. 1996
, 1998
; Hansen et al. 2006
), rpl16 (Downie and Palmer 1992
), clpP (Maier et al. 1995
; Stefanovic and Olmstead 2005
), rpl2 (Downie et al. 1991
; Doyle et al. 1995
; Stefanovic and Olmstead 2005
), rps12 (Hoot and Palmer 1994
; Freyer et al. 1995
), trnK (Wolfe et al. 1992
; Turmel et al. 2005
), and ycf3 (Stefanovic and Olmstead 2005
). In the Oleaceae, the loss of the 2 introns in clpP is confined to opposite-leaved Jasminum and Menodora (figs. 3 and 8).
Gene Duplications
Most gene duplications in cp genomes are caused by the expansion of the IR (Goulding et al. 1996
). These expansions tend to be relatively small and result in the duplication of only single genes or parts of genes, which often produce pseudogenes at the IRSC boundaries (Kim and Lee 2004
). In a number of cases, the IR has greatly expanded, including a 12-kb expansion in Nicotiana acuminata (Solanaceae; Goulding et al. 1996
), 11.5 kb in Mahonia (Berberidaceae; Kim and Jansen 1994
), 11 kb in Lobelia thuliana (Lobeliaceae; Knox and Palmer 1999
), and a remarkable 50 kb in Pelargonium (Geraniaceae; Palmer, Nugent, and Herbon 1987
; Chumley et al. 2006
). In J. nudiflorum, expansion of the IR has caused a duplication of the normally single-copy gene ycf1 (fig. 9). The duplication ycf1 of Jasminum is analogous to the rps15 duplication in the rye cp genome (Prombona and Subramanian 1989
).
Duplications of cp genes that do not involve the IR are uncommon, and only a few have been documented in cp genomes of Arabidopsis (Brassicaceae; Koch et al. 2005
), Pelargonium (Geraniaceae; Chumley et al. 2006
), Pinus (Pinaceae; Wakasugi et al. 1994
), and Trachelium (Campanulaceae; Cosner et al. 1997
). These duplications have involved mostly tRNAs and some protein-coding genes. The only duplication of a protein-coding gene in the Oleaceae involves rpl23, which has resulted in a 2nd copy of this gene in the LSC region of the opposite-leaved species of Jasminum. This gene is normally located between trnI-CAU and rpl2 (supplementary material SM 2, Supplementary Material online) in the IR region (supplementary material SM 3, Supplementary Material online). A very similar duplication of rp123 has been documented in the Poaceae, and in both the Oleaceae and Poaceae, the duplicated copy has been inserted into the intergenic region between rbcL and psaI where accD normally resides. Most cp gene duplications outside of the IR involve tRNAs, and this is certainly the case in the Oleaceae. Several tRNA duplications have occurred, including trnG-GCC in the LSC region of J. nudiflorum (fig. 1), trnI-CAU in the LSC species of Menodora and Jasminum except section Primulina, and trnS-GCU in the LSC region of alternate-leaved species of Jasminum (represented by J. subhumile) (fig. 10). In the Oleaceae, most of these tRNA duplications are associated with other rearrangements in the cp genomes. The mechanism for tRNA duplications is unknown, but they could be caused by duplicative transpositions.
Interruption of trnK-UUU
The most unusual rearrangement in the Oleaceae involves the separation of the 2 exons of trnK-UUU by 59 kb. The cp genomes of Menodora have an interrupted trnK-UUU gene because of an inversion breakpoint within the intron between the matK gene and the trnK-UUU 3' exon (fig. 3). The only other gene with exons that are separated by such a large distance is rps12, which has 3 exons. The 5' exon for this gene is located in the LSC region within an operon that includes 2 other genes, clpP and rpl20, and the two 3' exons are in the IR, approximately 30 kb from the 5' exon. In several cp genomes, transsplicing has been demonstrated for rps12 to produce a transcript (Fukuzawa et al. 1986
; Torazawa et al. 1986
; Giese et al. 1987
). Separation of the 2 trnK-UUU exons suggests 2 possible alternatives for the functionality of this gene in M. longiflora. Either the gene must be transspliced so that it can function as it does in other cp genomes or the gene is nonfunctional. We have no experimental evidence demonstrating transsplicing of trnK-UUU in Menodora. However, this gene is absent in 2 cp genomes, the fern Adiantum (Wolf et al. 2003
) and the holoparasite Epifagus (Wolfe et al. 1992
). In both of these genomes, matK, which is encoded in the trnK-UUU intron and is involved in intron splicing of trnK-UUU, is present and functional (Hauser et al. 2006
). The situation in Adiantum is very similar to Menodora because a large inversion in the cp genome has one endpoint in the trnK-UUU intron, but in this case the 2 exons were not found (Wolf et al. 2003
). In Epifagus, Wolfe et al. (1992)
suggested that matK may have an alternative function, which most likely involves splicing of other group II introns (Hauser et al. 2006
). Experimental studies of both trnK-UUU and matK are needed to investigate these 2 alternatives in Menodora.
In summary, cpDNA rearrangements in Jasminum and Menodora are confined to several regions of the genome. Gene duplications, insertions of highly repetitive DNA, and gene and intron losses are concentrated between the rbcL and psaI genes and near clpP. These rearrangements also correlate with 4 large inversions in the LSC regions. Overall, the combination of these molecular evolutionary events makes the Jasmineae genomes distinct from all previously examined land plant cp genomes. Duplicated insertions of coding or noncoding sequences into sites where genes or introns have been lost may compensate for the organizational instability caused by these rearrangements. Alternatively, insertions of duplicated elements may be caused by gene or intron losses. Although the genome organization of grasses and the Jasmineae is very similar, losses of major portions of accD and the clpP introns represent parallel evolutionary events. This is also the case for the occurrence of a large inversion that splits the trnK-UUU intron in Menodora and the fern Adiantum.
| Supplementary Materials |
|---|
|
|
|---|
Supplementary materials SM 1SM 5 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
| Acknowledgements |
|---|
|
|
|---|
We thank several botanical gardens for providing plant material, including Kew Gardens, University of Connecticut, Fairchild Tropical Garden, The Royal Botanic Garden, Edinburgh, Arnold Arboretum, and Sisanbana Tropical Garden. All DNA materials were obtained from the plant DNA Bank in Korea. We also thank Mary Guisinger-Bellan, Katie Hansen, and 2 anonymous reviewers for comments on an earlier version of the manuscript. This research was supported by a grant (C00449 [GenBank] ) from Korea Research Foundation to K.J.K. and a National Science Foundation grant (DEB-0120709) to R.K.J.
Funding to pay the Open Access publication charges for this article was provided by the BK21 School of Life Sciences and Biotechnology Initiative, Korea University, Seoul, Korea.
| Footnotes |
|---|
Charles Delwiche, Associate Editor
| References |
|---|
|
|
|---|
Achaz G, Coissac E, Netter P, Rocha EPC. Associations between inverted repeats and the structural evolution of bacterial genomes. Genetics (2003) 164:12791289.
Altschul S, Gish W, Miller W, Myers E, Lipman D. Basic local alignment search tool. J Mol Biol (1990) 215:403410.[CrossRef][Web of Science][Medline]
Blanchard JL, Schmidt GW. Pervasive migration of organellar DNA to the nucleus in plants. J Mol Evol (1995) 41:397406.[CrossRef][Web of Science][Medline]
Bowman CM, Barker RF, Dyer TA. In wheat ctDNA, segments of ribosomal-protein genes are dispersed repeats, probably conserved by nonreciprocal recombination. Curr Genet (1988) 14:127136.[CrossRef][Web of Science][Medline]
Bowman CM, Dyer TA. The location and possible evolutionary significance of small dispersed repeats in wheat ctDNA. Curr Genet (1986) 10:931941.[CrossRef][Web of Science]
Chumley TW, Palmer JD, Mower JP, Boore JL, Fourcade HM, Caile PJ, Jansen RK, Forthcoming. The completed chloroplast genome sequence of Pelargonium x hortorum: organization and evolution of the largest and most highly rearranged chloroplast genome of land plants. Mol Biol Evol (2006) 23:116.
Cosner ME, Jansen RK, Palmer JD, Downie SR. The highly rearranged chloroplast genome of Trachelium caeruleum (Campanulaceae): multiple inversions, inverted repeat expansion and contraction, transposition, insertions/deletions, and several repeat families. Curr Genet (1997) 31:419429.[CrossRef][Web of Science][Medline]
Cosner ME, Raubeson LA, Jansen RK. Chloroplast DNA rearrangements in Campanulaceae: phylogenetic utility of highly rearranged genomes. BMC Evol Biol (2004) 4:27.[CrossRef][Medline]
Cummings MP, Nugent JM, Olmstead RG, Palmer JD. Phylogenetic analysis reveals five independent transfers of the chloroplast gene rbcL to the mitochondrial genome in angiosperms. Curr Genet (2003) 43:131138.[Web of Science][Medline]
De Candolle AP. Ordo CXXVIII. Jasmineae. In: Prodromus systematis naturalis regni vegetabilisDe Candolle A, ed. (1844) Vol. 8. Paris: Fortin, Masson et sociorum. 300316.
Downie SR, Katz-Downie DS, Rogers EJ, Zujewski HL, Small E. Multiple independent losses of the plastid rpoC1 intron in Medicago (Fabaceae) as inferred from phylogenetic analyses of nuclear ribosomal DNA internal transcribed spacer sequences. Can J Bot (1998) 76:791803.[CrossRef]
Downie SR, Llanas E, Katz-Downie DS. Multiple independent losses of the rpoC1 intron in angiosperm chloroplast DNAs. Syst Bot (1996) 21:135151.[CrossRef]
Downie SR, Olmstead RG, Zurawski G, Soltis DE, Soltis PS, Watson JC, Palmer JD. Six independent losses of the chloroplast DNA rpl2 intron in dicotyledons: molecular and phylogenetic implications. Evolution (1991) 45:12451259.[CrossRef][Web of Science]
Downie SR, Palmer JD. Use of chloroplast DNA rearrangements in reconstructing plant phylogeny. In: Molecular systematics of plantsSoltis PS, Soltis DE, Doyle JJ, eds. (1992) New York: Chapman and Hall. 1435.
Doyle JJ, Davis JI, Soreng RJ, Garvin D, Anderson MJ. Chloroplast DNA inversions and the origin of the grass family (Poaceae). Proc Natl Acad Sci USA (1992) 89:77227726.
Doyle JJ, Doyle JA. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull (1987) 19:1115.
Doyle JJ, Doyle JL, Ballenger JA, Palmer JD. The distribution and phylogenetic significance of a 50-kb chloroplast DNA inversion in the flowering plant family Leguminosae. Mol Phylogenet Evol (1996) 5:429438.[CrossRef][Web of Science][Medline]
Doyle JJ, Doyle JL, Palmer JD. Multiple independent losses of two genes and one intron from legume chloroplast genomes. Syst Bot (1995) 20:272294.[CrossRef]
Fan WH, Woelfle MA, Mosig G. Two copies of a DNA element, Wendy, in the chloroplast chromosome of Chlamydomonas reinhardtii between rearranged gene clusters. Plant Mol Biol (1995) 29:6380.[CrossRef][Web of Science][Medline]
Felsenstein J. Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol (1981) 17:368376.[CrossRef][Web of Science][Medline]
Freyer R, Neckermann K, Maier RM, Kossel H. Structural and functional analysis of plastid genomes from parasitic plants: loss of an intron within the genus Cuscuta. Curr Genet (1995) 27:580586.[CrossRef][Web of Science][Medline]
Fukuzawa H, Kohchi T, Ohyama K, Umesono K, Inokuchi H, Ozeki H. Coding sequences for the chloroplast ribosomal protein S12 from the liverwort, Marchantia polymorpha, are separated far apart on different DNA strands. FEBS Lett (1986) 198:1114.[CrossRef][Web of Science]
Giese K, Subramanian AR, Larrinua IM, Bogorad L. Nucleotide sequence, promoter analysis, and linkage mapping of the unusually organized operon encoding proteins S7 and S12 in maize chloroplasts. J Biol Chem (1987) 262:1525115255.
Goulding SE, Olmstead RG, Morden CW, Wolfe KH. Ebb and flow of the chloroplast inverted repeat. Mol Gen Genet (1996) 252:195206.[CrossRef][Web of Science][Medline]
Green PS. A revision of the pinnate-leaved species of Jasminum. Kew Bull (1997) 52:933947.
Green PS. Studies in the genus Jasminum, XVII: sections Trifoliolata and Primulina. Kew Bull (2001) 56:903915.
Green PS. Oleaceae. In: Flowering plants, Dicotyledons: Lamiales (except Acanthaceae including Avicenniaceae)Kadereit JW, ed. (2004) Vol. 7. New York: Springer-Verlag. 296306.
Hachtel W, Neuss A, Stein JV. A chloroplast DNA inversion marks an evolutionary split in the genus Oenothera. Evolution (1991) 45:10501052.[CrossRef][Web of Science]
Hansen AK, Gilbert LE, Simpson BB, Downie SR, Cervi AC, Jansen RK. Phylogenetic relationships and chromosome number evolution in Passifloria. Syst Bot (2006) 31:138150.[CrossRef]
Hauser G, Olson R, Simon D, Johnson I, Sanders ER, Karol KG, McCourt RM, Zimmerly S. Origin and evolution of the chloroplast trnK (matK) intron: a model for evolution of group II intron RNA structures. Mol Biol Evol (2006) 23:380391.
Heywood VH. Flowering plants of the world (1978) New York: Mayflower Books. 335.
Higgins DG, Thompson JD, Gibson TJ. Using CLUSTAL for multiple sequence alignments. Methods Enzymol (1996) 266:383402.[Web of Science][Medline]
Hiratsuka J, Shimada H, Whittier R, et al, (16 co-authors). The complete sequence of the rice (Oryza sativa) chloroplast genome: intermolecular recombination between distinct tRNA genes accounts for a major plastid DNA inversion during the evolution of the cereals. Mol Gen Genet (1989) 217:185194.[CrossRef][Web of Science][Medline]
Hoot SB, Palmer JD. Structural rearrangements, including parallel inversions, within the chloroplast genome of Anemone and related genera. J Mol Evol (1994) 38:274281.[CrossRef][Web of Science][Medline]
Jansen RK, Palmer JD. A chloroplast DNA inversion marks an ancient evolutionary split in the sunflower family (Asteraceae). Proc Natl Acad Sci USA (1987) 84:58185822.
Johnson LAS. A review of the family Oleaceae. Contr New South Wales Natl Herb (1957) 2:395418.
Katayama H, Ogihara Y. Phylogenetic affinities of the grasses to other monocots as revealed by molecular analysis of chloroplast DNA. Curr Genet (1996) 29:572581.[Web of Science][Medline]
Kavvadas DS. Olive tree. In: In: Illustrated botanical-Phytological dictionaryKavvadas DS, ed. (1956) Vol. 3. Athens (Greece): G. P. Xenos. 13691401. Greek.
Kim KJ, Choi KS, Jansen RK. Two chloroplast DNA inversions originated simultaneously during early evolution in the sunflower family. Mol Biol Evol (2005) 22:17831792.
Kim YD, Jansen RK. Characterization and phylogenetic distribution of a chloroplast DNA rearrangement in the Berberidaceae. Plant Syst Evol (1994) 193:107114.[CrossRef]
Kim KJ, Jansen RK. ndhF sequence evolution and the major clades in the sunflower family. Proc Natl Acad Sci USA (1995) 92:1037910383.
Kim KJ, Lee HL. Complete chloroplast genome sequences from Korean ginseng (Panax schinseng Nees) and comparative analysis of sequence evolution among 17 vascular plants. DNA Res (2004) 11:247261.[Abstract]
Knox EB, Downie SR, Palmer JD. Chloroplast genome rearrangements and the evolution of giant lobelias from herbaceous ancestors. Mol Biol Evol (1993) 10:414430.[Web of Science]
Knox EB, Palmer JD. The chloroplast genome arrangement of Lobelia thuliniana Lobeliaceae: expansion of the inverted repeat in an ancestor of the Campanulales. Plant Syst Evol (1999) 214:4964.[CrossRef]
Koch MA, Dobes C, Matschinger M, Bleeker W, Vogel J, Kiefer M, Mitchell-Olds T. Evolution of the trnF(GAA) gene in Arabidopsis relatives and the Brassicaceae family: monophyletic origin and subsequent diversification of a plastidic pseudogene. Mol Biol Evol (2005) 22:10321043.
Kumar S, Koichiro T, Jakobsen IB, Nei M. MEGA2: molecular evolutionary genetics analysis software. Bioinformatics (2001) 17:12441245.
Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R. REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res (2001) 29:46334642.
Lidholm J, Gustafsson P. A three-step model for the rearrangement of the chloroplast trnK-psbA region of the gymnosperm Pinus contorta. Nucleic Acids Res (1991) 19:28812887.
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res (1997) 25:955964.
Maddison WP, Maddison DR. MacClade, v, 4.0 (release version 4.06) (2003) Sunderland (MA): Sinauer Associates.
Maier RM, Neckermann K, Igloi GL, Kosel H. Complete sequence of the maize chloroplast genome: gene content, hotspots of divergence and fine tuning of genetic information by transcript editing. J Mol Biol (1995) 251:614628.
Martin W, Stoebe B, Goremykin V, Hansmann S, Hasegawa M, Kowallik K. Gene transfer to the nucleus and the evolution of chloroplasts. Nature (1998) 393:162165.[CrossRef][Medline]
Matsuo M, Ito Y, Yamauchi R, Obokata J. The rice nuclear genome continuously integrates, shuffles, and eliminates the chloroplast genome to cause chloroplast-nuclear DNA flux. Plant Cell (2005) 17:665675.
Milligan BG, Hampton JN, Palmer JD. Dispersed repeats and structural reorganization in subclover chloroplast DNA. Mol Biol Evol (1989) 6:355368.[Abstract]
Muller J. Fossil pollen records of extant angiosperms. Bot Rev (1981) 47:1142.
Ogihara Y, Terachi T, Sasakuma T. Intramolecular recombination of chloroplast genome mediated by short direct-repeat sequences in wheat species. Proc Natl Acad Sci USA (1988) 85:85738577.
Olmstead RG, Michaels HJ, Scott KM, Palmer JD. Monophyly of the Asteridae and identification of their major lineages inferred from DNA sequences of rbcL. Ann Mo Bot Gard (1992) 79:249265.[CrossRef]
Olmstead RG, Palmer JD. A chloroplast DNA phylogeny of the Solanaceae: subfamilial relationships and character evolution. Ann Mo Bot Gard (1992) 79:346360.[CrossRef]
Palmer JD. Isolation and structural analysis of chloroplast DNA. In: Methods in enzymologyWeissbach A, Weissbach H, eds. (1986) Vol. 118. New York: Academic Press. 167186.[Web of Science]
Palmer JD. Contrasting modes and tempos of genome evolution in land plant organelles. Trends Genet (1990) 6:115120.[CrossRef][Web of Science][Medline]
Palmer JD. Plastid chromosomes: structure and evolution. In: Cell culture and somatic cell genetics in plants. Vol. 7A. The molecular biology of plastidsVasil IK, Bogorad L, eds. (1991) San Diego: Academic Press. 553.
Palmer JD, Downie SR, Nugent JM, Brandt R, Unseld M, Brennicke A, Schuster W, Borner T. The chloroplast and mitochondrial DNAs of Arabidopsis thaliana: conventional genomes in an unconventional plant. In: ArabidopsisSomerville C, Meyerowitz E, eds. (1994) Cold Spring Harbor, New York: Cold Spring Harbor Press. 3767.
Palmer JD, Nugent JM, Herbon LA. Unusual structure of geranium chloroplast DNA: a triple-sized inverted repeat, extensive gene duplications, multiple inversions, and two repeat families. Proc Natl Acad Sci USA (1987) 84:769773.
Palmer JD, Osorio B, Aldrich J, Thompson WF. Chloroplast DNA evolution among legumes: loss of a large inverted repeat occurred prior to other sequence rearrangements. Curr Genet (1987) 11:275286.[CrossRef][Web of Science]
Palmer JD, Osorio B, Thompson WF. Evolutionary significance of inversions in legume chloroplast DNAs. Curr Genet (1988) 14:6574.[CrossRef][Web of Science]
Palmer JD, Thompson WF. Rearrangements in the chloroplast genomes of mung bean and pea. Proc Natl Acad Sci USA (1981) 78:55335537.
Perry AS, Brennan S, Murphy DJ, Wolfe KH. Evolutionary re-organization of a large operon in Adzuki bean chloroplast DNA caused by inverted repeat movement. DNA Res (2002) 9:157162.[Abstract]
Pombert JF, Lemieux C, Turmel M. The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes. BMC Biol (2006) 4:3.[CrossRef][Medline]
Pombert JF, Otis C, Lemieux C, Turmel M. The chloroplast genome sequence of the green alga Pseudendoclonium akinetum (Ulvophyceae) reveals unusual structural features and new insights into the branching order of chlorophyte lineages. Mol Biol Evol (2005) 22:19031918.
Posada D, Crandall KA. Modeltest: testing the model of DNA substitution. Bioinformatics (1998) 14:817818.
Prombona A, Subramanian AR. A new rearrangement of angiosperm chloroplast DNA in rye (Secale cereale) involving translocation and duplication of the ribosomal rpS15 gene. J Biol Chem (1989) 264:1906019065.
Quigley F, Weil JH. Organization and sequence of 5 transfer-RNA genes and of an unidentified reading frame in the wheat chloroplast genomeevidence for gene rearrangements during the evolution of chloroplast genomes. Curr Genet (1985) 96:495503.
Raubeson LA, Jansen RK. A rare chloroplast DNA structural mutation is shared by all conifers. Biochem Syst Ecol (1992) 20:1724.[CrossRef][Web of Science]
Raubeson LA, Jansen RK. Chloroplast genomes of plants. In: Diversity and evolution of plantsgenotypic and phenotypic variation in higher plantsHenry R, ed. (2005) Oxfordshire: CABI Publishing. 4568.
Rehder A. Manual of cultivated trees and shrubs (1940) 2nd ed. New York: McMillan. 452481.
Rocha EPC. DNA repeats lead to the accelerated loss of gene order in bacteria. Trends Genet (2003) 19:600603.[CrossRef][Web of Science][Medline]
Rohwer JG. Seed characters in Jasminum (Oleaceae): unexpected support for De Candolles's section. Bot Jahrb Syst (1994) 116:299319.
Rohwer JG. Seed characters in Jasminum (Oleaceae). II. Evidence from additional species. Bot Jahrb Syst (1995) 117:299315.
Sanderson MJ. Estimating absolute rates of molecular evolution and divergence times: a penalized likelihood approach. Mol Biol Evol (2002) 19:101109.
Sanderson MJ. r8s: inferring absolute rates of molecular evolution and divergence times in the absence of a molecular clock. Bioinformatics (2003) 19:301302.
Saski C, Lee SB, Daniell H, Wood TC, Tomkins J, Kim HG, Jansen RK. Complete chloroplast genome sequence of Glycine max and comparative analyses with other legume genomes. Plant Mol Biol (2005) 59:309322.[CrossRef][Web of Science][Medline]
Shinozaki K, Ohime M, Tanaka M, et al, (23 co-authors). The complete nucleotide sequence of tobacco chloroplast genome: its gene organization and expression. EMBO J (1986) 5:20432049.[Web of Science][Medline]
Stefanovic S, Olmstead G. Down the slippery slope: plastid genome evolution in Convolvulaceae. J Mol Evol (2005) 61:292305.[CrossRef][Web of Science][Medline]
Steyermark JA. A revision of the genus Menodora. Ann Mo Bot Gard (1932) 19:87176.[CrossRef]
Strauss SH, Palmer JD, Howe GT, Doerksen AH. Chloroplast genomes of two conifers lack a large inverted repeat and are extensively rearranged. Proc Natl Acad Sci USA (1998) 85:38983902.[CrossRef]
Swofford DL. PAUP: phylogenetic analysis using parsimony and other methods. In: Version 4.0 (2002) Sunderland (MA): Sinauer Associates.
Takahashi M, Crane PR, Manchester SR. Hironoia fusiformis gen. et sp. nov.: a cornalean fruit from the Kamikitaba locality (Upper Cretaceous, Lower Coniacian) in northeastern Japan. J Plant Res (2002) 115:463473.[CrossRef][Web of Science][Medline]
Taylor H. Cyto-taxonomy and phylogeny of the Oleaceae. Brittonia (1945) 5:337367.[CrossRef]
Torazawa K, Hayashida N, Obokata J, Shinozaki K, Sugiura M. The 5' part of the gene for ribosomal protein S12 is located 30 kbp downstream from its 3' part in tobacco chloroplast genome. Nucleic Acids Res (1986) 14:3143.
Turmel M, Otis C, Lemieux C. The chloroplast and mitochondrial genome sequences of the charophyte Chaetosphaeridium globosum: insights into the timing of the events that restructured organelle DNAs within the green algal lineage that led to land plants. Proc Natl Acad Sci USA (2002) 99:1127511280.
Turmel M, Otis C, Lemieux C. The complete chloroplast DNA sequences of the charophycean green algae Staurastrum and Zygnema reveal that the chloroplast genome underwent extensive change during the evolution of Zygnematales. BMC Biol (2005) 3:22.[CrossRef][Medline]
Turner BL. An overview of the North American species of Menodora (Oleaceae). Phytologia (1991) 71:340356.
Wakasugi T, Tsudzuki J, Ito S, Nakashima K, Tsudzuki T, Sugiura M. Loss of all ndh genes as determined by sequencing the entire chloroplast genome of the black pine Pinus thunbergii. Proc Natl Acad Sci USA (1994) 91:97949798.
Wallander E, Albert VA. Phylogeny and classification of the Oleaceae based on rps16 and trnL-F sequence data. Am J Bot (2000) 87:18271841.
Wolf PG, Rowe CA, Sinclair RB, Hasebe M. Complete nucleotide sequence of the chloroplast genome from a leptosporangiate fern, Adiantum capillus-veneris L. DNA Res (2003) 10:5965.[Abstract]
Wolfe KH, Morden CW, Palmer JD. Function and evolution of a minimal plastid genome from a nonphotosynthetic parasitic plant. Proc Natl Acad Sci USA (1992) 89:1064810652.
Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics (2004) 20:32523255.
Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res (2003) 31:34063415.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Besnard, R. Rubio de Casas, P.-A. Christin, and P. Vargas Phylogenetics of Olea (Oleaceae) based on plastid and nuclear ribosomal DNA sequences: Tertiary climatic shifts and lineage differentiation times Ann. Bot., July 1, 2009; 104(1): 143 - 160. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||










