MBE Advance Access originally published online on March 14, 2008
Molecular Biology and Evolution 2008 25(6):1120-1128; doi:10.1093/molbev/msn057
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
On the Expansion of the Pentatricopeptide Repeat Gene Family in Plants

,

,¶
,
* Centre for Computational Systems Biology, University of Western Australia, Perth, Australia
The Centre for Gene Research, Nagoya University, Japan
ARC Centre of Excellence in Plant Energy Biology, University of Western Australia, Perth, Australia
Unité de Recherche en Génomique Végétale (INRA-CNRS-UEVE) Evry, France
|| Department of Botany and Plant Sciences, University of California, Riverside
¶ Institute of Biology, Humboldt University of Berlin, Berlin, Germany
E-mail: iansmall{at}uwa.edu.au
| Abstract |
|---|
|
|
|---|
Pentatricopeptide repeat (PPR) proteins form a huge family in plants (450 members in Arabidopsis and 477 in rice) defined by tandem repetitions of characteristic sequence motifs. Some of these proteins have been shown to play a role in posttranscriptional processes within organelles, and they are thought to be sequence-specific RNA-binding proteins. The origins of this family are obscure as they are lacking from almost all prokaryotes, and the spectacular expansion of the family in land plants is equally enigmatic. In this study, we investigate the growth of the family in plants by undertaking a genome-wide identification and comparison of the PPR genes of 3 organisms: the flowering plants Arabidopsis thaliana and Oryza sativa and the moss Physcomitrella patens. A large majority of the PPR genes in each of the flowering plants are intron less. In contrast, most of the 103 PPR genes in Physcomitrella are intron rich. A phylogenetic comparison of the PPR genes in all 3 species shows similarities between the intron-rich PPR genes in Physcomitrella and the few intron-rich PPR genes in higher plants. Intron-poor PPR genes in all 3 species also display a bias toward a position of their introns at their 5' ends. These results provide compelling evidence that one or more waves of retrotransposition were responsible for the expansion of the PPR gene family in flowering plants. The differing numbers of PPR proteins are highly correlated with differences in organellar RNA editing between the 3 species.
Key Words: Oryza sativa Arabidopsis thaliana Physcomitrella patens introns pentatricopeptide repeat RNA editing
| Introduction |
|---|
|
|
|---|
Pentatricopeptide repeat (PPR) proteins are characterized by tandem repeats of a degenerate 35 amino acid motif, discovered in silico (Small and Peeters 2000
Recent years have seen many experimental investigations of PPR function, motivated by the finding that mutants of several PPR genes in plants display embryo lethal or otherwise spectacular phenotypes (for recent reviews, see Andrés et al. 2007
; Saha et al. 2007
). PPR proteins have been shown to play crucial roles in virtually all stages of organellar gene expression. For example, PPR proteins are associated with both the transcription (Ikeda and Gray 1999
; Pfalz et al. 2006
) and translation machinery (Pusnik et al. 2007
) and involved in many stages of mRNA processing including splicing (Schmitz-Linneweber et al. 2006
; Falcon de Longevialle et al. 2007
), endonucleolytic cleavage (Hashimoto et al. 2003
), and RNA editing (Kotera et al. 2005
; Okuda et al. 2007
). PPR proteins from various higher plants also act to suppress the expression of mitochondrial genes associated with cytoplasmic male sterility (e.g., Desloire et al. 2003
; Gillman et al. 2007
). The growing body of experimental results on PPR protein functions is consistent with the fact that the majority of Arabidopsis PPR proteins are predicted to be targeted to mitochondria or chloroplasts (Lurin et al. 2004
; Small et al. 2004
). The common thread to the various roles implicated for PPR proteins is an RNA-binding activity, demonstrated in several cases (Lahmy et al. 2000
; Mancebo et al. 2001
; Nakamura et al. 2003
; Schmitz-Linneweber et al. 2005
, 2006
; Okuda et al. 2006
).
Computational scans of complete genome sequences reveal that nonplant organisms contain very few PPR-encoding genes (Lurin et al. 2004
; Andrés et al. 2007
). For example, PPR genes are virtually absent from prokaryotes (Pusnik et al. 2007
), and moreover the yeast, Drosophila and human genomes are predicted to contain only 5, 2, and 6 PPR genes, respectively. One of the yeast PPR genes, PET309, was the first PPR gene to be functionally described. It plays an essential role in translation of the mitochondrial COX1 gene (Manthey and McEwen 1995
; Tavares-Carreon et al. 2008
). Mutations in a human PPR protein, LRPPRC, give rise to Leigh syndrome French Canadian variant (Mootha et al. 2003
). LRPPRC has been demonstrated to be a mitochondrial mRNA stabilization factor (Xu et al. 2004
). The nonplant organism with the largest number of predicted PPR genes is the parasitic protozoan Trypanosoma brucei, with 28. Several of these have been shown to be essential for mitochondrial function (Pusnik et al. 2007
). Thus, evidence suggests that the few PPR proteins in nonplants also play roles in organellar (i.e., mitochondrial) gene expression.
The vast difference between the numbers of PPR genes in higher plants and nonplant organisms indicates that a massive expansion of the PPR gene family occurred during the evolution of plants. The expansion was commented on in earlier studies based on Arabidopsis PPR genes (Lurin et al. 2004
; Rivals et al. 2006
); however, its origins and significance remain largely a mystery. In the present study, we take advantage of newly completed genome sequencing results to perform a systematic genome-wide comparison of the PPR genes in 3 organisms, widely separate along the plant lineage: the dicot Arabidopsis, the monocot Oryza sativa (rice), and the moss Physcomitrella patens (Rensing et al. 2008
), hereafter often referred to as "moss" for simplicity. Our results enable us to draw firm conclusions on the causes and timing of the expansion of PPR genes in higher plants. The dramatic differences in numbers of different subclasses of PPR proteins between species also give insight into the evolution and mechanism of RNA editing in plant organelles.
| Materials and Methods |
|---|
|
|
|---|
Identification of PPR-Encoding Genes from Genomic Data
The genome sequence data and gene annotations used in this work were for Arabidopsis thaliana: Release 6 of the Arabidopsis annotation from The Arabidopsis Information Resource (TAIR; http://www.arabidopsis.org), O. sativa: the Osa1 Release 3 (Yuan et al. 2005
The hmmsearch program from the HMMER package (Eddy 1998
) was used to detect PPR motifs in protein and translated genomic sequences. The hidden Markov models used were identical to those used for detection of Arabidopsis PPR motifs (Lurin et al. 2004
). As multiple models based on PPR variants were used, multiple overlapping hits were usually obtained. In these cases, the highest scoring chain of nonoverlapping hits was retained and the alternative overlapping hits discarded.
As an initial screen, genomic nucleotide sequence data in the pseudochromosomes of Arabidopsis and rice and scaffolds of Physcomitrella were translated in all 6 frames. The hmmsearch program was applied to the translated sequence data to identify clusters of all PPR motifs (P, L, S, L2, E, E+, and DYW) separated by fewer than 200 base pairs. These clusters correspond to putative PPR genes and were visualized in modified versions of the FlagDB++ (Samson et al. 2004
) and GBrowse (Stein et al. 2002
), genome browsers. In the case of Physcomitrella, these clusters of motifs correspond to already annotated genes from the JGI genome release and these were used in this study, except where a JGI gene model did not include either a start or stop codon (or both). In these cases, the open reading frames (ORFs) were extended to include these. It is expected that future sequencing results and refinements will significantly improve many of the Physcomitrella models. Following our earlier study on Arabidopsis (Lurin et al. 2004
), several clusters of rice PPR motifs found in the translated genome-wide search fell outside existing gene models. Modified gene models were constructed using the PPR motif data and Genemark.hmm trained on rice sequences (Yuan et al. 2005
) to verify possible alternative exon–intron structures. Several of the Arabidopsis models we proposed earlier (Lurin et al. 2004
) have also been revised after reviewing the new data from rice. Approximately 20% of the Arabidopsis and rice PPR models differ from the current gene models released by TAIR and TIGR, respectively. The most common errors found in the earlier gene models were insertions of extra introns, leading to the noninclusion of sequences encoding PPR motifs, and fusions to downstream exons of a neighboring gene.
The nomenclature of our final rice PPR gene models follows the schema OsPPR_##g##### where the first 2-digit number indicates the chromosome and the second 5-digit number corresponds wherever possible to the equivalent number from the Osa1 gene model. Where this is not possible for models that have been split in 2 or lie entirely in a region predicted to be intergenic in the Osa1 annotation, an appropriate number lying between those of the adjacent Osa1 gene models has been chosen. The nomenclature of the Arabidopsis models follows the same logic except for the single digit chromosome number (AtPPR_#g#####). For Physcomitrella, PPR gene models are named PpPPR_#, numbered sequentially. All gene models can be browsed using the "PPR Genome Browser" based on the Gbrowse software (Stein et al. 2002
) at http://www.plantenergy.uwa.edu.au/applications/osatppr/index.html. This site also contains GFF files describing all gene models.
Sequence Comparisons
Clustal W (Thompson et al. 1994
) was used for protein sequence alignments and for calculating distance trees using the Neighbor-Joining (NJ) method. All the figures in the paper were obtained using default "slow, accurate" parameters; variations in gap opening and extension parameters were tested but made only minor differences to the trees and did not affect any of the conclusions reached in this work. Distance trees were visualized with A Tree Viewer (ATV) (Zmasek and Eddy 2001
) and drawn using a modified version of ATV to produce scalable vector graphics output.
| Results and Discussion |
|---|
|
|
|---|
The PPR Content of the Arabidopsis, Rice, and Moss Genomes
Complete sets of genes encoding PPR proteins in the genomes of Arabidopsis, rice, and moss were identified using techniques described in the Materials and Methods. Final results identified 450 PPR genes in Arabidopsis, 477 in rice, and 103 in moss. The raw numbers of PPR genes in these species are informative: moss diverged early in the evolution of land plants and the much smaller number of PPR genes encoded by this genome compared with those of Arabidopsis and rice suggests at face value that the bulk of the expansion of the PPR family occurred following the divergence of moss and the lineage leading to vascular plants. The number of PPR genes in Arabidopsis and rice are strikingly similar, particularly, so given that there are approximately twice as many predicted protein-coding genes in rice than Arabidopsis. We return to a discussion of this similarity later.
PPR genes can be divided into 4 subclasses based on their C-terminal domain structure and the presence of longer (L) or shorter (S) variant PPR motifs within the tandem arrays of the classic P PPR. Figure 1 displays the numbers of PPR genes divided by species and subclass. A large majority of moss PPR proteins belong to the P subclass, apart from a small set of DYW subclass proteins. No E subclass PPR genes are found in moss, compared with over 100 in Arabidopsis and rice. Note the similar number of genes in Arabidopsis and rice in the various PPR subclasses.
|
A peculiarity of PPR genes in Arabidopsis is that the large majority do not contain any introns (Lurin et al. 2004
|
Phylogenetic Comparison of PPR Proteins
The predicted protein sequences of all PPR genes in this study were aligned by ClustalW to produce a NJ tree, displayed in figure 3. Tree branches are colored by species, numbers of introns, PPR subfamily, and predicted targeting to organelles. The trees are available as high-resolution vectorial figures and in standard New Hampshire format as supplementary material (Supplementary Material online). It is evident from figure 3A that there are few species-specific clusters of PPR genes. In those regions of the tree dominated by Arabidopsis and rice genes, there generally appears an even mix of genes from both species. On the other hand, there is a clear separation between genes in the PPR subclasses (fig. 3C), which group in separate regions in the tree.
|
Targeting to plant organelles on the basis of N-terminal protein sequences for all PPR sequences was predicted by the program Predotar (Small et al. 2004
Evidence That the PPR Gene Family Expanded via Retrotransposition
One of the mechanisms of new gene formation in eukaryotes is retrotransposition, wherein a mature messenger RNA, associated with a retrotransposon, is reverse transcribed and integrated into the genome. For a recent review on the subject, see Babushok et al. (2007)
. Retrotranscribed copies of genes that originally contained introns are thus intron less. Noting the largely intron-less nature of PPR genes in Arabidopsis, Lurin et al. (2004)
suggested that retrotransposition may be responsible for the expansion of the PPR gene family in higher plants. The results of the present study provide compelling evidence that this is indeed the case. We find that the majority of PPR coding sequences in rice are also intron less, whereas the PPR genes of moss typically contain many introns (fig. 2). If retrotransposition was responsible for the expansion of the PPR gene family, the few PPR genes in Arabidopsis and rice with many introns would represent "ancient" PPR genes that predated, and provided the template for, the expanded number of intron-less genes. Support for this hypothesis is immediately revealed by inspection of figure 3A and B, where intron-rich Arabidopsis and rice PPR genes cluster among the intron-rich PPR genes of moss, which diverged early in the plant lineage. An example of a triplet of orthologous intron-rich Arabidopsis, rice, and moss PPR genes is displayed in figure 4A. In figure 4B and C, we also display examples of intron-less Arabidopsis and rice PPR genes along with their intron-containing orthologs in moss. These figures also contain paralogous pairs of moss and rice PPR genes arising from genome duplication events, to be discussed in a later section.
|
Further evidence of retrotransposon-mediated expansion of the PPR family is provided by the distribution of intron positions in intron-poor PPR genes. It has been noted that in eukaryotes that are intron poor, introns are preferentially located at the 5' ends of genes (Fink 1987
Relative intron positions, defined as the length of the ORF upstream of the start of the intron divided by the full length of the ORF, were calculated for all introns in PPR genes of Arabidopsis and rice. The distribution of all intron positions is uniform (data not shown); however, these data are dominated by the introns of the few intron-rich PPRs mentioned above and which, under the hypothesis of retrotransposition-mediated PPR family expansion, can be considered as "ancestral" genes predating the expansion. Following Sakurai et al. (2002)
, figure 5 shows the distribution of intron positions for those PPR genes with a single intron. There is a clear overrepresentation of introns in the 5' end of these genes, as is observed for all introns in intron-poor eukaryotes, and for the introns of those genes with a single intron in several other intron-rich eukaryotes (Sakurai et al. 2002
). In contrast, the distribution of intron position for all introns in Arabidopsis is uniform (Mourier and Jeffares 2003
). Similar distributions to that in figure 5 are found for Arabidopsis, rice, and moss separately, suggesting that intron-poor moss PPR genes have also been generated by reverse transcription. The fact that there are over 100 moss PPR genes and that several are intron less and orthologous to Arabidopsis and rice PPR genes indicates that at least some of the retrotransposition-mediated expansion of the PPR gene family occurred prior to the divergence of moss and vascular plants.
|
The PPR Gene Family Expanded prior to the Monocot/Dicot Divergence
As noted earlier, there is a striking similarity in the number of PPR genes in Arabidopsis and rice. These similarities extend to the breakdown of the number of these genes by subclass (fig. 1), intron content (fig. 2), and are also reflected in the topology of the phylogenetic tree for PPR genes in figure 3A. An extraordinarily large proportion of outermost branches in the phylogenetic tree are pairs of probably orthologous Arabidopsis and rice PPR genes (e.g., AtPPR_5g27270 and OsPPR_06g02120 in fig. 4A). Bootstrap support for these pairs is very strong (generally 100%) and the branch lengths for all these pairs are similar, consistent with the idea that these pairs diverged from each other at roughly the same time, presumably the date of the last common ancestor of rice and Arabidopsis. Recently, several genome-wide phylogenetic analyses of other protein families from rice and Arabidopsis have been conducted. From these studies, pairs of proteins from the outermost branches of phylogenetic trees presented, with maximum bootstrap support, were identified. The proportion of orthologous Arabidopsis/rice protein pairs among all pairs in these studies are compared with that found for PPR proteins in figure 6. It is clear that the PPR protein family stands out in its exceptionally high degree of interspecies conservation of individual proteins. These data on the conserved number and nature of PPR genes in Arabidopsis and rice and their phylogenetic relationships provide strong evidence that the complement of PPRs in these organisms existed prior to the divergence of monocots and dicots, with few examples of gain or loss of PPR genes since that event.
|
It should be noted that 2 regions of the phylogenetic tree in figure 3A do not conform to the general trend observed above and consist of groups of rice-specific and Arabidopsis-specific paralogs. These proteins in these regions are homologous to the restorer-of-fertility (Rf) genes mentioned in the Introduction and found in several plant species (Chase 2007
Effects of Ancient Genome Duplication Events
The history of plant genomes is one of duplications (Sterck et al. 2007
). Over 70% of the Arabidopsis genome contains regions that are remnants of a genome duplication that occurred between 20 and 60 MYA, after the split of monocots and dicots (Blanc et al. 2003
; Bowers et al. 2003
). There is also evidence of earlier genome duplication events in the Arabidopsis lineage (Maere et al. 2005
). Similarly, over 65% of the rice genome is composed of remnants of whole-genome or segmental duplications, including a recent (ca. 8 MYA) segmental duplication between regions of chromosomes 11 and 12 (Yu et al. 2005
), and recently, a genome duplication approximately 30–60 MYA has been detected for Physcomitrella using EST sequence data (Rensing et al. 2007
). The approximate times of these duplication events are marked in figure 7.
|
The high proportion of orthologous Arabidopsis and rice pairs in our phylogenetic analysis indicates that the retention of new PPR genes following genome duplication is very rare in both species. Indeed, using the data of (Blanc et al. 2003
It has been demonstrated that the extent to which both pairs of genes formed by genome duplication are retained varies depending on their function (Blanc and Wolfe 2004
). Rensing et al. (2007)
noted that patterns of retention of duplicate moss genes based on functional categorization are markedly different than for seed plants. The extent to which the differing levels of duplicate PPR gene retention found in moss and the flowering plants is a reflection of a functional divergence of PPR genes in either set remains to be seen.
Implications for PPR Function
Several of the findings in this study are consistent with recent experimental evidence on the functions of PPR genes. The extraordinary conservation of the number of orthologous PPR genes from Arabidopsis and rice over hundreds of millions of years argues for their essentiality—something that is confirmed by gene knockout studies (Lurin et al. 2004
; Cushing et al. 2005
). This conservation also suggests that orthologous PPR proteins in these organisms carry out the same function in Arabidopsis and rice and, presumably, other flowering plants.
Numerous PPR proteins have been shown to bind avidly to RNA (reviewed in Delannoy et al. 2007
) and at least some do so with a high degree of specificity in vitro and in vivo (Nakamura et al. 2003
; Schmitz-Linneweber et al. 2005
, 2006
; Okuda et al. 2006
). Domain-swapping experiments implicate the PPR motifs in RNA recognition (Okuda et al. 2007
), and indeed, apart from a very few exceptions (Schmitz-Linneweber et al. 2006
), PPR proteins do not contain other known RNA-binding motifs. The various extra domains associated with PPR proteins (generally C-terminal) are thought to determine the function of them once bound (splicing, cleavage, editing, RNA stability, enhancing, or blocking translation) (Lurin et al. 2004
). These extra domains can be used to subdivide the PPR family as shown in figure 1 for the major classes. There is a notable difference in the relative proportions of PPR subfamilies among the PPR genes of the angiosperms and moss. For example, Physcomitrella does not contain any E subclass PPRs (fig. 1), whereas these are abundant in Arabidopsis and rice. To date, the molecular function of only 2 E-type PPR proteins is known, and both are implicated in RNA editing. CRR4 (Kotera et al. 2005
) and CRR21 (Okuda et al. 2007
) are required for editing of independent sites on ndhD transcripts in Arabidopsis chloroplasts and are thought to play a role as specificity factors, with the PPR repeats binding the target transcript while the E domain is needed to recruit the (as yet unknown) editing enzyme (Okuda et al. 2006
, 2007
). RNA editing is prevalent in most angiosperm organelles, with over 400 sites catalogued in Arabidopsis (Giege and Brennicke 1999
) and rice (Notsu et al. 2002
) mitochondria and a further 30 or more in plastids (Tsudzuki et al. 2001
; Chateigner-Boutin and Small 2007
). Editing in Physcomitrella is a much rarer event but does occur (Miyata et al. 2002
; Miyata and Sugita 2004
); approximately 9 sites are predicted between the 2 organelles based on their genome sequences (Sugiura et al. 2003
; Terasawa et al. 2007
). As Physcomitrella contains no E-type PPR proteins, something else must take this role in moss. The most likely candidates are the 10 related DYW domain proteins and indeed we have recently proposed, based on sequence similarity and phylogenetic distribution, that the DYW domain could carry the catalytic activity need for cytosine deamination (Salone et al. 2007
). The correlation between the diversity of E and DYW PPR proteins and the number of editing sites in these 3 species is further support for this hypothesis.
| Conclusions |
|---|
|
|
|---|
Our study provides comprehensive genome-wide data on the content, nature, and evolutionary relationship of the PPR genes in 3 organisms: Arabidopsis, rice, and the moss P. patens. Here we have focused on the remarkable expansion of the family in plants and form 3 main conclusions: 1) that the expansion of the family is likely to have been mediated by retrotransposition; 2) that the expansion of the family occurred prior to the monocot/dicot divergence, 3) that since this time little gene loss or gain has occurred, implying considerable conservation of function; and 4) that the expansion of the E and DYW classes in rice and Arabidopsis is correlated with a similar striking increase in the number of RNA editing sites in these 2 species.
| Supplementary Material |
|---|
|
|
|---|
Supplementary material is available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/) and at http://www.plantenergy.uwa.edu.au/applications/osatppr/index.html. GFF format files of all gene models used in this analysis. FASTA format files of all predicted protein sequences. NHX format file of the NJ tree. Adobe Illustrator files of the trees depicted in figure 3.
| Acknowledgements |
|---|
|
|
|---|
This work was supported by grants from the French Ministry of Education and Research, the French Australian S and T Programme (FR060030), the Australian Research Council (CE0561495), and the Western Australian State Government Centres of Excellence scheme.
| Footnotes |
|---|
Geoffrey McFadden, Associate Editor
| References |
|---|
|
|
|---|
Andrés C, Lurin C, Small ID. The multifarious roles of PPR proteins in plant mitochondrial gene expression. Physiol Plant (2007) 129:14–22.[CrossRef]
Babushok DV, Ostertag EM, Kazazian HH Jr. Current topics in genome evolution: molecular mechanisms of new gene formation. Cell Mol Life Sci (2007) 64:542–554.[CrossRef][Web of Science][Medline]
Blanc G, Hokamp K, Wolfe KH. A recent polyploidy superimposed on older large-scale duplications in the Arabidopsis genome. Genome Res (2003) 13:137–144.
Blanc G, Wolfe KH. Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution. Plant Cell (2004) 16:1679–1691.
Bowers JE, Chapman BA, Rong J, Paterson AH. Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events. Nature (2003) 422:433–438.[CrossRef][Medline]
Chase CD. Cytoplasmic male sterility: a window to the world of plant mitochondrial-nuclear interactions. Trends Genet (2007) 23:81–90.[CrossRef][Web of Science][Medline]
Chateigner-Boutin AL, Small I. A rapid high-throughput method for the detection and quantification of RNA editing based on high-resolution melting of amplicons. Nucleic Acids Res (2007) 35:e114.
Cushing DA, Forsthoefel NR, Gestaut DR, Vernon DM. Arabidopsis emb175 and other ppr knockout mutants reveal essential roles for pentatricopeptide repeat (PPR) proteins in plant embryogenesis. Planta (2005) 221:424–436.[CrossRef][Web of Science][Medline]
Delannoy E, Stanley WA, Bond CS, Small ID. Pentatricopeptide repeat (PPR) proteins as sequence-specificity factors in post-transcriptional processes in organelles. Biochem Soc Trans (2007) 35:1643–1647.[CrossRef][Web of Science][Medline]
Desloire S, Gherbi H, Laloui W, et al, (14 co-authors). Identification of the fertility restoration locus, Rfo, in radish, as a member of the pentatricopeptide-repeat protein family. EMBO Rep (2003) 4:588–594.[CrossRef][Web of Science][Medline]
Eddy SR. Profile hidden Markov models. Bioinformatics (1998) 14:755–763.
Falcon de Longevialle AF, Meyer EH, Andres C, Taylor NL, Lurin C, Millar AH, Small ID. The pentatricopeptide repeat gene OTP43 is required for trans-splicing of the mitochondrial nad1 intron 1 in Arabidopsis thaliana. Plant Cell (2007) 19:3256–3265.
Fink GR. Pseudogenes in yeast? Cell (1987) 49:5–6.[CrossRef][Web of Science][Medline]
Geddy R, Brown GG. Genes encoding pentatricopeptide repeat (PPR) proteins are not conserved in location in plant genomes and may be subject to diversifying selection. BMC Genomics (2007) 8:130.[CrossRef][Medline]
Giege P, Brennicke A. RNA editing in Arabidopsis mitochondria effects 441 C to U changes in ORFs. Proc Natl Acad Sci USA (1999) 96:15324–15329.
Gillman JD, Bentolila S, Hanson MR. The petunia restorer of fertility protein is part of a large mitochondrial complex that interacts with transcripts of the CMS-associated locus. Plant J (2007) 49:217–227.[CrossRef][Web of Science][Medline]
Hashimoto M, Endo T, Peltier G, Tasaka M, Shikanai T. A nucleus-encoded factor, CRR2, is essential for the expression of chloroplast ndhB in Arabidopsis. Plant J (2003) 36:541–549.[CrossRef][Web of Science][Medline]
Hattori M, Hasebe M, Sugita M. Identification and characterization of cDNAs encoding pentatricopeptide repeat proteins in the basal land plant, the moss Physcomitrella patens. Gene (2004) 343:305–311.[CrossRef][Web of Science][Medline]
Ikeda TM, Gray MW. Characterization of a DNA-binding protein implicated in transcription in wheat mitochondria. Mol Cell Biol (1999) 19:8113–8122.
Karpenahalli MR, Lupas AN, Soding J. TPRpred: a tool for prediction of TPR-, PPR- and SEL1-like repeats from protein sequences. BMC Bioinformatics (2007) 8:2.[CrossRef][Medline]
Kim J, Shiu SH, Thoma S, Li WH, Patterson SE. Patterns of expansion and expression divergence in the plant polygalacturonase gene family. Genome Biol (2006) 7:R87.[CrossRef][Medline]
Kotera E, Tasaka M, Shikanai T. A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts. Nature (2005) 433:326–330.[CrossRef][Medline]
Lahmy S, Barneche F, Derancourt J, Filipowicz W, Delseny M, Echeverria M. A chloroplastic RNA-binding protein is a new member of the PPR family. FEBS Lett (2000) 480:255–260.[CrossRef][Web of Science][Medline]
Li X, Duan X, Jiang H, et al, (13 co-authors). Genome-wide analysis of basic/helix-loop-helix transcription factor family in rice and Arabidopsis. Plant Physiol (2006) 141:1167–1184.
Lijavetzky D, Carbonero P, Vicente-Carbajosa J. Genome-wide comparative phylogenetic analysis of the rice and Arabidopsis Dof gene families. BMC Evol Biol (2003) 3:17.[CrossRef][Medline]
Lin K, Zhang DY. The excess of 5' introns in eukaryotic genomes. Nucleic Acids Res (2005) 33:6522–6527.
Lurin C, Andres C, Aubourg S, et al, (19 co-authors). Genome-wide analysis of Arabidopsis pentatricopeptide repeat proteins reveals their essential role in organelle biogenesis. Plant Cell (2004) 16:2089–2103.
Maere S, De Bodt S, Raes J, Casneuf T, Van Montagu M, Kuiper M, Van de Peer Y. Modeling gene and genome duplications in eukaryotes. Proc Natl Acad Sci USA (2005) 102:5454–5459.
Mancebo R, Zhou X, Shillinglaw W, Henzel W, Macdonald PM. BSF binds specifically to the bicoid mRNA 3' untranslated region and contributes to stabilization of bicoid mRNA. Mol Cell Biol (2001) 21:3462–3471.
Manthey GM, McEwen JE. The product of the nuclear gene PET309 is required for translation of mature mRNA and stability or production of intron-containing RNAs derived from the mitochondrial COX1 locus of Saccharomyces cerevisiae. EMBO J (1995) 14:4031–4043.[Web of Science][Medline]
Miyata Y, Sugita M. Tissue- and stage-specific RNA editing of rps 14 transcripts in moss (Physcomitrella patens) chloroplasts. J Plant Physiol (2004) 161:113–115.[CrossRef][Web of Science][Medline]
Miyata Y, Sugiura C, Kobayashi Y, Hagiwara M, Sugita M. Chloroplast ribosomal S14 protein transcript is edited to create a translation initiation codon in the moss Physcomitrella patens. Biochim Biophys Acta (2002) 1576:346–349.[Medline]
Mootha VK, Lepage P, Miller K, et al, (17 co-authors). Identification of a gene causing human cytochrome c oxidase deficiency by integrative genomics. Proc Natl Acad Sci USA (2003) 100:605–610.
Mourier T, Jeffares DC. Eukaryotic intron loss. Science (2003) 300:1393.
Nakamura T, Meierhoff K, Westhoff P, Schuster G. RNA-binding properties of HCF152, an Arabidopsis PPR protein involved in the processing of chloroplast RNA. Eur J Biochem (2003) 270:4070–4081.[Web of Science][Medline]
Nakamura T, Schuster G, Sugiura M, Sugita M. Chloroplast RNA-binding and pentatricopeptide repeat proteins. Biochem Soc Trans (2004) 32:571–574.[CrossRef][Web of Science][Medline]
Notsu Y, Masood S, Nishikawa T, Kubo N, Akiduki G, Nakazono M, Hirai A, Kadowaki K. The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants. Mol Genet Genomics (2002) 268:434–445.[CrossRef][Web of Science][Medline]
Okuda K, Myouga F, Motohashi R, Shinozaki K, Shikanai T. Conserved domain structure of pentatricopeptide repeat proteins involved in chloroplast RNA editing. Proc Natl Acad Sci USA (2007) 104:8178–8183.
Okuda K, Nakamura T, Sugita M, Shimizu T, Shikanai T. A pentatricopeptide repeat protein is a site recognition factor in chloroplast RNA editing. J Biol Chem (2006) 281:37661–37667.
Ooka H, Satoh K, Doi K, et al, (16 co-authors). Comprehensive analysis of NAC family genes in Oryza sativa and Arabidopsis thaliana. DNA Res (2003) 10:239–247.[Abstract]
Pfalz J, Liere K, Kandlbinder A, Dietz KJ, Oelmuller R. pTAC2, -6, and -12 are components of the transcriptionally active plastid chromosome that are required for plastid gene expression. Plant Cell (2006) 18:176–197.
Pusnik M, Small I, Read LK, Fabbro T, Schneider A. Pentatricopeptide repeat proteins in Trypanosoma brucei function in mitochondrial ribosomes. Mol Cell Biol (2007) 27:6876–6888.
Rensing SA, Ick J, Fawcett JA, Lang D, Zimmer A, Van de Peer Y, Reski R. An ancient genome duplication contributed to the abundance of metabolic genes in the moss Physcomitrella patens. BMC Evol Biol (2007) 7:130.[CrossRef][Medline]
Rensing SA, Lang D, Zimmer AD, et al, (70 co-authors). The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science (2008) 319:64–69.
Reyes JC, Muro-Pastor MI, Florencio FJ. The GATA family of transcription factors in Arabidopsis and rice. Plant Physiol (2004) 134:1718–1732.
Rivals E, Bruyere C, Toffano-Nioche C, Lecharny A. Formation of the Arabidopsis pentatricopeptide repeat family. Plant Physiol (2006) 141:825–839.
Roy SW, Gilbert W. Rates of intron loss and gain: implications for early eukaryotic evolution. Proc Natl Acad Sci USA (2005) 102:5773–5778.
Saha D, Prasad AM, Srinivasan R. Pentatricopeptide repeat proteins and their emerging roles in plants. Plant Physiol Biochem (2007) 45:521–534.[CrossRef][Web of Science][Medline]
Sakurai A, Fujimori S, Kochiwa H, Kitamura-Abe S, Washio T, Saito R, Carninci P, Hayashizaki Y, Tomita M. On biased distribution of introns in various eukaryotes. Gene (2002) 300:89–95.[CrossRef][Web of Science][Medline]
Salone V, Rudinger M, Polsakiewicz M, Hoffmann B, Groth-Malonek M, Szurek B, Small I, Knoop V, Lurin C. A hypothesis on the identification of the editing enzyme in plant organelles. FEBS Lett (2007) 581:4132–4138.[CrossRef][Web of Science][Medline]
Samson F, Brunaud V, Duchene S, De Oliveira Y, Caboche M, Lecharny A, Aubourg S. FLAGdb++: a database for the functional analysis of the Arabidopsis genome. Nucleic Acids Res (2004) 32:D347–D350.
Schmitz-Linneweber C, Williams-Carrier R, Barkan A. RNA immunoprecipitation and microarray analysis show a chloroplast pentatricopeptide repeat protein to be associated with the 5' region of mRNAs whose translation it activates. Plant Cell (2005) 17:2791–2804.
Schmitz-Linneweber C, Williams-Carrier RE, Williams-Voelker PM, Kroeger TS, Vichas A, Barkan A. A pentatricopeptide repeat protein facilitates the trans-splicing of the maize chloroplast rps12 pre-mRNA. Plant Cell (2006) 18:2650–2663.
Serrano M, Parra S, Alcaraz LD, Guzman P. The ATL gene family from Arabidopsis thaliana and Oryza sativa comprises a large number of putative ubiquitin ligases of the RING-H2 type. J Mol Evol (2006) 62:434–445.[CrossRef][Web of Science][Medline]
Small I, Peeters N, Legeai F, Lurin C. Predotar: a tool for rapidly screening proteomes for N-terminal targeting sequences. Proteomics (2004) 4:1581–1590.[CrossRef][Web of Science][Medline]
Small ID, Peeters N. The PPR motif—a TPR-related motif prevalent in plant organellar proteins. Trends Biochem Sci (2000) 25:46–47.[Web of Science][Medline]
Stein LD, Mungall C, Shu S, et al, (11 co-authors). The generic genome browser: a building block for a model organism system database. Genome Res (2002) 12:1599–1610.
Sterck L, Rombauts S, Vandepoele K, Rouze P, Van de Peer Y. How many genes are there in plants (and why are they there)? Curr Opin Plant Biol (2007) 10:199–203.[CrossRef][Web of Science][Medline]
Sugiura C, Kobayashi Y, Aoki S, Sugita C, Sugita M. Complete chloroplast DNA sequence of the moss Physcomitrella patens: evidence for the loss and relocation of rpoA from the chloroplast to the nucleus. Nucleic Acids Res (2003) 31:5324–5331.
Tavares-Carreon F, Camacho-Villasana Y, Zamudio-Ochoa A, Shingu-Vazquez M, Torres-Larios A, Perez-Martinez X. The pentatricopeptide repeats present in Pet309 are necessary for translation but not for stability of the mitochondrial COX1 mRNA in yeast. J Biol Chem (2008) 283:1472–1479.
Terasawa K, Odahara M, Kabeya Y, Kikugawa T, Sekine Y, Fujiwara M, Sato N. The mitochondrial genome of the moss Physcomitrella patens sheds new light on mitochondrial evolution in land plants. Mol Biol Evol (2007) 24:699–709.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res (1994) 22:4673–4680.
Tsudzuki T, Wakasugi T, Sugiura M. Comparative analysis of RNA editing sites in higher plant chloroplasts. J Mol Evol (2001) 53:327–332.[CrossRef][Web of Science][Medline]
Wu KL, Guo ZJ, Wang HH, Li J. The WRKY family of transcription factors in rice and Arabidopsis and their origins. DNA Res (2005) 12:9–26.[Abstract]
Xu F, Morin C, Mitchell G, Ackerley C, Robinson BH. The role of the LRPPRC (leucine-rich pentatricopeptide repeat cassette) gene in cytochrome oxidase assembly: mutation causes lowered levels of COX (cytochrome c oxidase) I and COX III mRNA. Biochem J (2004) 382:331–336.[CrossRef][Web of Science][Medline]
Yu JJ, Wang W, Lin S, et al, (117 co-authors). The genomes of Oryza sativa: a history of duplications. PLoS Biol (2005) 3:e38.[CrossRef][Medline]
Yuan Q, Ouyang S, Wang A, et al, (12 co-authors). The institute for genomic research Osa1 rice genome annotation database. Plant Physiol (2005) 138:18–26.
Zmasek CM, Eddy SR. ATV: display and manipulation of annotated phylogenetic trees. Bioinformatics (2001) 17:383–384.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. Cai, D. Ji, L. Peng, J. Guo, J. Ma, M. Zou, C. Lu, and L. Zhang LPA66 Is Required for Editing psbF Chloroplast Transcripts in Arabidopsis Plant Physiology, July 1, 2009; 150(3): 1260 - 1271. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. C. Marti, E. Olmos, J. J. Calvete, I. Diaz, S. Barranco-Medina, J. Whelan, J. J. Lazaro, F. Sevilla, and A. Jimenez Mitochondrial and Nuclear Localization of a Novel Pea Thioredoxin: Identification of Its Mitochondrial Target Proteins Plant Physiology, June 1, 2009; 150(2): 646 - 657. [Abstract] |







