MBE Advance Access originally published online on October 16, 2007
Molecular Biology and Evolution 2008 25(1):62-68; doi:10.1093/molbev/msm227
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Research Articles |
Chromosome-Specific Distribution of Nucleotide Substitutions in Telomeric Repeats of Rice (Oryza sativa L.)

* National Institute of Agrobiological Sciences, 1-2, Kannondai 2-chome, Tsukuba, Ibaraki 305-8602, Japan
Institute of the Society for Techno-innovation of Agriculture, Forestry and Fisheries, 446-1, Ippaizuka, Kamiyokoba, Tsukuba, Ibaraki 305-0854, Japan
E-mail: mat{at}nias.affrc.go.jp.
| Abstract |
|---|
|
|
|---|
Examination of the genomic sequence of the telomere region makes it possible to understand the evolution of the structure of chromosomal ends. We compared the genomic sequences of 14 chromosomal ends of rice, Oryza sativa, L., on the basis of the variation in TTTAGGG repeats. In the proximal telomere repeats, nucleotide substitution occurred more frequently than in the more distal repeats. The most significant diversity was observed at the 1st, 2nd, or 3rd position of TTTAGGG, suggesting that T has been a target of mutation preferentially. Copies of ATTAGGG, CTTAGGG, GTTAGGG, TTCAGGG, TTGAGGG, or TATAGGG were arrayed in tandem, or the same subtypes were located close to each other. The substituted variants were accumulated in chromosomes 2L, 3L, 7L, and 10S but not in the ends of the other chromosomes. In contrast, deletion variants, almost all of which were TTTAGGG to TTAGGG, were dispersed over approximately 4.9% of the sequenced telomere repeats. In summary, the rice proximal telomeric arrays were composed of blocks of at least 6 types of substituted variants and the canonical sequence in a chromosome-specific manner. These results suggest that the variants might arise from the rapid expansion of a single mutation rather than from the gradual accumulation of random mutations.
Key Words: plant telomere tandem repeats mutation
| Introduction |
|---|
|
|
|---|
Telomere protein–DNA complexes form the ends of linear eukaryotic chromosomes and serve as protective caps that prevent fusion and degradation of the chromosomal ends (McEachern et al. 2000
To maintain the chromosomal ends, telomerase extends the telomeric repeats using its own RNA as a template. As telomerase reconstructs only the distal end of the repeat, the proximal repeats might not have been reconstructed for a long time on an evolutionary time scale. Therefore, the rate of accumulation of mutations might differ between the proximal and distal ends of the telomere. Do the most proximal arrays accumulate random nucleotide changes?
The rice species Oryza sativa L. is considered to be a model monocot plant because of its small genome size and synteny with other cereal crops (Devos 2005
). The International Rice Genome Sequencing Project (IRGSP) has completed a high-quality map-based sequencing of the genome of the japonica cultivar Nipponbare, and the whole-genomic sequence and its annotation are available in a public database (IRGSP 2005
; Rice Annotation Project 2007
). However, the highly repetitive regions, including the telomeres and centromeres, have not yet been fully characterized (Mizuno et al. 2006a
, 2006b
).
Here, we describe a comprehensive analysis of nucleotide substitutions in the telomeric arrays on the chromosome ends of rice. We obtained new genomic sequences for the ends of 6 chromosomes and compared them with 8 published genomic sequences. We elucidated the chromosome-specific distributions of substituted, deleted, or inserted telomere repeats; such distributions might be keys to investigations of the evolution of the structure of chromosomal ends. We address the comparative analysis of telomere variants among organisms. Finally, we discuss the mechanism for the generation of telomere variants and the functional effects in telomere homeostasis.
| Materials and Methods |
|---|
|
|
|---|
Screening, Mapping, and Sequencing of Fosmid Clones
The fosmid library of genomic DNA derived from the rice cultivar Nipponbare (O. sativa L. ssp. japonica, JP229579 in Genebank of National Institute of Agrobiological Sciences) was screened with overgo probes, as described previously (Chen et al. 2002
| Results |
|---|
|
|
|---|
Mapping and Sequencing of Rice Telomere Repeats
We obtained the telomere sequences at the ends of several rice chromosomes. Fosmid clones that contained telomere tandem repeats were screened and mapped to the ends of chromosomes 3S, 3L, 4S, 4L, 5S, and 10S. They contained copies of TTTAGGG, its variants, and the adjacent chromosome-specific sequences (table 1). Eight other fosmid clones that also contained telomere tandem repeats were drawn from public databases (table 1); they had previously been mapped to the ends of chromosomes 1S, 2S, 2L, 6L, 7S, 7L, 8S, and 9S (Fujisawa et al. 2006
|
Accumulation of Substituted Variants in Proximal Telomere Repeats
We compared the numbers of TTTAGGG variants at the distal parts (distal to the centromere) and proximal parts (proximal to the centromere) of the first 50 telomere repeats. Fifty copies of telomere repeats adjacent to the chromosome-specific region on chromosomes 2S, 2L, 3L, 4S, 4L, 5S, 7L, 8S, 9S, and 10S were compared (fig. 1). As chromosomes 1S, 3S, 6L, and 7S contained fewer than 50 copies of sequenced telomere repeats (table 1), they were not examined. The sum of total variants in the 10 ends studied was higher in about the first 30 units from the beginning of the telomere array than in the remaining units (fig. 1A). The sum of substitution variants was higher among the first 30 units than among the remainder (fig. 1B) but that of deletion or insertion variants was spread about evenly among units (fig. 1C). Therefore, nucleotide substitution occurred at a high rate over a span of approximately 30 units from the beginning of the telomere array.
|
Nucleotide Substitutions or Deletions in TTTAGGG Repeats
We characterized the rates of nucleotide change in the telomere repeats of 14 chromosomal ends. By comparing the 30-telomere units adjacent to the chromosome-specific region, we identified various kinds of nucleotide substitutions, deletions, and insertions in the TTTAGGG sequence (fig. 2). There were many nucleotide substitutions in chromosomes 2L, 3L, 7L, and 10S, and these chromosomes showed different patterns of substitution (fig. 2). Most substituted units contained 1-nt substitution, and there were no more than 2 substitutions in 1 unit (data not shown). Thus, we classified the patterns on the basis of the positions of the nucleotide substitutions (fig. 3):
- (i) 1st position in TTTAGGG. All types of substitutions (T–A, T–C, T–G) occurred, changing the sequence to ATTAGGG, CTTAGGG, or GTTAGGG. These TTTAGGG variants were found on chromosomes 2L, 3L, and 10S (fig. 3).
- (ii) 2nd position of TTTAGGG. Only one type of substitution (T–A) occurred, changing the sequence to TATAGGG (fig. 3). This type of substitution was previously reported from chromosomes 7L (Mizuno et al. 2006b
), but we found TATAGGG arrays on both chromosomes 7L and 3L (fig. 3).
- (iii) 3rd position of TTTAGGG. T–C or T–G substitutions changed the sequence to TTCAGGG or TTGAGGG (fig. 3). These TTTAGGG variants were specific to the end of chromosome 3L alone (fig. 3).
- (iv) 4th, 5th, 6th, or 7th position of TTTAGGG. There was no obvious clustering of TTTAGGG variants that had nucleotide substitutions at the 4th, 5th, 6th, or 7th positions of TTTAGGG.
- (ii) 2nd position of TTTAGGG. Only one type of substitution (T–A) occurred, changing the sequence to TATAGGG (fig. 3). This type of substitution was previously reported from chromosomes 7L (Mizuno et al. 2006b
|
|
In total, the rice telomeres contained at least 6 types of TTTAGGG variant: ATTAGGG, CTTAGGG, GTTAGGG, TATAGGG, TTCAGGG, and TTGAGGG.
Most deletions occurred at one of the Ts in TTT, changing the sequence from TTTAGGG to TTAGGG. Few deletions of a G in GGG were observed (fig. 4A). The T nucleotide in the 7-nt unit was deleted in 4.9% (44/897) of sequenced repeats. Consequently, changes in sequence from T to N occurred preferentially because the nucleotide substitution or deletion occurred most commonly at the 1st, 2nd, or 3rd position of TTTAGGG.
|
Distribution of TTTAGGG Variants
By comparing the TTTAGGG sequence or its variants, we found marked differences in distribution patterns depending on the subtype of variants. Nucleotide-substituted TTTAGGG variants were often arrayed in tandem, or the same subtypes were often close to each other (fig. 3). Among the 14 chromosomal ends, these variants were found only on the ends of chromosomes 2L, 3L, 7L, and 10S (fig. 3). The other ends had hardly any substituted repeats (data not shown). In contrast, TTTAGGG deletion or insertion variants were dispersed throughout the sequenced regions of almost all ends (typical examples: fig. 4B). Inversion of part of the array was observed in chromosomes 4L, 7S, and 9S (fig. 4C). These inversions were located adjacent to the chromosome-specific region and were followed by a 6-to 28-bp junction (fig. 4C). These results indicate that substitutions or inversions were accumulated in tandem and distributed in a chromosome-specific manner. On the other hand, TTTAGGG nucleotide deletion or insertion variants were dispersed over the ends of almost all chromosomes.
| Discussion |
|---|
|
|
|---|
Evolution of Proximal Telomere Arrays
We compared the genomic sequences of 14 chromosomal ends in rice. The rate of accumulation of telomere variants was higher in the proximal region than in the distal region (fig. 1), suggesting that the proximal region had been rarely reconstructed by telomerase on an evolutionary time scale. On the basis of the following evidence, we consider that this change was not due to the accumulation of random mutations. The telomere array was composed of blocks of canonical sequences or 6 types of TTTAGGG variants (fig. 3). A change from T occurred preferentially (fig. 2). The most commonly substituted sequence contained 1-nt substitution, and there were no more than 2 substitutions in 1 unit (fig. 3); thus, these characteristic telomeric sequences ended abruptly at the junction between the telomere and the chromosome-specific region. These results suggest that the variants might arise from the rapid expansion of a single mutation rather than from the gradual accumulation of random mutations.
This expansion of telomere variants has made it possible to characterize the rice chromosomal end. Copies of ATTAGGG, CTTAGGG, GTTAGGG, TATAGGG, TTCAGGG, or TTGAGGG were arrayed in tandem, or the same subtypes were close to each other at the ends of 4 of the chromosomes (fig. 3). Inversion of telomere repeats was observed adjacent to the beginning of the telomere array on the ends of 3 chromosomes (fig. 4C). Therefore, the proximal telomeric sequences are composed of blocks of at least 6 types of TTTAGGG variants and the canonical sequence in a chromosome-specific manner.
Propensity for Nucleotide Substitutions, Deletions, and Insertions among Organisms
We found 6 types of TTTAGGG variants in rice (fig. 3). In A. thaliana, arrays of TTCAGGG and TTAAGGG are reported as telomere-associated sequences (Richards et al. 1992
). Therefore, the TTCAGGG pattern is common between rice and A. thaliana (fig. 5A). But they could result from different origins. In humans, the telomeric array contains TGAGGG, TCAGGG, and TTGGGG on the Xp/Yp chromosome and on a few autosomes (Allshire et al. 1989
; Baird et al. 1995
; Coleman et al. 1999
; Baird et al. 2000
). Despite the fact that the length of a unit varies between rice and humans (Moyzis et al. 1988
), T–C or T–G substitution next to A might occur preferentially in both organisms (fig. 5A). However, other frequently observed variants—TTAAGGG in A. thaliana and TTGGGG in humans—have hardly ever been observed in this study of rice, although single TTAAGGG variants were observed on chromosomes 2S, 8S, and 10S in rice (fig. 2). In summary, T–C substitution next to A is common among these organisms.
|
We also addressed the propensity for nucleotide deletion or insertion among plants. The telomere of rice contained a nucleotide deletion or insertion at T in TTTAGGG (fig. 4A). It is interesting that the telomere sequences of Chlamydomonas and Asparagales are similar to that of rice but not identical: the insertion type in rice, TTTTAGGG, is present in Chlamydomonas (Petracek et al. 1990
Mechanism of Expansion of Substituted Variants among Chromosomal Ends
As the same substituted variants were close to each other on specific chromosomes (fig. 3), expansion of variants might have arisen from intrachromosomal processes such as sister-chromatid exchange or slip during DNA synthesis. The high frequency of DNA recombination in the subtelomeric region of rice (Wu et al. 2003
; Gaut et al. 2007
) might affect telomere–telomere DNA recombination. In addition to the expansion in single chromosomes, some ends had common substituted sequences (fig. 3). This common distribution may have resulted from interchromosomal telomere–telomere recombination, although there is a possibility that it has originated from independent mutations. As duplication of sequences was found on only 3 of the 14 ends, interchromosomal exchange might play only a minor role in the expansion of variants.
Do telomere variations inherit from generation to generation? We compared the genomic sequences between 2 individuals (fig. 3). The distribution of the 5 types of substituted variants in 3L, 7L, and 10S was almost identical, suggesting that it was stable and not temporal. However, the distribution in relatively distal parts of the sequenced repeats was not identical in 3L and 10S. It is possible that the region might have been deleted by telomere rapid deletion (Li and Lustig 1996
; Watson and Shippen 2007
) and subsequently reconstructed by telomerase.
Effect of Dispersed Deletion Variants on Telomere Homeostasis
Telomere arrays associate with telomere proteins to form specialized chromatin structures. The deletion process from TTTAGGG to TTAGGG occurred in approximately 4.9% of sequenced repeats and was spread all over the sequenced region (fig. 4B). Do many deletion variants work as alternatives to TTTAGGG? The effect of nucleotide substitution was previously examined on the basis of their binding affinity to RTBP1, a DNA-binding protein that recognizes the telomeric sequence in rice (Yu et al. 2000
). Although the internal 6-bp GGGTTT sequence in the 2-telomere repeat is critical for binding of RTBP1, it has been shown that RTBP1 can bind to deletion variants with less affinity (Yu et al. 2000
). Therefore, an abundance of dispersed TTAGGG sequences may not have much effect in the binding of RTBP1to telomere repeats.
Researchers have been viewing the chromosomal end as highly polymorphic and evolutionarily dynamic in various organisms (Mefford and Trask 2002
; Eichler and Sankoff 2003
; Kuo et al. 2006
). We elucidated the variations in the sequence and distribution of the first 30–50 telomere repeats among chromosomal ends of rice. As rice telomeres have arrays of 730–1500 copies of TTTAGGG repeats (Mizuno et al. 2006b
), genomic sequencing of all the arrays is not yet complete. It is possible that the mosaics of blocks of noncanonical telomere sequences in the remaining rice telomeres could have resulted from slips during DNA synthesis, high frequency of DNA recombination, and/or rapid deletion in the telomere region. Further analysis of the chromosome-specific distribution of variants would help to precisely determine the evolutionary history of rice chromosomal ends.
| Supplementary Material |
|---|
|
|
|---|
Detailed maps and finished sequences of the whole rice genome are available at our Web site (http://rgp.dna.affrc.go.jp/).
| Acknowledgements |
|---|
|
|
|---|
We thank Dr Rod A. Wing of the Arizona Genomics Institute for providing the Nipponbare fosmid library; F. Aota and K. Ohtsu for technical assistance; and Dr B. A. Antonio for critical reading of the manuscript. This study was supported by grant no. GD-2007 from the Ministry of Agriculture, Forestry, and Fisheries of Japan.
| Footnotes |
|---|
Charles Delwiche, Associate Editor
| References |
|---|
|
|
|---|
Allshire RC, Dempster M, Hastie ND. Human telomeres contain at least three types of G-rich repeat distributed non-randomly. Nucleic Acids Res (1989) 17:4611–4627.
Baird DM, Coleman J, Rosser ZH, Royle NJ. High levels of sequence polymorphism and linkage disequilibrium at the telomere of 12q: implications for telomere biology and human evolution. Am J Hum Genet (2000) 66:235–250.[CrossRef][Web of Science][Medline]
Baird DM, Jeffreys AJ, Royle NJ. Mechanisms underlying telomere repeat turnover, revealed by hypervariable variant repeat distribution patterns in the human Xp/Yp telomere. EMBO J (1995) 14:5433–5443.[Web of Science][Medline]
Blackburn EH. Telomeres and telomerase: their mechanisms of action and the effects of altering their functions. FEBS Lett (2005) 579:859–862.[CrossRef][Web of Science][Medline]
Burr B, Burr FA, Matz EC, Romero-Severson J. Pinning down loose ends: mapping telomeres and factors affecting their length. Plant Cell (1992) 4:953–960.
Cech TR. Beginning to understand the end of the chromosome. Cell (2004) 116:273–279.[CrossRef][Web of Science][Medline]
Chen M, Presting G, Barbazuk WB, et al, (39 co-authors). An integrated physical and genetic map of the rice genome. Plant Cell (2002) 14:537–545.
Coleman J, Baird DM, Royle NJ. The plasticity of human telomeres demonstrated by a hypervariable telomere repeat array that is located on some copies of 16p and 16q. Hum Mol Genet (1999) 8:1637–1646.
Devos KM. Updating the crop circle. Curr Opin Plant Biol (2005) 8:155–162.[CrossRef][Web of Science][Medline]
Eichler EE, Sankoff D. Structural dynamics of eukaryotic chromosome evolution. Science (2003) 301:793–797.
Fujisawa M, Yamagata H, Kamiya K, Nakamura M, Saji S, Kanamori H, Wu J, Matsumoto T, Sasaki T. Sequence comparison of distal and proximal ribosomal DNA arrays in rice (Oryza sativa L.) chromosome 9S and analysis of their flanking regions. Theor Appl Genet (2006) 113:419–428.[CrossRef][Web of Science][Medline]
Gaut BS, Wright SI, Rizzon C, Dvorak J, Anderson LK. Recombination: an underappreciated factor in the evolution of plant genomes. Nat Rev Genet (2007) 8:77–84.[CrossRef][Web of Science][Medline]
International Rice Genome Sequencing Project. The map-based sequence of the rice genome. Nature (2005) 436:793–800.[CrossRef][Medline]
Kilian A, Stiff C, Kleinhofs A. Barley telomeres shorten during differentiation but grow in callus culture. Proc Natl Acad Sci USA (1995) 92:9555–9559.
Kuo HF, Olsen KM, Richards EJ. Natural variation in a subtelomeric region of Arabidopsis: implications for the genomic dynamics of a chromosome end. Genetics (2006) 173:401–417.
Li B, Lustig AJ. A novel mechanism for telomere size control in Saccharomyces cerevisiae. Genes Dev (1996) 10:1310–1326.
McEachern MJ, Krauskopf A, Blackburn EH. Telomeres and their control. Annu Rev Genet (2000) 34:331–358.[CrossRef][Web of Science][Medline]
McKnight TD, Shippen DE. Plant telomere biology. Plant Cell (2004) 16:794–803.
Mefford HC, Trask BJ. The complex structure and dynamic evolution of human subtelomeres. Nat Rev Genet (2002) 3:91–102.[CrossRef][Web of Science][Medline]
Mizuno H, Ito K, Wu J, Tanaka T, Kanamori H, Katayose Y, Sasaki T, Matsumoto T. Identification and mapping of expressed genes, simple sequence repeats and transposable elements in centromeric regions of rice chromosomes. DNA Res (2006a) 13:267–274.
Mizuno H, Wu J, Kanamori H, Fujisawa M, Namiki N, Saji S, Katagiri S, Katayose Y, Sasaki T, Matsumoto T. Sequencing and characterization of telomere and subtelomere regions on rice chromosomes 1S, 2S, 2L, 6L, 7S, 7L and 8S. Plant J (2006b) 46:206–217.[CrossRef][Web of Science][Medline]
Moyzis RK, Buckingham JM, Cram LS, Dani M, Deaven LL, Jones MD, Meyne J, Ratliff RL, Wu JR. A highly conserved repetitive DNA sequence, (TTAGGG)n, present at the telomeres of human chromosomes. Proc Natl Acad Sci USA (1988) 85:6622–6626.
Petracek ME, Lefebvre PA, Silflow CD, Berman J. Chlamydomonas telomere sequences are A+T-rich but contain three consecutive G-C base pairs. Proc Natl Acad Sci USA (1990) 87:8222–8226.
Rice Annotation Project. Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana. Genome Res (2007) 17:175–183.
Richards EJ, Ausubel FM. Isolation of a higher eukaryotic telomere from Arabidopsis thaliana. Cell (1988) 53:127–136.[CrossRef][Web of Science][Medline]
Richards EJ, Chao S, Vongs A, Yang J. Characterization of Arabidopsis thaliana telomeres isolated in yeast. Nucleic Acids Res (1992) 20:4039–4046.
Sykorova E, Leitch AR, Fajkus J. Asparagales telomerases which synthesize the human type of telomeres. Plant Mol Biol (2006) 60:633–646.[CrossRef][Web of Science][Medline]
Sykorova E, Lim KY, Kunicka Z, Chase MW, Bennett MD, Fajkus J, Leitch AR. Telomere variability in the monocotyledonous plant order Asparagales. Proc Biol Sci (2003) 270:1893–1904.
Watson JM, Shippen DE. Telomere rapid deletion regulates telomere length in Arabidopsis thaliana. Mol Cell Biol (2007) 27:1706–1715.
Wu J, Mizuno H, Hayashi-Tsugane M, et al, (22 co-authors). Physical maps and recombination frequency of six rice chromosomes. Plant J (2003) 36:720–730.[CrossRef][Web of Science][Medline]
Wu KS, Tanksley SD. Genetic and physical mapping of telomeres and macrosatellites of rice. Plant Mol Biol (1993) 22:861–872.[CrossRef][Web of Science][Medline]
Yang TJ, Yu Y, Chang SB, de Jong H, Oh CS, Ahn SN, Fang E, Wing RA. Toward closing rice telomere gaps: mapping and sequence characterization of rice subtelomere regions. Theor Appl Genet (2005) 111:467–478.[CrossRef][Web of Science][Medline]
Yu EY, Kim SE, Kim JH, Ko JH, Cho MH, Chung IK. Sequence-specific DNA recognition by the Myb-like domain of plant telomeric protein RTBP1. J Biol Chem (2000) 275:24208–24214.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
E. V. Shakirov, X. Song, J. A. Joseph, and D. E. Shippen POT1 proteins in green algae and land plants: DNA-binding properties and evidence of co-evolution with telomeric DNA Nucleic Acids Res., December 1, 2009; 37(22): 7455 - 7467. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





