Skip Navigation


MBE Advance Access originally published online on December 7, 2007
Molecular Biology and Evolution 2008 25(2):287-300; doi:10.1093/molbev/msm252
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow Supplementary Material
Right arrow All Versions of this Article:
25/2/287    most recent
msm252v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nystedt, B.
Right arrow Articles by Andersson, S. G. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nystedt, B.
Right arrow Articles by Andersson, S. G. E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oxfordjournals.org

Research Articles

Diversifying Selection and Concerted Evolution of a Type IV Secretion System in Bartonella

Björn Nystedt*,1, A. Carolin Frank*,1,2, Mikael Thollesson* and Siv G. E. Andersson*

* Department of Molecular Evolution, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden

E-mail: siv.andersson{at}ebc.uu.se.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We have studied the evolution of a type IV secretion system (T4SS), in Bartonella, which is thought to have changed function from conjugation to erythrocyte adherence following a recent horizontal gene transfer event. The system, called Trw, is unique among T4SSs in that genes encoding both exo- and intracellular components are located within the same duplicated fragment. This provides an opportunity to study the influence of selection on proteins involved in host–pathogen interactions. We sequenced the trw locus from several strains of Bartonella henselae and investigated its evolutionary history by comparisons to other Bartonella species. Several instances of recombination and gene conversion events where detected in the 2- to 5-fold duplicated gene fragments encompassing trwJIH, explaining the homogenization of the anchoring protein TrwI and the divergence of the minor pilus protein TrwJ. A phylogenetic analysis of the 7- to 8-fold duplicated gene coding for the major pilus protein TrwL displayed 2 distinct clades, likely representing a subfunctionalization event. The analyses of the B. henselae strains also identified a recent horizontal transfer event of almost the complete trwL region. We suggest that the switch in function of the T4SS was mediated by the duplication of the genes encoding pilus components and their diversification by combinatorial sequence shuffling within and among genomes. We suggest that the pilus proteins have evolved by diversifying selection to match a divergent set of erythrocyte surface structures, consistent with the trench warfare coevolutionary model.

Key Words: type IV secretion system • Bartonella, duplication • recombination • gene conversion • host–pathogen interaction


    Introduction
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
A common theme for bacteria that colonize eukaryotes is the presence of variable surface-exposed proteins that coevolve with the host proteins. These bacteria experience selection for variability on 2 timescales; within individual hosts to enable binding to a diverse set of host cell surface proteins and to avoid immune system recognition, and between hosts to increase survival in genetically different hosts. Two coevolutionary models of host–pathogen interactions have been proposed. The trench warfare model suggests that diversifying selection on bacterial surface proteins maintains genetic polymorphisms in the population. The coevolutionary arms race model suggests back and forth evolution that oscillates between the offense imposed by the pathogen on the host and the subsequent evolution of defense responses by the host. In this model, positive selection on favorable alleles yields genetic variation over time, yet produces little variation within the population.

Bacterial secretion systems are surface-exposed pili that mediate the transfer of effector molecules into recipient host cells. Studies of the pilus protein HrpA of the type III secretion system (T3SS) in the plant pathogen Pseudomonas syringae provide evidence for diversifying selection according to the trench warfare model (Guttman et al. 2006Go), whereas novel variants of the effector protein alleles are selected during host switching in line with the coevolutionary arms race model (Ma et al. 2006Go). Mechanistically, a recombinatorial process called terminal reassortment has been proposed to explain the extremely rapid evolution of the T3SS effectors when compared between a wide range of bacterial species (Stavrinides et al. 2006Go).

The focus of this study is on the evolution of type IV secretion systems (T4SSs). Prototype systems are found in the plant pathogen Agrobacterium tumefaciens that uses a T4SS called VirB to transfer oncogenic DNA to its host, the whooping cough agent Bordetella pertussis that translocates a toxin into the host using a T4SS, and the human-associated gastric bacterium Helicobacter pylori that transfers the effector protein CagA with the aid of its T4SS. A phylogenetic analysis of T4SSs showed that they have originated from plasmid conjugation systems multiple times independently (Frank et al. 2005Go). The system we have studied is the trw T4SS in Bartonella, which is thought to have switched function recently during its horizontal transfer into the Bartonella lineage from a role in plasmid conjugation to its current role in host interaction (Seubert et al. 2003Go; Frank et al. 2005Go; Fernandez-Lopez et al. 2006Go). Bartonella species are vector-borne bacteria that invade and replicate within the host erythrocytes without normally shortening the erythrocyte life span (Schulein et al. 2001Go). The Trw system is essential for erythrocyte colonization and is thought to mediate adherence to these host cells (Seubert et al. 2003Go).

The Bartonella Trw system is unique among T4SSs in that several genes in the cluster are duplicated (Seubert et al. 2003Go; Alsmark et al. 2004Go). For example, the gene encoding the major component of the T-pilus, trwL is present in 7 or 8 copies. In addition, a segment encompassing trwJ, trwH, and trwI is present in variable copy numbers in the different species. By homology to other T4SS, TrwJ is thought to mediate host cell attachment (Yeo et al. 2003Go), whereas TrwH and TrwI are associated with the anchoring of the pilus to the outer and inner membranes (Krall et al. 2002Go; Cascales and Christie 2004Go).

Studies of duplicated genes for other outer membrane proteins suggest that gene conversion is one of the most important mechanisms for generating diversity within a host (Deitsch et al. 1997Go). This can be manifested as the expression of alternative copies of surface proteins by the transfer of short donor sequence fragments, called functional pseudogenes, into a recipient expression cassette (Santoyo and Romero 2005Go). The power of combinatorial gene conversion to generate functional diversity can be illustrated with the major antigen (encoded by the tprK gene) in Treponema pallidum (Gray et al. 2006Go): at least 420,000 different sequence variants of this protein can be generated from 47 variable pseudogene sequences and 7 variable (V) regions in the expressed tprK gene (Santoyo and Romero 2005Go). The contribution of combinatorial gene conversion to antigenic variation of surface structures is by now well established in many pathogens, including Anaplasma marginale (Brayton et al. 2002Go, 2005Go), Borrelia hermsii (Plasterk et al. 1985Go), Borrelia burgdorferi (Zhang and Norris 1998Go), Neisseria gonorrhoeae (Hagblom et al. 1985Go; Haas and Meyer 1986Go), and Mycoplasma synoviae (Noormohammadi et al. 2000Go).

Gene conversion can also cause members of a multigene family to evolve in concert, as has been reported for essential cell components such as the tuf gene encoding elongation factor Tu and the ribosomal RNA (rRNA) genes. Studies of the duplicated tuf genes indicate transfer rates in the order of 10–8 to 10–9 per cell division (Abdulkarim and Hughes 1996Go), which is higher than the estimated nucleotide substitution rate. However, mutations in recombination–repair genes, such as recAB and mutSHL have been demonstrated to affect these frequencies, suggesting that the intrinsic rate of gene conversion may be species specific.

Until now, the influence of gene conversion on homogenization and diversification has been studied separately for cytoplasmic and outer membrane proteins, making it difficult to assess the relative contribution of mutation and selection to the observed patterns. Here, the trw cluster comes in handy: 1 of the duplicated regions contains genes for both exocellular (TrwJ) and intracellular pilus components (TrwI), providing the opportunity to start decoding the evolutionary forces that promotes shifts in the balance between homogenization and diversification.

In this study, we have examined the molecular evolution of the duplicated genes in the trw cluster using sequence data from 4 Bartonella species from various natural hosts, including humans, felines, and rodents. We show that genes with rapid evolution and extreme intragenomic sequence divergence are found on the same coduplicated segment as genes homogenized by gene conversion. This discrepancy is directly related to the intra- versus exocellular location of the encoded proteins. We also provide evidence for horizontal gene transfer, and finally we discuss the possible selective forces and dynamics shaping the evolution of duplicated genes for host-interaction proteins.


    Materials and Methods
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Sequences and Accession Numbers
This study includes sequence data from 4 species of Bartonella and 4 strains of Bartonella henselae (fig. 1). Data were taken from the genome sequences of B. henselae strains Houston-1 (Bh-H1) (NC_005956) and Bartonella quintana (Bq) (NC_005955) and the trw gene sequences from Bartonella tribocorum (Bt) (AJ496288) and Bartonella grahamii (Bg) (AM905248). We performed de novo sequencing by fosmid library construction of the complete trw region from the 2 B. henselae strains Marseille (Bh-MAR) (AM905245) and IndoCat-11 (Bh-IC11) (AM905246) and of the trwL region (including korA) from a long-range polymerase chain reaction (PCR) product of B. henselae strain Cheetah (Bh-CH) (AM905247).


Figure 1
View larger version (13K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 1.— Phylogeny and gene order structure of the trw locus in Bartonella. Maximum likelihood phylogeny as inferred from the concatenated alignments of the single-copy genes trwFED (left). Schematic representation of the Trw region with tandem repeat segments within brackets (top right). The trwDtrwN genes (DN) and the regulatory genes korA and korB are indicated. The copy numbers and host species for each strain are outlined (right). ST: sequence type. * Only the trwL region has been sequenced for Bh-CH, and thus this sequence is not included in the trwFED phylogeny. {dagger} The trwH gene is lacking from the third tandem repeat of this segment in Bt. {ddagger} The strain was isolated from an incidentally infected human patient.

 
Fosmid Library Construction and Sequencing
Whole genome fosmid libraries were constructed using the Epicentre Fosmid Library Construction Kit. Genomic DNA was prepared from 2 week's B. henselae cultures on horse blood agar plates. The DNA was physically sheared with a 50-µl Hamilton syringe, end repaired, and size separated by pulse-field gel electrophoresis in 1.2% agarose (SeaKem Gold; Cambrex Bio Science, East Rutherford, NJ) in 0.5x Tris-Borate-EDTA buffer in a GenNavigator System apparatus (Amersham Biosciences, Uppsala, Sweden) at 14 °C and 5.7 V/cm for a total of 35 h split into 4 phases: 1 s for 3 h, 5 s for 10 h, 10 s for 15 h, and 15 s for 7 h. Fragments of 35–45 kb were cut from the gel. The gel was digested with GELase, and the DNA was extracted by a single phenol step, followed by NaCl precipitation, and washed with 70% ethanol. The sample was desalted by microdialysis (0.025 µm Millipore membrane filters, 30 min), ligated to the CopyControl pCC1FOS fosmid vector, and packaged into phages that were used to infect T1 resistant Escherichia coli (EPI300-T1). The bacteria were screened on chloramphenicol plates (12 µg/ml), grown overnight in liquid selective media, and frozen in 15% glycerol in –80 °C for long-time storage.

Screening for trw-positive clones was performed with the Gen Images AlkPhos direct labeling and detection system with chemiluminescence detection with CDP-Star (GE Healthcare, Uppsala, Sweden). Samples of 2-µl fosmid-containing E. coli cultures were added directly onto a nylon filter, and positive clones were identified by DNA–DNA overnight hybridization at 65 °C with a labeled PCR product from the Bh-H1 trwK gene, followed by detection by exposure on Hyperfilm ECL.

Fosmids were induced to high copy numbers (CopyControl Induction Solution) and prepared using standard plasmid preparation protocols (QIAGEN, Valencia, CA). Shotgun sequencing was performed on the purified fosmid with the insert DNA, using a MegaBace1000 capillary sequenator. Published fosmids were sequenced to a coverage of >8x, complemented by PCR sequencing if needed to an average consensus sequence quality value of >85, and at least one read with a quality value >20 in both directions at each site.

Sequence Alignments and Distance Measures
For all coding genes, protein alignments were made with Kalign (Lassmann and Sonnhammer 2005Go) and backtranslated to nucleotide sequences, with the trailing gap parameter set to t = 0. Nucleotide spacers were aligned with Kalign (t = 0). All alignments were manually inspected and edited if needed.

Distance measures must be carefully interpreted in the light of recombining sequences. Hence, we used as distance measures both corrected and uncorrected substitution frequencies, as well as nucleotide identity and diversity index calculations (see below), depending on the particular sequences involved and the question at hand. Nonsynonymous (dN) and synonymous (dS) substitution frequencies were calculated using the method of (Yang and Nielsen 2000Go) as implemented in yn00 in the PAML package (Yang 2000Go), also giving the pairwise percent nucleotide identity. Uncorrected nonsynonymous (pN) and synonymous (pS) substitution frequencies were obtained by synonymous nonsynonymous analysis program (Korber 2000Go).

Sequence Feature Predictions and Gene Segments
The transmembrane regions of aligned genes were predicted using PolyPhobius (http://phobius.cgb.ki.se/) (Käll et al. 2005Go), which simultaneously predicts both signal peptide (SP) regions and transmembrane regions from protein sequence data. We used stability of aligned positions (Loytynoja and Milinkovitch 2001Go) to analyze protein alignment ambiguities for each of the possible parameter combinations in ClustalW, with the gap extension penalty ranging from 0.1 to 0.4 (step = 0.1) and the gap opening penalty ranging from 8 to 15 (step = 1). Segments of the gene supported by less than 90% of the protein alignments were consider ambiguous.

Tree Reconstructions
The phylogenies of the alignments were inferred by maximum likelihood as implemented in Treefinder (Jobb et al. 2004Go), with 100 bootstrap replicates. The Akaike information criterion was used to select among 56 nt substitution models, using Modeltest 3.7 (Posada and Crandall 1998Go). The models obtained were general time reversible (GTR) + I (trwFED), GTR + I + G (trwJ), GTR + G (trwI), and transversion model + I + G (trwL).

Alignment Visualizations
Sequence alignments with up to ~10 sequences were directly visualized by a novel method: Each sequence was schematically plotted as a line/arrow (spacer/gene) in a spread-out fashion. At each position in the alignment, lines were drawn to indicate identical nucleotides. Unique nucleotides and gaps were represented above (or below) each sequence as stars and lines, respectively. For visibility, nonsegregating sites were removed from the alignments in some plots, and lines were not drawn for nucleotides shared across more than 3 sequences.

Detection of Recombination and Gene Conversion
Potentially recombined trwJ sequences were identified using the 4-sequence method implemented in 4SIS (Maynard Smith and Smith 1998Go). For each analysis, 1 gene copy (the "child") was analyzed for similarity on polymorphic sites to 2 other gene copies (the "parents"), using a fourth gene copy as outgroup. The program optimizes a chi-square value to identify potential recombination breakpoints in the analyzed gene, based on biased similarity on informative sites, with the P value being equal to the fraction of 1,000 permutations with a chi-square value larger than for the original data.

We also detected recombination using the 4 methods RDP (Martin and Rybicki 2000Go), GENECONV (Padidam et al. 1999Go), MaxChi (Maynard Smith 1992Go), and Chimaera (Posada and Crandall 2001Go) as implemented in the program RDP3 (Martin and Rybicki 2000Go), with default settings.

Potential gene conversion events in the concatenated trwJIH region (including spacers) and among the trwL genes were detected using test of (Sawyer 1989Go) in GENECONV (Sawyer 1999Go). Parameters were set to equal weights for all nucleotides (seqtype = all) and no mismatches allowed (gscale = 0). Clusters of repeated gene families (more than 60% amino acid similarity over at least 50% of the protein length) in B. henselae and B. quintana were detected using Blastclust (Altschul et al. 1990Go). Each cluster with at least 3 repeated genes in the 2 genomes combined was aligned with Kalign (t = 0) and analyzed for gene conversion events with GENECONV (same parameters as above).

Identity Plots
Segments of differential sequence conservation were detected by calculating the Shannon–Wiener diversity index (H) in sliding windows along the alignments of the trwJIH segment (window size = 40 nt, step size = 20 nt) and the trwL genes (window size = 20 nt, step size = 10 nt). The Shannon–Wiener index at each position i was here defined as:

Formula
where fiA is the frequency of A:s at site i and n equals the number of possible states (n = 5 if there are 5 or more sequences in the alignment, otherwise n equals the number of sequences). A high similarity at a given site will give a low Shannon–Wiener index and a low similarity will give a high value.


    Results
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Evolution of the trw Cluster in Bartonella
We have studied the evolution of the trw cluster that codes for a conjugative T4SS using Bartonella as the model system. Included in the analysis were 4 different Bartonella species: B. henselae (Bh), B. quintana (Bq), B. tribocorum (Bt), and B. grahamii (Bg) (fig. 1). To increase the resolution of the analyses, we compared the sequences of the trw gene cluster in 3 different strains of B. henselae: Houston-1 (Bh-H1, type strain), Marseille (Bh-MAR) and IndoCat-11 (Bh-IC11), with some sequence data also collected from the strain Cheetah (Bh-CH). The B. henselae strains represent 3 of the 8 sequence types (ST1, ST4, and ST8) identified in a previous multi locus sequence typing analysis of polymorphisms in 9 core genes (Lindroos et al. 2006Go).

We inferred the evolutionary relationships of the trw region based on the last 3 genes in the trw cluster, trwFED (the positional homologs to A. tumefaciens virB 9, 10, 11). This gene set supported a clade containing the human pathogens B. quintana and the feline-associated species B. henselae to the exclusion of the rodent-associated species B. grahamii and B. tribocorum (fig. 1) in accordance with previously estimated species trees (Zeaiter et al. 2002Go; Frank et al. 2006Go), providing evidence for evolution by vertical inheritance of trw in the Bartonella lineages. The nonsynonymous (dN) and the synonymous (dS) substitution frequencies of the trwFED genes between Bh-H1 and Bq were dN = 0.07 and dS = 0.45, which is close to the average of all orthologs in these 2 completely sequenced genomes (dN = 0.06, dS = 0.49) (table 1).


View this table:
[in this window]
[in a new window]

 
Table 1 Synonymous and Nonsynonymous Substitution Frequencies for trwI (upper rows) and trwFED (lower rows) withina,b and between Genomes

 
The analyses further suggested a tight relationship of all the 3 B. henselae strains, with the ST1-strain Bh-H1 forming a clade together with the ST8-strain Bh-MAR (97% bootstrap support), to the exclusion of the ST1-strain Bh-IC11 (fig. 1). This can be compared with a previous MLST-based analysis, where no differences were found between Bh-H1 and Bh-IC11, whereas Bh-MAR differed from these 2 in 6 of the 9 sequenced genes (Lindroos et al. 2006Go). Thus, different segments of the genome seem to have different evolutionary histories.

Gene-Specific Phylogenetic Signals in the Tandemly Repeated trwJIH Segment
The Bartonella Trw locus differs from previously described T4SS in that several genes within the locus are present in multiple copies (fig. 1). One segment encompassing the genes trwJ, trwI, and trwH (the positional homologs of virB5, virB6, and virB7) is present in 2 (Bq, Bh-H1, Bh-IC11), 3 (Bh-MAR), 4 (Bg), or 5 (Bt) copies, implying lineage-specific duplication–deletion processes. Because the order of genes in all 4 species is trwJ-I-H-J-I-H rather than trwJ-J-I-I-H-H or variants thereof, we assume that all 3 genes in the segment encompassing trwJ-I-H were duplicated simultaneously.

There is a striking difference in sequence conservation between genes that lie in the same coamplified fragment, yielding gene-specific tree topologies for the trwJ, trwI, and trwH genes. A phylogenetic analysis of the trwI and trwH genes revealed species-specific clades with bootstrap support of 100%, with trwI showing this pattern even down to the strain-specific level in B. henselae (fig. 2A). In contrast, the trwJ sequences tended to form clades consisting of positional homologs (fig. 2B). For example, the trwJ copies from the 3 B. henselae strains formed 2 separate clades with at least 98% bootstrap support, one clade containing all trwJ1 copies and the other clade containing all trwJ2/3 copies. By and large, the topology of the trwJ tree is congruent with the estimated trw tree (fig. 1), given some duplication and deletion events.


Figure 2
View larger version (38K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 2.— Upper panels: Maximum likelihood phylogenies for the nucleotide alignments of (A) trwI and (B) trwJ. The trees were inferred without an outgroup and the maximum likelihood bootstrap values above 80% are shown. Lower panels: alignment visualization of (A) the trwI and (B) the trwJ gene copies in the B. henselae strains. Each arrow represents a gene copy in the alignment. At each position in the alignment, groups of homologs with shared nucleotides are connected by lines. For visibility, only groups with 2 or 3 members are connected. Red stars above the genes indicate unique sites. All nonsegregating sites were omitted from the alignments, as well as the HV region of trwJ.

 
Because 1 part of the trwJ gene displayed extreme sequence divergence, we examined if the pattern of vertical descent in trwJ holds true for the entire gene. The trwJ alignment was divided into a SP region (nucleotide position 1–72), a hypervariable (HV) region (nucleotide position 73–780), and a V region (nucleotide position 780–1128). Phylogenetic trees based both on the HV region (supplementary fig. 1A, Supplementary Material online) and a concatenation of the SP + V regions (supplementary fig. 1B, Supplementary Material online) were congruent with the tree inferred from the complete trwJ alignment (fig. 2B) for all clades with strong bootstrap support (>85%). Hence, the signal of vertical descent is evident both in the hypervariable and the more conserved parts of the trwJ gene.

To visualize the completely different phylogenetic signals in the trwJ and trwI genes directly from the alignments, we plotted the shared nucleotides between the 7 copies from the 3 B. henselae strains for trwJ and trwI separately. The result highlighted the similarity between copies in the same genome for trwI (fig. 2A) and between positional homologs for trwJ (fig. 2B). We also noted an access of unique sites in Bh-H1 trwJ2 compared with the other trwJ copies in the 3 strains.

Composite Genes and Gene Conversion
Visual inspection of the trwJ copies indicated heterogeneous pairwise similarities between close homologs along the alignment. One explanation for this could be composite genes, that is, genes created by recombination of 2 parental sequences. We tested this hypothesis using 4SIS (Maynard Smith and Smith 1998Go), looking for recombined genes by comparing 1 gene with 2 potential parental sequences. Each trwJ copy in the B. henselae strains was tested, using the closest phylogenetic homolog (always being a positional homolog from another strain) as 1 parental sequence, whereas the other parental sequence was assumed to be the second trwJ copy from the same genome as the analyzed gene (for simplicity excluding Bh-MAR trwJ3 from this analysis). Bartonella grahamii trwJ1 was used as an outgroup in all analyses. Three of the 6 analyzed trwJ genes (Bh-H1 trwJ2, Bh-IC11 trwJ1, and Bh-IC11 trwJ2) showed statistically significant evidence of recombination (P < 0.01) (supplementary fig. 2, Supplementary Material online). Similar results were obtained with RDP3 (Martin and Rybicki 2000Go), with significant recombination signals (P < 0.05) found by at least 3 of the 4 recombination detection methods used (see Materials and Methods) (fig. 3). Even though a much more extensive sampling of strains would be needed in order to fully explain the evolutionary history of the trwJ genes, these examples provide evidence that recombined copies are common and contribute to the divergence of the trwJ genes.


Figure 3
View larger version (10K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 3.— Detection of recombinant trwJ copies by RDP3 (Martin and Rybicki 2000Go). Each dark bar represents a trwJ gene, with the gene name indicated to the upper left corner. White bars below a gene indicate a recombination event, with the presumed parental sequence indicated to the right. Recombination events supported (P < 0.05) by at least 3 of the 4 methods tested (see Materials and Methods) are shown.

 
In striking contrast to trwJ, the trwI copies formed strong intragenomic clades in the phylogenetic analysis (fig. 2A). Calculating the Shannon–Wiener diversity index in sliding windows along the concatenated alignments of trwJ (excluding the HV region), trwI, trwH, and their preceding spacers showed extreme differences in intragenomic diversity along the segment, with the lowest diversity focused around trwI in all genomes (fig. 4A–F). Given the signal of vertical descent in trwJ and that the trwJIH genes seem to have been copied together, the low intragenomic diversity in trwI is unlikely to be due to recent independent trwJIH duplications in each genome. We also tested whether strong negative selection on trwI could explain the low intragenomic diversity. We calculated the average dN and dS values for each pair of trwI genes between any genome pair and compared with the corresponding values for the single-copy trwFED genes (table 1). Between species, the dN/dS values were somewhat increased for trwI (dN/dS = 0.22–0.42) compared with trwFED (dN/dS = 0.08–0.15), indicating that negative selection is not particularly strong on trwI compared with the trwFED genes. Negative selection is also unlikely to operate on synonymous sites, but both dN and dS values were much lower within than between genomes (table 1).


Figure 4
View larger version (20K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 4.— Intragenomic diversity index plotted against nucleotide position in the alignments of the repeated segments including trwJ, trwI, and trwH and their upstream spacers. Panels shown for the sequences from (A) B. tribocorum, (B) B. grahamii, (C) B. henselae (Bh-H1), (D) B. henselae (Bh-MAR), (E) B. henselae (Bh-IC11), and (F) B. quintenz. The bars above the plots indicate the locations and copy numbers of the genes. Putative gene conversion fragments larger than 200 nt as inferred by Sawyer's test are shown as lines, with the stars indicating fragments between strains.

 
Gene conversion between gene copies may explain strong intragenomic homogenization on both synonymous and nonsynonymous sites, without a decreased substitution frequency between genomes. Using runs test of (Sawyer 1989Go) on the concatenated trwJIH segments above, gene conversion fragments larger than 200 nt were detected in all genomes, covering various parts of the trwI and trwH genes and their preceding spacers (fig. 4AF). One intergenomic fragment was also detected between the second copy in Bh-H1 and the 2 copies in Bh-IC11, indicating either recombination between strains or representing a segment where gene conversion has not yet distorted the signal of vertical inheritance between the strains. Such partial gene conversion, or recombinations between strains, would introduce artifacts in the distance measures, perhaps explaining the elevated average of the calculated dS values between the B. henselae strains for trwI (dS = 0.15–0.17) compared with trwFED (dS = 0.02–0.03) (table 1).

Rapid Evolution of trwJ by Short Indels and Repeats
Whereas the conserved SP and V regions in trwJ showed stable alignments and contained virtually no indels, the HV region ranged in size from 87 amino acids (Bh-H1 trwJ2) to 191 amino acids (Bg trwJ4). The most notable aspect of the trwJ (HV) alignments between close homologs from the B. henselae strains was the high frequency of indels, even in otherwise almost identical sequence segments, typically without distorting the open reading frame. For example, between Bh-IC11 trwJ2, Bh-MAR trwJ2, and Bh-MAR trwJ3 we found 5 indels of 2, 13, 24, 3 (fig. 5), and 12 nt in size. The indel of 24 nt was associated with a short imperfect repeat of 18 nt, with the first copy present in all 3 strains (fig. 5). Similar patterns of indels were seen in the alignments of the 3 trwJ1 copies of the B. henselae strains, with 6 indels of 18, 36, 30, 9, 3, and 3 nt in size. An imperfect triple repeat of 18 nt was also found in a homologous stretch of the B. tribocorum copies trwJ2 and trwJ5. Some of the indels were flanked by short stretches containing more nucleotide differences than seen elsewhere in the alignment, as observed for example around the 2 and 13 nt gaps in Bh-MAR trwJ3 compared with the Bh-MAR trwJ2 and Bh-ICII trwJ2 (fig. 5).


Figure 5
View larger version (36K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 5.— Examples of indels in a region of otherwise high similarity of the HV region of trwJ between positional homologs from the B. henselae strains. The black rectangles indicate an imperfect repeat. The given position refers to the complete trwJ alignment.

 
We also used the corrected (dN and dS) and uncorrected (pN and pS) substitution frequencies as a measure of the sequence divergence of the conserved SP + V regions for all trwJ pairs within and across genomes. In contrast to trwI, the divergence of trwJ (SP + V) within each genome was as high as the divergence between genomes (supplementary fig. 3, Supplementary Material online), with the maximum pairwise intragenomic divergence of trwJ (SP + V) for each genome ranging from dN = 0.26, dS = 0.44 in B. quintenz to dN = 0.60, dS = 2.85 for B. tribocorum To compare, no copy pair of trwFED between any genomes exceeded dN = 0.10 and dS = 0.87. The dN/dS (and pN/pS) ratios were generally higher for the trwJ homologs compared with the single-copy trwFED genes, but always below 1 (the only exception being Bg trw3 with a dN/dS > 1 when compared with Bg trwJ4, Bt trwJ2, and Bt trwJ5) (supplementary figs. 3 and 4, Supplementary Material online).

Diverged Clades of the Tandemly Repeated trwL Genes
The main pilus component protein TrwL is encoded by 7 to 8 gene copies located in a single tandem array in the investigated Bartonella genomes (fig. 1). The trwL copies displayed segmental conservation patterns as shown by the Shannon–Wiener diversity index (supplementary fig. 6, Supplementary Material online), where the most conserved parts corresponded to 2 transmembrane segments linked by a cytoplasmic region in the second half of the gene.

Phylogenetic inference strongly supported a division of the trwL copies into 2 major clades, 1 clade (trwL:B) containing the last gene copy from each genome and 1 clade (trwL:A) containing all the remaining gene copies (supplementary fig. 5, Supplementary Material online). The same division into 2 major clades was seen in a splits decomposition network of the trwL genes, also revealing some conflicting signals in the link between the 2 clades (fig. 6). The low bootstrap support values for most of the internal nodes of the trwL:A clade (supplementary fig. 5, Supplementary Material online) were also explained by the network analysis, with many trwL:A genes displaying long terminal branches radiating out from a shared core network structure (fig. 6).


Figure 6
View larger version (38K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 6.— A splits decomposition network of the trwL nucleotide alignments. The trwL:A and the trwL:B clades are indicated with dashed lines.

 
Whereas the trwL:B copies appear to follow the trwFED phylogeny (supplementary fig. 5, Supplementary Material online), the trwL:A copies of B. quintenz, B. grahamii, and B. tribocorum tended to form intragenomic clades, whereas the trwL:A copies from the B. henselae strains seemed to be distributed over the entire tree (fig. 6, supplementary fig. 5 [Supplementary Material online]). Unlike the homogenized trwI genes, the species-specific clades of trwL seemed to be completely independent of the trwFED phylogeny, with every species clade being roughly equidistant to every other species clade. This was quantified for B. tribocorum, using the average synonymous substitution frequencies for all pairwise comparisons of the trwL:A copies in 2 genomes as a distance measure. Comparing B. tribocorum with the other species gave high and virtually identical dS averages (Bt:Bg 1.4, Bt:Bq 1.5, Bt:Bh 1.3–1.5), which can be compared with the single-copy trwFED genes, where the dS values are considerably lower and directly reflect the trwFED phylogeny (Bt:Bg 0.37, Bt:Bq 0.84, Bt:Bh 0.85–0.86). The average intragenomic dS value of the trwL:A genes in B. tribocorum is 0.25, thus approaching the magnitude of the trwFED genes in the 2 distinct species B. tribocorum and B. grahamii (dS = 0.37), and 25 times larger than the average dS values of the intragenomic clade of the B. tribocorum trwI genes (dS = 0.01).

Horizontal Transfer of a Large trwL Segment
The 4 B. henselae strains represent 2 completely different versions of the trwL region. Bh-H1 and Bh-MAR are very similar over most of the trwL region, and so are Bh-IC11 and Bh-CH (fig. 7A). From the alignments, we identified the exact boundaries of this pairwise similarity, stretching from the end region of korA to the first half of the trwL7 (Bh-H1:Bh-MAR) and trwL6 (Bh-IC11:Bh-CH) copies, respectively (fig. 7B and C), thus including all the genes in the trwL:A clade identified above.


Figure 7
View larger version (26K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
FIG. 7.— Analysis of the trwL region, showing distinctly different variants among the Bh strains. (A) Each gene is connected to its closest homolog in the genome outlined above/below, with the percent nucleotide identity indicated according to the grayscale (upper right). The korA gene and the trwL genes (L1L8) are indicated. (B, C) Alignment visualization of the 5' (B) and 3' (C) ends of a segment with strong pairwise similarity among the 4 strains. Arrows represent the aligned genes of the left (B) and right (C) framed areas in (A). At each position in the alignment, sequences with identical nucleotides are connected. For visibility, nonsegregating sites are not connected. Lines and stars above (or below) the genes indicate gaps and unique sites, respectively.

 
We inferred the closest homolog for each of the trwL:A copies between each pair of strains based on the lowest synonymous substitution frequency (excluding the last copy in the array). Within each group, the synonymous substitution frequencies for the closest homologs range from dS = 0 to 0.16 (average 0.04) for Bh-H1:Bh-MAR and dS = 0 to 0.001 (average 0.002) for Bh-IC11:Bh-CH, showing complete gene order conservation and >99% nt sequence identity over the complete region, including spacers. This is consistent with the estimated distances between strains as inferred from the trwFED genes (table 1). In striking contrast, dS values for the closest trwL pairs between representatives for the 2 groups range from dS = 0.07 to 1.18 (average 0.59) for Bh-MAR:Bh-ICII. These values overlap with the values for the closest homologs between the B. henselae strains and B. quintenz, for example, dS = 0.81–1.43 (average 1.05) for Bh-MAR:Bq. Because the dS values for single-copy genes located within and around the trw region are similar between all the B. henselae strains (sequence data are lacking for these genes in Bh-CH), the differences between the trwL arrays cannot be explained by a gradual process of nucleotide substitutions and gene duplications/deletions. Instead, the data suggest a replacement of this large trwL segment by a recent horizontal gene transfer into one of the lineages from a rather distantly related strain.

Upstream of the horizontally transferred trwL segment, the 4 B. henselae strains display between all high similarity (fig. 7B). However, at the downstream end of the segment, the Bh-H1 strain clearly differed from the 3 other strains (fig. 7C) throughout the 2 last genes in the trwL array including the intervening spacer and to some degree also in trwM and at the beginning of trwK. Segmental differences in sequence similarity were observed for trwK between the B. henselae strains (supplementary fig. 7A, Supplementary Material online), and significant (P < 0.05) recombination signals in this gene were detected among the B. henselae strains by at least 3 of the 4 methods tested with RDP3 (see Materials and Methods) (supplementary fig. 7B, Supplementary Material online).

Gene Conversion on a Genome Scale
In order to determine whether gene conversion is specifically targeting the trw locus, we roughly estimated the overall gene conversion pattern for repeated genes in the B. henselae and B. quintenz genomes. On the complete gene set of B. henselae and B. quintenz genes, we used Blastclust to detect repeat gene families with at least 3 members (both genomes combined) with more than 60% sequence similarity over more than 50% of their length. This resulted in 35 clusters, with the number of members ranging from 3 to 20 (average 5.3 members per cluster), excluding the clusters formed by the repeated trw genes and 2 clusters with too few polymorphisms for analysis. In 12 of the 35 clusters (34%), intragenomic gene conversion fragments larger than 100 nt were detected, suggesting that gene conversion occurs frequently among duplicated genes in Bartonella.


    Discussion
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
In this paper, we present the first detailed evolutionary analysis of the Bartonella T4SS, Trw, which is required for invasion of the erythrocytes in the mammalian host (Schulein et al. 2001Go). This system was recently acquired to one of the Bartonella lineages by horizontal gene transfer, followed by a functional switch from plasmid conjugation to host association (Seubert et al. 2003Go; Frank et al. 2005Go; Fernandez-Lopez et al. 2006Go). The Bartonella trw cluster contains duplications unique among T4SSs, providing an excellent model system for dissecting the molecular evolutionary forces of host-interaction proteins versus intracellular proteins, and the evolutionary dynamics of gene duplications.

Gene duplication is one of the most important sources of gene innovation (Ohno 1970Go; Gevers et al. 2004Go). An important feature of tandem duplications is that the initial duplication event is the most unlikely, whereas subsequent duplication–deletions easily occur by homologous recombination during replication. As a consequence, natural selection is needed in order to maintain tandem duplications within a population (Pettersson et al. 2005Go). The advantages of multiple gene copies may be 1) increased expression (e.g., dosage compensation of defective enzymes), 2) variability within the genome (e.g., antigenic variation), and 3) increased potential for new functions (e.g., neo- or subfunctionalization). The Trw system in Bartonella appears to display examples of all the above dynamics, likely providing the variation and innovation required both for the initial functional switch and for successful host adaptation and chronic infection.

A notable feature of the duplicated pilus component TrwJ is the extreme intragenomic divergence and the rapid evolution with frequent indels scattered over the first half of the trwJ homologs, resulting in a 2-fold difference in size of this region (from 87 amino acids in Bh-H1 trwJ2 to 191 amino acids in Bg trwJ4). A similar, but less pronounced, divergence is seen in the surface-exposed regions of the duplicated major pilus protein TrwL.

One possible explanation for the sequence variability among the gene duplicates is that one of the copies is expressed, whereas the others serve as sequence reservoirs that feed divergences into the expressed site, as demonstrated for many exocellular proteins (Hagblom et al. 1985Go; Plasterk et al. 1985Go; Haas and Meyer 1986Go; Brayton et al. 2002Go, 2005Go). However, none of the duplicated trw genes in Bartonella appear to be silent, because all copies are complete open reading frames preceded by ribosomal binding sites (Schroder and Dehio 2005Go), and evolve under purifying selection (dN/dS < 1). Likewise, expression of the duplicated trwJIH variants is probably coregulated, with conserved putative KorA/KorB-regulated promoters preceding all copies (Schroder and Dehio 2005Go).

Reoccurring rapid gene amplification, followed by divergent loss of duplicated copies has been suggested as a common molecular mechanism to select for mutations in demand (Andersson et al. 1998Go). In this model, selection for dosage compensation may rapidly increase the gene copy number to the order of hundreds and then quickly revert to a single copy when a favorable allele has emerged (Andersson et al. 1998Go; Pettersson et al. 2005Go). Although duplication–deletions clearly represent important mutational processes in the evolution of the trw cluster, this specific model would be difficult to reconcile with the patterns of vertical descent of the trwJ copies.

Yet another possibility for the escalation of sequence divergence would be that this region is replicated separately from the rest of the genome during later growth stages (Lindroos et al. 2006Go), which could induce a different spectrum of mutations in this part of the genome. However, the trwFED genes as well as the genes flanking the trw cluster evolve at similar substitution rate as the Bartonella core genes, suggesting that an altered pattern or rate of nucleotide changes in the region covering the trw cluster cannot explain the data.

We favor divergence mechanisms that involve combinatorial sequence shuffling between copies through recombination events within and between relatively distantly related lineages. With multiple tandem copies such as in the trwL array, the pool of genetic variants can easily be increased by new chimerical sequences created by intragenomic shuffling and recurrent duplication–deletions of genes. If Bartonella lineages can also recombine with each other, the potential for novel genetic diversity increases dramatically. Indeed, the extensive divergence of (Bh-H1:Bh-MAR) compared with (Bh-IC11:Bh-CH) in the trwL array is difficult to reconcile without invoking horizontal gene transfer from a rather distantly related lineage. Likewise, the many unique sequence stretches between otherwise closely related trwJ homologs in the B. henselae strains might also be explained by frequent recombination events from a large pan-genomic pool of divergent trwJ alleles. Importantly, the dynamic of this system is so high that it may well explain the "apparent absences" in some species of virB2 and virB7 (homologs of trwL and trwJ, respectively) as indicated by ordinary Blast searches (Medini et al. 2006Go). Thus, our results refute models suggesting that T4SSs have been built up in a stepwise manner by independent recruitment of units to the complex (Medini et al. 2006Go).

We infer duplication–deletions, gene conversion between duplicated genes and sequence exchange between genomes to be part of the mutational engine that underlies the dynamic structure of the trw cluster, with the outcome being dependent on the selective constraints driving the evolution of the individual genes.

Selection for Erythrocyte Binding on trwJ
The trw pili could be targets for the immune response, as demonstrated for the YscF protein at the base of the T3SS in Yersinia (Matson et al. 2005Go), possibly explaining the rapid evolution and the sequence divergence between the Bartonella isolates. However, in lack of evidence for differential expression of the pilus protein gene copies, it appears for now as if all structural variants are exposed simultaneously on the cell surface, making it difficult to explain the strong intragenomic divergence solely in terms of an immune evasion mechanism.

An increased binding potential of the erythrocytes has been suggested as an alternative hypothesis to explain the variability in the Trw pilus proteins in Bartonella (Seubert et al. 2003Go). Mammalian erythrocytes are known to have highly variable glycoprotein surface structures that have been suggested to evolve under 2 opposing selection pressures (Gagneux and Varki 1999Go). Because the nonnucleated erythrocytes cannot serve as hosts for viruses, a broad erythrocyte-binding potential may act as a decoy defense mechanism by distracting viruses from their target tissues (Gagneux and Varki 1999Go), whereas pathogens as for example the malarian parasite Plasmodium falciparum use the same glycoprotein structures for adherence and invasion to the erythrocytes themselves (Orlandi et al. 1992Go; Sim et al. 1994Go). The erythrocyte surface protein glycophorin A has been shown to be subject to both positive and diversifying selection in humans and other primates (Baum et al. 2002Go; Altheide et al. 2006Go). Likewise, and possibly as a consequence, the erythrocyte-binding antigen-175 in P. falciparum and duffy binding protein in Plasmodium vivax have also been shown to be subject to diversifying selection (Baum et al. 2003Go).

Given the extreme variation in the erythrocyte surface structures both within and between individual host species, selection for a diverse arsenal of binding structures appears likely and may explain the extreme variation of the trwJ gene copies within and between isolates, whereas evasion of the host immune system may explain the observed rapid evolution of this pilus protein.

It is well known that B. henselae is able to invade the erythrocytes in their natural feline hosts but not in incidentally infected humans. The molecular basis for this host specificity is not known, but the Trw pilus is a prime suspect in that it is required for the erythrocyte invasion in vivo (Seubert et al. 2003Go). If so, host specificity may also explain the need for a larger number of trwJ copies in the rodent-associated B. tribocorum and B. grahamii that are believed to have a wider host range compared with the feline- and human-associated B. henselae and B. quintana.

Selection for Functional Conservation of trwI
In contrast to the highly diverged pilus proteins, the trwI (and trwH) copies are nearly identical within genomes and show a normal rate of protein evolution between lineages, despite being located on the same duplicated fragment as trwJ. This pattern resembles multiple rRNA genes and their repeated sequences (Liao 2000Go; Ganley and Kobayashi 2007Go), and as for the rRNA genes, we suggest that concerted evolution through frequent fixation of gene conversion events between gene copies best explains the observed patterns for trwI. The observation that intragenomic trwI copies appear less diverged than their upstream spacer regions suggests that the homogenization is a selected feature in trwI. If so, what would be the selective advantages behind homogenization of the anchoring proteins?

As for the VirB T4SS complex in A. tumefaciens (Dang et al. 1999Go; Sagulenko et al. 2001Go), the tight interactions of the Trw anchoring proteins will likely impose strong restrictions on amino acid substitutions, where a substitution in one protein may require a compensatory substitution in another protein (Moyle et al. 1994Go; Goh et al. 2000Go; Mintseris and Weng 2005Go) (even though this view has been recently questioned; Hakes et al. 2007Go). Multiple copies of an anchoring gene may thus pose a serious problem because all copies must fit with the rest of the structure. Frequent gene conversion offers an opportunity for continuous homogenization, efficiently purging deleterious mutations that arise in one copy, while also allowing for beneficial compensatory substitutions in one gene to quickly spread to the other copies. The homogenized trwI copies are likely to be hot spots for homologous recombination during replication, thereby preserving the duplication–deletion dynamics of the region, despite the high divergence of the trwJ copies.

Selection for Subfunctionalization and Pilus Elongation in trwL
The 2 distinct phylogenetic clades of the genes coding for the major pilus protein TrwL is evidence that 2 trwL gene variants have been retained side-by-side in the Bartonella lineages over a considerable evolutionary time. This pattern is consistent with a functional separation between the 2 clades dating back to, or shortly after, the horizontal transfer of Trw into the Bartonella lineage. In other T4SSs, the TrwL homolog constitutes the main part of the pilus and also attaches the pilus to the outer membrane (Cascales and Christie 2003Go). We thus speculate that the 2 clades represent a subfunctionalization event, where one clade is responsible for the attachment to the outer membrane and the other clade provides the building blocks for the actual pilus. If so, the increased copy number of 1 clade could result in a relative increase of pilus proteins compared with anchoring proteins and an increase in pilus length that might be advantageous for rapid adhesion to the erythrocyte in the blood stream of the mammalian host. Expression level-dependent pilus lengths have previously been demonstrated for the main pilus protein of the Type III Pilus of Corynebacterium diphteriae (Swierczynski and Ton-That 2006Go) and is consistent with the idea that retained gene duplications are often immediately beneficial through dosage effects (Kondrashov et al. 2002Go).

Evolutionary Tests and Duplicated Genes
Even though the strikingly different evolutionary patterns in trwJ, trwI, and trwL provide strong evidence per se that selective forces are operating on some or all these genes, proper quantitative evolutionary analyses of the trw genes are difficult. Many assumptions underlying standard selective tests, such as D of (Tajima 1989Go), D* and F* of (Fu and Li 1993Go), and elevated Ka/Ks ratios are violated by frequent recombination, rapid evolution, and nonindependent substitutions between sites. Because duplications, recombination, and rapid evolution appear to be recurrent themes for host-interaction proteins, evaluating the influence of selection versus neutrality remains a major challenge for these important systems. In order to better understand how and why new infectious disease agents emerge, we need better evolutionary models, broadening our horizon beyond point mutations. This will require massive pan-genome sampling as well as the development of new statistical tools.


    Supplementary Material
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
Supplementary figures 1–7 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).


    Acknowledgements
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 
We thank Petter Johansson and Anna Åsman for assistance with the experimental work. This work was supported by the Linnaeus Centre for Bioinformatics in Uppsala (M. Thollesson) and the Swedish Research Council, the Swedish Foundation for Strategic Research, the Knut and Alice Wallenberg Foundation, the Göran Gustzfsson Foundation, and the European Union (S.G.E. Andersson).


    Footnotes
 
1 These authors contributed equally to this work. Back

2 Present address: Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland. Back

Edward Holmes, Associate Editor


    References
 TOP
 Abstract
 Introduction
 Materials and Methods
 Results
 Discussion
 Supplementary Material
 Acknowledgements
 References
 

    Abdulkarim F, Hughes D. Homologous recombination between the tuf genes of Salmonella typhimurium. J Mol Biol (1996) 260:506–522.[CrossRef][Web of Science][Medline]

    Alsmark CM, Frank AC, Karlberg EO, et al, (13 co-authors). The louse-borne human pathogen Bartonella quintana is a genomic derivative of the zoonotic agent Bartonella henselae. Proc Natl Acad Sci USA (2004) 101:9716–9721.[Abstract/Free Full Text]

    Altheide TK, Hayakawa T, Mikkelsen TS, Diaz S, Varki N, Varki A. System-wide genomic and biochemical comparisons of sialic acid biology among primates and rodents: evidence for two modes of rapid evolution. J Biol Chem (2006) 281:25689–25702.[Abstract/Free Full Text]

    Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol (1990) 215:403–410.[CrossRef][Web of Science][Medline]

    Andersson DI, Slechta ES, Roth JR. Evidence that gene amplification underlies adaptive mutability of the bacterial lac operon. Science (1998) 282:1133–1135.[Abstract/Free Full Text]

    Baum J, Thomas AW, Conway DJ. Evidence for diversifying selection on erythrocyte-binding antigens of Plasmodium falciparum and P. vivax. Genetics (2003) 163:1327–1336.[Abstract/Free Full Text]

    Baum J, Ward RH, Conway DJ. Natural selection on the erythrocyte surface. Mol Biol Evol (2002) 19:223–229.[Abstract/Free Full Text]

    Brayton KA, Kappmeyer LS, Herndon DR, Dark MJ, Tibbals DL, Palmer GH, McGuire TC, Knowles DP Jr. Complete genome sequencing of Anaplasma marginale reveals that the surface is skewed to two superfamilies of outer membrane proteins. Proc Natl Acad Sci USA (2005) 102:844–849.[Abstract/Free Full Text]

    Brayton KA, Palmer GH, Lundgren A, Yi J, Barbet AF. Antigenic variation of Anaplasma marginale msp2 occurs by combinatorial gene conversion. Mol Microbiol (2002) 43:1151–1159.[CrossRef][Web of Science][Medline]

    Cascales E, Christie PJ. The versatile bacterial type IV secretion systems. Nat Rev Microbiol (2003) 1:137–149.[CrossRef][Web of Science][Medline]

    Cascales E, Christie PJ. Definition of a bacterial type IV secretion pathway for a DNA substrate. Science (2004) 304:1170–1173.[Abstract/Free Full Text]

    Dang TA, Zhou XR, Graf B, Christie PJ. Dimerization of the Agrobacterium tumefaciens VirB4 ATPase and the effect of ATP-binding cassette mutations on the assembly and function of the T-DNA transporter. Mol Microbiol (1999) 32:1239–1253.[CrossRef][Web of Science][Medline]

    Deitsch KW, Moxon ER, Wellems TE. Shared themes of antigenic variation and virulence in bacterial, protozoal, and fungal infections. Microbiol Mol Biol Rev (1997) 61:281–293.[Abstract]

    Fernandez-Lopez R, Garcillan-Barcia MP, Revilla C, Lazaro M, Vielva L, de la Cruz F. Dynamics of the IncW genetic backbone imply general trends in conjugative plasmid evolution. FEMS Microbiol Rev (2006) 30:942–966.[Web of Science][Medline]

    Frank AC, Alsmark CM, Thollesson M, Andersson SG. Functional divergence and horizontal transfer of type IV secretion systems. Mol Biol Evol (2005) 22:1325–1336.[Abstract/Free Full Text]

    Frank AC, Berglund E, Andersson SGE. The genomes of pathogenics, pathogenic Bartonella species. In: —Hacker J, Dobrindt U, eds. (2006) Pathogenomics. Weinheim (Germany): Wiley-VCH Verlag GmbH & Co. 281–300.

    Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics (1993) 133:693–709.[Abstract]

    Gagneux P, Varki A. Evolutionary considerations in relating oligosaccharide diversity to biological function. Glycobiology (1999) 9:747–755.[Abstract/Free Full Text]

    Ganley AR, Kobayashi T. Highly efficient concerted evolution in the ribosomal DNA repeats: total rDNA repeat variation revealed by whole-genome shotgun sequence data. Genome Res (2007) 17:184–191.[Abstract/Free Full Text]

    Gevers D, Vandepoele K, Simillon C, Van de Peer Y. Gene duplication and biased functional retention of paralogs in bacterial genomes. Trends Microbiol (2004) 12:148–154.[CrossRef][Web of Science][Medline]

    Goh CS, Bogan AA, Joachimiak M, Walther D, Cohen FE. Co-evolution of proteins with their interaction partners. J Mol Biol (2000) 299:283–293.[CrossRef][Web of Science][Medline]

    Gray RR, Mulligan CJ, Molini BJ, Sun ES, Giacani L, Godornes C, Kitchen A, Lukehart SA, Centurion-Lara A. Molecular evolution of the tprC, D, I, K, G, and J genes in the pathogenic genus Treponema. Mol Biol Evol (2006) 23:2220–2233.[Abstract/Free Full Text]

    Guttman DS, Gropp SJ, Morgan RL, Wang PW. Diversifying selection drives the evolution of the type III secretion system pilus of Pseudomonas syringe. Mol Biol Evol (2006) 23:2342–2354.[Abstract/Free Full Text]

    Haas R, Meyer TF. The repertoire of silent pilus genes in Neisseria gonorrhoeae: evidence for gene conversion. Cell (1986) 44:107–115.[CrossRef][Web of Science][Medline]

    Hagblom P, Segal E, Billyard E, So M. Intragenic recombination leads to pilus antigenic variation in Neisseria gonorrhoeae. Nature (1985) 315:156–158.[CrossRef][Medline]

    Hakes L, Lovell SC, Oliver SG, Robertson DL. Specificity in protein interactions and its relationship with sequence diversity and coevolution. Proc Natl Acad Sci USA (2007) 104:7999–8004.[Abstract/Free Full Text]

    Jobb G, von Haeseler A, Strimmer K. TREEFINDER: a powerful graphical analysis environment for molecular phylogenetics. BMC Evol Biol (2004) 4:18.[CrossRef][Medline]

    Käll L, Krogh A, Sonnhammer EL. A HMM posterior decoder for sequence feature prediction that includes homology information. Bioinformatics (2005) 21:i251–i257.[Abstract]

    Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV. Selection in the evolution of gene duplications. Genome Biol (2002) 3:R8:1–9.

    Korber B. HIV signature and sequence variation analysis. In: Computational analysis of HIV molecular sequences—Rodrigo AG, Learn GH, eds. (2000) Dordrecht (The Netherlands): Kluwer Academic Publishers. 55–72.

    Krall L, Wiedemann U, Unsin G, Weiss S, Domke N, Baron C. Detergent extraction identifies different VirB protein subassemblies of the type IV secretion machinery in the membranes of Agrobacterium tumefaciens. Proc Natl Acad Sci USA (2002) 99:11405–11410.[Abstract/Free Full Text]

    Lassmann T, Sonnhammer EL. Kalign–an accurate and fast multiple sequence alignment algorithm. BMC Bioinformatics (2005) 6:298.[CrossRef][Medline]

    Liao D. Gene conversion drives within genic sequences: concerted evolution of ribosomal RNA genes in bacteria and archaea. J Mol Evol (2000) 51:305–317.[Web of Science][Medline]

    Lindroos H, Vinnere O, Mira A, Repsilber D, Naslund K, Andersson SG. Genome rearrangements, deletions, and amplifications in the natural population of Bartonella henselae. J Bacteriol (2006) 188:7426–7439.[Abstract/Free Full Text]

    Loytynoja A, Milinkovitch MC. SOAP, cleaning multiple alignments from unstable blocks. Bioinformatics (2001) 17:573–574.[Abstract/Free Full Text]

    Ma W, Dong FF, Stavrinides J, Guttman DS. Type III effector diversification via both pathoadaptation and horizontal transfer in response to a coevolutionary arms race. PLoS Genet (2006) 2:e209.[CrossRef][Medline]

    Martin D, Rybicki E. RDP: detection of recombination amongst aligned sequences. Bioinformatics (2000) 16:562–563.[Abstract/Free Full Text]

    Matson JS, Durick KA, Bradley DS, Nilles ML. Immunization of mice with YscF provides protection from Yersinia pestis infections. BMC Microbiol (2005) 5:38.[CrossRef][Medline]

    Maynard Smith J. Analyzing the mosaic structure of genes. J Mol Evol (1992) 34:126–129.[Web of Science][Medline]

    Maynard Smith J, Smith NH. Detecting recombination from gene trees. Mol Biol Evol (1998) 15:590–599.[Abstract]

    Medini D, Covacci A, Donati C. Protein homology network families reveal step-wise diversification of type III and type IV secretion systems. PLoS Comp Biol (2006) 12:e173.

    Mintseris J, Weng Z. Structure, function, and evolution of transient and obligate protein-protein interactions. Proc Natl Acad Sci USA (2005) 102:10930–10935.[Abstract/Free Full Text]

    Moyle WR, Campbell RK, Myers RV, Bernard MP, Han Y, Wang X. Co-evolution of ligand-receptor pairs. Nature (1994) 368:251–255.[CrossRef][Medline]

    Noormohammadi AH, Markham PF, Kanci A, Whithear KG, Browning GF. A novel mechanism for control of antigenic variation in the haemagglutinin gene family of Mycoplasma synoviae. Mol Microbiol (2000) 35:911–923.[CrossRef][Web of Science][Medline]

    Ohno S. Evolution by gene duplication. (1970) New York: Springer.

    Orlandi PA, Klotz FW, Haynes JD. A malaria invasion receptor, the 175-kilodalton erythrocyte binding antigen of Plasmodium falciparum recognizes the terminal Neu5Ac(alpha 2-3) Gal- sequences of glycophorin. A J Cell Biol (1992) 116:901–909.[CrossRef]

    Padidam M, Sawyer S, Fauquet CM. Possible emergence of new geminiviruses by frequent recombination. Virology (1999) 265:218–225.[CrossRef][Web of Science][Medline]

    Pettersson ME, Andersson DI, Roth JR, Berg OG. The amplification model for adaptive mutation: simulations and analysis. Genetics (2005) 169:1105–1115.[Abstract/Free Full Text]

    Plasterk RH, Simon MI, Barbour AG. Transposition of structural genes to an expression sequence on a linear plasmid causes antigenic variation in the bacterium Borrelia hermsii. Nature (1985) 318:257–263.[CrossRef][Medline]

    Posada D, Crandall KA. Modeltest: testing the model of DNA substitution. Bioinformatics (1998) 14:817–818.[Abstract/Free Full Text]

    Posada D, Crandall KA. Evaluation of methods for detecting recombination from DNA sequences: computer simulations. Proc Natl Acad Sci USA (2001) 98:13757–13762.[Abstract/Free Full Text]

    Sagulenko E, Sagulenko V, Chen J, Christie PJ. Role of Agrobacterium VirB11 ATPase in T-pilus assembly and substrate selection. J Bacteriol (2001) 183:5813–5825.[Abstract/Free Full Text]

    Santoyo G, Romero D. Gene conversion and concerted evolution in bacterial genomes. FEMS Microbiol Rev (2005) 29:165–183.

    Sawyer S. Statistical tests for detecting gene conversion. Mol Biol Evol (1989) 6:526–538.[Abstract]

    Sawyer S. GENECONV: a computer package for the statistical detection of gene conversion. In: Distributed by the author (1999) St Louis (MO): Department of Mathematics, Washington University.

    Schroder G, Dehio C. Virulence-associated type IV secretion systems of Bartonella. Trends Microbiol (2005) 13:336–342.[CrossRef][Web of Science][Medline]

    Schulein R, Seubert A, Gille C, Lanz C, Hansmann Y, Piemont Y, Dehio C. Invasion and persistent intracellular colonization of erythrocytes. A unique parasitic strategy of the emerging pathogen Bartonella. J Exp Med (2001) 193:1077–1086.[Abstract/Free Full Text]

    Seubert A, Hiestand R, de la Cruz F, Dehio C. A bacterial conjugation machinery recruited for pathogenesis. Mol Microbiol (2003) 49:1253–1266.[CrossRef][Web of Science][Medline]

    Sim BK, Chitnis CE, Wasniowska K, Hadley TJ, Miller LH. Receptor and ligand domains for invasion of erythrocytes by Plasmodium falciparum. Science (1994) 264:1941–1944.[Abstract/Free Full Text]

    Stavrinides J, Ma W, Guttman DS. Terminal reassortment drives the quantum evolution of type III effectors in bacterial pathogens. PLoS Pathog (2006) 2:e104.[CrossRef][Medline]

    Swierczynski A, Ton-That H. Type III pilus of corynebacteria: pilus length is determined by the level of its major pilin subunit. J Bacteriol (2006) 188:6318–6325.[Abstract/Free Full Text]

    Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics (1989) 123:585–595.[Abstract/Free Full Text]

    Yang Z. Phylogenetic analysis by maximum likelihood. PAML. Version 3.0 (2000) London: University College.

    Yang Z, Nielsen R. Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol Biol Evol (2000) 17:32–43.[Abstract/Free Full Text]

    Yeo HJ, Yuan Q, Beck MR, Baron C, Waksman G. Structural and functional characterization of the VirB5 protein from the type IV secretion system encoded by the conjugative plasmid pKM101. Proc Natl Acad Sci USA (2003) 100:15947–15952.[Abstract/Free Full Text]

    Zeaiter Z, Fournier PE, Ogata H, Raoult D. Phylogenetic classification of Bartonella species by comparing groEL sequences. Int J Syst Evol Microbiol (2002) 52:165–171.[Abstract]

    Zhang JR, Norris SJ. Genetic variation of the Borrelia burgdorferi gene vlsE involves cassette-specific, segmental gene conversion. Infect Immun (1998) 66:3698–3704.[Abstract/Free Full Text]

Accepted for publication November 11, 2007.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Phil Trans R Soc BHome page
M. A. Ragan and R. G. Beiko
Lateral genetic transfer: open issues
Phil Trans R Soc B, August 12, 2009; 364(1527): 2241 - 2251.
[Abstract] [Full Text] [PDF]


Home page
Phil Trans R Soc BHome page
A. Norman, L. H. Hansen, and S. J. Sorensen
Conjugative plasmids: vessels of the communal gene pool
Phil Trans R Soc B, August 12, 2009; 364(1527): 2275 - 2289.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
M. Castellanos and D. Romero
The Extent of Migration of the Holliday Junction Is a Crucial Factor for Gene Conversion in Rhizobium etli
J. Bacteriol., August 1, 2009; 191(15): 4987 - 4995.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Supplementary Material
Right arrow Supplementary Material
Right arrow All Versions of this Article:
25/2/287    most recent
msm252v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Nystedt, B.
Right arrow Articles by Andersson, S. G. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nystedt, B.
Right arrow Articles by Andersson, S. G. E.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?