Abstract
The bacterial strains and plasmids used in this study are listed in Table 1. Both S. meliloti and A. tumefaciens were grown in LB broth supplemented with 100 µg streptomycin ml1 and 30 µg gentamicin ml1 (where appropriate) at 30 °C while E. coli was grown in the same medium supplemented with 5 µg gentamicin ml1 at 37 °C. Antibiotic concentrations were doubled in solid medium. The oligonucleotides used for primer extension reactions are also listed in Table 1.
Table 1. Bacterial strains, plasmids and primer extension oligonucleotides used in this study
Sequence analysis.
The first-generation promoter prediction was based upon an alignment of previously verified S. meliloti promoter sequences (Fig. 1, sequences 111). These sequences or sequence 10 (the rRNA operon promoter) were used to derive a search string that could be used to scan the S. meliloti genome for matches. A standard weight matrix was also derived based upon nucleotide frequency (percentage conservation) at each of the 30 nucleotide positions in the alignment of promoters. For the purposes of this initial investigation only linker regions (between the 10 and 35 hexanucleotides) of 17 nt were considered and in promoters possessing 18 nt linkers, a non-conserved position was eliminated in the linker sequence to facilitate construction of the matrix. The second-generation matrix was constructed as described for the first except that all 25 verified promoter sequences (Fig. 1) were used. The program PatScan (Dsouza et al., 1997) was used to search the S. meliloti genome for matching sequences. In the case of the matrices, PatScan allows the input of a minimum threshold score such that only 30-mer sequences in the genome that match or exceed the score are reported. All sequences in this work were aligned manually.
|
Primer extension.
Primer extension reactions were conducted using total bacterial RNA and oligonucleotides listed in Table 1 as previously described (MacLellan et al., 2005). Extension and sequencing reactions were separated on 8 % denaturing polyacrylamide gel containing 7 M urea. Signals were recorded on a phosphor screen and images were processed using ImageQuant v. 5.2.
Enzyme assays and other techniques.
Estimates of promoter activity were obtained by cloning DNA fragments into the transcriptional reporter vector pOT1 (Allaway et al., 2001) and measuring green fluorescent protein (GFP) fluoresence in a Tecan Safire fluorimeter. Cells were grown overnight then used to inoculate fresh volumes of LB broth to an OD600 of ∼0.05. Cultures were grown to an OD600 of 0.70.8, washed once in 0.85 % saline and resuspended in 1 vol. saline. Two hundred microlitre volumes were deposited in black microtitre plates and fluorescence was measured with an excitation wavelength of 405 nm and an emission wavelength of 505 nm. The optical densities of equivalent volumes were measured at 600 nm in clear microtitre plates. Specific fluorescence was obtained by dividing relative fluorescence by the optical density.
DNA sequencing and oligonucleotide synthesis was conducted by MOBIX lab (McMaster University, Hamilton).
Putative promoter sequences were predicted in two stages using a set of 11 experimentally verified promoter sequences from the existing literature in the first stage (first-generation prediction) and an expanded set of verified promoter sequences (consisting of an additional 14 promoter sequences verified in our laboratory) for the second stage (second generation) of predictions.First-generation prediction of novel promoter sequences
Recently the characterization and mutational analysis of a promoter found upon the pSymB megaplasmid in S. meliloti led to the discovery of a small RNA gene that plays a role in regulating the replication of the plasmid (MacLellan et al., 2005). We noticed a striking conservation of sequence (CTTGAC) at the 35 region of the small RNA (incA1) promoter with other S. meliloti promoter sequences (Bae et al., 1989; Fisher et al., 1987; Leong et al., 1985; Osteras et al., 1995; Ronson et al., 1987) previously reported in the literature. We therefore compiled 11 experimentally verified promoter sequences that were related to one another (most obviously at the 35 hexanucleotide sub-sequences) with the idea of using them to predict the location of other related promoter sequences in the S. meliloti genome. These 11 promoter sequences are listed (and aligned) in Fig. 1 (sequences 111). Two strategies were employed to identify sequences in the genome that had similarity to these sequences. First, we used the program PatScan (Dsouza et al., 1997) to perform a string pattern search of the S. meliloti genome (see ) using IUPAC symbols that constituted a degenerate consensus of the verified promoter sequences and a consensus based solely upon the rRNA operon promoter sequence. Amongst over 160 hits that were within ∼250 bp on the proper strand upstream of a predicted ORF we identified two hits occurring upstream of genes involved in transcription (suhR, rpoD), twelve involved in translation [four tRNA genes (SMc01378, SMc01936, SMc00758, SMc00303), two aminoacyl synthetases (cysS, trpS) and six ribosomal proteins (rluD, rpmE, rpmH, rpmJ, rplM, rpsT)] and hits linked with a number of other physiologically important or essential genes including secE (protein translocation subunit), topA (topoisomerase I), two proteases (ptrB, SMc03769), fixN3 (cytochrome c oxidase subunit), expA1 (one of a 21-gene cluster required for exopolysaccharide synthesis), and a predicted chaperone gene, grpE. Eleven hits were linked with solute transporter loci and 14 were linked with transcriptional regulatory protein genes. These hits are tabulated and aligned in Fig. 2.
|
We also constructed a weight-based matrix that we could use to assign scores to sub-sequences in the genome based on similarity to the verified promoter sequences. Best hits using this strategy were defined based on those sequences whose score equalled or exceeded an arbitrary discriminating threshold value. This first-generation weight matrix was used to scan the S. meliloti genome. The maximum score attainable with this matrix was 1860 and no sequence in the S. meliloti genome attained this score. Using a cutoff score of 1300, many more candidate promoter hits were obtained, including sequences linked to genes SMc02206 (tRNA), cspA6 (a cold-shock protein), dnaE1 (DNA polymerase subunit), pip3 (proline iminopeptidase), rpoZ (RNA polymerase omega subunit) and nuoA1 (the first gene in an 11-gene cluster encoding subunits of an NADH dehydrogenase complex) (Fig. 2).
The hits tabulated in Fig. 2 were selected from the first-generation prediction stage as their function is generally well established in bacteria and several are either very important or essential to cellular viability. These fall into well-defined physiological groups such as genes involved in translation or in ribosomal structure and function. The degree of sequence conservation within the 10 and 35 sub-sequences aligned in Fig. 2 is in most cases striking and obvious. The 5' nucleotide of the 35 region is almost always a C or a G (as is the case with all of the input sequences used for the analyses) but the weight matrix analysis with low frequency specified some hits with an A (but never a T) in this position. One of the first indications of the broad predictive power of our analysis came with the subsequent but independent identification of two promoters in the genome (repA2 and pcaI, sequences 12 and 13 in Fig. 1) that indeed possess an A at the 5' nucleotide of the 35 hexanucleotide. Thus, our sequence analysis suggested that such promoters may exist and we subsequently identified two of these promoters in the course of unrelated experimental investigations in our laboratory. Although the absolute frequency of the nucleotides found at this position may change as more promoters are identified, it seems that the hierarchy of nucleotide identity at the 5' position of the 35 sub-sequence is: C>G>>A>>T. All of the 31 predicted promoters listed in Fig. 2 are of course putative and we targeted 14 of these for subsequent experimental verification.
Experimental verification of predicted putative promoter sequences
One way of providing support for a predicted promoter element is to find that the sequence is positionally conserved in the genomes of related organisms. In the case of S. meliloti, we chose A. tumefaciens as a related alphaproteobacterium since the chromosomes of these organisms show a high degree of collinearity (Wood et al., 2001). Sequences upstream of homologous genes in A. tumefaciens were compared with the S. meliloti intergenic sequences encompassing four predicted promoters (rpmE, rpoD, topA, secE) from the list in Fig. 2. For each locus, the intergenic sequences were manually aligned (Fig. 3) and in each case the critical 10 and 35 sub-sequences were completely conserved (except for a single nucleotide substitution in the 35 region of the secE predicted promoters). In secE and rpmE, the conserved hexanucleotide sub-sequences are flanked by rather extensive degrees of sequence conservation, but in the case of rpoD and topA, the conserved 10 and 35 sub-sequences are distinct in otherwise poorly conserved regions. This suggests that these discrete regions have experienced the selective pressure that would be expected for functionally important sequences such as promoter elements. It also suggests that, with whatever facility we can predict a subset of S. meliloti promoters, this same ability can be extended to A. tumefaciens and probably other related genera (see Discussion).
|
Most often, the 10 and 35 regions of promoters are inferred from the experimental determination of the transcriptional start site for a given locus. We therefore performed primer extension reactions to determine the transcriptional start sites for 14 of the genetic loci listed in Fig. 2 and examined whether these start sites correlated with the promoter elements we predicted for these genes. The targets we examined were rpmJ, rpmE, ptrB, topA, rpoD, metC, ropB1, trpS, rpsT, secE, trkH, and three tRNA genes (SMc00758, SMc01378, SMc02206). The reverse transcriptase products are shown in Fig. 4(a). No extension product was visible for ptrB (not shown). Several of the reactions (rpmJ, topA rpoD, ropB1, SMc02206) generated multiple extension products but in every case (except for rpoD) the longest extension product (marked with an arrow) indicated a transcriptional start site that corresponds to the 10 and 35 regions that were predicted for each locus (Fig. 4b). In most cases, the shorter extension products (most of which are not shown due to space limitations) are probably due to premature termination of the reverse transcription reactions. In the case of rpmE, only a single extension product was observed but the transcriptional start site did not correspond to the predicted promoter element for this locus (not shown). However, we note that the extension reaction in this case terminates near a GC-rich sequence that is annotated on the S. meliloti genome website as a rho-independent transcriptional terminator element. We therefore suspect that this product also represents a premature termination product. The distance between this GC-rich element and what may be the actual transcriptional start site (based on our promoter prediction) is only 17 nt and we therefore could not design a new primer to test this hypothesis. As shown in Fig. 3, however, the sequence we predict to be the rpmE promoter is positionally conserved in the related bacterium A. tumefaciens, suggesting that our prediction is most likely correct. In the case of rpoD, an extension product (see arrow) exactly correlates with the promoter we predicted for this locus but a longer extension product was also synthesized (not shown). The nucleotide sequence upstream of this additional start site was not obviously related to the confirmed promoter but nonetheless may represent an alternative promoter at this locus.
|
This experimental approach established that 12 of 13 (∼92 %) of the promoter elements we predicted and targeted for verification (not including ptrB, for which no information was obtained) are indeed genuine promoters and it is reasonable to suspect that the lone candidate that was not confirmed (rpmE) may be the result of premature termination of the primer extension reaction. In a circumstance that parallels the situation in E. coli, an A residue is the most frequently (50 %) utilized transcription start nucleotide (Fig. 4b). The 12 predicted and confirmed promoter elements are listed in Fig. 1 (sequences 1425) and all of the (now 25) verified promoter sequences were used to construct a second-generation weight matrix.
Second-generation weight-based matrix
We compiled the newly verified 12 promoter sequences (this work) with the original 11 sequences and with the two additional promoter sequences independently verified in our laboratory (sequences 12 and 13, Fig. 1) to generate a list of 25 verified promoter sequences (Fig. 1, sequences 125). Using these 25 verified promoter sequences we constructed a second-generation (and presumably more robust) weight matrix that we again used to scan intergenic regions in the S. meliloti genome using the program PatScan. The weight matrix in this case only identifies putative promoters that have a 17 nt linker region between the 35 and 10 sub-sequences.
The highest possible score with this matrix is 1484 and no sequence in the genome obtained this score. Five randomly generated 30-mer sequences (with the same G+C content as S. meliloti intergenic DNA: 56 %) had a mean score of only 710±59 (SE). Of the sequences used to construct the matrix (Fig. 1) the promoter for the rRNA gene has the highest score (1376) while the promoter for the trpE gene has the lowest score (1116). The highest scoring sequence (with a 17 nt linker) in the genome had a score of 1364 and was a hit linked to gene SMa0229, a hypothetical ORF whose predicted product has similarity to the translation elongation factor GreA. Hits linked to genes cysS and topA (both with a score of 1360) were the next highest scoring sequences in the genome. Using an arbitrary threshold score of >1190, 411 hits were obtained. Ninety-five (23 %) of these hits were not obviously linked to annotated genes. Some of these hits occurred in large intergenic regions while others occurred close to annotated genes but on the opposite strand. Seven of these orphan promoter-like sequences were closely linked (within ∼200 nt) to orphan rho-independent transcriptional termination signals (that are annotated on the S. meliloti genome website), raising the possibility that together these elements may indicate the presence of small RNA genes. The number of orphan promoters closely linked with orphan terminators is likely an underestimation since a great many sequences that are good candidates for termination sequences are currently not annotated.
Eighteen and 12 hits, respectively, were closely linked with transcriptional regulatory protein genes and oxidoreductase/dehydrogenase genes. Twenty-nine hits were linked with transporter genes. As befitting the frequency distribution of transporter types in the genome (Galibert et al., 2001), most of the hits link to ABC transporter genes. These, along with other selected predictions, are listed in Table 2. Most of the hits listed in Table 2 come from the second-generation weight matrix analysis but some of the hits (including several tRNA gene and ribosomal protein gene promoter predictions) are duplications from Fig. 2 (from the first round of promoter predictions) and are included solely to facilitate sequence comparison amongst genes within a functional family. Table 2 also includes 32 hits linked to genes involved in translation and RNA metabolism and other hits to physiologically important or essential genes. Except for those few that we verified using primer extension, all of the sequences in Table 2 are of course putative but we expect roughly the same prediction success rate as demonstrated from the first generation of promoter predictions (that is 12 of 13 or minimally ∼90 %).
Table 2. List of predicted S. meliloti promoters
Characterization of the S. meliloti promoter
Based upon the 25 verified S. meliloti promoter sequences (Fig. 1), we tabulated the frequency of each nucleotide at each of the positions delimited by the inferred 35 and 10 hexanucleotide sub-sequences and at each of 20 nucleotide positions upstream and downstream of the promoter (Fig. 5a). A plot of A+T frequency across the regions shows a defined profile of A/T-richness at the 10 and 35 sequences. The region upstream of the 35 hexanucleotide is also frequently rich in either A or T residues and 18 of the 25 sequences carry stretches of four or more contiguous A or T residues in the 20 nt that precede the 35 region. These sequences may function analogously to the UP elements found in the 40 to 60 region of many bacterial promoters (Aiyar et al., 1998; Ross et al., 1998). In contrast, the DNA region directly downstream of the 10 region is distinctly GC-rich, a situation reminiscent of the discriminator sequences found in some E. coli promoters (Zacharias et al., 1990, 1991). A plot of conservation (percentage abundance) of the most abundant residue at each position (Fig. 5a) shows that the 35 region is well demarcated but that the 10 sub-sequence is ill-defined and more diverse than is the 35 sequence. The 35 region has the consensus sequence CTTGAC and this assignment is supported by the facts that the 5' and 3' terminal C residues are conserved in 68 % and 64 % of the sequences (respectively) (Fig. 5a) while the proximal flanking nucleotides on either end of the hexanucleotide show considerably less bias. We deduce that the 10 region has the consensus sequence CTATAT but except for nucleotides 3 and 4 (A,T), which are conserved in 88 % and 80 % of the sequences (respectively), there is less pronounced nucleotide bias at any of the positions (see S. meliloti sequence in Fig. 5b). The overall consensus (CTTGACCTATAT) specifies that 19 of the 25 verified promoters listed in Fig. 1 have a linker region that consists of 17 nt. The 35 and 10 S. meliloti sub-sequences bear striking similarity to the canonical E. coli promoter consensus (Harley & Reynolds, 1987; Lisser & Margalit, 1993) and these sequences are directly compared in Fig. 5(b). At the 35 sub-sequence, both E. coli and S. meliloti promoters share a core sequence of TTGAC but they are shifted or skewed relative to one another by a 1 nt step, at least as evidenced by the conservation of nucleotides in these regions (Fig. 5b).
|
Given the significant similarity between these S. meliloti promoters and the canonical E. coli promoter, we cloned approximately 200 bp regions from upstream of the genes rpoD, secE, rpmJ, rpmE, ropB1, SMc1378, rpsT and topA (see list in Fig. 2) into the gfp transcriptional reporter pOT1 and promoter activity was measured in E. coli, A. tumefaciens and S. meliloti. As a control for these experiments, we used the incA promoter, which is known to be expressed in E. coli (MacLellan et al., 2005). Most of the promoters were highly active in both A. tumefaciens and S. meliloti, but other than the control incA, none were appreciably active in E. coli. In this host, the most active promoter belonged to rpmE (with activity 1.9-fold above background) (Table 3). We expect that several of these housekeeping genes encode factor-independent promoters and thus should technically be capable of activity in heterologous hosts. In some instances, however, promoter activity may be dependent on host-specific transcriptional regulators not present in E. coli. Nevertheless, this survey is quite consistent with the common finding in our laboratory that S. meliloti promoters rarely display activity in E. coli, a circumstance that provided a major motivation towards better defining at least a subset of S. meliloti promoter sequences.
Table 3. Activity of selected S. meliloti promoters in parental or heterologous host strains
Despite the conservation of core sequence (TTGAC) between the E. coli and S. meliloti 35 hexanucleotides we have concluded that the S. meliloti consensus is completed by a 5' C residue (CTTGAC) as opposed to the E. coli consensus, which is completed by a 3' A residue (TTGACA) based on the frequency of conserved nucleotides. Using site-directed mutagenesis on two S. meliloti promoters (the incA1 and incA2 gene promoters) we sought to provide experimental support for the idea that the 5' C residue of the consensus is functionally important for promoter activity and, conversely, that the 3' nucleotide most proximal to CTTGAC has little functional importance. To do this, we cloned both promoter regions into the gfp transcriptional reporter vector pOT1 and monitored promoter activity as specific fluorescence in both A. tumefaciens and E. coli cells. We found it necessary to use the closely related bacterium A. tumefaciens as host for the test plasmids because the products expressed from the incA1 and incA2 promoters (∼56 nt untranslated RNAs) mediate incompatibility against the resident megaplasmids of S. meliloti (MacLellan et al., 2005). We know from this and previous work that patterns of expression in A. tumefaciens and S. meliloti are comparable.
As shown in Table 4, the incA1 (pTH1560) and incA2 (pTH1982) promoters are highly active in both A. tumefaciens and E. coli. Mutations in the nucleotide 3' of the consensus (pTH1985 and pTH1983, respectively) have little or no impact on promoter activity in both species. We expected this result for A. tumefaciens cells since that nucleotide falls outside the consensus hexanucleotide and is less biased than the other adjacent nucleotides. In the case of E. coli, however, we found this surprising since we expected that a T in that nucleotide position (which is found much less frequently than an A in that position) might lower activity if E. coli RNA polymerase continued to recognize the E. coli consensus hexanucleotide that is incidentally formed in plasmids pTH1560 and pTH1983 but not plasmids pTH1985 and pTH1982. In all cases, when the inferred 5' nucleotide of the S. meliloti consensus (CTTGAC) was mutated to form TTTGAC (plasmids pTH1991 and pTH1984), promoter activity was significantly affected. For reasons we have not determined, the influence of the mutation is much more dramatic in the incA2 promoter than in the incA1 promoter. Although the incA1 and incA2 promoter regions are highly similar, sequence differences elsewhere in the promoters are presumably responsible for this effect. In any case, the weight matrix analysis suggests that the 35 hexanucleotide rarely if ever begins with a T residue and this is consistent with the dramatic loss of promoter activity from promoters that carry this nucleotide in that position.
Table 4. Mutational analysis of S. meliloti promoter 35 sub-sequence
The ability to predict promoter elements in the alphaproteobacteria is only now emerging as the result of the accumulation of experimentally verified promoter sequences and access to the genomic sequences of these bacteria. Except for the σ54 promoters (Dombrecht et al., 2002), we are aware of no concerted effort to predict promoters in any members of the group. Our effort represents a fundamental first step in this direction and as a result of this work we report the experimentally verified sequences of an additional 12 S. meliloti promoters. As more sequences are verified, this work can be extended and refined. We provide a list (Table 2) of nearly 100 predicted promoters but these represent only a fraction of sequences in the S. meliloti genome that appear to be related to the consensus we have defined in this work.The structure demonstrated for S. meliloti promoters appears to be conserved amongst other bacteria in the alphaproteobacteria for the following reasons: (1) we showed that the promoter sequences for the rpmE, rpoD, topA and secE genes were conserved between S. meliloti and A. tumefaciens (Fig. 3); (2) homologues of the incA1 and incA2 genes whose promoters have been verified have homologues in A. tumefaciens, Rhizobium etli, R. leguminosarum and Brucella spp. (Chai & Winans, 2005; Dombrecht et al., 2002; Izquierdo et al., 2005; MacLellan et al., 2005; Venkova-Canova et al., 2004) and their promoters are nearly identical; and (3) a small compilation (eight genes) of Bradyrhizobium japonicum promoters (Beck et al., 1997) demonstrated the same general pattern of nucleotide conservation as described in this work.
We have not experimentally deduced which sigma factor recognizes the confirmed and putative promoters that have been predicted in this work. However, the sigma factor is likely to be rpoD, the vegetative sigma factor, since many of the confirmed and predicted loci associated with these promoters encode products required for translation and transcription, and other essential or important functions.
In this work we have identified consensus 10 and 35 hexameric promoter sequences defining what we believe to be a subset of S. meliloti promoters. Further experimental work will be required to better define these sequences from a functional point of view and to determine whether the E. coli paradigm of conserved hexamers is appropriate for S. meliloti promoters, particularly with regard to the poorly conserved 10 region. From the perspective of predicting related sequences (putative promoters) in the alphaproteobacteria, however, the identification of patterns of nucleotide conservation described in this work is a productive first step towards more effective promoter prediction in these bacteria.
This work was supported with funding from Genome Canada through the Ontario Genomics Institute and with funding from the Ontario research and development challenge fund. The work was initially supported by a grant from the Natural Sciences and Engineering Research Council to T. M. F. We are grateful to Marie Elliot, Alison Cowie and Richard Morton for input and comments on the manuscript.References
Allaway, D., Schofield, N. A., Leonard, M. E., Gilardoni, L., Finan, T. M. & Poole, P. S. (2001). Use of differential fluorescence induction and optical trapping to isolate environmentally induced genes. Environ Microbiol 3, 397406.[CrossRef][Medline]
Argaman, L., Hershberg, R., Vogel, J., Bejerano, G., Wagner, E. G., Margalit, H. & Altuvia, S. (2001). Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Curr Biol 11, 941950.[CrossRef][Medline]
Bae, Y. M., Holmgren, E. & Crawford, I. P. (1989). Rhizobium meliloti anthranilate synthase gene: cloning, sequence, and expression in Escherichia coli. J Bacteriol 171, 34713478.
Beck, C., Marty, R., Klausli, S., Hennecke, H. & Gottfert, M. (1997). Dissection of the transcription machinery for housekeeping genes of Bradyrhizobium japonicum. J Bacteriol 179, 364369.
Chai, Y. & Winans, S. C. (2005). A small antisense RNA downregulates expression of an essential replicase protein of an Agrobacterium tumefaciens Ti plasmid. Mol Microbiol 56, 15741585.[Medline]
Chen, S., Lesnik, E. A., Hall, T. A., Sampath, R., Griffey, R. H., Ecker, D. J. & Blyn, L. B. (2002). A bioinformatics based approach to discover small RNA genes in the Escherichia coli genome. Biosystems 65, 157177.[CrossRef][Medline]
DelVecchio, V. G., Kapatral, V., Elzer, P., Patra, G. & Mujer, C. V. (2002). The genome of Brucella melitensis. Vet Microbiol 90, 587592.[CrossRef][Medline]
Dombrecht, B., Marchal, K., Vanderleyden, J. & Michiels, J. (2002). Prediction and overview of the RpoN-regulon in closely related species of the Rhizobiales. Genome Biol 3, RESEARCH0076.[Medline]
Dsouza, M., Larsen, N. & Overbeek, R. (1997). Searching for patterns in genomic data. Trends Genet 13, 497498.[Medline]
Fisher, R. F., Brierley, H. L., Mulligan, J. T. & Long, S. R. (1987). Transcription of Rhizobium meliloti nodulation genes. Identification of a nodD transcription initiation site in vitro and in vivo. J Biol Chem 262, 68496855.
Galibert, F., Finan, T. M., Long, S. R. & 53 other authors (2001). The composite genome of the legume symbiont Sinorhizobium meliloti. Science 293, 668672.
Gustafson, A. M., O'Connell, K. P. & Thomashow, M. F. (2002). Regulation of Sinorhizobium meliloti 1021 rrnA-reporter gene fusions in response to cold shock. Can J Microbiol 48, 821830.[CrossRef][Medline]
Halling, S. M., Peterson-Burch, B. D., Bricker, B. J., Zuerner, R. L., Qing, Z., Li, L. L., Kapur, V., Alt, D. P. & Olsen, S. C. (2005). Completion of the genome sequence of Brucella abortus and comparison to the highly similar genomes of Brucella melitensis and Brucella suis. J Bacteriol 187, 27152726.
Harley, C. B. & Reynolds, R. P. (1987). Analysis of E. coli promoter sequences. Nucleic Acids Res 15, 23432361.
Izquierdo, J., Venkova-Canova, T., Ramirez-Romero, M. A., Tellez-Sosa, J., Hernandez-Lucas, I., Sanjuan, J. & Cevallos, M. A. (2005). An antisense RNA plays a central role in the replication control of a repC plasmid. Plasmid 54, 259277.[Medline]
Kaneko, T., Nakamura, Y., Sato, S. & 14 other authors (2002). Complete genomic sequence of nitrogen-fixing symbiotic bacterium Bradyrhizobium japonicum USDA110 (supplement). DNA Res 9, 225256.[CrossRef][Medline]
Leong, S. A., Williams, P. H. & Ditta, G. S. (1985). Analysis of the 5' regulatory region of the gene for delta-aminolevulinic acid synthetase of Rhizobium meliloti. Nucleic Acids Res 13, 59655976.
Lisser, S. & Margalit, H. (1993). Compilation of E. coli mRNA promoter sequences. Nucleic Acids Res 21, 15071516.
MacLellan, S. R., Smallbone, L. A., Sibley, C. D. & Finan, T. M. (2005). The expression of a novel antisense gene mediates incompatibility within the large repABC family of alpha-proteobacterial plasmids. Mol Microbiol 55, 611623.[CrossRef][Medline]
Osteras, M., Driscoll, B. T. & Finan, T. M. (1995). Molecular and expression analysis of the Rhizobium meliloti phosphoenolpyruvate carboxykinase (pckA) gene. J Bacteriol 177, 14521460.
Papp, P. P. (2004). tRNA gene containing an attachment site of a temperate phage is functional: proof by converting attB to amber suppressor. GenBank accession no. AJ698943.
Paulsen, I. T., Seshadri, R., Nelson, K. E. & 28 other authors (2002). The Brucella suis genome reveals fundamental similarities between animal and plant pathogens and symbionts. Proc Natl Acad Sci U S A 99, 1314813153.
Ronson, C. W., Nixon, B. T., Albright, L. M. & Ausubel, F. M. (1987). Rhizobium meliloti ntrA (rpoN) gene is required for diverse metabolic functions. J Bacteriol 169, 24242431.
Ross, W., Aiyar, S. E., Salomon, J. & Gourse, R. L. (1998). Escherichia coli promoters with UP elements of different strengths: modular structure of bacterial promoters. J Bacteriol 180, 53755383.
Thony, B. & Hennecke, H. (1989). The 24/12 promoter comes of age. FEMS Microbiol Rev 5, 341357.[CrossRef][Medline]
Venkova-Canova, T., Soberon, N. E., Ramirez-Romero, M. A. & Cevallos, M. A. (2004). Two discrete elements are required for the replication of a repABC plasmid: an antisense RNA and a stem-loop structure. Mol Microbiol 54, 14311444.[CrossRef][Medline]
Wood, D. W., Setubal, J. C., Kaul, R. & 49 other authors (2001). The genome of the natural genetic engineer Agrobacterium tumefaciens C58. Science 294, 23172323.
Zacharias, M., Goringer, H. U. & Wagner, R. (1990). The signal for growth rate control and stringent sensitivity in E. coli is not restricted to a particular sequence motif within the promoter region. Nucleic Acids Res 18, 62716275.
Zacharias, M., Theissen, G., Bradaczek, C. & Wagner, R. (1991). Analysis of sequence elements important for the synthesis and control of ribosomal RNA in E coli. Biochimie 73, 699712.[Medline]
Received 4 December 2005; revised 26 February 2006; accepted 28 February 2006.