Abstract
The GenBank accession number for the complete sequence of the RYSV genome is AB011257.
Footnotes
†,Rice yellow stunt virus (RYSV), synonymous with Rice transitory yellowing virus, is a species in the genus Nucleorhabdovirus of the Rhabdoviridae (Walker et al., 2000). RYSV has a non-segmented, negative-sense single-stranded RNA genome. We have previously sequenced the RYSV genomic RNA from cDNA clones representing the 3' leader (Wang et al., 1999), the nucleocapsid protein (N) (Fang et al., 1994), the phosphoprotein (P, formerly designated non-structural protein) (Zhu et al., 1997), a protein (P3) of unknown function (Chen et al., 1998), the matrix protein (M) (Luo et al., 1998) and the glycoprotein (G) (Luo & Fang, 1998), as well as the 5' trailer region (Wang et al., 1999). In this paper, we report the sequence of the genomic region between the glycoprotein gene and the 5' trailer, thus completing the sequence of the entire RYSV genome (14 042 nucleotides). In this region, a large open reading frame (ORF) encoding the polymerase (L) and a small ORF (ORF 6) capable of encoding 93 amino acids (aa) were found on the viral complementary (vc) strand. Furthermore, we present evidence that the polypeptide encoded by ORF 6, named P6, is a virion-associated protein. Thus RYSV encodes seven genes and has, among the characterized rhabdoviruses, the unique genome organization 3' leader-N-P-3-M-G-6-L-5' trailer. In comparison to the basic gene order represented by the genome of Vesicular stomatitis virus (VSV), the RYSV genome has two additional genes: gene 3 and gene 6. Gene 3 is located between the P and M genes where additional ORFs were identified in the genomes of all plant rhabdoviruses examined so far [Sonchus yellows net virus (SYNV) (Scholthof et al., 1994), Lettuce necrotic yellows virus (LNYV) (Wetzel et al., 1994) and Northern cereal mosaic virus (NCMV) (Tanno et al., 2000)] and the insect rhabdovirus Sigma virus (SigmaV) (Teninges et al., 1993). However, in the position between the G and L genes where the RYSV gene 6 is interposed, the presence of the extra gene(s) has only been described for some animal rhabdoviruses [Adelaide River virus (ARV) (Wang et al., 1994), Bovine ephemeral fever virus (BEFV) (McWilliam et al., 1997), Rabies virus (RABV) (Tordo et al., 1986), Infectious haematopoietic necrosis virus (IHNV) (Schutze et al., 1995), Viral haemorrhagic septicaemia virus (VHSV) (Basurco & Benmansour, 1995) and Snakehead rhabdovirus (SHRV) (GenBank accession no. AF147498)]. The products of these extra viral genes are either non-structural or unidentified.
The nucleotide sequence of the RYSV genomic region spanning the untranscribed intergenic spacer 5' to the G gene to that 3' to the trailer was obtained by sequencing 10 overlapping cDNA clones isolated from a cDNA library of the RYSV genomic RNA (Fang et al., 1994) using a genome-walking strategy initiated with a probe from the G gene. This region consisted of 6560 nucleotides (nt) and each nucleotide was determined by sequencing both strands of at least two different clones. Sequence analysis using the DNASIS software (Hitachi Software Engineering) revealed two ORFs on the vc strand each of which is bordered by the stretch of nucleotides 5'-AAAUAAAACCCCAACA-3', similar to the gene junction sequences found between the other RYSV genes.
The large ORF located at the 3' region of the vc strand could encode a protein of 1967 aa with a deduced molecular mass of 223·6 kDa, and probably encodes the L protein. The transcription initiation sequence of the L gene has been determined to be 5'-AACA-3' by 5'RACE analysis performed on poly(A+) RNA from RYSV-infected rice plants (Luo & Fang, 1998). This sequence motif is identical to the 5' end sequence of other RYSV genes (N, P, M and G) and similar to that of gene 3 (5'-AACU-3') or gene 6 (see below). As in the case of genes G, M and 6 of RYSV, non-viral nucleotides were found at the 5' terminus of the mRNA for the L gene preceding the initiation sequence (Luo & Fang, 1998), suggesting the possibility that the initiation of transcription of RYSV genes proceeds via a cap snatching mechanism as originally demonstrated for influenza virus (Krug, 1981). To determine the exact 3' end of the L gene, 3'RACE was performed as described by Fang et al. (1994). The termination sequence of the L gene was defined as 5'-AAAUAAAAA-3', which is consistent with the conserved termination sequence of other RYSV genes. We thus conclude that the L gene contains a 39 nt 5'-untranslated sequence and a 52 nt 3'-untranslated sequence, and in total is composed of 5988 nt extending from positions 7860 to 13847 relative to the 3' end of the RYSV genomic RNA.
The RYSV L protein is the smallest of all characterized non-segmented negative-strand RNA viruses (NNSV) except for the 1608 aa L protein of Borna disease virus (Briese et al., 1994). Nevertheless, the RYSV L protein harbours multiple functional domains typical of the RNA polymerases of NNSV, e.g. the catalytic domain, the RNA template-binding site and a metal-binding motif (data not shown). A phylogenetic tree was generated by comparison of the amino acid sequence of the RYSV L protein with sequences of the L protein of 32 NNSV (Fig. 1). It is clear that the RYSV L protein is most closely related to the L protein of SYNV, also a nucleorhabdovirus (Choi et al., 1992), with a sequence similarity of 36·9 %. However, the RYSV L protein is distinct from the L proteins of other rhabdoviruses and most members of other families of the Mononegavirales in that it is an acidic protein with a calculated isoelectric point of 6·22 (Fig. 1). Inspection of the RYSV L protein sequence revealed an overall Asp+Glu composition of 12·1 % and a Lys+Arg content of 10·9 %. More significantly, the N-terminal 110 amino acids contained 30 % Asp+Glu and 7·3 % Lys+Arg. This acidic domain is not present in the L proteins of the order Mononegavirales, and its function is unknown.
|
Gene 6 is located between the G and L genes. The junction sequence between the G gene and gene 6 is 5'-UAAUAAAAACCCAAUA-3' on the vc strand, where UAAUAAAAA represents the termination sequence of the G gene, and AAUA the initiation sequence of gene 6 as determined by 5'RACE (Luo & Fang, 1998).
The 3' end of gene 6 was determined by 3'RACE. Gene 6 terminates with the sequence 5'-AAAUAAAA-3' followed by a tetranucleotide CCCC as an untranscribed intergenic spacer before the L gene (Fig. 2). Thus gene 6 is flanked by two junction sequences which are homologous to the RYSV intergenic consensus sequence. Since 5' and 3'RACE have provided evidence for the presence of the mRNA for gene 6 in RYSV-infected rice plants, this confirms that RYSV encodes an extra gene located between the G and L genes that is unique to plant rhabdoviruses.
|
These analyses indicated that gene 6 consists of 568 nt extending from positions 7288 to 7855 relative to the 3' end of the RYSV genomic RNA. Gene 6 contains a 38 nt 5'-untranslated leader sequence, a 251 nt 3'-untranslated region and a small ORF (ORF 6) of 279 nt that is capable of encoding a polypeptide (P6) of 93 aa with a calculated molecular mass of 10·5 kDa (Fig. 2). Comparison of the nucleotide sequence of the RYSV gene 6 with that of the large non-coding region preceding the L gene found in the genomes of Hendra virus (Wang et al., 2000) and lyssaviruses (Le Mercier et al., 1997; Tordo et al., 1986) revealed sequence similarity of 38·3 % to Hendra virus and 36·6 % to RABV. The cytorhabdovirus NCMV also has a small ORF (52 aa) between the G and L genes (Tanno et al., 2000). However, little sequence similarity was found when compared this small putative peptide with the RYSV P6.
RYSV P6 is an acidic protein with an isoelectric point of 3·49, and has five potential phosphorylation sites (consensus pattern S/T-X-X-D/E) and one possible aspartic protease motif (D-T-G). Homology analysis and pair-wise comparison of the P6 amino acid sequence was conducted against GenBank/EMBL and SWISSPROT entries, but no clear similarity was found with any protein from these databases. Small non-virion (NV) genes preceding the L gene are present in all characterized novirhabdoviruses (Basurco & Benmansour, 1995; Schutze et al., 1995; Johnson et al., 2000). The RYSV P6 has very limited sequence similarities with the NV proteins: 22·6 % to IHNV and 25·4 % to VHNV only slightly higher than those generated from random sequences (about 20 %).
To elucidate whether RYSV P6 is a viral protein, SDS-PAGE analysis of purified RYSV virions was performed. Since the P6 protein was not visible in the SDS-PAGE profile of the RYSV virion proteins using Coomassie blue R-250 staining (Fang et al., 1994), an antiserum against glutathione S-transferase (GST)P6 fusion protein was raised and used in immunoblot analysis. ORF 6 was inserted into the BamHI site of the pGEX-3X vector (Amersham Pharmacia) in-frame with the GST gene. Following transformation of E. coli BL21 with the recombinant clone pGEX-3X-6 and induction by IPTG, the GSTP6 fusion protein was expressed and purified with the Bulk GST Purification Module kit (Amersham Pharmacia) (Fig. 3a). Rabbit anti-GSTP6 antiserum was prepared as previously described (Luo et al., 1998). The total proteins from purified RYSV, healthy leafhoppers and viruliferous leafhoppers were extracted, separated on a 16 % Tris/Tricine gel (Schagger & von Jagow, 1987), and electro-transferred onto ImmobilonTM-P PVDF membrane (Millipore). Immunoblots were done as described by Fang et al. (1994) using rabbit anti-GSTP6 antiserum diluted 1 : 3000. P6 protein (10·5 kDa) was detected in purified virions and also in viruliferous leafhoppers, which transmit RYSV among rice plants (Fig. 3b). Such a protein band was not detected in the total protein extracted from RYSV-infected rice plants in a similar immunoblot assay (data not shown), probably due to the very low P6 content in infected rice plants. Thus, unlike the NV proteins of novirhabdoviruses and as the first case in the family Rhabdoviridae, the protein encoded by the small ORF between the G and L genes in the RYSV genome is associated with purified virions and appears to be a viral structural protein.
|
As described above, the RYSV P6 protein contains five putative phosphorylation motifs. To determine if these sites are candidate targets of phosphorylation, in vitro phosphorylation analysis was performed. Purified GSTP6 fusion protein was phosphorylated in vitro, along with purified GST protein as a control, in a 10 µl assay mixture containing the reaction buffer, 1 µCi [α-32P]ATP (3000 Ci mmol-1, 10 µCi µl-1) and 1 unit casein kinase II (CKII, Promega). After incubation for 1520 min at 37 °C the reactions were stopped, and resolved on a 15 % SDS-PAGE gel followed by autoradiography (Chigaev et al., 2001). The results showed that the GSTP6 fusion protein was phosphorylated by CKII, but the GST protein was not (Fig. 3c). To determine the phosphorylated amino acid residues, gel pieces containing 32P-labelled bands were excised, and the proteins were recovered from gel and precipitated with trichloroacetic acid. The recovered proteins were partially hydrolysed in 6 M HCl at 110 °C for 1 h. The released amino acids were mixed with phosphoamino acid markers and resolved on a thin-layer chromatography plate (Merck). Markers were visualized by spraying the plate with ninhydrin and compared with positions of the 32P-labelled spots (Fig. 3d). Both Thr and Ser were found to be 32P-labelled. In vitro phosphorylation and phosphoamino acid analysis showed that P6 can be phosphorylated by CKII in vitro and the phosphorylated amino acid residues are Thr and Ser.
Although P6 appears to be a virion structural protein and can be phosphorylated, its function remains unknown. P6 has a limited (24 %) sequence similarity with the N-terminal 110 aa of the SYNV L protein. Moreover, P6 contains 32 Asp+Glu (34·4 %) and 4 Lys+Arg (4·3 %) with a large net negative charge, similar to the N-terminal acidic domain of the RYSV L protein (see above), although they share only 18 % sequence similarity. This suggests that P6 may have a close evolutionary relationship with the L protein. Reverse genetic studies have demonstrated that although the NV protein of IHNV was not required for virus replication in cell cultures, it greatly improved virus growth (Biacchesi et al., 2000). Therefore, the small gene preceding the L gene may play an important role in rhabdovirus replication.
References
Biacchesi, S., Thoulouze, M. I., Bearzotti, M., Yu, Y. X. & Bremont, M. (2000). Recovery of NV knockout infectious hematopoietic necrosis virus expressing foreign genes. J Virol 74, 1124711253.
Briese, T., Schneemann, A., Lewis, A. J., Park, Y. S., Kim, S., Ludwig, H. & Lipkin, W. I. (1994). Genomic organization of Borna disease virus. Proc Natl Acad Sci U S A 91, 43624366.
Chen, X. Y., Luo, Z. L. & Fang, R. X. (1998). Structure analysis of the rice yellow stunt rhabdovirus gene 3. Chin Sci Bull 43, 745748. (in Chinese).
Chigaev, A., Lu, G., Shi, H., Asher, C., Xu, R., Latter, H., Seger, R., Garty, H. & Reuveny, E. (2001). In vitro phosphorylation of COOH termini of the epithelial Na+ channel and its effects on channel activity in Xenopus oocytes. Am J Physiol Renal Physiol 280, F10301036.
Choi, T.-J., Kuwata, S., Koonin, E. V., Heateon, L. A. & Jackson, A. O. (1992). Structure of the L (polymerase) protein gene of Sonchus yellow net virus. Virology 189, 3139.[CrossRef][Medline]
Fang, R. X., Wang, Q., Xu, B. Y., Pang, Z., Zhu, H. T., Mang, K. Q., Gao, D. M., Qin, W. S. & Chua, N. H. (1994). Structure of the nucleocapsid protein gene of rice yellow stunt rhabdovirus. Virology 204, 367375.[CrossRef][Medline]
Johnson, M. C., Simon, B. E., Kim, C. H. & Leong, J. A. (2000). Production of recombinant snakehead rhabdovirus: the NV protein is not required for viral replication. J Virol 74, 23432350.
Krug, R. M. (1981). Priming of influenza viral RNA transcription by capped heterologous RNAs. Curr Top Microbiol Immunol 93, 125149.[Medline]
Le Mercier, P., Jacob, Y. & Tordo, N. (1997). The complete Mokola virus genome sequence: structure of the RNA-dependent RNA polymerase. J Gen Virol 78, 15711576.[Abstract]
Luo, Z. L. & Fang, R. X. (1998). Structure analysis of the rice yellow stunt rhabdovirus glycoprotein gene and its mRNA. Arch Virol 143, 24532459.[CrossRef][Medline]
Luo, Z., Chen, X., Gao, D. & Fang, R. (1998). The gene 4 of rice yellow stunt rhabdovirus encodes the matrix protein. Virus Genes 16, 277280.[CrossRef][Medline]
McWilliam, S. M., Kongsuwan, K., Cowley, J. A., Byrne, K. A. & Walker, P. J. (1997). Genome organization and transcription strategy in the complex GNS-L intergenic region of bovine ephemeral fever rhabdovirus. J Gen Virol 78, 13091317.[Abstract]
Schagger, H. & von Jagow, G. (1987). Tricine-sodium dodecyl sulfate-polyacrylamide gel electrophoresis for the separation of proteins in the range from 1 to 100 kDa. Anal Biochem 166, 368379.[CrossRef][Medline]
Scholthof, K. B., Hillman, B. I., Modrell, B., Heaton, L. A. & Jackson, A. O. (1994). Characterization and detection of sc4: a sixth gene encoded by sonchus yellow net virus. Virology 204, 279288.[CrossRef][Medline]
Schutze, H., Enzmann, P. J., Kuchling, R., Mundt, E., Niemann, H. & Mettenleiter, T. C. (1995). Complete genomic sequence of the fish rhabdovirus infectious haematopoietic necrosis virus. J Gen Virol 76, 25192527.
Tanno, F., Nakatsu, A., Toriyama, S. & Kojima, M. (2000). Complete nucleotide sequence of Northern cereal mosaic virus and its genome organization. Arch Virol 145, 13731384.[CrossRef][Medline]
Teninges, D., Bras, F. & Dezelee, S. (1993). Genome organization of the sigma rhabdovirus: six genes and a gene overlap. Virology 193, 10181023.[CrossRef][Medline]
Tordo, N., Poch, O., Ermine, A., Keith, G. & Rougeon, F. (1986). Walking along the rabies genome: is the large G-L intergenic region a remnant gene? Proc Natl Acad Sci U S A 83, 39143918.
Walker, P. J., Benmansour, A., Dietzgen, R. & 7 other authors (2000). Family Rhabdoviridae. In Virus Taxonomy. Seventh Report of the International Committee on Taxonomy of viruses, pp. 563583. Edited by M. H. V. van Regenmortel, C. M. Fauquet, D. H. L. Bishop, E. B. Carstens, M. K. Estes, S. M. Lemon, J. Maniloff, M. A. Mayo, D. J. McGeoch, C. R. Pringle & R. B. Wickner. San Diego: Academic Press.
Wang, Y., McWilliam, S. M., Cowley, J. A. & Walker, P. J. (1994). Complex genome organization in the GNS-L intergenic region of Adelaide River rhabdovirus. Virology 203, 6372.[CrossRef][Medline]
Wang, Q., Chen, X. Y., Luo, Z. L. & Fang, R. X. (1999). Sequence analysis of leader and trailer regions of rice yellow stunt rhabdovirus and characterization of their in vivo transcripts. Sci China Ser C Life Sci 42, 5056.
Wang, L. F., Yu, M., Hansson, E., Pritchard, L. I., Shiell, B., Michalski, W. P. & Eaton, B. T. (2000). The exceptionally large genome of Hendra virus: support for creation of a new genus within the family Paramyxoviridae. J Virol 74, 99729979.
Wetzel, T., Dietzgen, R. G. & Dale, J. L. (1994). Genomic organization of lettuce necrotic yellows rhabdovirus. Virology 200, 401412.[CrossRef][Medline]
Zhu, H. T., Chen, X. Y., Luo, Z. L., Fang, R. X. & Gao, D. M. (1997). Nucleotide sequence of the rice yellow stunt rhabdovirus gene 2. Chin J Virol 13, 369375 (in Chinese).
Received 3 March 2003; accepted 17 April 2003.