Abstract
Footnotes
†Present address: School of Biomedical Sciences, Queen's University Belfast, 97 Lisburn Road, Belfast, BT9 7BL, UK.Porcine Transmissible gastroenteritis virus (TGEV) is a member of the Coronaviridae, a family of enveloped, positive-strand RNA viruses comprising the genera Coronavirus and Torovirus (González et al., 2003; Siddell et al., 2005). Based on genetic and antigenic criteria, TGEV has been classified as belonging to coronavirus group I, which also includes Human coronavirus 229E (HCoV-229E), human coronavirus NL63, Porcine epidemic diarrhea virus, Canine coronavirus (CCoV) and Feline coronavirus (FCoV) (González et al., 2003; van der Hoek et al., 2004).
Coronaviruses feature exceptionally large RNA genomes of about 30 kb and employ a unique transcription mechanism, called discontinuous extension of minus strands, to synthesize an extensive set of 5' leader-containing, subgenome-length RNAs encoding the viral structural proteins and several species-specific (accessory) proteins (Sawicki & Sawicki, 1995, 1998; Zúñiga et al., 2004; Siddell et al., 2005). Except for the nucleocapsid protein, which has recently been shown to be involved in coronavirus replication (Almazán et al., 2004; Schelle et al., 2005), all of the functions required for viral RNA synthesis are thought to be encoded by the replicase gene (Ziebuhr, 2005). This gene occupies about two-thirds of the genome (more than 20 000 nt) and comprises two large ORFs called 1a and 1b. ORF1a encodes the replicase polyprotein pp1a and ORFs 1a and 1b together encode a C-terminally extended version of pp1a, which is called pp1ab and requires ribosomal frameshifting for expression of its ORF1b-encoded portion (Brierley et al., 1987). Proteolytic processing by two or three proteases yields up to 16 processing end products called non-structural proteins (nsp) 116 that, together with several cellular proteins and the nucleocapsid protein, form the active replicasetranscriptase (Ziebuhr et al., 2000; Shi & Lai, 2005; Ziebuhr, 2005). The replicasetranscriptase has been demonstrated (or predicted) to include protease, RNA polymerase, helicase, endo- and exoribonuclease, ribose 2'-O-methyltransferase and ADP-ribose 1''-phosphatase (ADRP) activities (Gorbalenya et al., 1989; Lu et al., 1995; Ziebuhr et al., 1995, 2000; Seybert et al., 2000; Snijder et al., 2003; Bhardwaj et al., 2004; Ivanov & Ziebuhr, 2004; Ivanov et al., 2004a, b; Putics et al., 2005). The biologically relevant reactions catalysed by these enzymes are only slowly beginning to emerge and our understanding of the complex enzymology involved in coronavirus RNA synthesis is limited. For TGEV, none of the replicase gene-encoded activities, except for the 3C-like (main) protease (3CLpro, Mpro) (Anand et al., 2002, 2003; Hegyi & Ziebuhr, 2002), have so far been characterized.
In this study, we characterized two enzymic activities residing in TGEV nsp3, the largest pp1a/pp1ab processing product. Previous sequence analysis revealed that, despite significant sequence diversity in this part of the genome (Ziebuhr et al., 2001; Snijder et al., 2003), there are several domains in nsp3 that, with few exceptions, are conserved in coronaviruses. The typical arrangement of these domains (from N to C terminus) is as follows: acidic domain, papain-like protease 1 (PL1pro), X domain, papain-like protease 2 (PL2pro) and Y domain (Fig. 1) (Ziebuhr et al., 2001). The papain-like protease activities have previously been characterized for Murine hepatitis virus (MHV) (Dong & Baker, 1994; Bonilla et al., 1997; Kanjanahaluethai & Baker, 2000), HCoV-229E (Herold et al., 1998, 1999; Ziebuhr et al., 2001) and Severe acute respiratory syndrome coronavirus (SARS-CoV) (Thiel et al., 2003; Harcourt et al., 2004). Whilst most coronaviruses encode two papain-like protease domains, SARS-CoV nsp3 contains only one of these domains. Because its position in nsp3 corresponds to that of the PL2pro domains of other coronaviruses, the SARS-CoV papain-like protease is called PL2pro (Snijder et al., 2003; Thiel et al., 2003). Also, avian Infectious bronchitis virus (IBV) employs only PL2pro to process the N-terminal pp1a/pp1ab region (Lim & Liu, 1998; Lim et al., 2000). In this case, however, remnants of an (inactive) PL1pro domain have been identified, indicating that IBV has lost its PL1pro activity in the course of evolution (Fig. 1) (Ziebuhr et al., 2001). Upstream of PL2pro, all coronaviruses encode a so-called X domain (Gorbalenya et al., 1991), which has recently been shown for HCoV-229E and SARS-CoV to mediate ADRP activity (Putics et al., 2005).
|
The activity and substrate specificity of the TGEV PL1pro were characterized by expressing the domain together with flanking regions in rabbit reticulocyte lysates. Nucleotides 3154319 of the TGEV (strain Purdue 46) genomic sequence (GenBank accession no. AJ271965[GenBank] ) were amplified by RT-PCR using poly(A) RNA isolated from TGEV-infected cells as a template. The amplicon was inserted into pBlueScript II KS+ DNA (Stratagene) and the resultant plasmid, p31/32, was used as template to amplify the coding sequence of the pp1a/pp1ab residues Thr785Gln1335. The PCR forward primer contained a bacteriophage T7 RNA polymerase promoter sequence followed by a translation initiation codon, and the reverse primer contained a translation stop codon. From this PCR product, m7G(5')ppp(5')G-capped RNA was transcribed by using T7 RNA polymerase and subsequently translated in rabbit reticulocyte lysates in the presence of [35S]methionine by using previously described protocols (Ziebuhr et al., 2001). The expressed protein, 7851335, essentially contained the nsp3 acidic and PL1pro domains preceded by a predicted cleavage site, 878GG|G880 (Fig. 2a). Following incubation of the translation products for up to 8 h, the proteins were analysed by SDS-PAGE and autoradiography. As a negative control in this experiment, a mutant version of 7851335, 7851335_C1093A, was used, in which the codon of the presumed active-site nucleophile, Cys1093, was replaced with an alanine codon by using PCR-based mutagenesis methods. The data shown in Fig. 2(b) suggest that 7851335 (apparent molecular mass, 78 kDa) undergoes autoproteolysis to produce cleavage products of approximately 68 and 10 kDa. By contrast, the control protein, 7851335_C1093A, remained stable. The data are consistent with previous predictions (Eleouet et al., 1995; Ziebuhr et al., 2001) and lead us to conclude that the TGEV PL1pro domain has proteolytic activity and Cys1093 has an essential (probably catalytic) function. Surprisingly, the apparent mass of the larger processing product of 68 kDa differed significantly from its calculated mass (51 kDa). To obtain additional evidence for cleavage at the predicted sequence, we expressed the predicted processing products, 785879 and 8801335, in separate reactions. The two proteins were analysed together with the products obtained from the 7851335 autoprocessing reaction in the same gel (data not shown). We observed co-migration of the autoprocessing products with the two separately expressed (predicted) cleavage products, further supporting the predictions on the site of cleavage and confirming an aberrant migration of the C-terminal cleavage product in SDS-PAGE. Independent support for the site of cleavage also came from site-directed mutagenesis. Substitution with Ala of each of the three consecutive Gly residues at positions 878880 abolished cleavage, suggesting a requirement for Gly residues at the cleavage site (Fig. 2c). To determine unambiguously the scissile bond, we performed an N-terminal radiosequence analysis of the C-terminal cleavage product (Ziebuhr et al., 2001). In this case, we used a mutant form of the protein, 7851335_V884M/V890M, in which two Val residues downstream of the presumed cleavage site were replaced with Met. The mutated RNA was translated in the presence of [35S]methionine to produce two cleavage products that co-migrated with those derived from the parental protein, 7851335 (data not shown), confirming that the two substitutions did not affect the autoprocessing activity of the protein. The processing products obtained from protein 7851335_V884M/V890M were separated by SDS-PAGE and transferred electrophoretically onto a PVDF membrane (Bio-Rad). The C-terminal processing product was identified by autoradiography, excised from the membrane and subjected to radiosequence analysis using previously described protocols (Ziebuhr et al., 2001). The sequence data (Fig. 2d) provided conclusive evidence that TGEV PL1pro cleaves the triple-glycine sequence between Gly879 and Gly880.
|
The data obtained in this study, together with our previous predictions on an upstream site, 110Gly|Ala111 (Fig. 1) (Ziebuhr et al., 2001), led us to predict that TGEV nsp2 comprises the pp1a/pp1ab residues Ala111Gly879. To further corroborate this idea, we performed an additional set of experiments. First, we sought to produce an nsp2-specific antiserum. To this end, residues Lys526Asp713 were expressed in Escherichia coli as a fusion with the maltose-binding protein (MBP). The protein was purified by amylose-affinity chromatography and used to immunize rabbits. The resultant antiserum, T3, was used in immunoprecipitation experiments as described previously (Ziebuhr et al., 1995). Briefly, 7x106 ST cells (ATCC no. CRL-1746) were mock-infected or infected with TGEV at an m.o.i. of 100 TCID50 per cell. Newly synthesized proteins were labelled metabolically with 30 µCi (1110 kBq) [35S]methionine (ml medium)1 between 8 and 11 h post-infection. Subsequently, cytoplasmic lysates were prepared and incubated with either T3 serum or the corresponding preimmune serum for 45 min at 4 °C. Immune complexes were isolated and analysed by SDS-PAGE using 17·3 % gels. As shown in Fig. 2(e), antiserum T3 specifically precipitated a protein of ∼85 kDa from virus-infected cells. The protein was not detectable in lysates obtained from mock-infected cells or if the corresponding preimmune serum was used. The size of the protein corresponded to the calculated size of fully processed nsp2, indicating that, besides the 879G/G880 site, the predicted 110G|A111 site was also cleaved in infected cells. The corresponding N-terminal processing product, nsp1, remains to be identified.
In a final set of experiments, we characterized the so-called X domain, which is located between the PL1pro and PL2pro domains in nsp3 (Fig. 3). Recently, we have shown that the X-domain homologues of HCoV-229E and SARS-CoV are highly specific phosphatases that act to convert ADP-ribose 1''-phosphate (Appr-1''-p) to ADP-ribose (Putics et al., 2005). Intriguingly, the enzymic activity (although being conserved among coronaviruses) proved to be dispensable for HCoV-229E RNA synthesis and production of virus progeny in cultured cells, suggesting that the activity is involved in a non-essential regulatory function or provides a selective advantage only in the infected host (Putics et al., 2005). To further evaluate the biological significance of this newly identified activity, we investigated whether the TGEV X domain is functional. The coding sequence of the TGEV X domain (pp1a/pp1ab residues 13121486, Fig. 3) was expressed as an MBP fusion as decribed previously for the homologues from SARS-CoV and HCoV-229E. Examination of the ADRP activity of the amylose affinity-purified and factor Xa-cleaved protein provided convincing evidence that the protein had the predicted activity, although it was less active than the bacterially expressed homologue from HCoV-229E. To test whether this low activity is an intrinsic property of the TGEV protein, we expressed three additional versions of this protein with slightly extended or shortened N- and C-terminal protein segments. As shown in Fig. 3, variations of the domain borders strongly affected the ADRP activities of the expressed proteins. Under the conditions used in our assay, the protein TGEV-X-2 (and, as expected, our positive control, HCoV-229E X) dephosphorylated the substrate, Appr-1''-p, completely. TGEV-X-1 proved to be less active, whilst TGEV-X-3 and TGEV-X-4 were completely inactive [also after longer incubation times (data not shown)]. The data confirm unambiguously that the TGEV X domain has the predicted ADRP activity and delimit the minimal catalytically active domain to pp1a/pp1ab residues Asp1320Ser1486. The data further suggest that the enzymic activity of the TGEV X domain critically depends on the integrity of N-terminal structure elements and thus could be modulated (or even inhibited) by the adjacent domains in nsp3.
|
In conclusion, X domain-associated ADRP activities have now been identified in members of three different coronavirus (sub)groups (Putics et al., 2005). Furthermore, X-domain homologues are conserved in the genomes of toroviruses, hepatitis E viruses, rubellaviruses and alphaviruses (Gorbalenya et al., 1991; Draker et al., 2006) and exist in many cellular organisms (so-called macro-domain proteins) (Karras et al., 2005). The function of these ubiquitous domains, which is probably promoted by ADPribose binding and the associated ADRP activity, remains to be identified.
References
Anand, K., Palm, G. J., Mesters, J. R., Siddell, S. G., Ziebuhr, J. & Hilgenfeld, R. (2002). Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra α-helical domain. EMBO J 21, 32133224.[CrossRef][Medline]
Anand, K., Ziebuhr, J., Wadhwani, P., Mesters, J. R. & Hilgenfeld, R. (2003). Coronavirus main proteinase (3CLpro) structure: basis for design of anti-SARS drugs. Science 300, 17631767.
Bhardwaj, K., Guarino, L. & Kao, C. C. (2004). The severe acute respiratory syndrome coronavirus Nsp15 protein is an endoribonuclease that prefers manganese as a cofactor. J Virol 78, 1221812224.
Bonilla, P. J., Hughes, S. A. & Weiss, S. R. (1997). Characterization of a second cleavage site and demonstration of activity in trans by the papain-like proteinase of the murine coronavirus mouse hepatitis virus strain A59. J Virol 71, 900909.[Abstract]
Brierley, I., Boursnell, M. E. G., Binns, M. M., Bilimoria, B., Blok, V. C., Brown, T. D. K. & Inglis, S. C. (1987). An efficient ribosomal frame-shifting signal in the polymerase-encoding region of the coronavirus IBV. EMBO J 6, 37793785.[Medline]
Dong, S. & Baker, S. C. (1994). Determinants of the p28 cleavage site recognized by the first papain-like cysteine proteinase of murine coronavirus. Virology 204, 541549.[CrossRef][Medline]
Draker, R., Roper, R. L., Petric, M. & Tellier, R. (2006). The complete sequence of the bovine torovirus genome. Virus Res 115, 5668.[CrossRef][Medline]
Eleouet, J.-F., Rasschaert, D., Lambert, P., Levy, L., Vende, P. & Laude, H. (1995). Complete sequence (20 kilobases) of the polyprotein-encoding gene 1 of transmissible gastroenteritis virus. Virology 206, 817822.[CrossRef][Medline]
Genschik, P., Hall, J. & Filipowicz, W. (1997). Cloning and characterization of the Arabidopsis cyclic phosphodiesterase which hydrolyzes ADP-ribose 1'',2''-cyclic phosphate and nucleoside 2',3'-cyclic phosphates. J Biol Chem 272, 1321113219.
González, J. M., Gomez-Puertas, P., Cavanagh, D., Gorbalenya, A. E. & Enjuanes, L. (2003). A comparative sequence analysis to revise the current taxonomy of the family Coronaviridae. Arch Virol 148, 22072235.[CrossRef][Medline]
Gorbalenya, A. E., Koonin, E. V., Donchenko, A. P. & Blinov, V. M. (1989). Coronavirus genome: prediction of putative functional domains in the non-structural polyprotein by comparative amino acid sequence analysis. Nucleic Acids Res 17, 48474861.
Gorbalenya, A. E., Koonin, E. V. & Lai, M. M. C. (1991). Putative papain-related thiol proteases of positive-strand RNA viruses. Identification of rubi- and aphthovirus proteases and delineation of a novel conserved domain associated with proteases of rubi-, α- and coronaviruses. FEBS Lett 288, 201205.[CrossRef][Medline]
Harcourt, B. H., Jukneliene, D., Kanjanahaluethai, A., Bechill, J., Severson, K. M., Smith, C. M., Rota, P. A. & Baker, S. C. (2004). Identification of severe acute respiratory syndrome coronavirus replicase products and characterization of papain-like protease activity. J Virol 78, 1360013612.
Hegyi, A. & Ziebuhr, J. (2002). Conservation of substrate specificities among coronavirus main proteases. J Gen Virol 83, 595599.
Herold, J., Gorbalenya, A. E., Thiel, V., Schelle, B. & Siddell, S. G. (1998). Proteolytic processing at the amino terminus of human coronavirus 229E gene 1-encoded polyproteins: identification of a papain-like proteinase and its substrate. J Virol 72, 910918.
Herold, J., Siddell, S. G. & Gorbalenya, A. E. (1999). A human RNA viral cysteine proteinase that depends upon a unique Zn2+-binding finger connecting the two domains of a papain-like fold. J Biol Chem 274, 1491814925.
Ivanov, K. A. & Ziebuhr, J. (2004). Human coronavirus 229E nonstructural protein 13: characterization of duplex-unwinding, nucleoside triphosphatase, and RNA 5'-triphosphatase activities. J Virol 78, 78337838.
Ivanov, K. A., Hertzig, T., Rozanov, M., Bayer, S., Thiel, V., Gorbalenya, A. E. & Ziebuhr, J. (2004a). Major genetic marker of nidoviruses encodes a replicative endoribonuclease. Proc Natl Acad Sci U S A 101, 1269412699.
Ivanov, K. A., Thiel, V., Dobbe, J. C., van der Meer, Y., Snijder, E. J. & Ziebuhr, J. (2004b). Multiple enzymatic activities associated with severe acute respiratory syndrome coronavirus helicase. J Virol 78, 56195632.
Kanjanahaluethai, A. & Baker, S. C. (2000). Identification of mouse hepatitis virus papain-like proteinase 2 activity. J Virol 74, 79117921.
Karras, G. I., Kustatscher, G., Buhecha, H. R., Allen, M. D., Pugieux, C., Sait, F., Bycroft, M. & Ladurner, A. G. (2005). The macro domain is an ADP-ribose binding module. EMBO J 24, 19111920.[CrossRef][Medline]
Lim, K. P. & Liu, D. X. (1998). Characterization of the two overlapping papain-like proteinase domains encoded in gene 1 of the coronavirus infectious bronchitis virus and determination of the C-terminal cleavage site of an 87-kDa protein. Virology 245, 303312.[CrossRef][Medline]
Lim, K. P., Ng, L. F. P. & Liu, D. X. (2000). Identification of a novel cleavage activity of the first papain-like proteinase domain encoded by open reading frame 1a of the coronavirus Avian infectious bronchitis virus and characterization of the cleavage products. J Virol 74, 16741685.
Lu, Y., Lu, X. & Denison, M. R. (1995). Identification and characterization of a serine-like proteinase of the murine coronavirus MHV-A59. J Virol 69, 35543559.[Abstract]
Putics, Á., Filipowicz, W., Hall, J., Gorbalenya, A. E. & Ziebuhr, J. (2005). ADP-ribose-1''-monophosphatase: a conserved coronavirus enzyme that is dispensable for viral replication in tissue culture. J Virol 79, 1272112731.
Sawicki, S. G. & Sawicki, D. L. (1995). Coronaviruses use discontinuous extension for synthesis of subgenome-length negative strands. Adv Exp Med Biol 380, 499506.[Medline]
Sawicki, S. G. & Sawicki, D. L. (1998). A new model for coronavirus transcription. Adv Exp Med Biol 440, 215219.[Medline]
Schelle, B., Karl, N., Ludewig, B., Siddell, S. G. & Thiel, V. (2005). Selective replication of coronavirus genomes that express nucleocapsid protein. J Virol 79, 66206630.
Seybert, A., Hegyi, A., Siddell, S. G. & Ziebuhr, J. (2000). The human coronavirus 229E superfamily 1 helicase has RNA and DNA duplex-unwinding activities with 5'-to-3' polarity. RNA 6, 10561068.[Abstract]
Shi, S. T. & Lai, M. M. C. (2005). Viral and cellular proteins involved in coronavirus replication. Curr Top Microbiol Immunol 287, 95131.[Medline]
Siddell, S. G., Ziebuhr, J. & Snijder, E. J. (2005). Coronaviruses, toroviruses, and arteriviruses. In Topley and Wilson's Microbiology and Microbial Infections, 10th edn, pp. 823856. Edited by B. W. J. Mahy & V. ter Meulen. London: Hodder Arnold.
Snijder, E. J., Bredenbeek, P. J., Dobbe, J. C. & 7 other authors (2003). Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J Mol Biol 331, 9911004.[CrossRef][Medline]
Thiel, V., Ivanov, K. A., Putics, Á. & 9 other authors (2003). Mechanisms and enzymes involved in SARS coronavirus genome expression. J Gen Virol 84, 23052315.
van der Hoek, L., Pyrc, K., Jebbink, M. F. & 7 other authors (2004). Identification of a new human coronavirus. Nat Med 10, 368373.[CrossRef][Medline]
Ziebuhr, J. (2005). The coronavirus replicase. Curr Top Microbiol Immunol 287, 5794.[Medline]
Ziebuhr, J. & Siddell, S. G. (1999). Processing of the human coronavirus 229E replicase polyproteins by the virus-encoded 3C-like proteinase: identification of proteolytic products and cleavage sites common to pp1a and pp1ab. J Virol 73, 177185.
Ziebuhr, J., Herold, J. & Siddell, S. G. (1995). Characterization of a human coronavirus (strain 229E) 3C-like proteinase activity. J Virol 69, 43314338.[Abstract]
Ziebuhr, J., Snijder, E. J. & Gorbalenya, A. E. (2000). Virus-encoded proteinases and proteolytic processing in the Nidovirales. J Gen Virol 81, 853879.
Ziebuhr, J., Thiel, V. & Gorbalenya, A. E. (2001). The autocatalytic release of a putative RNA virus transcription factor from its polyprotein precursor involves two paralogous papain-like proteases that cleave the same peptide bond. J Biol Chem 276, 3322033232.
Zúñiga, S., Sola, I., Alonso, S. & Enjuanes, L. (2004). Sequence motifs involved in the regulation of discontinuous coronavirus subgenomic RNA synthesis. J Virol 78, 980994.
Received 9 October 2005; accepted 12 November 2005.