We use cookies to improve your experience. By continuing to browse this site, you accept our cookie policy.×
Skip main navigation
Aging Health
Bioelectronics in Medicine
Biomarkers in Medicine
Breast Cancer Management
CNS Oncology
Colorectal Cancer
Concussion
Epigenomics
Future Cardiology
Future Medicine AI
Future Microbiology
Future Neurology
Future Oncology
Future Rare Diseases
Future Virology
Hepatic Oncology
HIV Therapy
Immunotherapy
International Journal of Endocrine Oncology
International Journal of Hematologic Oncology
Journal of 3D Printing in Medicine
Lung Cancer Management
Melanoma Management
Nanomedicine
Neurodegenerative Disease Management
Pain Management
Pediatric Health
Personalized Medicine
Pharmacogenomics
Regenerative Medicine
Short CommunicationFree Access

In silico identification of conserved cis-acting RNA elements in the SARS-CoV-2 genome

    Bader Y Alhatlani

    *Author for correspondence:

    E-mail Address: balhatlani@qu.edu.sa

    Department of Medical Applied Sciences, Unayzah Community College, Qassim University, Unayzah, Saudi Arabia

    Published Online:https://doi.org/10.2217/fvl-2020-0163

    Abstract

    Aim: The aim of this study was to computationally predict conserved RNA sequences and structures known as cis-acting RNA elements (CREs) in the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome. Materials & methods: Bioinformatics tools were used to analyze and predict CREs by obtaining viral sequences from available databases. Results: Computational analysis revealed the presence of RNA stem-loop structures within the 3′ end of the ORF1ab region analogous to previously identified SARS-CoV genomic packaging signals. Alignment-based RNA secondary structure predictions of the 5′ end of the SARS-CoV-2 genome also identified conserved CREs. Conclusion: These CREs may be potential vaccine and/or antiviral therapeutic targets; however, further studies are warranted to confirm their roles in the SARS-CoV-2 life cycle.

    In December 2019, a novel coronavirus, initially named 2019-nCoV, was found to be the causative agent for an outbreak of pneumonia in patients that visited a wet market in Wuhan, China [1]. The virus was subsequently renamed severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and identified to be from the Betacoronavirus genus [2]. SARS-CoV-2 is responsible for the ongoing global pandemic of coronavirus disease 2019, which as of 22 May 2020 has caused more than 5 million confirmed cases and more than 332,000 deaths worldwide according to the WHO. Although there have been great efforts to develop an effective vaccine or specific antiviral treatment, neither is available yet. The Coronaviridae family contains a variety of viruses that cause a wide range of diseases, including respiratory, enteric, hepatic and neurological diseases in human and animals [3]. Most of the human coronaviruses usually cause mild symptoms; however, two human coronaviruses, severe acute respiratory syndrome (SARS-CoV) and Middle East respiratory syndrome (MERS-CoV), were identified to be highly pathogenic in humans [4,5]. Coronaviruses (CoVs) are generally grouped into four genera: Alphacoronavirus, Betacoronavirus, Gammacoronavirus and Deltacoronavirus [6,7].

    CoVs have the largest known genomes among RNA viruses, which range from about 26 to 32 kb in length and contain an enveloped positive-sense, single-stranded RNA molecule (+ve ssRNA) that is capped at the 5′ end and polyadenylated at the 3′ end [8]. SARS-CoV2 shares genomic features with other SARS-like CoVs with complete genomic similarities of about 88% to bat SARS-like CoVs and 79% to SARS-CoV [9]. The SARS-CoV-2 genome is organized into about 13 open reading frames (ORFs) and two-thirds of the genome is occupied by the 5′-terminal region overlapping ORF1a and ORF1b, which are translated from the genomic RNA to encode the replicase polyproteins pp1a and pp1b [9,10].

    Like most RNA viruses, the genomes of CoVs contain cis-acting RNA elements (CREs) and stem-loop structures that interact with RNA and viral or host proteins to form RNA–RNA or RNA–protein interactions to facilitate viral replication, translation and genome packaging [11,12]. While often these CREs are located at the 5′ and 3′ untranslated regions (UTRs), they can also be found within the coding regions of CoV genomes [13–15]. These important regions of the viral genome may be potential targets for SARS-CoV-2 antiviral therapeutics. Therefore, the aim of this study was to locate cis-acting regulatory elements within the SARS-CoV-2 genome using bioinformatics approach.

    Materials & methods

    GenBank accession numbers of viral sequences

    Viral genomic sequences were retrieved from GenBank (National Center for Biotechnology Information [NCBI]). Virus strains and accession IDs used in this study were: (NC_045512.2) for SARS-CoV-2, (NC_004718.3) for SARS-CoV and (MG772933) for bat SARS-like CoV.

    Bioinformatics analysis

    The RNA secondary structures of the viral genomic sequences were predicted using the online Mfold web server at http://unafold.rna.albany.edu/?q=mfold/ and RNAfold web server at http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi [16,17]. In addition, the LocARNA web server at http://rna.informatik.uni-freiburg.de/LocARNA/Input.jsp was used for the alignment-based prediction of consensus RNA secondary structures at the 5′-terminal region of SARS-CoV-2, bat SARS-like CoV and SARS-CoV genomes [18]. The VARNA web applet at http://varna.lri.fr/ was then used to draw RNA secondary structures [19].

    Results

    Previous studies on betacoronaviruses such as mouse hepatitis coronavirus (MHV), MERS-CoV and SARS-CoV revealed that the ORF1b region may contain CREs suggested to function as packaging signals (PSs) [20,21]. These CREs are functionally and structurally conserved within the same lineages of betacoronaviruses [14]. A previous bioinformatics study on the SARS-CoV genome predicted a stable stem-loop RNA structure at the 3′ end of ORF1b, encompassing nucleotides (nts) 19888–19950, to be the putative core PS (PScore) of SARS-CoV [22]. A further functional analysis identified the PS of the SARS-CoV as a 580 nt sequence encompassing viral genomic RNA (nt range: 19712–20294) that fold into a RNA secondary structure, including the PScore and binds to the nucleocapsid (N) protein [21]. SARS-CoV-2 shares 87.6 and 79% complete genome similarities with bat SARS-like CoV and SARS-CoV, respectively [9,23].

    Hence, in this study, bioinformatics analysis was used to identify if the ORF1b region of SARS-CoV-2 possesses cis-acting RNA elements similar to that observed in SARS-CoV and other CoVs. To test this hypothesis, the SARS-CoV-2 ORF1b region was analyzed for RNA secondary structures and compared with the closely related SARS-CoV and bat SARS-like CoV sequences. SARS-CoV-2 RNA sequences spanning nts 15,000–21,541 were first analyzed using the RNAfold web server to predict the minimum free energy [17]. Results predicted the lowest free energy was located between nts 19,000 and 20,300 of the SARS-CoV-2 ORF1b region (Figure 1). Because this position overlaps with the genomic PS of SARS-CoV, which was identified at nt positions 19,712–20,294, the Mfold web server was then used to predict the RNA secondary structures spanning nts 19,712–20,294 of the SARS-CoV-2 ORF1b region and were compared with those of SARS-CoV and bat SARS-like CoV [16]. The Mfold analysis resulted in differences in the predicted RNA secondary structures between the three sequences; however, two stable stem-loops were identified to be similar and observed in all three viruses (Figure 2A–C). The predicted two stem-loops (named as SL1 and SL2) were located at nt positions 19,900–20,000, 19,839–19,943 and 19,903–20,000 of the SARS-CoV-2, SARS-CoV and bat SARS-like CoV genomes, respectively (Figure 3A–C). The upper part of SL1, which contains 38 nts, is structurally conserved between the three viruses, with covariation in the sequences. In the case of SARS-CoV-2, the SL1 is 70 nts and longer than those predicted in SARS-CoV and bat SARS-like CoV, whereas its length in SARS-CoV is 51 nts and only the upper part of SL1 is formed in bat SARS-like CoV (Figure 3A–C). The SL2 is much shorter in SARS-CoV-2, with only 26 nts, compared with that predicted in SARS-CoV and bat SARS-like CoV, which is 51 nts in length for both viruses. However, the genetic sequences of the upper part of SL2 were more conserved among the three viruses than SL1, with only a two nucleotide difference among SARS-CoV-2, SARS-CoV and bat SARS-like CoV. The first covariant is located at the stem and includes a C-G base pair in the SARS-CoV-2 sequence in place of a U-G base pair found in SARS-CoV and bat SARS-like CoV (Figure 3A–C). This single nucleotide difference from a C to U maintains the base pairing of the stem and does not change the amino acid (Leu) of ORF1b (Figure 3A–C). The second covariant is located at the U-rich loop and it is the third base in the loop in which this nucleotide is a U in the SL2 of SARS-CoV-2. It is a G and an A in the SARS-CoV and bat SARS-like CoV sequences, respectively. However, this single nucleotide difference changed the amino acid from Leu to Phe at this position in the SARS-CoV-2 genome (Figure 3A–C). It should be noted that the SL2 that was the previously predicted to function as the putative core PS (PScore) of SARS-CoV genome [21,22]. Therefore, it is reasonable to assume that the predicted RNA stem-loop structures of SARS-CoV-2 may also have the same role as a putative genomic packaging signal, given the conservation of the RNA structures and SL1 and SL2 sequences and similarities of the genomic locations of these predicted RNA structures within ORF1b.

    Figure 1. Prediction of minimum free energy structures within the severe acute respiratory syndrome coronavirus 2 ORF1b region.

    The SARS-CoV-2 genome is about 30 kb in length and organized in 13 ORFs. The viral genome is flanked by the 5′ and 3′ UTRs. The ORF1b region is highlighted in red (upper panel) and a mounting plot represents the prediction of the MFE structures spanning from nt position 15,000–21,541 of the SARS-CoV-2 genome sequences (NC_045512.2) is shown with the highest peak of MFE prediction located between 19,000 and about 20,300 nt of the SARS-CoV-2 ORF1b region (lower panel). Note that the mountain plot indicates secondary structure in a plot of height versus sequence position, where the peaks represent the probabilities of RNA base pairing.

    MFE: Minimum free energy; nt: Nucleotide; ORF: Open reading frame; SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2; UTR: Untranslated region.

    Figure 2. Prediction of ORF1b RNA secondary structures.

    RNA secondary structures located at the 3′ end of the SARS-CoV-2 ORF1b region (nucleotide positions: 19,712–20,294) were predicted using the Mfold web server (A) and compared with RNA secondary structures at the same position for SARS-CoV (NC_004718.3) (B) and bat SARS-like CoV (MG772933) (C). The two identical RNA stem-loops (SL1 and SL2) are indicated in a rectangle among the three viruses.

    SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.

    Figure 3. Comparison of the predicted RNA secondary structures among Severe Acute Respiratory Syndrome Coronavirus 2.

    The viral RNA sequences in the 3′ end of the ORF1b region from 19,900 to 20,019 nucleotide for both SARS-CoV-2 and bat SARS-like CoV and in the region of 19,830–19,949 nucleotide for SARS-CoV, were folded using the Mfold web server and drawn using the VARNA web applet to compare predicted RNA stem-loops. The similarities of the RNA structures of the predicted SL1 and SL2 among the three viruses are highlighted in green and blue, respectively. In addition, the sequence variation in these two stem-loops are indicated in black. These two RNA stem-loops are part of the previously identified SARS-CoV genomic PS, with SL2 being identified as the PScore [21,22].

    SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.

    Cis-acting RNA element secondary structures and sequences have also been previously described at the 5′ end of a number of CoVs, including MHV, bovine coronavirus (BCoV), MERS-CoV and SARS-CoV [15]. RNA secondary structures at the 5′ region of the SARS-CoV genome were previously predicted to fold into eight stem-loops (SL1 to SL8) [15]. To predict the RNA secondary structures at the 5′-proximal SARS-CoV-2 sequence and compare it to the 5′ ends of SARS-CoV and bat SARS-like CoV, the first 474 nts of the SARS-CoV-2 genomic sequence was analyzed using the Mfold web server. It should be noted that during the preparation of this manuscript, a recent study predicted conserved RNA structures within the SARS-CoV-2 genome, including RNA elements at the 5′ and 3′ ends [24]. The Mfold analysis predicted a SARS-CoV-2 5′-terminal RNA secondary structure model with eight RNA stem-loops (SL1–SL8) that was identical to the SARS-CoV model previously proposed by Yang et al. (Figure 4). In addition, the predicted model in this study was similar to that recently described in the bioinformatics study by Rangan et al., except that seven stem-loops (SL1–SL7) were predicted. Herein, an additional stem-loop (SL8) was identified that was consistent with the SARS-CoV 5′ RNA structure model [15,24]. SL1, SL2 and SL4 are located within the 5′ UTRs of coronavirus genomes and they are structurally conserved among at least three coronavirus genera, with SL2 being the most conserved RNA secondary structure [25]. In addition, these results demonstrated that the conserved core leader of the transcriptional regulatory sequence (TRS-L) region required for subgenomic RNA synthesis is located within SL3, which is a finding consistent with previous studies of SARS-CoV and BCoV (Figure 4) [25]. The predicted SL4, which has been shown to be conserved in all CoVs, is longer than the three preceding stem-loops and contains a short upstream ORF that is found in most CoVs [13,25,26]. Moreover, a long stem-loop RNA structure that contains three hairpin substructures (termed as SL5A, 5B and 5C) was also identified (Figure 4) [14]. Part of the SL5 is located in the 5′ UTR; however, the AUG initiation codon of the nsp1 is located downstream of SL5C at a position similar with previous studies on SARS-CoV (Figure 4). It should be noted that SL5ABC is conserved among betacoronaviruses, suggesting essential roles, such as viral replication, of these stem-loops structures as CREs in the life cycle of CoVs. In agreement with a previous study on SARS-CoV, the loops of SL5A and SL5B contain the conserved 5′-UUUCGU-3′ motifs and this is equivalent to the conserved 5′-UUYCGU-3′ loop sequences found in SL5ABC of alphacoronaviruses [13,25]. Bioinformatics analysis predicted three stem-loops (SL6, SL7 and SL8) in the nsp1 coding sequence similar to that found in the 5′-terminal region of SARS-CoV (Figure 4). However, these RNA structures are known to be less conserved between CoV lineages than the stem-loops found within the 5′ UTR of CoV genomes.

    Figure 4. Computational prediction of RNA secondary structures model at the severe acute respiratory syndrome coronavirus 2 5′-terminal region.

    Schematic representation of the RNA secondary structures at the 5′ region of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome (nucleotide positions: 1–474) was generated using the Mfold web server and drawn using the VARNA web applet (ΔG = -155.50 kcal/mol). The eight RNA stem-loops (SL1–SL8) are indicated and the conserved core leader transcriptional regulatory sequence (TRS) region located within the SL3 is highlighted in gray. In addition, the conserved 5′-UUUCGU-3′ motifs found within SL5A and SL5B are indicated in boxes and the AUG start codon of ORF1ab is indicated in red. The SARS-CoV-2 reference genome sequence obtained from GenBank (NC_045512.2) was used.

    To further confirm the conservation of the RNA secondary structures located at the 5′ regions among SARS-related viruses, the first 474 nts of SARS-CoV-2, SARS-CoV and bat SARS-like CoV genomes were aligned and folded using the LocARNA web server [18]. Sequence alignments indicated that all the RNA stem-loops were highly conserved among the three viruses, with SL2 and SL3 being the most conserved among the other stem-loops with sequence covariation (Figure 5). This high degree of conservation was expected for SL2 and SL3, because SL2 is suggested to be the most conserved RNA element in the CoV 5′ UTR region. SL3, which is found in SARS-like CoVs and BCoVs, contains TRS-L sequences that have an essential role in subgenomic RNA synthesis [13,27]. These results indicated high conservation of RNA elements at the 5′ ends of SARS-like CoV genomes, hence suggesting that these RNA secondary structures also function as CREs in the SARS-CoV-2 life cycle.

    Figure 5. Alignment-based prediction of the RNA structures at the 5′-terminal regions of severe acute respiratory syndrome coronavirus 2 genomes.

    The first 474 nucleotides of SARS-CoV-2, bat SARS-like CoV and SARS-CoV genomes were aligned using LocARNA to predict conserved RNA stem-loops at the 5′ ends. Colors indicate the conservation of base pair types in which red represents conservation of one base pair and yellow and green indicate conservation of two and three base pairs, respectively. Dark color represents that all three viruses contained this base pair, whereas light color shows that one or two of the sequences did not contain the base pairs.

    SARS-CoV-2: Severe acute respiratory syndrome coronavirus 2.

    Discussion

    Cis-acting RNA elements have been described and characterized in several RNA virus genomes, including picornaviruses, coronaviruses and noroviruses [11,28,29]. These CREs play important functional roles in the virus life cycle and usually contribute to viral RNA replication, translation and genome packaging [11]. Previous studies have identified stem-loop RNA structures located at the 3′ end of the SARS-CoV ORF1b, which are recognized by the SARS-CoV structural N protein and suggested to act as the genomic PS [21,22]. Therefore, this work aimed to use bioinformatics tools to identify if the genome of the newly emerged SARS-CoV-2 shares conserved CREs present within the 5′ end and the ORF1ab regions of the viral genome.

    Computational analysis predicted two conserved stem-loops within the 3′ end of the ORF1b region (named as SL1 and SL2), the latter of which have been previously described to be part of the SARS-CoV PS and was named as PScore [22]. SL2 is more conserved among the three viruses than SL1 and the top of SL2 is featured with a hexaloop that contains a U-rich motif. It should be noted that genomic PSs for different CoVs consist of RNA structure elements that vary in length and genomic location within the same lineages [14,30]. For example, the genomic PS of MERS-CoV, which is a lineage C Betacoronavirus, was identified in a similar position as SARS-CoV at the 3′ end of the ORF1ab region [31]. Moreover, the genomic PS is functionally and structurally conserved in lineage A betacoronaviruses that contain a 95 nt stem-loop RNA structure and located within the 3′ region of ORF1b [32]. However, the genomic PS of transmissible gastroenteritis virus is located at the 5′-terminal end of the viral genome [33]. Although this study cannot conclude the functionality of the predicted RNA stem-loop structures located at the SARS-CoV-2 ORF1b region and their roles in the SARS-CoV-2 life cycle, it is postulated that they may similarly function as cis-acting elements. Hence, they could be a putative genomic PS for SARS-CoV-2 because of: the similar position of the viral genome where these RNA elements are located at the 3′ end of ORF1b; and and the conserved RNA secondary structures and sequences of these predicted stem-loops when compared with those found in the closely related SARS-CoV and bat SARS-like CoV.

    This study also used genome sequences at the 5′-proximal region of the SARS-CoV-2 genome to predict conserved RNA secondary structures and compare to SARS-related CoVs, such as SARS-CoV and bat SARS-like CoV. The predicted model in this study was similar to a previous model described for SARS-CoV, in which eight RNA stem-loops (SL1–SL8) were identified [15]. In addition, the RNA structures at the 5′ region of SARS-CoV-2 were similar to structures recently described in a bioinformatics study; however, the only difference was that SL8 was not identified in the previous study [24]. The RNA secondary structures of SL1, SL2 and SL4 are conserved across the three CoV genera, whereas SL1, SL2 and SL4 and SL5ABC are only conserved among betacoronaviruses [15].

    Conclusion

    In summary, this study used computational tools to predict cis-acting RNA motifs in the SARS-CoV-2 RNA genome. Bioinformatics analysis suggested that the 3′ end of the SARS-CoV-2 ORF1b region may contain RNA structure elements structurally conserved with other SARS-CoVs and analogous to the SARS-CoV genomic packaging signal. This study also demonstrated and confirmed predicted RNA secondary structures at the 5′-proximal region of the SARS-CoV-2 genome. However, further studies are warranted to: biochemically confirm these CREs; and to investigate their roles in the virus life cycle. These important regions within the SARS-CoV-2 genome could then be targeted to develop a vaccine and/or antiviral therapeutics.

    Summary points
    • Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is responsible for the ongoing global pandemic of coronavirus disease 2019.

    • Similar to most RNA viruses, the SARS-CoV-2 genome contains cis-acting RNA elements that interact with RNA and viral or host protein.

    • A previous bioinformatics study on the SARS-CoV genome predicted a stable stem-loop RNA structure at the 3′ end of ORF1b, to be the putative core PS (PScore).

    • Bioinformatics analysis identified two stable stem-loops that are similar and observed in the 3′ end of three SARS-CoVs ORF1b region (including SARS-CoV-2).

    • One of the identified stem-loop (SL2) was the previously predicted to function as the putative core PS (PScore) of SARS-CoV genome.

    • Therefore, it is reasonable to assume that the predicted RNA stem-loop structures of SARS-CoV-2 may also have the same role as a putative genomic packaging signal.

    • The Mfold analysis predicted a SARS-CoV-2 5′-terminal RNA secondary structure model that was identical to the SARS-CoV model previously proposed.

    • Alignment-based prediction of the RNA structures at the 5′-terminal regions were highly conserved among three SARS-like CoVs.

    • Further studies are warranted to: biochemically confirm these cis-acting RNA elements; and to investigate their roles in the virus life cycle.

    Acknowledgments

    The author would like to acknowledge S Vashist for the discussion and helpful comments on this manuscript.

    Financial & competing interests disclosure

    The author has no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

    No writing assistance was utilized in the production of this manuscript.

    References

    • 1. Zhu N, Zhang D, Wang W et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382(8), 727–733 (2020).
    • 2. Coronaviridae Study Group of the International Committee on Taxonomy of Viruses. The species severe acute respiratory syndrome coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2. Nat. Microbiol. 5(4), 536 (2020).
    • 3. Weiss SR, Leibowitz JL. Coronavirus pathogenesis. Adv. Virus Res. 81, 85–164 (2011).
    • 4. Drosten C, Günther S, Preiser W et al. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 348(20), 1967–1976 (2003).
    • 5. Zaki AM, van Boheemen S, Bestebroer TM, Osterhaus AD, Fouchier RA. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N. Engl. J. Med. 367(19), 1814–1820 (2003).
    • 6. Woo PCY, Lau SKP, Lam CSF et al. Discovery of seven novel mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavi. J. Virol. 86(7), 3995–4008 (2012).
    • 7. King AM, Adams MJ, Carsten EB, Lefkowitz EJ. Virus Taxonomy: Ninth Report of the International Committee on Taxonomy of Viruses. Science Direct, CA, USA (2011).
    • 8. Su S, Wong G, Shi W et al. Epidemiology, genetic recombination and pathogenesis of coronaviruses. Trends Microbiol. 24(6), 490–502 (2016).
    • 9. Lu R, Zhao X, Li J et al. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet 395(10224), 565–574 (2020).
    • 10. Wu A, Peng Y, Huang B et al. Commentary genome composition and divergence of the novel coronavirus (2019-nCoV) originating in China. Cell Host Microbe 27(3), 325–328 (2020).
    • 11. Liu Y, Wimmer E, Paul AV. Cis-acting RNA elements in human and animal plus-strand RNA viruses. Biochim. Biophys. Acta 1789(9–10), 495–517 (2009).
    • 12. Sola I, Mateos-Gomez PA, Almazan F, Zuñiga S, Enjuanes L. RNA-RNA and RNA-protein interactions in coronavirus replication and transcription. RNA Biol. 8(2), 237–248 (2011).
    • 13. Madhugiri R, Fricke M, Marz M, Ziebuhr J. RNA structure analysis of alphacoronavirus terminal genome regions. Virus Res. 194, 76–89 (2014).
    • 14. Madhugiri R, Fricke M, Marz M, Ziebuhr J. Coronavirus cis-acting RNA elements. Adv. Virus Res. 96, 127–163 (2016).
    • 15. Yang D, Leibowitz JL. The structure and functions of coronavirus genomic 3′ and 5′ ends. Virus Res. 206, 120–133 (2015).
    • 16. Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 31(13), 3406–3415 (2003).
    • 17. Lorenz R, Bernhart SH, Siederdissen Höner zu C et al. ViennaRNA Package 2.0. Algorithms Mol. Biol. 6(1), 26 (2011).
    • 18. Will S, Joshi T, Hofacker IL, Stadler PF, Backofen R. LocARNA-P: accurate boundary prediction and improved detection of structural RNAs. RNA 18(5), 900–914 (2012).
    • 19. Darty K, Denise A, Ponty Y. VARNA: interactive drawing and editing of the RNA secondary structure. Bioinform 25(15), 1974–1975 (2009).
    • 20. Fosmire JA, Hwang K, Makino S. Identification and characterization of a coronavirus packaging signal. J. Virol. 66(6), 3522–3530 (1992).
    • 21. Hsieh P-K, Chang SC, Huang CC et al. Assembly of severe acute respiratory syndrome coronavirus RNA packaging signal into virus-like particles Is nucleocapsid dependent. J. Virol. 79(22), 13848–13855 (2005).
    • 22. Qin L, Xiong B, Luo C et al. Identification of probable genomic packaging signal sequence from SARS_CoV genome by bioinformatics analysis. Acta Pharmacol. Sin. 24(6), 489–496 (2003).
    • 23. Andersen KG, Rambaut A, Lipkin WI, Holmes EC, Garry RF. The proximal origin of SARS-CoV-2. Nat. Med. 26(4), 450–452 (2020).
    • 24. Rangan R, Zheludev IN, Das R. RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look. RNA. doi:10.1261/rna.076141.120 (2020) (Epub ahead of print).
    • 25. Chen SC, Olsthoorn RCL. Group-specific structural features of the 5′-proximal sequences of coronavirus genomic RNAs. Virology 401(1), 29–41 (2010).
    • 26. Raman S, Brian DA. Stem-loop IV in the 5′ untranslated region is a cis-acting element in bovine coronavirus defective interfering RNA replication. J. Virol. 79(19), 12434–12446 (2005).
    • 27. Liu P, Li L, Millership JJ, Kang H, Leibowitz JL, Giedroc DP. A U-turn motif-containing stem-loop in the coronavirus 5′ untranslated region plays a functional role in replication. RNA 13(5), 763–780 (2007).
    • 28. Alhatlani B, Vashist S, Goodfellow I. Functions of the 5′ and 3′ ends of calicivirus genomes. Virus Res. 206, 134–143 (2015).
    • 29. Goodfellow I, Chaudhry Y, Richardson A et al. Identification of a cis-acting replication element within the poliovirus coding region. J. Virol. 74(10), 4590–4600 (2000).
    • 30. Masters PS. Coronavirus genomic RNA packaging. Virology doi:10.1016/j.virol.2019.08.031 (2019) (Epub ahead of print).
    • 31. Hsin WC, Chang CH, Chang CY et al. Nucleocapsid protein-dependent assembly of the RNA packaging signal of Middle East respiratory syndrome coronavirus. J. Biomed. Sci. 25(1), 47 (2018).
    • 32. Chen SC, van den Born E, van den Worm SH, Pleij CW, Snijder EJ, Olsthoorn RC. New structure model for the packaging signal in the genome of group IIa coronaviruses. J. Virol. 81(12), 6771–6774 (2007).
    • 33. Escors D, Izeta A, Capiscol C, Enjuanes L. Transmissible gastroenteritis coronavirus packaging signal is located at the 5′ end of the virus genome. J. Virol. 77(14), 7890–7902 (2003).