Bridging the gap between in vitro and in vivo RNA folding

Kathleen A. Leamy; Sarah M. Assmann; David H. Mathews; Philip C. Bevilacqua

doi:10.1017/S003358351600007X

Bridging the gap between in vitro and in vivo RNA folding

Published online by Cambridge University Press: 24 June 2016

David H. Mathews and

Kathleen A. Leamy: Affiliation:
Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
Sarah M. Assmann: Affiliation:
Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA Department of Biology, Pennsylvania State University, University Park, PA 16802, USA Plant Biology Graduate Program, Pennsylvania State University, University Park, PA 16802, USA
David H. Mathews: Affiliation:
Department of Biochemistry and Biophysics, Department of Biostatistics and Computational Biology, Center for RNA Biology, University of Rochester Medical Center, Rochester, NY 14642, USA
Philip C. Bevilacqua*: Affiliation:
Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA Plant Biology Graduate Program, Pennsylvania State University, University Park, PA 16802, USA Department of Biochemistry and Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA
*: *Author for correspondence: Philip C. Bevilacqua, Department of Chemistry, Pennsylvania State University, University Park, PA 16802, USA and Center for RNA Molecular Biology, Pennsylvania State University, University Park, PA 16802, USA. Tel.: 1-814-863-3812; Fax: 1-814-865-2927; Email: [email protected]

Article contents

Abstract
Introduction
Setting the stage
Bridging the gap between in vitro and in vivo RNA folding using in vivo-like studies
Future directions
References

Rights & Permissions

Abstract

Deciphering the folding pathways and predicting the structures of complex three-dimensional biomolecules is central to elucidating biological function. RNA is single-stranded, which gives it the freedom to fold into complex secondary and tertiary structures. These structures endow RNA with the ability to perform complex chemistries and functions ranging from enzymatic activity to gene regulation. Given that RNA is involved in many essential cellular processes, it is critical to understand how it folds and functions in vivo. Within the last few years, methods have been developed to probe RNA structures in vivo and genome-wide. These studies reveal that RNA often adopts very different structures in vivo and in vitro, and provide profound insights into RNA biology. Nonetheless, both in vitro and in vivo approaches have limitations: studies in the complex and uncontrolled cellular environment make it difficult to obtain insight into RNA folding pathways and thermodynamics, and studies in vitro often lack direct cellular relevance, leaving a gap in our knowledge of RNA folding in vivo. This gap is being bridged by biophysical and mechanistic studies of RNA structure and function under conditions that mimic the cellular environment. To date, most artificial cytoplasms have used various polymers as molecular crowding agents and a series of small molecules as cosolutes. Studies under such in vivo-like conditions are yielding fresh insights, such as cooperative folding of functional RNAs and increased activity of ribozymes. These observations are accounted for in part by molecular crowding effects and interactions with other molecules. In this review, we report milestones in RNA folding in vitro and in vivo and discuss ongoing experimental and computational efforts to bridge the gap between these two conditions in order to understand how RNA folds in the cell.

Type: Review
Information: Quarterly Reviews of Biophysics , Volume 49 , 2016 , e10

DOI: https://doi.org/10.1017/S003358351600007X [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2016

1. Introduction

According to the classical view of biology, RNA has three roles, as a messenger (mRNA) that shuttles information between DNA and proteins, as an adaptor (tRNA) that translates the information stored in mRNA into protein sequence, and as a structural molecule (rRNA) that is part of the ribosome (Fig. 1). Research over the last 25 years has revealed that RNA carries out many other essential functions in the cell. RNA regulates gene expression at the transcriptional and translational levels, and this regulation often arises from the structures adopted by various RNA classes, including ribozymes, riboswitches, and RNA–protein complexes (Doudna & Cech, Reference Doudna and Cech2002; Serganov & Nudler, Reference Serganov and Nudler2013). Since RNA is single stranded it can fold back on itself forming a plethora of secondary and tertiary interactions, as well as complex folding motifs, binding pockets, and active site clefts (Fig. 1). Misfolding and mutations of RNA are characteristics of many cancers and diseases; for example, triplet repeat expansion diseases are associated with Huntington's disease, myotonic dystrophy, and Fragile X syndrome (Osborne & Thornton, Reference Osborne and Thornton2006). Single nucleotide polymorphisms (SNPs) that alter the structural ensemble of RNA sequences also have been associated with genetic diseases (Halvorsen et al. Reference Halvorsen, Martin, Broadaway and Laederach2010). Accordingly, understanding RNA structures and their dynamic regulation is an integral aspect of understanding RNA function.

Fig. 1. The Classical View (top) and the Modern View (bottom) of RNA's role in biology. In the classical view of biology, RNA (top) serves as a messenger molecule between DNA and proteins and proteins have all the main functions in cells. Messenger RNA serves to translate information from DNA to proteins. The modern view of biology (bottom) has emerged in the last 25 years as the field learns more about the many functions of RNA. Non-coding RNA (ncRNA) has vast regulatory functions, some of which include immune responses (‘ppp’ = 5′-triphosphate, which activates PKR) (Nallagatla et al. Reference Nallagatla, Hwang, Toroney, Zheng, Cameron and Bevilacqua2007), thermosensors, ribozymes, riboswitches, and genome editing. In the modern view of biology, proteins still have most cellular functions, but RNA plays essential roles in the cell beyond its classical functions.

The negatively charged phosphate backbone and diverse folds of RNA lead it to interact with cellular components, including metal ions, ligands, and proteins. Binding interactions with these species can change the fold of the RNA (Fig. 2). Monovalent and divalent metal ions are essential for the catalysis of small self-cleaving and large ribozymes both for folding and for active site catalysis (Serganov & Patel, Reference Serganov and Patel2007; Swisher et al. Reference Swisher, Su, Brenowitz, Anderson and Pyle2002). Small molecule-binding refolds riboswitches to regulate gene expression in a positive or negative mode (see Fig. 2) (Garst et al. Reference Garst, Edwards and Batey2011; Serganov & Patel, Reference Serganov and Patel2007).

Fig. 2. RNA interactions with RNA-binding proteins (RBP, left), metal ions (central), and ligands (star, right) can result in structure changes. Unlike typical in vitro conditions, there are other molecules and complex solution conditions in vivo that can interact with RNA and change its structure. These structure changes can result in an RNA with less structure (top left) more structure (top right), or an alternate conformation than the structure that is prevalent in vitro (bottom). Also shown (bottom) are the bacterial expression platforms of riboswitches that switch between two mutually exclusive structures that turn a gene ON (left) or OFF (right) by exposing or sequestering the Shine–Dalgarno sequence (blue).

Functional RNAs, such as tRNA, ribozymes, and riboswitches, are often found in ribonucleoprotein (RNP) complexes, which can help fold metastable RNA structures or induce a conformational change. The Protein Data Bank (PDB) has over 2000 annotated RNA-binding proteins, which include RNA chaperones, helicases, dsRNA-binding proteins, tRNA synthetases, ribonucleases (RNases) and RNA recognition motifs (RRMs) (Gerstberger et al. Reference Gerstberger, Hafner and Tuschl2014). Highly studied RNPs include the ribosome, non-plant RNase P, and the splicesome, which are responsible for the synthesis of proteins, maturation of the 5′-end of tRNAs, and splicing of pre-mRNAs, respectively. Remarkably it is the RNA component that is responsible for catalysis in these three RNPs, while the protein component provides scaffolding (Guerrier-Takada et al. Reference Guerrier-Takada, Gardiner, Marsh, Pace and Altman1983; Nissen et al. Reference Nissen, Hansen, Ban, Moore and Steitz2000).

Crowding plays critical but poorly understood roles in RNA folding. The cellular environment is very complex with up to 40% of the cytosol taken up by macromolecules (Minton, Reference Minton2001; Zimmerman & Trach, Reference Zimmerman and Trach1991). In addition, small molecule metabolites, polyamines and other species occupy volume and interact with RNAs. Macromolecular crowding can drive the compaction of RNA and proteins, while small molecules can either stabilize or destabilize RNAs through interactions with the RNA molecule (Minton, Reference Minton2001).

Nearly all of the biological components that influence RNA structure and function in vivo – biological ion compositions, ligands, proteins, and crowding – are missing during typical in vitro experiments (expanded upon in Table 1). A major goal of current research is to add back these components in order to more closely mimic in vivo conditions. We group studies of RNA folding into three approaches: (1) in vitro studies in dilute solutions; (2) in vivo studies in living cells; and (3) in vivo-like studies that mimic in vivo conditions. We also discuss how in silico methods facilitate each of these approaches. There are advantages and limitations to working in each of these conditions, and experiments in each can yield unique insights into the biological functions of RNAs. Structures and folding pathways of RNA have been studied mostly in dilute in vitro conditions, resulting in fundamental insights into RNA structure and function. However, there is a deep desire to understand how Nature works, and the in vivo environment is very different from typical in vitro solution conditions (Table 1 and Fig. 3). In particular, the majority of thermodynamic experiments studying the energetics of folding and associated pathways (Freier et al., Reference Freier, Kierzek, Jaeger, Sugimoto, Caruthers, Neilson and Turner1986b; Schroeder & Turner, Reference Schroeder and Turner2009) have been conducted in non-biological salt concentrations (London, Reference London1991; Lusk et al. Reference Lusk, Williams and Kennedy1968; Minton, Reference Minton2001; Romani, Reference Romani2007; Truong et al. Reference Truong, Sidote, Russell and Lambowitz2013). There are also myriad RNA–protein interactions in vivo, many of which profoundly affect RNA folding and function.

Fig. 3. Artist's rendition of in vitro conditions (left), in vivo conditions (right) and in vivo-like conditions (center). Typical in vitro solutions are dilute with high monovalent ion concentrations that are very different from cellular conditions. The cellular environment is complex with monovalent and divalent salts, macromolecules, cosolutes, and organelles. In vivo-like conditions (center) bridge in vitro and in vivo conditions and are more complex than in vitro conditions with added synthetic crowding agents and proteins and physiological ion concentrations. However, in vivo-like conditions are still much less complex than those prevailing in vivo.

Table 1. Comparison of in vitro and in vivo solution conditions

Conditions in vitro are the conditions historically used to study RNA. Typical values in the literature are listed in the table, although actual values differ across various studies. In vivo-like conditions, not provided in this table, typically emulate at least one of the conditions missing during in vitro experiments.

^a Typically Na⁺ is used in vitro although K⁺ is found. Freier et al. (Reference Freier, Kierzek, Caruthers, Neilson and Turner1986a), Xia et al. (Reference Xia, Santalucia, Burkard, Kierzek, Schroeder, Jiao, Cox and Turner1998).

^b Feig & Uhlenbeck (Reference Feig, Uhlenbeck, Gesteland, Cech and Atkins1999).

^c Typically Mg²⁺ is used. Herschlag & Cech (Reference Herschlag and Cech1990), Tanner & Cech (Reference Tanner and Cech1996).

^d Alberts et al. (Reference Alberts, Bray, Lewis, Roberts and Watson1994), London (Reference London1991), Romani (Reference Romani2007).

^e Lusk et al. (Reference Lusk, Williams and Kennedy1968), Truong et al. (Reference Truong, Sidote, Russell and Lambowitz2013).

Over the last few years, in vivo experiments probing RNA structure in living cells have revealed significant differences in many RNA structures as compared to in vitro (Kwok et al. Reference Kwok, Ding, Tang, Assmann and Bevilacqua2013; Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014; Tyrrell et al. Reference Tyrrell, Mcginnis, Weeks and Pielak2013). In vivo studies, while desirable because of their biological relevance, are at the same time limited in that they typically elucidate only the ensemble structure of each RNA transcript, do not deconvolute RNA–protein interactions versus RNA self-structure, and cannot easily perturb or control solution conditions. In particular, biophysical studies that can be readily conducted under highly controlled in vitro conditions are often simply not feasible in vivo. In an effort to gain more insight into the structure and function of RNA in the cellular environment, recent studies have focused on the folding pathway, structure, and function of RNAs under in vivo-like conditions, which mimic conditions in the cell (Desai et al. Reference Desai, Kilburn, Lee and Woodson2014; Dupuis et al. Reference Dupuis, Holmstrom and Nesbitt2014; Kilburn et al. Reference Kilburn, Roh, Behrouzi, Briber and Woodson2013; Nakano et al. Reference Nakano, Miyoshi and Sugimoto2014; Strulson et al. Reference Strulson, Molden, Keating and Bevilacqua2012, Reference Strulson, Yennawar, Rambo and Bevilacqua2013).

In silico prediction and modeling of RNA structure is an important tool used in all three of the above approaches to provide additional insight into RNA structure and function (Dawson & Bujnicki, Reference Dawson and Bujnicki2016; Seetin & Mathews, Reference Seetin and Mathews2012a). Prediction of canonical base pairs, for example, provides testable hypotheses for RNA structure and also provides frameworks for interpreting experimental results. Likewise, experimental data aid in improving in silico structure prediction.

In this review, we discuss major achievements in describing and understanding RNA folding and structure through in vitro, in vivo, and in silico efforts. The next section introduces the reader to in vitro studies of RNA folding, which set the stage for in vivo and in vivo-like studies of RNA folding. We focus on recent efforts to understand how RNA folds in the cell by bridging the gap between knowledge of RNA structure and folding in vitro and in vivo, which has led to an emerging field that studies RNA under in vivo-like conditions. We also discuss ways in which the accuracy of in silico modeling could be improved with experimentally derived in vivo structure probing data. We conclude by discussing advances needed under cellular-like conditions to better understand how RNA folds in the cell.

2. Setting the stage

2.1 In vitro studies of RNA folding

Most of what we currently know about RNA structure and folding comes from studies completed in vitro, under experimental conditions that favor a folded state. Such studies are typically conducted in dilute solutions with high concentrations (~1 M) of monovalent ions (Freier et al. Reference Freier, Kierzek, Jaeger, Sugimoto, Caruthers, Neilson and Turner1986b) and/or (~10 mM) divalent ions (Herschlag & Cech, Reference Herschlag and Cech1990), especially Mg²⁺, or under conditions that facilitate population of a desired folding intermediate, for example, by renaturing the RNA at an unusual temperature or salt concentration (Baird et al. Reference Baird, Westhof, Qin, Pan and Sosnick2005). These solution conditions are advantageous for studying folding because they can be chosen such that the RNA folds in an apparent two-state manner or the RNA populates just a single intermediate, but have the drawback that they differ profoundly from in vivo conditions, which have predominantly ~140 mM K⁺ and 0·5–3 mM Mg²⁺ (expanded upon in Table 1).

An advantage of using high concentrations of monovalent salts is that they compete with trace polyvalent metal ions and hydroxide ions for the phosphate backbone thereby reducing RNA degradation. In addition, high monovalent salt conditions minimize end fraying of RNA hairpins, favoring two-state folding (Freier et al. Reference Freier, Kierzek, Jaeger, Sugimoto, Caruthers, Neilson and Turner1986b). As we describe below, the thermodynamics and kinetics of systems, ranging from simple RNAs, such as hairpins and bulges, to complex RNAs and RNPs, such as ribozymes and the ribosome, have been well characterized. Many aspects of the RNA-folding process can be understood by the application of techniques and the systematic manipulation of conditions only possible under in vitro or under in vivo-like environments.

2.1.1 Major advances: elucidating RNA folding pathways in vitro

With the invention of various enzymological methods, such as PCR, cloning, T7 transcription and chemical synthesis, RNA preparation has advanced to the point where RNA of almost any sequence and length can be studied (Hoseini & Sauer, Reference Hoseini and Sauer2015; Li et al. Reference Li, Wen, Shen, Lu, Huang and Chang2011; Milligan et al. Reference Milligan, Groebe, Witherell and Uhlenbeck1987; Mullis, Reference Mullis1990). A wide variety of techniques have been applied to the study of RNA in vitro (Table 2). The earliest studies on RNA were conducted on homoribopolymers, such as polyU and polyA, which revealed that stacking – the non-bonded interactions between the surfaces of the bases – contributes to RNA stability (Richards et al. Reference Richards, Flessel and Fresco1963; Suurkuusk et al. Reference Suurkuusk, Alvarez, Freire and Biltonen1977). These studies also provided the first indications that individual RNAs adopt structure. An early breakthrough was from studies of tRNA, which could be isolated from living systems owing to its high cellular abundance, which led to insights into RNA tertiary structure. The cloverleaf base pairing of tRNA had been first predicted from sequence alignments of sequence variants (Levitt, Reference Levitt1969). Solving the crystal structure of tRNA confirmed its cloverleaf secondary structure and revealed novel tertiary interactions (Kim et al. Reference Kim, Quigley, Suddath, Mcpherson, Sneden, Kim, Weinzierl and Rich1973; Robertus et al. Reference Robertus, Ladner, Finch, Rhodes, Brown, Clark and Klug1974). The crystal structure of tRNA provided the first direct evidence that RNAs can form complex structures, akin to those of proteins, and that stacking, base pairing, and tertiary contacts all contribute to the adoption of complex three-dimensional (3D) structures (Sussman et al. Reference Sussman, Holbrook, Warrant, Church and Kim1978). With the advent of chemical synthesis techniques, ~100–200 mer of DNA and eventually ~50 mer RNA of any sequence could be made (Matteucci & Caruthers, Reference Matteucci and Caruthers1981; Scaringe et al. Reference Scaringe, Wincott and Caruthers1998; Sierzchala et al. Reference Sierzchala, Dellinger, Betley, Wyrzykiewicz, Yamada and Caruthers2003), with a plethora of atomic modifications. Semi-synthetic approaches were then developed that combine enzymological and chemical synthesis to facilitate the introduction of mutations both at the nucleotide and functional group levels in RNAs of any size (Moore & Sharp, Reference Moore and Sharp1992).

Table 2. Common experimental techniques used to study RNA structure and folding

Thermodynamic and kinetic studies under in vitro conditions provide insight into the complex folding pathways of many functional RNAs. Ribozymes and riboswitches are ideal for the study of RNA folding because their function serves as a readout for the occupancy of the native state (Banerjee et al. Reference Banerjee, Jaeger and Turner1993; Crothers et al. Reference Crothers, Cole, Hilbers and Shulman1974; Mitchell & Russell, Reference Mitchell and Russell2014; Mitchell et al. Reference Mitchell, Jarmoskaite, Seval, Seifert and Russell2013; Rook et al. Reference Rook, Treiber and Williamson1998). Major themes are that large RNAs fold on a rugged pathway through populated intermediates, largely in a hierarchical manner, where secondary structures form before tertiary contacts, as demanded by the topologies of these complex RNAs (Fig. 4) (Brion & Westhof, Reference Brion and Westhof1997; Mitchell & Russell, Reference Mitchell and Russell2014; Solomatin et al. Reference Solomatin, Greenfeld, Chu and Herschlag2010; Tinoco & Bustamante, Reference Tinoco and Bustamante1999; Wan et al. Reference Wan, Suh, Russell and Herschlag2010). It is informative to consider these principles on several specific RNAs. Using temperature-dependent nuclear magnetic resonance (NMR) and relaxation kinetics, the mechanism of tRNA unfolding was elucidated (Crothers et al. Reference Crothers, Cole, Hilbers and Shulman1974; Hilbers et al. Reference Hilbers, Robillard, Shulman, Blake, Webb, Fresco and Riesner1976; Stein & Crothers, Reference Stein and Crothers1976). Five distinct transitions were mapped to the four arms and the tertiary contacts (Crothers et al. Reference Crothers, Cole, Hilbers and Shulman1974). Secondary structures form on a fast timescale (μs to ms) followed by folding of the tertiary structure on a slower timescale (ms to s). In the presence of monovalent metal ions, multiple thermal unfolding transitions are observed for these processes (Stein & Crothers, Reference Stein and Crothers1976). These transitions merge into one as Mg²⁺ concentrations are increased, revealing that Mg²⁺ induces an apparent two-state folding. Larger functional RNAs, ribozymes, and riboswitches also fold in a hierarchical manner in vitro (Fig. 4a ).

Fig. 4. Depiction of the hierarchical RNA folding pathway and folding funnels for non-cooperative and cooperative folding. (a) RNA folds in a hierarchical manner in which secondary structures form followed by tertiary structure. Hierarchical folding can be (b) rugged and non-cooperative in which the pathway intermediates are populated and the RNA can form misfolds (M_i) before populating the native state (N), or folding can occur in a (c) cooperative manner in which the intermediates do not populate and the RNA folds in a single transition.

The Azoarcus group I ribozyme was used to determine the influence of tertiary interactions on RNA folding (Fig. 5). This ribozyme has been shown to fold quickly, with ~80% of the ribozyme folded into the native state in under 50 ms in 15 mM Mg²⁺ (Rangan et al. Reference Rangan, Masuida, Westhof and Woodson2003). To determine the roles of tertiary interactions in ribozyme folding, the tertiary contact between the P9 GAAA tetraloop and its J5/5a receptor were perturbed (Chauhan & Woodson, Reference Chauhan and Woodson2008). While the WT ribozyme folded in a cooperative manner to the native state, the tetraloop mutant occupied many previously hidden intermediates on the folding pathway, even at 50 mM Mg²⁺. This study indicated that tertiary contacts promote cooperative RNA folding.

More recently, methods have been developed to study RNA folding on the nucleotide level and at the millisecond timescale (Merino et al. Reference Merino, Wilkinson, Coughlan and Weeks2005; Scalvi et al. Reference Scalvi, Woodson, Sullivan, Chance and Brenowitz1997; Zhuang et al. Reference Zhuang, Bartley, Babcock, Russell, Ha, Herschlag and Chu2000). Experiments using hydroxyl radical mapping yielded insight into the pathway of tertiary structure formation and folding kinetics in the Tetrahymena Group I Intron (Sclavi et al. Reference Sclavi, Sullivan, Change, Brenowitz and Woodson1998). Combined with time resolved small-angle X-ray scattering (SAXS) (Roh et al. Reference Roh, Guo, Kilburn, Briber, Irving and Woodson2010), hydroxyl radical footprinting on the Tetrahymena ribozyme folding pathway uncovered an initial collapse of structure on the millisecond timescale during the dead time of the instrument. During the subsequent time course, tertiary contacts and several intermediates were elucidated (Sclavi et al. Reference Sclavi, Sullivan, Change, Brenowitz and Woodson1998).

The folding pathways of large functional RNAs have proven to be quite complex with intermediates that can be trapped for minutes to hours (Banerjee & Turner, Reference Banerjee and Turner1995; Chadalavada et al. Reference Chadalavada, Senchak and Bevilacqua2002; Zarrinkar et al. Reference Zarrinkar, Wang and Williamson1996). For example, 90% of the Tetrahymena ribozyme is found in a misfolded state that transitions to the native state with hour timescale kinetics (Banerjee & Turner, Reference Banerjee and Turner1995), and the hepatitis delta virus (HDV) ribozyme folds through numerous intermediates, some long-lived (Chadalavada et al. Reference Chadalavada, Senchak and Bevilacqua2002). Long-lived misfolded intermediates are often very similar in structure to the native RNA and typically arise from a secondary structure mispairing or an incorrect 3D topology (Mitchell et al. Reference Mitchell, Jarmoskaite, Seval, Seifert and Russell2013; Treiber et al. Reference Treiber, Rook, Zarrinkar and Williamson1998; Wan et al. Reference Wan, Suh, Russell and Herschlag2010). For instance, a long-lived intermediate occurs in the Tetrahymena ribozyme where P3 is docked correctly but the topology of the ribozyme is incorrect (Mitchell & Russell, Reference Mitchell and Russell2014; Mitchell et al. Reference Mitchell, Jarmoskaite, Seval, Seifert and Russell2013). To fold into the native state, this misfold needs to undergo a global unwinding of structure. Importantly, the extent to which these pathways and intermediates are populated in vivo is unknown. Indeed, some of these folding intermediates are affected by the method by which the RNA is purified. For example, the wild-type HDV ribozyme has the optimal rate of catalysis when the ribozyme is folded co-transcriptionally, as opposed to being renatured prior to assay (Chadalavada et al. Reference Chadalavada, Cerrone-Szakal and Bevilacqua2007). In addition, choice of flanking sequences can profoundly affect the activity of small and large ribozymes (Cao & Woodson, Reference Cao and Woodson1998; Chadalavada et al. Reference Chadalavada, Knudsen, Nakano and Bevilacqua2000).

2.1.2 Major advances: applying biophysical techniques to study RNA folding in vitro

Using optical melting, a set of thermodynamic parameters have been established to estimate folding free energies from sequence and structure alone (Andronescu et al. Reference Andronescu, Condon, Turner and Mathews2014; Lu et al. Reference Lu, Turner and Mathews2006; Turner & Mathews, Reference Turner and Mathews2010; Xia et al. Reference Xia, Santalucia, Burkard, Kierzek, Schroeder, Jiao, Cox and Turner1998). The nearest-neighbor model predicts the free energy and stability of an RNA from each base pair's nearest neighbor, along with initiation, symmetry, and terminal-AU base pair terms. Nearest-neighbor terms for certain loops, those regions without canonical base pairs, have also been determined (Mathews et al. Reference Mathews, Disney, Childs, Schroeder, Zuker and Turner2004). As noted below, these experimental parameters have been incorporated in RNA structure prediction programs that find the lowest free energy structures for an input RNA sequence (Mathews, Reference Mathews2006; Reeder et al. Reference Reeder, Hochsmann, Rehmsmeier, Voss and Giegerich2006; Seetin & Mathews, Reference Seetin and Mathews2012a). Parameters to account for complicated tertiary interactions and loops are still being revised (Liu et al. Reference Liu, Shankar and Turner2010b, Reference Liu, Diamond, Mathews and Turner2011; Lu et al. Reference Lu, Turner and Mathews2006). The nearest-neighbor parameters currently available were measured under highly folding in vitro conditions of 1 M NaCl.

Fig. 5. Different RNA structures can be populated under in vitro, in vivo, and in vivo-like conditions. RNA structures induced by the cellular environment, including proteins and crowding, are shown in the two outermost structures. The conditions in vitro favor the population of a structure that may not always be the functional RNA structure (center two structures). Depending on the in vivo-like conditions chosen, specific RNA structures will be populated.

Low-resolution methods provide information about the structure of RNA on both the global and nucleotide length scales. Although these techniques do not give atomic resolution, they have significantly faster throughput than crystallography or NMR structures while still providing insight into the fold and function of RNA. SAXS and Förster Resonance Energy Transfer (FRET) provide low-resolution information on the overall fold of an RNA. RNA is particularly amenable to SAXS because the phosphate backbone is electron-rich and scatters X-rays well. Different solution conditions can be prepared and examined quickly by SAXS to elucidate RNA structural changes. The structures of several functional RNAs and RNA–protein complexes have been explored using SAXS, including ribozymes, riboswitches bound and unbound to ligand, and the spliceosome (Pollack, Reference Pollack2011). FRET studies, in which acceptor and donor fluorophores are attached to the RNA at key locations, have helped elucidate folding intermediates (Walter, Reference Walter2001). Using single-molecule FRET, or smFRET, the Tetrahymena ribozyme was found to fold into multiple conformations, nearly all of which were active, indicating that the ribozyme populates multiple native states (Solomatin et al. Reference Solomatin, Greenfeld, Chu and Herschlag2010). Upon exposure to denaturant, the ribozyme re-populated the native conformations, indicating the results are independent of original conformation. Both SAXS and smFRET have been applied to RNA folding under in vivo-like conditions, as discussed below (Paudel & Rueda, Reference Paudel and Rueda2014; Strulson et al. Reference Strulson, Yennawar, Rambo and Bevilacqua2013).

Structure probing methods serve essential roles in elucidating the structures of functional RNAs at the nucleotide level. Several chemical probes have been employed to attack and modify the RNA bases, sugar, and backbone, in order to reveal the base pairing status of the nucleotides. Commonly used chemical probes include dimethyl sulfate (DMS), carbodiimide tosylate (CMCT), and SHAPE reagents, which allow selective 2′-hydroxyl acylation – each of which is analyzed by primer extension via reverse transcription. Commonly used enzymatic probes are RNases T1, V1, and S1. Targets of these probes and methods of readout are provided in Fig. 6. Structure probing of RNAs in vitro has revealed very complex structures, as well as binding sites of ligands, metal ions, and proteins. As discussed below, structure probing with chemical probes can be used in vivo as well.

Fig. 6. RNA modifications by DMS, SHAPE reagents (selective 2′-hydroxyl acylation analyzed by primer extension), and CMCT (1-cyclohexyl-(2-morpholinoehyl)carbodiimide metho-p-toluene). SHAPE reagents modify the 2′-hydroxyl on the sugar of all four ribonucleobases. SHAPE reagents include 1M7 (1-methyl-7-nitroisatoic anhydride), NMIA (N-methylisotoic anhydride), and NAI (2-methylnicotinic acid imidazolide). DMS modifies the N1 of A and the N3 of C as well as the N7 of G. CMCT modifies N3 of U and N1 of G. The chemical modifications (except N7 of G) can be detected immediately by RT followed by gel electrophoresis or high-throughput sequencing, and the enzymatic cleavages, which cleave single- and double-stranded RNA, can be read out through gel electrophoresis or high-throughput sequencing.

Very recently, RNA structure in vitro has been probed genome-wide at the nucleotide level, utilizing the power of next-generation sequencing. Several methods have been developed to map entire transcriptomes. Parallel Analysis of RNA Structure (PARS) cleaves double-stranded regions with RNase V1 and single-stranded regions with RNase S1, and FragSeq cleaves single-stranded regions with nuclease P1 (Kertesz et al. Reference Kertesz, Wan, Mazor, Rinn, Nutter, Chang and Segal2010; Underwood et al. Reference Underwood, Uzilov, Katzman, Onodera, Mainzer, Mathews, Lowe, Salama and Haussler2010). In PARS, RNA is extracted from cells and aliquots are separately exposed to each nuclease, the digested RNA is converted to cDNA through reverse transcription, and then deep sequenced to map the reverse transcriptase stops to the genome. A PARS score is determined from the log ratio of V1/S1 sequencing reads, where a high PARS score indicates more RNA structure (Kertesz et al. Reference Kertesz, Wan, Mazor, Rinn, Nutter, Chang and Segal2010). In FragSeq, RNA is extracted from cells, and one aliquot is treated with P1 nuclease and a second aliquot is untreated (Underwood et al. Reference Underwood, Uzilov, Katzman, Onodera, Mainzer, Mathews, Lowe, Salama and Haussler2010). RNA-seq is then performed on each aliquot, and a cutting score is determined for each mapped nucleotide that indicates the propensity to be cut by P1 nuclease. The cutting score is then used to annotate RNA secondary structures and/or to restrain RNA secondary structure prediction. Genome-wide studies in several organisms, both in vitro and in vivo, have found that there is significantly more structure in the coding regions than the untranslated regions of RNAs (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014; Kertesz et al. Reference Kertesz, Wan, Mazor, Rinn, Nutter, Chang and Segal2010; Li et al. Reference Li, Zheng, Vandivier, Willmann, Chen and Gregory2012; Wan et al. Reference Wan, Qu, Zhang, Flynn, Manor, Ouyang, Zhang, Spitale, Snyder, Segal and Chang2014; Zheng et al. Reference Zheng, Ryvkin, Li, Dragomir, Valladares, Yang, Cao, Wang and Gregory2010). There is also less structure in the start and stop codons than in the rest of a transcript, which presumably facilitates read-through by the ribosome.

Using a method similar to PARS but differing in that the RNA structure is probed at several temperatures, PARTE (Parallel Analysis of RNA Structures with Temperature Elevation) was used to obtain the folding free energies for yeast transcripts genome-wide in vitro (Wan et al. Reference Wan, Qu, Ouyang, Kertesz, Li, Tibshirani, Makino, Nutter, Segal and Chang2012). RNA from yeast was folded between 30 and 75 °C and exposed to RNase V1 followed by deep sequencing. By examining the melting temperatures (T _m) of RNAs, non-coding and coding RNAs could be distinguished and RNAs with distinct cellular functions could be identified. Functional non-coding RNAs (ncRNAs) were found to have a higher T _m on average than mRNAs.

Three methods that utilize DMS chemistry to determine transcriptome-wide RNA structure were recently published: Structure-seq, DMS-sequencing (DMS-seq), and modification sequencing (Mod-seq) (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014; Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014; Talkish et al. Reference Talkish, May, Lin, Woolford and Mcmanus2014). To date, only DMS-seq has been applied in vitro and all of the methods have been applied in vivo. These methods are described in more detail below.

2.1.3 Benefits and limitations of in vitro studies

Many of the foundational experiments on RNA folding and structure have come from in vitro experiments, and numerous underlying mechanisms of RNA folding and function have been discovered in vitro. Studies in vitro have revealed the folding pathways and structures of RNAs. More recently, methods have been developed to probe the structure of RNAs genome-wide. Major advances include elucidating fast formation of secondary structure and slow formation of the tertiary contacts, understanding of RNA folding energetics, establishment of nearest-neighbor parameters, and determination of structures of functional RNA motifs. The complex structures that RNA adopts enable diverse functions. Experimental techniques, ranging from structure probing to kinetic methods, have been applied to RNA across diverse pH, salt, and temperature conditions.

The major limitation of in vitro experiments is that the solution conditions are very different from the cellular environment and unavoidably lack many of the components present in cells, which can influence RNA folding and function. These limitations necessitate the development of experiments and techniques under in vivo and in vivo-like conditions to determine how RNAs fold and respond to cellular environmental conditions.

2.2 In vivo studies of RNA folding

In the previous section, we provided an overview of RNA folding in vitro. In this section we discuss recent advances made in vivo to understand RNA folding. We note that RNA structure has also been explored to a lesser extent in cellular extracts. Experiments in extracts contain more proteins bound to RNA than in vitro experiments but less than in vivo studies, as supported by recent comparisons of low DMS reactivity assignments amongst in vitro, extract, and in vivo studies (Ding et al. Reference Ding, Kwok, Tang, Bevilacqua and Assmann2015). Studies in extracts for RNAs with high positive predictive value (PPV) between reactivities in vitro and in silico, such as the ribosome, have been shown to be biologically relevant (Ding et al. Reference Ding, Kwok, Tang, Bevilacqua and Assmann2015; Moazed et al. Reference Moazed, Stern and Noller1986a). Likewise, for RNAs with low PPV between reactivities in vitro and in silico, studies in extracts might not provide the full complement of interactions. While experiments in cell extracts share many similarities with in vivo conditions, thermodynamic assays cannot be easily performed in extracts due to the denaturation and signal of other biomolecules.

An ultimate goal of RNA-folding studies is to understand how RNA behaves in the cell. The majority of the methods developed to study RNA in vivo are structure probing, where several chemicals known to penetrate the cell membrane are applied to modify RNA. Structure probing has been used to study the structures of RNAs in vivo on both the single gene and genome-wide levels, and has resulted in a breadth of information regarding structures that RNA forms inside living cells. These studies have revealed novel in vivo RNA folds, RNA–protein interactions, and novel regulatory roles.

2.2.1 Major advances: transcript-specific RNA structure mapping in vivo

Structure probing of RNA in vivo uses small chemicals such as SHAPE reagents, DMS, and CMCT, which penetrate cells and modify solvent-accessible regions of the RNA (Bloomfield et al. Reference Bloomfield, Crothers and Tinoco2000; Ehresmann et al. Reference Ehresmann, Baudin, Mougel, Romby, Ebel and Ehresmann1987). Structure probing methods using chemicals have revealed that for some transcripts there are significant differences between RNA structures formed in vivo and in vitro. We first describe in vivo structure probing experiments on single transcripts, followed by experiments across a genome.

The first in vivo nucleic acid structure probing study was from the Gilbert laboratory, where binding of multiple proteins to their cognate sites was observed using DMS modification (Nick & Gilbert, Reference Nick and Gilbert1985). Structure probing is outlined in Fig. 6. Briefly, DMS methylates adenine and cytosine on the Watson–Crick face and guanine on the Hoogsteen face. The modification on A and C is read out directly by stops in reverse transcription (RT) one position before the methylated base, while the methylated G is treated with aniline to create an abasic site followed by RT read out, which again stops one position before the modified base (Bloomfield et al. Reference Bloomfield, Crothers and Tinoco2000; Ehresmann et al. Reference Ehresmann, Baudin, Mougel, Romby, Ebel and Ehresmann1987). The RT can be read out in a gene-specific fashion by polyacrylamide gel electrophoresis (PAGE) or capillary electrophoresis (CE), and in a library fashion with next-generation sequencing (see the next section) (Kwok et al. Reference Kwok, Ding, Tang, Assmann and Bevilacqua2013).

The first report of RNA structure comparisons in vivo and in vitro came from the Cech laboratory (Zaug & Cech, Reference Zaug and Cech1995). Structure probing with DMS was used to map the structures of two known protein-bound RNAs, telomerase RNA and U2 snRNA, as well as the Tetrahymena ribozyme. Protections from reactivity in vivo compared with in vitro indicate either protein protection or gain of base pairing, while enhancements of reactivity indicate refolding to expose RNA bases. Telomerase RNA and U2 snRNA showed different reactivity patterns in vivo versus in vitro, consistent with the influence of protein binding on DMS reactivity. As expected, the group I ribozyme had very similar nucleotide reactivity in vivo and in vitro, demonstrating that the ribozyme is not protein-bound and self-splices without protein assistance in vivo.

Our group investigated structures of high and low abundance RNAs, also on a gene-specific basis, and compared DMS and SHAPE reactivities in vivo and in vitro. For low abundance RNAs we developed a gene-specific ligation-mediated PCR (LM-PCR) approach (Kwok et al. Reference Kwok, Ding, Tang, Assmann and Bevilacqua2013). These studies, which were in the model plant species Arabidopsis thaliana, revealed in vivo footprinting on high abundance 25S rRNA and 5·8S rRNA, as well as on the low abundance U12 snRNA. We showed that different bases in 5·8S rRNA are methylated in vivo and in vitro, which provided evidence for 5·8S rRNA refolding in vivo. These studies also provided critical control reactions that strongly supported DMS modification of RNA occurring in vivo and DMS being completely quenched prior to workup of the in vivo reaction. These controls apply equally to the genome-wide studies in the next section.

2.2.2 Major advances: genome-wide RNA structure mapping in vivo

Recently, several groups including ours have developed high-throughput methods to probe RNA structure in living cells transcriptome-wide. These studies revealed significant differences in RNA structure in vivo compared to in vitro and in silico predicted (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014; Kwok et al. Reference Kwok, Tang, Assmann and Bevilacqua2015; Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014; Talkish et al. Reference Talkish, May, Lin, Woolford and Mcmanus2014). Three separate methods using DMS to probe RNA structure in vivo were published in 2014: Structure-seq (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014), DMS-seq (Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014), and Mod-seq (Talkish et al. Reference Talkish, May, Lin, Woolford and Mcmanus2014), each of which utilizes the next-generation sequencing to probe RNA structure transcriptome-wide.

Each of these studies revealed novel information on RNA structure and possible regulatory functions of those structures. In Structure-seq, the PPV describes the fraction of base pairs in the in vivo DMS-restrained predicted structure that are also predicted in the unrestrained in silico predicted structure (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014). Of the greater than 10 000 mRNAs evaluated in this fashion, most had a PPV value far from unity, with a maximum PPV of the distribution slightly <0·4. This observation indicates that the in vivo structures of many RNAs cannot be predicted well purely in silico, using only sequence information and thermodynamic parameters originally derived in vitro. We also observed that the mRNAs with the lowest PPV distribution (bottom 5%) were enriched in annotations of biological function of stress and stimulus response, while the mRNAs with the highest PPV distribution (top 5%) were enriched in housekeeping functions (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014).

One possibility is that housekeeping RNAs have well-defined folds, while stress-related RNAs have ill-defined folds or adopt many folds. DMS-seq in yeast found that certain mRNAs are less structured in vivo than naked, protein-free RNA in vitro, and under in vivo ATP depletion the mRNAs on a whole become more structured, with the implication that ATP-dependent processes contribute to RNA unfolding. It is likely that a range of factors in vivo contribute to RNA structure (Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014). Mod-seq was used to reveal the binding location of the L26 protein by deletion in yeast; upon L26 deletion, 58 nucleotides became more reactive to DMS in vivo and most of these nucleotides were located in the 5·8S–25S rRNA interface where L26 is known to bind (Talkish et al. Reference Talkish, May, Lin, Woolford and Mcmanus2014).

Individual copies of a given RNA sequence can adopt different conformations owing to the single-stranded nature of RNA. Indeed, this may be the origin of the low PPV value in the stress-related genes (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014) in that structure probing methods reveal the average of all populated structures at some instant in time. There is experimental evidence that some transcripts appreciably populate multiple structures in vitro. Using the PARS method, ~4% of mRNAs had both high RNase V1 and RNase S1 activity, which cleave paired and unpaired RNA, respectively, under in vitro conditions (Wan et al. Reference Wan, Qu, Zhang, Flynn, Manor, Ouyang, Zhang, Spitale, Snyder, Segal and Chang2014). The high extents of cleavage by both nucleases suggest that populations of those mRNAs adopt multiple conformations simultaneously in vitro, and potentially in vivo.

Genome-wide studies revealed a triplet periodicity in mRNA nucleotide reactivity in yeast, mouse, and humans in vitro (Incarnato et al. Reference Incarnato, Neri, Anselmi and Oliviero2014; Wan et al. Reference Wan, Qu, Zhang, Flynn, Manor, Ouyang, Zhang, Spitale, Snyder, Segal and Chang2014), as well as in Arabidopsis in vivo (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014; Kertesz et al. Reference Kertesz, Wan, Mazor, Rinn, Nutter, Chang and Segal2010). The triplet repeat in reactivity is observed in the coding sequence but not in the untranslated regions. At present the mechanism behind the periodicity is not understood. Observation of the repeat in vitro suggests that occupancy of ribosomes is not necessary. Additional studies under in vitro, in vivo, and in vivo-like conditions will be necessary to attain a molecular-level understanding of the triplet periodicity in mRNA.

High-throughput sequencing has been coupled with CLIP (crosslinking and immunoprecipitation) to probe RNA-binding protein sites transcriptome wide in HITS-CLIP (high-throughput sequencing of RNA isolated by crosslinking immunoprecipitation) and PAR-CLIP (photoactivatable-ribonucleoside-enhanced crosslinking and immunoprecipitation) (Hafner et al. Reference Hafner, Landthaler, Burger, Khorshid, Hausser, Berninger, Rothballer, Ascano, Jungkamp, Munschauer, Ulrich, Wardle, Dewell, Zavolan and Tuschl2010; Licatalosi et al. Reference Licatalosi, Mele, Fak, Ule, Kayikci, Chi, Clark, Schweitzer, Blume, Wang, Darnell and Darnell2008; Weyn-Vanhentenryck et al. Reference Weyn-Vanhentenryck, Mele, Yan, Sun, Farny, Zhang, Xue, Herre, Silver, Zhang, Krainer, Darnell and Zhang2014). Studies using both of these methods on specific proteins have revealed novel sites of protein binding to RNA as well as possible protein regulatory functions (Hafner et al. Reference Hafner, Landthaler, Burger, Khorshid, Hausser, Berninger, Rothballer, Ascano, Jungkamp, Munschauer, Ulrich, Wardle, Dewell, Zavolan and Tuschl2010; Licatalosi et al. Reference Licatalosi, Mele, Fak, Ule, Kayikci, Chi, Clark, Schweitzer, Blume, Wang, Darnell and Darnell2008). Briefly, in HITS-CLIP, RNA is crosslinked to proteins, the protein of interest is isolated through IP, the RNA is reverse transcribed and amplified through PCR, then high-throughput sequencing is performed and reads are mapped to the genome (Licatalosi et al. Reference Licatalosi, Mele, Fak, Ule, Kayikci, Chi, Clark, Schweitzer, Blume, Wang, Darnell and Darnell2008). In PAR-CLIP, cells are grown with a photoactivatable nucleoside (4-thiouridine or 5-bromouridine) in the media to facilitate crosslinking with proteins upon exposure to 365 nm radiation (Hafner et al. Reference Hafner, Landthaler, Burger, Khorshid, Hausser, Berninger, Rothballer, Ascano, Jungkamp, Munschauer, Ulrich, Wardle, Dewell, Zavolan and Tuschl2010).

Genome-wide structure data have recently been used to identify certain sites of RNA–protein interactions. The method icSHAPE was used to probe RNA structure in mouse embryonic stem cells in vivo and in vitro (Spitale et al. Reference Spitale, Flynn, Zhang, Crisalli, Lee, Jung, Kuchelmeister, Batista, Torre, Kool and Chang2015). The difference in nucleotide reactivity in vitro and in vivo matched binding sites of the protein Rbfox2, previously identified with iCLIP experiments. This methodology was tested again and successfully identified RNA-binding sites of another RNA-binding protein, HuR. Using this type of analysis, certain RNA–protein interactions and associated RNA structural rearrangements can be distinguished using bioinformatics with experimental genome-wide mapping data.

2.2.3 Major advances: quantification of cellular factors in vivo

In vivo quantification of all the cellular factors known to affect RNA folding would both allow more accurate interpretation of in vivo RNA structure datasets and allow design of in vivo-like experiments that would more faithfully mimic in vivo conditions. Although such a comprehensive view of the inner workings of living cells has yet to be achieved, tremendous strides have been made in technique development for in vivo monitoring of cellular parameters relevant to RNA structure, including divalent ion concentrations, pH, reactive oxygen species (ROS), certain cosolutes, and RNA molecules themselves. Almost all of these techniques in vivo rely on a fluorescent readout, and thus advances in probe technology have gone hand-in-hand with advances in microscopy, although only the former topic is discussed here.

Fluorescent reporters are of three types: synthetic dyes, genetically encoded reporters, and reporters that incorporate both synthetic dyes and genetically encoded elements. Genetically encoded reporters typically rely on the cellular factor interacting with and altering the readout from a naturally fluorescent protein from jellyfish, green fluorescent protein (GFP), or its engineered variants (Tsien, Reference Tsien2010), the gene for which can be transformed into the system of interest. The ideal sensor will be minimally invasive and will have high specificity, brightness, and signal-to-noise ratios, a dynamic range that can accurately report the range of concentrations observed in vivo, and response kinetics that are as fast as the natural changes in the probed constituent. The best sensors are also ratiometric, which allows signal normalization to take into account such factors as photobleaching and heterogenous dye distribution. It is important to note that the cellular environment differs among various cellular compartments and organelles. For example, the microenvironments of mitochondria (De Michele et al. Reference De Michele, Carimi and Frommer2014) and chloroplasts (Stael et al. Reference Stael, Wurzinger, Mair, Mehlmer, Vothknecht and Teige2011) (both of which have their own genomes and thus local RNA transcription) are quite different from the microenvironment of the nucleus, and both differ from the cytosolic environment. Ideally, a sensor would also have the capacity to be specifically targeted to an organelle or subcellular location where RNA-folding events of interest occur; for example, sensors that are genetically encoded can be fused to sequences that confer organelle-specific targeting (Choi et al. Reference Choi, Swanson and Gilroy2012).

Cations of particular relevance to RNA structure are heavy metals, which tend to destabilize and degrade RNA, Mg²⁺, which tends to promote RNA folding, and H⁺ (pH), which affects RNA catalysis. In addition, K⁺ and Na⁺ promote formation of the special RNA structure, the G-quadruplex. In vivo concentrations of Mg²⁺ (London, Reference London1991; Lusk et al. Reference Lusk, Williams and Kennedy1968; Romani, Reference Romani2007; Truong et al. Reference Truong, Sidote, Russell and Lambowitz2013) and K⁺ as well as pH changes are all within the concentration ranges that can affect RNA structure. Among these cations, sensors based on GFP and its variants are available for Mg²⁺ (Lindenburg et al. Reference Lindenburg, Vinkenborg, Oortwijn, Aper and Merkx2013), Pb²⁺ (Nadarajan et al. Reference Nadarajan, Ravikumar, Deepankumar, Lee and Yun2014), Hg²⁺ (Hu et al. Reference Hu, Hu, Chen and Wang2013), and H⁺ (Tantama et al. Reference Tantama, Hung and Yellen2011). A number of synthetic pH sensors are also available (Yang et al. Reference Yang, Cao, He, Yang, Kim, Peng and Kim2014). Both genetically encoded and synthetic sensors of ROS are also available (Pouvreau, Reference Pouvreau2014; Swanson et al. Reference Swanson, Choi, Chanoca and Gilroy2011), which could be applied to study how ROS are associated with genetic diseases (Fimognari, Reference Fimognari2015) or environmental conditions (Jaspers & Kangasjärvi, Reference Jaspers and Kangasjärvi2010) that affect RNA structure in vivo.

As discussed in Section 3.2.2, synthetic and biological cosolutes typically destabilize RNA structure. In one early report, sucrose, which is the circulating ‘energy currency’ in plants, was reported to destabilize RNAs in vitro (Gao et al. Reference Gao, Gnutt, Orban, Appel, Righetti, Winter, Narberhaus, Müller and Ebbinghaus2016; Lambert & Draper, Reference Lambert and Draper2007). While the in vitro effects occurred at significantly higher concentrations than prevail in the cytosol proper, in microdomains close to the sites of sugar transporters, sucrose, and other sugars could perhaps be present at significantly higher concentrations and consequently affect RNA structure locally; moreover, weakly folded RNAs, such as certain mRNAs, may be more susceptible to such cosolutes. In a possibly analogous situation, while resting Ca²⁺ levels in the cell cytosol are 100–200 nM, Ca²⁺ concentrations as high as 100 mM have been reported at the mouths of Ca²⁺ channels (Tang et al. Reference Tang, Reddish, Zhuo and Yang2015a). Lipid anchoring of recently developed sucrose and glucose sensors (Fehr et al. Reference Fehr, Lalonde, Lager, Wolff and Frommer2003; Lager et al. Reference Lager, Looger, Hilpert, Lalonde and Frommer2006) to probe the near membrane microenvironment of sugar transporters could allow evaluation of this hypothesis.

The physical microenvironment and the localization of RNA, both of which can impact RNA structure, vary across cellular regions and organelles. Accordingly, methods that allow visualization of the spatial location of any specific RNA of interest are also highly desirable (Buxbaum et al. Reference Buxbaum, Haimovich and Singer2015). One of the first technologies developed for RNA visualization was molecular beacons (Santangelo et al. Reference Santangelo, Nitin and Bao2006), which are oligonucleotides tagged with a synthetic fluorophore at one end and a synthetic quencher on the other end. Molecular beacons take on a non-fluorescent stem-loop structure in the absence of a complementary RNA due to the close proximity of the quencher and fluorophore, but exhibit fluorescence upon unfolding and hybridization to the target RNA. Various strategies (Santangelo et al. Reference Santangelo, Nitin and Bao2006) can be employed to introduce molecular beacons into mammalian cells, but they are not genetically encoded. A more widely used strategy for visualization of specific RNAs employs a genetic approach in which an RNA sequence that binds the bacteriophage MS2 protein is inserted into the UTR of the transcript of interest and the organism is engineered to express GFP-tagged MS2, which then binds to the transcript of interest, marking its location (Buxbaum et al. Reference Buxbaum, Haimovich and Singer2015).

A different type of RNA marker has been developed recently based on the GFP fluorophore. GFP is fluorescent because the folded protein immobilizes the 4-hydroxy-benzylidene-imidazolinone (HBI) fluorophore encoded by a cyclized and subsequently oxidized Ser–Tyr–Gly tripeptide. RNA aptamers have been identified that analogously immobilize and thus induce fluorescence of a related synthetic fluorophore, DFHBI [(Z)-4-(3,5-difluoro-4-hydroxybenzylidene)-1,2-dimethyl-1H-imidazol-5(4H)-one]. The sequence of the RNA aptamer is genetically incorporated into the gene of interest and upon RNA expression and administration of the membrane-permeant fluorophore and its immobilization by the RNA aptamer, fluorescence is observed that marks the location of the target RNA (Paige et al. Reference Paige, Wu and Jaffrey2011). The RNA aptamer, dubbed Spinach, as well as the second generation aptamer Spinach2, both require addition of exogenous Mg²⁺ to fold properly; such addition could obviously also affect native RNA structures. The third generation Spinach reporter, Broccoli, eliminates this requirement (You & Jaffrey, Reference You and Jaffrey2015).

Spinach aptamers can be further modified to read out concentrations of cellular metabolites by fusion of the Spinach aptamer with other aptamer sequences (identified by artificial selection) that selectively bind small molecules (Paige et al. Reference Paige, Duc, Song and Jaffrey2012), or by incorporation of the Spinach aptamer into prokaryotic riboswitches (You et al. Reference You, Litke and Jaffrey2015). Riboswitch-based reporters have the advantage of having undergone natural selection that confers high affinity and specificity for the metabolite of interest, but are not currently ratiometric. Ratiometric sensors based on FRET between CFP and YFP, variants of GFP, have been engineered for several metabolites, including those with relevance to RNA structure. For example, FRET-based sensing of ATP concentration (Imamura et al. Reference Imamura, Huynh Nhat, Togawa, Saito, Iino, Kato-Yamada, Nagai and Noji2009) could be relevant to RNA structure because of the ATP requirement for the activity of RNA helicases (Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014). In summary, the future is bright for in vivo quantification of a plethora of the metabolites and physical properties that affect RNA structure. Quantification of cellular factors in vivo will play an important role in designing artificial cytoplasms to conduct in vivo-like studies of RNA folding.

2.2.4 Benefits and limitations of in vivo studies

Studies in vivo have shown that RNA can adopt different structures in vivo and in vitro, and have led to fresh insights on how the cellular environment affects RNA folding across a genome. Novel RNA structure motifs and RNA–protein interactions have been demonstrated through genome-wide in vivo experiments. In addition, novel RNA regulatory pathways have been identified by such studies.

Since some RNAs have been shown to fold and function differently under cellular conditions, the question arises, “Why not study RNA solely in living cells instead of in dilute solution conditions?” The reality is that methods for directly studying RNA folding in vivo are limited, and most current in vivo approaches rely on structure probing methods that do not probe RNA thermodynamics or folding pathways. Experiments done in vivo provide information only on the average RNA structure in a cell or organism and lack information on RNA dynamics, the folding process, and the presence of multiple populated structures of the same transcript. These limitations motivate in vivo-like studies to understand the influence of cellular conditions on RNA folding. Before moving to the in vivo-like section, we consider the important role that in silico studies play in both in vitro and in vivo studies.

2.3 In silico studies of RNA folding

Studies in vitro and in vivo described above yielded insights into RNA folding and structure that were informed by in silico structure prediction tools. Structure probing experiments, for example, typically use in silico prediction tools to model structure that is guided by the experimental data. In the subsections below, we describe advances in predicting RNA structure from one sequence, from multiple sequences, and with experimental data. Limitations of each approach are provided as well.

2.3.1 Major advances: RNA structure prediction from one sequence in silico

The most popular approaches to predict RNA structure use dynamic programming algorithms to efficiently search the set of possible structures (Eddy, Reference Eddy2004) and folding free energy nearest-neighbor rules to estimate folding stability (Turner & Mathews, Reference Turner and Mathews2010). The dynamic programming algorithms guarantee that every structure allowed by the set of folding rules is considered, except for those containing pseudoknots (see below). This means, for example, that the lowest free energy conformation will be found for programs that find lowest free energy structures, i.e. the most probable structure at equilibrium.

The accuracy of RNA structure prediction from sequence alone, in terms of fraction of known pairs correctly predicted, is stubbornly limited to ~70% (Hajiaghayi et al. Reference Hajiaghayi, Condon and Hoos2012; Lu et al. Reference Lu, Gloor and Mathews2009), and accuracy is lower for long sequences (>1000 nucleobases) such as small and large ribosomal RNAs and mRNAs (Doshi et al. Reference Doshi, Cannone, Cobaugh and Gutell2004) or for sequences that fold to more than one conformation at equilibrium. In silico predictions of base pairs presently rely on a parameterization of stabilities determined in vitro rather than in vivo, and these parameters are based on relatively few experiments, as compared to all possible folded sequences.

In response to this moderate success rate, a number of in silico methods have been developed to predict alternative structures, as reviewed previously (Mathews, Reference Mathews2006). Programs generate sets of alternative hypotheses for the structure (suboptimal structures) (Wuchty et al. Reference Wuchty, Fontana, Hofacker and Schuster1999; Zuker, Reference Zuker1989), feasible structures in equilibrium with each other (stochastic samples) (Ding & Lawrence, Reference Ding and Lawrence2003), or estimates for base pairing probabilities (partition function calculations) (McCaskill, Reference Mccaskill1990). Each of these three methods is described in turn. Suboptimal structures are those with similar free energy to the lowest free energy structure. Certain suboptimal structures can sometimes be more representative of the biological structure than the in silico-estimated lowest free energy structure, and can be viewed as alternative models or alternative hypotheses for the in vivo structure. Stochastic samples are rigorous samples from the equilibrium (Boltzmann) ensemble. They are useful for estimating ensemble statistics for the secondary structure of an RNA. Partition function calculations provide pairing probability estimates; more probable pairs in predicted structures are more likely to occur in the accepted structure (Mathews, Reference Mathews2004).

2.3.2 Major advances: RNA structure prediction from multiple sequences in silico

The accuracy of in silico folding can be dramatically improved by using additional information to guide the folding. In this section, we discuss using homologous sequences to guide the folding, while in the next section we discuss applying experimental data. Multiple homologous sequences, commonly called an RNA family, can be used to estimate the common secondary structure (Seetin & Mathews, Reference Seetin and Mathews2012a) because structure is generally conserved to a greater extent than sequence for RNAs. Due to sequence variation, the number of base pairs conserved across a family is smaller than the number of base pairs adopted by each sequence. With enough sequences, conserved pairs stand out as positions of covariation, where compensating base pair changes are observed. Covariation is a change in sequences where one biological species, for example, will have an AU base pair, but another species will have a GC pair at the homologous position. During evolution, two separate changes occurred in sequence (a compensating change) that conserved the base pair.

Three approaches are used to estimate the biologically conserved structure from a set of homologous sequences (Reeder et al. Reference Reeder, Hochsmann, Rehmsmeier, Voss and Giegerich2006; Seetin & Mathews, Reference Seetin and Mathews2012a). In the first approach, the available sequences are aligned, and then used to restrain the in silico prediction. This approach is typically the fastest, but generally works best when the pairwise sequence identity of all the homologs is high (75% or higher). These programs are exemplified by RNAalifold (Bernhart et al. Reference Bernhart, Hofacker, Will, Gruber and Stadler2008) and TurboFold (Harmanci et al. Reference Harmanci, Sharma and Mathews2011). Programs in the second set predict the structures for each sequence first and then compare the predicted structures to find those common to all sequences. This approach works well when the structure is highly conserved and is exemplified by RNAcast (Reeder & Giegerich, Reference Reeder and Giegerich2005). The third approach is to simultaneously align and fold sequences to find the common structure and sequence alignment. This is the best approach to use when the sequences are diverse (pairwise sequence identity for some sequence pairs below 75%) because low pairwise identity makes sequence alignment challenging. Programs in this class include Dynalign/Multilign (Fu et al. Reference Fu, Sharma and Mathews2014; Xu & Mathews, Reference Xu and Mathews2011), Foldalign (Torarinsson et al. Reference Torarinsson, Havgaard and Gorodkin2007), LocARNA (Will et al. Reference Will, Reiche, Hofacker, Stadler and Backofen2007), PARTS (Harmanci et al. Reference Harmanci, Sharma and Mathews2008), and RAF (Do et al. Reference Do, Foo and Batzoglou2008).

The accuracy of in silico prediction of conserved structures from a set of homologous sequences can be much higher, than for predictions from single sequences. For example, often an additional 20% or more of the known base pairs can be correctly predicted using multiple homologs as compared to predictions using a single sequence (Xu & Mathews, Reference Xu and Mathews2011). For a given set of sequences, however, it is not always obvious which approach or program to use, and, therefore, it is probably best to try more than one program to develop hypotheses about the in vivo structure. To date, no program can completely automate comparative sequence analysis. Manual comparison is still required for the most accurate RNA secondary structure determination.

2.3.3 Major advances: RNA structure prediction in silico restrained with experimental data

Another type of information used to guide in silico prediction of RNA structure is experimental structure mapping. Such mapping data can come from in vitro or in vivo experiments and are used to restrain structure prediction (Lorenz et al. Reference Lorenz, Wolfinger, Tanzer and Hofacker2016; Sloma & Mathews, Reference Sloma and Mathews2015). The effects of experimental structure restraints have been well studied using in vitro probing data on structured ncRNAs. Over 85% of known pairs can be correctly predicted using in vitro SHAPE, DMS, or enzymatic cleavage data (Cordero et al. Reference Cordero, Kladwang, Vanlang and Das2012; Deigan et al. Reference Deigan, Li, Mathews and Weeks2009; Eddy, Reference Eddy2014; Hajdin et al. Reference Hajdin, Bellaousov, Huggins, Leonard, Mathews and Weeks2013; Ouyang et al. Reference Ouyang, Snyder and Chang2013; Washietl et al. Reference Washietl, Hofacker, Stadler and Kellis2012; Wu et al. Reference Wu, Shi, Ding, Liu, Hu, Yip, Yang, Mathews and Lu2015; Zarringhalam et al. Reference Zarringhalam, Meyer, Dotu, Chuang and Clote2012) when the extent of accessibility is quantified using capillary/gel electrophoresis or deep sequencing counts. This is a dramatic improvement over the above-mentioned 70% limit in the absence of mapping data. Using in vivo mapping data to improve the accuracy of structure prediction has not yet been well studied, although mapping data overlaid on known structures suggests that, for structured ncRNAs such as rRNAs, the existing methods should improve structure prediction accuracy (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014). We recently developed a pipeline called StructureFold to fold RNAs across a genome using restraints from experimental data, which works with Structure-seq data (Tang et al. Reference Tang, Bouvier, Kwok, Ding, Nekrutenko, Bevilacqua and Assmann2015b), and the RNAstructure program and can accommodate other data and folding algorithms.

2.3.4 Challenges with in silico modeling of RNA secondary structure

Despite its widespread use, RNA secondary structure prediction has known limitations. First, the nearest-neighbor parameters are based on a limited number of experiments measured in vitro in 1 M NaCl rather than in vivo-like conditions, and there are probably many sequences that are not predicted well with those parameters (Andronescu et al. Reference Andronescu, Condon, Turner and Mathews2014; Mathews et al. Reference Mathews, Disney, Childs, Schroeder, Zuker and Turner2004). On the one hand, for a limited number of simple RNAs melted in physiological K⁺ and Mg²⁺ concentrations, the stability is often similar to that in 1 M NaCl (Diamond et al. Reference Diamond, Turner and Mathews2001; Jaeger et al. Reference Jaeger, Zuker and Turner1990; Jiang et al. Reference Jiang, Kennedy, Moss, Kierzek and Turner2014; Schroeder & Turner, Reference Schroeder and Turner2000). However, for a 5S ribosomal RNA loop E motif, for example, an appreciable difference in stabilities was found between buffers with and without Mg²⁺ (Serra et al. Reference Serra, Baird, Dale, Fey, Retatagos and Westhof2002). Second, although enthalpy parameters are available for structure prediction between 10 and 60 °C (Lu et al. Reference Lu, Turner and Mathews2006), predictions are generally made at 37 °C, which is relevant to humans, but not the majority of organisms. Third, finding lowest free energy structures assumes that RNAs fold to equilibrium, i.e. kinetics do not control folding. In favor of this assumption, an in vivo study of ribozymes suggested that RNAs fold to equilibrium to a greater extent in yeast cells than in vitro (Mahen et al. Reference Mahen, Harger, Calderon and Fedor2005). Also, in vitro structure mapping studies of annealed ribosomal RNAs were consistent with in vivo structures (Moazed et al. Reference Moazed, Stern and Noller1986b). However, some sequences are kinetically trapped, such as transcriptional riboswitches (Seetin & Mathews, Reference Seetin and Mathews2012a; Wickiser et al. Reference Wickiser, Cheah, Breaker and Crothers2005a; Wickiser et al. Reference Wickiser, Winkler, Breaker and Crothers2005b). Therefore, it is unclear to what extent factors such as non-physiological ionic conditions and cotranscriptional folding play roles in shaping the folding of RNA.

A fourth limitation of the most popular programs for in silico folding is that they cannot predict pseudoknots (Liu et al. Reference Liu, Mathews and Turner2010a). A pseudoknot occurs when there are base pairs between nucleotides in two different loops. Formally, a pseudoknot is composed of two or more base pairs, defined by indices i base paired to j and i′ base paired to j′, where the order of the nucleotides is i < i′ < j < j′. Pseudoknotted pairs are a small fraction of total base pairs in known structures but often occur in highly structured and functional RNAs. For programs that predict pseudoknots, the accuracy is shockingly low (<5%) (Bellaousov & Mathews, Reference Bellaousov and Mathews2010), although the use of multiple homologous sequences to identify conserved pseudoknots improves the accuracy (Seetin & Mathews, Reference Seetin and Mathews2012b). Recently, it was also shown that in vitro SHAPE mapping data can guide in silico structure prediction, including pseudoknots, and achieve over 90% accuracy at predicting known base pairs (Hajdin et al. Reference Hajdin, Bellaousov, Huggins, Leonard, Mathews and Weeks2013). The program that implements this, ShapeKnots, is limited, however, to sequences of 600 nucleotides or fewer.

Although structure mapping data and sequence comparison are each used to guide in silico modeling of RNA secondary structure, little has been done until recently to combine the two approaches for additional synergy. The secondary structures of three long ncRNAs were modeled with the aid of structure mapping data: HOTAIR (with in vitro SHAPE, DMS, and terbium) (Somarowthu et al. Reference Somarowthu, Legiewicz, Chillón, Marcia, Liu and Pyle2015), SRA (with in vitro SHAPE, DMS, in-line probing, and RNase V1 digestion) (Novikova et al. Reference Novikova, Hennelly and Sanbonmatsu2012), and XIST (with in vivo DMS mapping) (Fang et al. Reference Fang, Moss, Rutenberg-Schoenberg and Simon2015). For each of these studies, sequence comparison, i.e. the verification that the structures are conserved and the identification of compensating base pair changes, was subsequently used to further support the structure model.

Two software programs were enhanced to combine structure mapping data and sequence comparison to improve structure prediction. Sükösd et al. (Reference Sükösd, Knudsen, Kjems and Pedersen2012) reported PPfold, a program that uses a probabilistic approach to predict structure and can be guided by SHAPE mapping data and/or sequence covariation as estimated from a sequence alignment. Recently, SHAPE data were used to inform sequence alignment and then RNAalifold to predict the conserved structure for the aligned sequences (Lavender et al. Reference Lavender, Lorenz, Zhang, Tamayo, Hofacker and Weeks2015b). The key observation is that homologous nucleotides, i.e. those that align, have similar SHAPE reactivities and thus the differences in SHAPE reactivity can be included as an additional metric in the scoring of alignments. This approach demonstrated an improved accuracy of base pair prediction by RNAalifold as compared to consensus structure prediction or SHAPE guided structure prediction alone. Both of these approaches were used to model HIV RNA structure using mapping data and sequence comparison (Lavender et al. Reference Lavender, Gorelick and Weeks2015a; Sükösd et al. Reference Sükösd, Andersen, Seemann, Jensen, Hansen, Gorodkin and Kjems2015).

3. Bridging the gap between in vitro and in vivo RNA folding using in vivo-like studies

3.1 The gap

The previous sections outlined major contributions of RNA-folding studies in vitro and in vivo to our understanding of how RNA behaves, while considering the important roles that in silico approaches play. In vitro studies provide the fundamentals of RNA thermodynamics and kinetics, RNA structural motifs, and genome-wide RNA structure trends. In vivo structure probing methods reveal RNA structural trends related to biological functions and regulatory roles of RNA genome-wide. We discussed how several research teams have used genome-wide in vivo structural probing to uncover that, in general, RNAs do not adopt the same structures in vivo as in vitro. Since structure generally dictates function, understanding differences between RNA folding in vivo and in vitro can illuminate biological function. Toward accomplishing this goal, RNA folding and function studies have been increasingly conducted under conditions that mimic the cellular environment.

The dilute solution conditions traditionally used to study RNA in vitro are vastly different from the cellular environment. The cellular environment is a complex solution containing biopolymers, metabolites, dilute free salts, and organelles, with 20–40% of the cellular volume occupied by macromolecular crowders (Minton, Reference Minton2001; Zimmerman & Trach, Reference Zimmerman and Trach1991). As such, there is no single cellular environment to which RNA is exposed. As an mRNA passes from the nucleus to the cytosol, solution conditions change; in eukaryotes, the cell is compartmentalized and as the RNA is transported to different regions its fold can change.

It is of interest to consider the differences between RNA structure in eukaryotic and prokaryotic organisms. Functional RNAs have intricate structures with tertiary contacts that assemble secondary structures close in space. Cations, typically Mg²⁺, neutralize the negative charge of the phosphate backbone and promote tertiary structures. Free Mg²⁺ concentrations in prokaryotic and eukaryotic cells are different, ~1·5–3·0 and 0·5–1·0 mM, respectively (London, Reference London1991; Lusk et al. Reference Lusk, Williams and Kennedy1968; Romani, Reference Romani2007; Truong et al. Reference Truong, Sidote, Russell and Lambowitz2013). Structured RNAs such as ribozymes, riboswitches, and thermosensors are found frequently in prokaryotes, where free Mg²⁺ levels are higher. Although a few ribozymes and one riboswitch have been identified in eukaryotes, they appear to be rare, and proteins are typically involved in forming requisite tertiary structures (Kubodera et al. Reference Kubodera, Watanabe, Yoshiuchi, Yamashita, Nishimura, Nakai, Gomi and Hanamoto2003; Roth et al. Reference Roth, Weinberg, Chen, Kim, Ames and Breaker2014; Salehi-Ashtiani et al. Reference Salehi-Ashtiani, Luptak, Litovchick and Szostak2006). Lambowitz and co-workers demonstrated that prokaryotic group II introns fold poorly in eukaryotic cells, although they could select variant RNAs that fold into active conformations at eukaryotic low Mg²⁺ concentrations (Truong et al. Reference Truong, Sidote, Russell and Lambowitz2013). Studies in our laboratory indicate that the eukaryotic innate immune sensor PKR is activated by prokaryotic RNAs under eukaryotic low Mg²⁺ conditions, leading to the speculation that riboswitches and ribozymes may be selected against in eukaryotes to aid in discriminating self and non-self at the RNA level (Hull & Bevilacqua, Reference Hull and Bevilacqua2015, Reference Hull and Bevilacqua2016; Hull et al. Reference Hull, Anmangandla and Bevilacqua2016). To date, there are no studies that compare the structures of eukaryotic and prokaryotic RNAs genome-wide, but such information would be valuable.

Historically, in vitro experiments lack many of the components of cellular environments, and, moreover, often have high concentrations of salt to fold RNA for thermodynamic and structural studies (Table 1). Thermodynamic studies cannot, however, readily be performed in vivo. The cell prohibits wide variations of temperature, pH, salt, and ligand concentration, all of which are necessary to obtain thermodynamic information. As a result, RNA is being increasingly studied in artificial cytoplasms that mimic aspects of the cellular environment while allowing biophysical studies. Several recent studies focused on mimicking aspects of the in vivo environment in vitro; conditions referred to herein as ‘in vivo-like’ conditions (Fig. 3). Effects of such conditions as cellular concentrations of monovalent and divalent ions and molecular crowding agents on the folding of RNAs have been a theme in a number of recent studies (Desai et al. Reference Desai, Kilburn, Lee and Woodson2014; Dupuis et al. Reference Dupuis, Holmstrom and Nesbitt2014; Nakano et al. Reference Nakano, Kitagawa, Yamashita, Miyoshi and Sugimoto2015; Paudel & Rueda, Reference Paudel and Rueda2014; Strulson et al. Reference Strulson, Boyer, Whitman and Bevilacqua2014; Tyrrell et al. Reference Tyrrell, Weeks and Pielak2015). Experiments under these in vivo-like conditions have the potential to bridge our understanding of observations made in vitro and in vivo.

3.2 Design of artificial cytoplasms and early experiments

In this section, we discuss various methods of mimicking cytoplasmic conditions, including the use of polymers and cosolutes as crowding agents and the use of protocells and synthetic membranes. We also discuss the outcomes of early experiments under these in vivo-like conditions. Finally, directions in which the field needs to move to understand the fold and function of RNA in vivo are suggested.

3.2.1 Polymers

Synthetic crowding agents such as polyethylene glycol, dextran, and ficoll, and small cosolute additives such as methanol, proline, and trimethylamine oxide (TMAO) have been used to mimic the crowded environment of the living cell. Functional RNAs that are well studied in vitro have been used to test the effects crowding agents have on RNA folding. Various methods, including UV melts, SAXS, kinetic techniques, and smFRET, have been used to study RNA under these in vivo-like conditions. Several studies have shown that synthetic crowding agents affect the thermodynamics and function of several RNAs (Dupuis et al. Reference Dupuis, Holmstrom and Nesbitt2014; Kilburn et al. Reference Kilburn, Roh, Guo, Briber and Woodson2010, Reference Kilburn, Roh, Behrouzi, Briber and Woodson2013; Lambert et al. Reference Lambert, Leipply and Draper2010; Strulson et al. Reference Strulson, Boyer, Whitman and Bevilacqua2014). Findings of these studies are that RNAs fold cooperatively, structure becomes compact, and ribozymes cleave faster under in vivo-like conditions (Kilburn et al. Reference Kilburn, Roh, Behrouzi, Briber and Woodson2013; Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009; Strulson et al. Reference Strulson, Yennawar, Rambo and Bevilacqua2013, Reference Strulson, Boyer, Whitman and Bevilacqua2014).

The kinetics of several small and large ribozymes have been probed under in vivo-like conditions and in all reported cases, rates of catalysis have increased in the presence of molecular crowders as compared to dilute solution conditions (Desai et al. Reference Desai, Kilburn, Lee and Woodson2014; Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009; Paudel & Rueda, Reference Paudel and Rueda2014; Strulson et al. Reference Strulson, Molden, Keating and Bevilacqua2012, Reference Strulson, Yennawar, Rambo and Bevilacqua2013). For example, the hammerhead ribozyme has higher catalytic activity, between 3·5 and 6·5 faster than in dilute solutions, in the presence of 10–30% (wt %) PEG200 or PEG8000, suggesting a more populated active state in crowded conditions (Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009). In addition, in vivo-like solution conditions can stabilize ribozymes even in the presence of denaturants. For example, the rate of catalysis of the CPEB3 ribozyme in the presence of 2·5 M of the denaturant urea was recovered by the addition of 30% (w/v) PEG200, PEG8000, or Dextran10, at a rate higher than in buffer alone (Strulson et al. Reference Strulson, Yennawar, Rambo and Bevilacqua2013). SAXS experiments have provided insight into the structural basis for enhanced catalysis, showing that the natively folded state adopts a more compact structure in the presence of molecular crowders under conditions of biological Mg²⁺ concentrations (Kilburn et al. Reference Kilburn, Roh, Guo, Briber and Woodson2010; Strulson et al. Reference Strulson, Yennawar, Rambo and Bevilacqua2013).

The thermal stability of several functional RNAs has been reported to increase under in vivo-like conditions as compared with in vitro experiments. For instance, in 20% PEG200 or PEG8000 the hammerhead ribozyme retains catalytic activity up to 60 °C, a temperature that thermally denatures the ribozyme in dilute solutions (Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009). Observation of increased hammerhead catalytic activity, up to 270-fold, at high temperatures in crowded conditions indicates a more thermostable RNA under in vivo-like conditions. Interestingly, the individual secondary structure elements of the ribozyme were observed, through optical melting experiments, to be thermally destabilized in molecular crowding agents, suggesting that tertiary structure is stabilized and resulting in more cooperative folding of the ribozyme (Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009). A thermodynamic study from our laboratory using SHAPE structure probing on tRNA^phe under in vivo-like conditions showed that tRNA folds in a cooperative manner at biological Mg²⁺ concentrations in the presence of molecular crowding (Strulson et al. Reference Strulson, Boyer, Whitman and Bevilacqua2014). The observed increase in folding cooperativity with crowding was accompanied by an increase in the temperature of the melting transition for tertiary structure. When the tertiary interactions were removed by mutation of nucleotides in tertiary contacts to uridine, cooperativity was lost and the RNA folded with multiple transitions under all solution conditions, thus indicating that tertiary interactions are vital to cooperative RNA folding under in vivo-like conditions. This effect is similar to that observed under in vitro conditions mentioned above (Chauhan & Woodson, Reference Chauhan and Woodson2008).

The contribution of molecular crowding agents to RNA catalysis and folding has been found to be largest in a background of physiologically low ionic conditions rather than high ionic conditions (Kilburn et al. Reference Kilburn, Roh, Behrouzi, Briber and Woodson2013; Strulson et al. Reference Strulson, Yennawar, Rambo and Bevilacqua2013). In the absence of crowding, physiological concentrations of Mg²⁺ are not high enough to fold functional RNAs in a two-state manner. This is apparent from the observation of long-lived intermediates and slow folding under these conditions (Banerjee & Turner, Reference Banerjee and Turner1995; Chadalavada et al. Reference Chadalavada, Senchak and Bevilacqua2002; Mitchell et al. Reference Mitchell, Jarmoskaite, Seval, Seifert and Russell2013). However, in the presence of biological crowding conditions and physiological Mg²⁺, functional RNAs tend to fold in a cooperative manner into compact structures (Desai et al. Reference Desai, Kilburn, Lee and Woodson2014; Dupuis et al. Reference Dupuis, Holmstrom and Nesbitt2014; Strulson et al. Reference Strulson, Boyer, Whitman and Bevilacqua2014; Tyrrell et al. Reference Tyrrell, Weeks and Pielak2015), and ribozymes and riboswitches tend to have higher rates of cleavage and higher ligand-binding affinity (Paudel & Rueda, Reference Paudel and Rueda2014). The addition of more Mg²⁺ to these conditions does not result in a further increase in the rate of activity or more cooperative RNA folding, indicating that together physiological crowding and Mg²⁺ conditions fold RNA optimally (Fig. 7).

Fig. 7. Under both in vitro (right, grey) and in vivo-like conditions with molecular crowding (left, pink) RNA fold into their native state that is functional, indicated in this figure by catalysis. High concentrations of Mg²⁺ (10 mM or higher) are needed to achieve the folded state in vitro compared with in vivo-like crowded conditions where low physiological Mg²⁺ (0·5 mM) folds the RNA. Reprinted with permission from Strulson et al. (Reference Strulson, Yennawar, Rambo and Bevilacqua2013). Copyright 2016 American Chemical Society.

A recent study explored the structural effects of the molecular crowding agent PEG (ranging in size from the monomer to 35 000 kDa) on the adenine riboswitch (Tyrrell et al. Reference Tyrrell, Weeks and Pielak2015). Using SHAPE chemistry, the reactivity of the riboswitch under in vitro, in vivo, and in vivo-like conditions was explored. The authors found that in low molecular weight PEG (<3350 kDa) the riboswitch had low correlation between reactivity in vivo and in vivo-like conditions, whereas in higher molecular weight PEG (12 000–35 000 kDa) the RNA had a similar reactivity under in vivo and in vivo-like conditions. While this study was limited to a single molecular crowding agent, it is significant because it showed that certain in vivo-like conditions are not accurate cellular mimics.

Recently, the folding of a model RNA was examined in vivo. The Salmonella fourU RNA thermometer hairpin containing a FRET pair was injected into live mammalian cells and reported to have similar melting temperatures and unfolding free energy in vivo and in vitro (Gao et al. Reference Gao, Gnutt, Orban, Appel, Righetti, Winter, Narberhaus, Müller and Ebbinghaus2016). The addition of 30% (w/v) PEG of varying sizes and Ficoll70 was shown to modify the thermodynamics of the hairpin, and higher molecular weight polymers were found to have similar effects on the RNA as the in vivo environment. The in vivo data had a very broad distribution of melting temperatures and free energy between both different cells and different cellular compartments, leading to some uncertainty about how the cellular environment is affecting RNA folding.

3.2.2 Cosolutes

While molecular crowding agents generally facilitate the folding of functional RNAs, small cosolutes have varying and complicated effects on RNA thermostability and folding cooperativity. This arises in part because the effect on stability depends strongly on the interactions between the particular cosolute and RNA considered. Cosolutes, also known as osmolytes, regulate osmotic pressure in cells (Record et al. Reference Record, Courtenay, Cayley and Guttman1998; Yancey et al. Reference Yancey, Clark, Hand, Bowlus and Somero1982). Effects of cosolutes on RNA folding were not significantly investigated until the last decade. Studies on RNAs with either secondary and/or tertiary structures report that cosolutes such as betaine, proline, and methanol, almost always destabilize secondary structures, while having mixed effects on tertiary structure (Lambert & Draper, Reference Lambert and Draper2007, Reference Lambert and Draper2012; Lambert et al. Reference Lambert, Leipply and Draper2010; Soto et al. Reference Soto, Misra and Draper2007). Several osmolytes have been shown to interact with the nucleobase, sugar, and phosphate of RNAs, with examples of both favorable and unfavorable interactions (Lambert & Draper, Reference Lambert and Draper2007). Stabilizing osmolytes have unfavorable interactions with the unfolded state of RNA, resulting in RNA compaction that buries functional groups and stabilization of the native state, while destabilizing osmolytes have favorable interactions with the unfolded state of RNA, driving unfolding (Holmstrom et al. Reference Holmstrom, Dupuis and Nesbitt2015; Lambert & Draper, Reference Lambert and Draper2007; Lambert et al. Reference Lambert, Leipply and Draper2010).

There are a limited number of studies available for the effect of cosolutes on RNA function. The hammerhead ribozyme was shown to have increased rates of cleavage in 20% cosolutes, such as glycerol and 1,2-dimethoxyethane, in the presence of physiological Mg²⁺, which was attributed to enhanced electrostatic interactions with Mg²⁺ (Nakano et al. Reference Nakano, Kitagawa, Yamashita, Miyoshi and Sugimoto2015). The secondary and tertiary structures of the hammerhead ribozyme were destabilized in the presence of several cosolutes (Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009). In crowded conditions, ribozyme activity also increased, while secondary structure was destabilized and tertiary structure was stabilized (Nakano et al. Reference Nakano, Karimata, Kitagawa and Sugimoto2009).

The influence of the cosolute TMAO on RNA secondary and tertiary structure, as well as on the phosphate backbone, has been studied (Denning et al. Reference Denning, Thirumalai and Mackerell2013; Lambert & Draper, Reference Lambert and Draper2007; Lambert et al. Reference Lambert, Leipply and Draper2010). TMAO is unusual in having almost no effect on secondary structure stability, while generally stabilizing tertiary structure. A small 58 mer rRNA was found to exhibit cooperative two-state folding in the presence of TMAO, observed by a single transition in an optical melting experiment (Lambert et al. Reference Lambert, Leipply and Draper2010).

3.2.3 Protocells and synthetic membranes

There are several groups focusing on how to model RNA function and structure in early Earth conditions, which also relate to compartmentalization in modern cells. Coacervates and synthetic membranes are often used to mimic early Earth protocells, and RNA function in these protocells is often studied through ribozyme cleavage. Our laboratory studied the activity of a two-piece hammerhead ribozyme in aqueous two-phase systems (ATPS) made of polyethylene glycol and dextran (Strulson et al. Reference Strulson, Molden, Keating and Bevilacqua2012). The system forms a dextran-rich phase droplet in which the ribozyme preferentially localizes at a concentration up to 3000 times that of the aqueous phase, resulting in a 70-fold increase in the rate of catalysis. This study suggested that RNA catalysis in the early Earth environment could have arisen from compartmentalization increasing the local concentration of RNA, possibly accelerating very slow reactions, so that they could occur on a biologically relevant timescale.

Similar to these droplets, mononucleotides will form microdroplets when mixed with cationic peptides in water (Koga et al. Reference Koga, Williams, Perriman and Mann2011). Inside these droplets, nucleotides and peptides can reach concentrations as high as 1·6 M and 400 mM respectively, which is much more concentrated than in the aqueous phase. Cationic and anionic dyes and certain nanoparticles were shown to partition into the droplets, indicating that the droplets are permeable to charged molecules (Koga et al. Reference Koga, Williams, Perriman and Mann2011). These droplet phases are another indicator that early life could have arisen in non-membranous compartments. More recently we made coacervates from nucleotides and poly(allylamine) that contain molar concentrations of Mg²⁺ and nucleotides, which could facilitate RNA catalysis in an early life scenario (Frankel et al. Reference Frankel, Bevilacqua and Keating2016).

4. Future directions

The majority of what is known about RNA folding and structure comes from studies that were performed in vitro on small model systems and highly structured RNAs. In contrast, little is known about how RNA folds and functions in vivo. Current in vivo methods probe the RNA structure ensemble. While providing a benchmark for new prediction parameters, ensemble methods cannot themselves generate thermodynamic parameters.

The current thermodynamic parameters for RNA structure prediction were established in 1 M NaCl. However, several-transcript-specific and genome-wide studies have shown that certain RNAs do not fold into the same structures in vivo and in vitro (Kwok et al. Reference Kwok, Ding, Tang, Assmann and Bevilacqua2013; Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014; Tyrrell et al. Reference Tyrrell, Mcginnis, Weeks and Pielak2013, Reference Tyrrell, Weeks and Pielak2015), so improved software and prediction parameters are needed to model in vivo structure. In particular, genome-wide in vivo structure probing datasets (Ding et al. Reference Ding, Tang, Kwok, Zhang, Bevilacqua and Assmann2014; Rouskin et al. Reference Rouskin, Zubradt, Washietl, Kellis and Weissman2014) contain a wealth of information that has not yet been completely realized or understood. One barrier to taking full advantage of the data is that most of the available in silico methods assume that RNAs fold to a single structure (Cordero et al. Reference Cordero, Kladwang, Vanlang and Das2012; Deigan et al. Reference Deigan, Li, Mathews and Weeks2009; Hajdin et al. Reference Hajdin, Bellaousov, Huggins, Leonard, Mathews and Weeks2013; Ouyang et al. Reference Ouyang, Snyder and Chang2013; Wu et al. Reference Wu, Shi, Ding, Liu, Hu, Yip, Yang, Mathews and Lu2015), while probing data averages across all structures populated by sequences for the duration of the experiment.

Modeling a single structure works well for ncRNA sequences that function with a single structure, such as ribosomal RNAs, but there are many RNAs for which this assumption is not correct, such as RNA switches and open reading frames. A key challenge is developing methods to use the probing data to model ensembles of relevant structures. Three recent papers highlight work to address this challenge. Cordero and Das report an in silico method (M²-REEFFIT) that models complex mixtures of multiple structures, aided by in vitro SHAPE mapping of the wild-type sequence and also a set of mutant sequences, which reveal nucleotide interactions (Cordero & Das, Reference Cordero and Das2015). Multiple structures for the 5′ UTR of an mRNA were modeled using in vitro SHAPE mapping of a mixture of structures (Kutchko et al. Reference Kutchko, Sanders, Ziehr, Phillips, Solem, Halvorsen, Weeks, Moorman and Laederach2015). The multiple conformations were modeled in silico using stochastic sampling, restrained using the standard SHAPE restraints expressed as free energy terms (Deigan et al. Reference Deigan, Li, Mathews and Weeks2009). A third in vitro approach separated multiple conformations of HIV RNA using native gel electrophoresis, and mapped the structures with SHAPE in the gel (Sherpa et al. Reference Sherpa, Rausch, Le Grice, Hammarskjold and Rekosh2015). This simplified the in silico analysis because the SHAPE mapping data were acquired for each conformation independently.

Another key challenge for mapping studies is determining the best way to discover or model interactions of RNAs with proteins or other RNAs. In vivo, all RNAs can interact with macromolecules and metabolites. These interactions generally result in protection from probing agents. Deconvoluting in silico whether a nucleotide is unreactive because of intramolecular structure or intermolecular interactions is a grand challenge that will likely require new types of experimental information to address. Modeling and predicting 3D RNA structures in vitro is an ongoing challenge. A recent RNA puzzle tested blind 3D folding predictions by providing research teams with RNA sequences and chemical probing data for those RNAs (Miao et al. Reference Miao, Adamiak, Blanchet, Boniecki, Bujnicki, Chen, Cheng, Chojnowski, Chou, Cordero, Cruz, Ferré-D'amaré, Das, Ding, Dokholyan, Dunin-Horkawicz, Kladwang, Krokhotin, Lach, Magnus, Major, Mann, Masquida, Matelska, Meyer, Peselis, Popenda, Purzycka, Serganov, Stasiewicz, Szachniuk, Tandon, Tian, Wang, Xiao, Xu, Zhang, Zhao, Zok and Westhof2015). The structures that the teams modeled were compared with crystal structures, and most teams could predict Watson–Crick base pairs, but struggled in predicting non-canonical WC base pairing and stacking interactions. A long range of goal is to predict relevant RNA 3D structures in vivo to understand the biologically relevant confirmation(s).

The study of RNA under in vivo-like conditions is relatively young. To better mimic the cellular environment, more complex cytoplasm mimics should be developed. To date, artificial cytoplasms have focused on synthetic polymers and cosolutes, but more accurate ionic conditions, biopolymers and even cell extracts need to be applied. In addition, studies under in vivo-like conditions have focused on single transcripts in synthetic crowding and cosolute conditions. Genome-wide comparisons of RNA folding under in vivo and in vivo-like conditions are needed. Lastly, methods that can probe the thermodynamics and kinetics of RNA folding under complex in vivo-like conditions will enhance our understanding of in vivo RNA folding. Overcoming the challenges outlined herein will allow the field to accomplish the ultimate goal, to understand how RNA folds in the cell.

Acknowledgements

The authors would like to thank the NIH for funding under R01-GM110237 and the NSF for funding under IOS-1339282.

References

Alberts, B., Bray, D., Lewis, J., Roberts, K. & Watson, J. D. (1994). Molecular Biology of the Cell, 3rd edn. Garland Publishing, New York and London.Google Scholar

Andronescu, M., Condon, A., Turner, D. H. & Mathews, D. H. (2014). The determination of RNA folding nearest neighbor parameters. Methods in Molecular Biology 1097, 45–70.CrossRef Google Scholar PubMed

Baird, N. J., Westhof, E., Qin, H., Pan, T. & Sosnick, T. R. (2005). Structure of a folding intermediate reveals the interplay between core and peripheral elements in RNA folding. Journal of Molecular Biology 352, 712–722.CrossRef Google Scholar PubMed

Banerjee, A. R., Jaeger, J. A. & Turner, D. H. (1993). Thermal unfolding of a group 1 ribozyme: the low-temperature transition is primarily disruption of the tertiary structure. Biochemistry 32, 153–163.Google Scholar

Banerjee, A. R. & Turner, D. H. (1995). The time dependence of chemical modification reveals slow steps in the folding of a Group I ribozyme. Biochemistry 34, 6504–6512.CrossRef Google Scholar PubMed

Bellaousov, S. & Mathews, D. H. (2010). ProbKnot: fast prediction of RNA secondary structure including pseudoknots. RNA 16, 1870–1880.Google Scholar

Bernhart, S. H., Hofacker, I. L., Will, S., Gruber, A. R. & Stadler, P. F. (2008). RNAalifold: improved consensus structure prediction for RNA alignments. BMC Bioinformatics 9, 474.CrossRef Google Scholar PubMed

Bloomfield, V. A., Crothers, D. M. & Tinoco, I. J. (2000). Nucleic Acids: Structures, Properties, and Functions. Sausalito, California: University Science Books.Google Scholar

Brion, P. & Westhof, E. (1997). Hierarchy and dynamics of RNA folding. Annual Review of Biophysics and Biomolecular Structure 26, 113–137.CrossRef Google Scholar PubMed

Buxbaum, A. R., Haimovich, G. & Singer, R. H. (2015). In the right place at the right time: visualizing and understanding mRNA localization. Nature Reviews. Molecular Cell Biology 16, 95–109.Google Scholar

Cao, Y. & Woodson, S. A. (1998). Destabilizing effect of an rRNA stem-loop on an attenuator hairpin in the 5′ exon of the Tetrahymena pre-rRNA. RNA 4, 901–914.Google Scholar

Chadalavada, D. M., Cerrone-Szakal, A. L. & Bevilacqua, P. C. (2007). Wild-type is the optimal sequence of the HDV ribozyme under cotranscriptional conditions. RNA 13, 2189–2201.Google Scholar

Chadalavada, D. M., Knudsen, S. M., Nakano, S.-I. & Bevilacqua, P. C. (2000). A role for upstream RNA structure in facilitating the catalytic fold of the genomic hepatitis delta virus ribozyme. Journal of Molecular Biology 301, 349–367.Google Scholar

Chadalavada, D. M., Senchak, S. E. & Bevilacqua, P. C. (2002). The folding pathway of the genomic hepatitis delta virus ribozyme is dominated by slow folding of the pseudoknots1. Journal of Molecular Biology 317, 559–575.CrossRef Google Scholar

Chauhan, S. & Woodson, S. A. (2008). Tertiary interactions determine the accuracy of RNA folding. Journal of the American Chemical Society 130, 1296–1303.CrossRef Google Scholar PubMed

Choi, W.-G., Swanson, S. J. & Gilroy, S. (2012). High-resolution imaging of Ca²⁺, redox status, ROS, and pH using GFP biosensors. The Plant Journal 70, 118–128.Google Scholar

Clatterbuck Soper, S. F., Dator, R. P., Limbach, P. A. & Woodson, S. A. (2013). In vivo X-ray footprinting of pre-30S ribosomes reveals chaperone-dependent remodeling of late assembly intermediates. Molecular Cell 52, 506–516.Google Scholar

Cordero, P. & Das, R. (2015). Rich RNA structure landscapes revealed by mutate-and-map analysis. PLoS Computational Biology 11, e1004473.Google Scholar

Cordero, P., Kladwang, W., Vanlang, C. C. & Das, R. (2012). Quantitative dimethyl sulfate mapping for automated RNA secondary structure inference. Biochemistry 51, 7037–7039.CrossRef Google Scholar PubMed

Crothers, D. M., Cole, P. E., Hilbers, C. W. & Shulman, R. G. (1974). The molecular mechanism of thermal unfolding of Escherichia coli formylmethionine transfer RNA. Journal of Molecular Biology 87, 63–88.Google Scholar

Dawson, W. K. & Bujnicki, J. M. (2016). Computational modeling of RNA 3D structures and interactions. Current Opinion in Structural Biology 37, 22–28.CrossRef Google Scholar PubMed

Deigan, K. E., Li, T. W., Mathews, D. H. & Weeks, K. M. (2009). Accurate SHAPE-directed RNA structure determination. Proceedings of the National Academy of Sciences of the United States of America 106, 97–102.Google Scholar

De Michele, R., Carimi, F. & Frommer, W. B. (2014). Mitochondrial biosensors. The International Journal of Biochemistry & Cell Biology 48, 39–44.Google Scholar

Denning, E. J., Thirumalai, D. & Mackerell, A. D. (2013). Protonation of trimethylamine N-oxide (TMAO) is required for stabilization of RNA tertiary structure. Biophysical Chemistry 184, 8–16.Google Scholar

Desai, R., Kilburn, D., Lee, H.-T. & Woodson, S. (2014). Increased ribozyme acitivty in crowded solutions. Journal of Biological Chemistry 289, 2972–2977.CrossRef Google Scholar

Diamond, J. M., Turner, D. H. & Mathews, D. H. (2001). Thermodynamics of three-way multibranch loops in RNA. Biochemistry 40, 6971–6981.Google Scholar

Ding, Y., Kwok, C. K., Tang, Y., Bevilacqua, P. C. & Assmann, S. M. (2015). Genome-wide profiling of in vivo RNA structure at single-nucleotide resolution using Structure-seq. Nature Protocols 10, 1050–1066.Google Scholar

Ding, Y. & Lawrence, C. E. (2003). A statistical sampling algorithm for RNA secondary structure prediction. Nucleic Acids Research 31, 7280–7301.Google Scholar

Ding, Y., Tang, Y., Kwok, C. K., Zhang, Y., Bevilacqua, P. C. & Assmann, S. M. (2014). In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features. Nature 505, 696–700.Google Scholar

Do, C. B., Foo, C. S. & Batzoglou, S. (2008). A max-margin model for efficient simultaneous alignment and folding of RNA sequences. Bioinformatics 24, i68–i76.Google Scholar

Doshi, K. J., Cannone, J. J., Cobaugh, C. W. & Gutell, R. R. (2004). Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction. BMC Bioinformatics 5, 105.Google Scholar

Doudna, J. A. & Cech, T. R. (2002). The chemical repertoire of natural ribozymes. Nature 418, 222–228.Google Scholar

Dupuis, N. F., Holmstrom, E. D. & Nesbitt, D. J. (2014). Molecular-crowding effects on single-molecule RNA folding/unfolding thermodynamics and kinetics. Proceedings of the National Academy of Sciences of the United States of America 111, 8464–8469.Google Scholar

Dyer, R. B. & Brauns, E. B. (2009). Laser-induced temperature jump infrared measurements of RNA folding. Methods in Enzymology 469, 353–372.Google Scholar

Eddy, S. R. (2004). How do RNA folding algorithms work? Nature Biotechnology 22, 1457–1458.Google Scholar

Eddy, S. R. (2014). Computational analysis of conserved RNA secondary structure in transcriptomes and genomes. Annual Review of Biophysics 43, 433–456.Google Scholar

Ehresmann, C., Baudin, F., Mougel, M., Romby, P., Ebel, J. P. & Ehresmann, B. (1987). Probing the structure of RNAs in solution. Nucleic Acids Research 15, 9109–9128.Google Scholar

Fang, R., Moss, W. N., Rutenberg-Schoenberg, M. & Simon, M. D. (2015). Probing Xist RNA structure in cells using targeted structure-seq. PLoS Genetics 11, e1005668.Google Scholar

Fehr, M., Lalonde, S., Lager, I., Wolff, M. W. & Frommer, W. B. (2003). In vivo imaging of the dynamics for glucose uptake in the cytosol of COS-7 cells by fluorescent nanosensors. Journal of Biological Chemistry 278, 19127–19133.Google Scholar

Feig, A. L. & Uhlenbeck, O. C. (1999). The role of metal ions in RNA biochemistry. In The RNA World, 2nd edn (eds. Gesteland, R. F., Cech, T. R. & Atkins, J. F.), pp. 287–320. New York: Cold Spring Harbor Laboratory Press.Google Scholar

Fimognari, C. (2015). Role of oxidative RNA damage in chronic-degenerative diseases. Oxidative Medicine and Cellular Longevity 2015, 8.CrossRef Google Scholar PubMed

Frankel, E. A., Bevilacqua, P. C. & Keating, C. D. (2016). Polyamine/nucleotide coacervates provide strong compartmentalization of Mg²⁺, nucleotides, and RNA. Langmuir 32, 2041–2049.Google Scholar

Freier, S. M., Kierzek, R., Caruthers, M. H., Neilson, T. & Turner, D. H. (1986a). Free energy contributions of G.U. and other terminal mismatches to helix stability. Biochemistry 25, 3209–3223.Google Scholar

Freier, S. M., Kierzek, R., Jaeger, J. A., Sugimoto, N., Caruthers, M. H., Neilson, T. & Turner, D. H. (1986b). Improved free-energy parameters for predictions of RNA duplex stability. Proceedings of the National Academy of Sciences of the United States of America 83, 9373–9377.Google Scholar

Fu, Y., Sharma, G. & Mathews, D. H. (2014). Dynalign II: common secondary structure prediction for RNA homologs with domain insertions. Nucleic Acids Research 42, 13939–13948.CrossRef Google Scholar PubMed

Gao, M., Gnutt, D., Orban, A., Appel, B., Righetti, F., Winter, R., Narberhaus, F., Müller, S. & Ebbinghaus, S. (2016). RNA hairpin folding in the crowded cell. Angewandte Chemie International Edition 55, 3224–3228.CrossRef Google Scholar PubMed

Garst, A. D., Edwards, A. L. & Batey, R. T. (2011). Riboswitches: structures and mechanisms. Cold Spring Harbor Perspectives in Biology 3, a003533.Google Scholar

Gerstberger, S., Hafner, M. & Tuschl, T. (2014). A census of human RNA-binding proteins. Nature Reviews Genetics 15, 829–845.Google Scholar

Guerrier-Takada, C., Gardiner, K., Marsh, T., Pace, N. & Altman, S. (1983). The RNA moiety of ribonuclease P is the catalytic subunit of the enzyme. Cell 35, 849–857.Google Scholar

Hafner, M., Landthaler, M., Burger, L., Khorshid, M., Hausser, J., Berninger, P., Rothballer, A., Ascano, M. Jr., Jungkamp, A.-C., Munschauer, M., Ulrich, A., Wardle, G. S., Dewell, S., Zavolan, M. & Tuschl, T. (2010). Transcriptome-wide identification of RNA-binding protein and MicroRNA target sites by PAR-CLIP. Cell 141, 129–141.Google Scholar

Hajdin, C. E., Bellaousov, S., Huggins, W., Leonard, C. W., Mathews, D. H. & Weeks, K. M. (2013). Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots. Proceedings of the National Academy of Sciences of the United States of America 110, 5498–5503.Google Scholar

Hajiaghayi, M., Condon, A. & Hoos, H. H. (2012). Analysis of energy-based algorithms for RNA secondary structure prediction. BMC Bioinformatics 13, 22.CrossRef Google Scholar PubMed

Halvorsen, M., Martin, J. S., Broadaway, S. & Laederach, A. (2010). Disease-associated mutations that alter the RNA structural ensemble. PLoS Genetics 6, e1001074.Google Scholar

Harmanci, A. O., Sharma, G. & Mathews, D. H. (2008). PARTS: probabilistic alignment for RNA joinT secondary structure prediction. Nucleic Acids Research 36, 2406–2417.Google Scholar

Harmanci, A. O., Sharma, G. & Mathews, D. H. (2011). TurboFold: iterative probabilistic estimation of secondary structures for multiple RNA sequences. BMC Bioinformatics 12, 108.CrossRef Google Scholar PubMed

Herschlag, D. & Cech, T. R. (1990). Catalysis of RNA cleavage by the Tetrahymena thermophil ribozyme. 1. Kinetic description of the reaction of an RNA substrate complementary to the active site. Biochemistry 29, 10159–10171.CrossRef Google Scholar

Hilbers, C. W., Robillard, G. T., Shulman, R. G., Blake, R. D., Webb, P. K., Fresco, R. & Riesner, D. (1976). Thermal unfolding of yeast glycine transfer RNA. Biochemistry 15, 1874–1882.Google Scholar

Holmstrom, E. D., Dupuis, N. F. & Nesbitt, D. J. (2015). Kinetic and thermodynamic origins of osmolyte-influenced nucleic acid folding. Journal of Physical Chemistry B 119, 3687–3696.Google Scholar

Hoseini, S. S. & Sauer, M. G. (2015). Molecular cloning using polymerase chain reaction, an educational guide for cellular engineering. Journal of Biological Engineering 9, 1–13.CrossRef Google Scholar PubMed

Hu, B., Hu, L.-L., Chen, M.-L. & Wang, J.-H. (2013). A FRET ratiometric fluorescence sensing system for mercury detection and intracellular colorimetric imaging in live Hela cells. Biosensors and Bioelectronics 49, 499–505.Google Scholar

Hull, C. M., Anmangandla, A. & Bevilacqua, P. C. (2016). Bacterial riboswitches and ribozymes potently activate the human innate immune sensor PKR. ACS Chemical Biology 11, 1118–1127.Google Scholar

Hull, C. M. & Bevilacqua, P. C. (2015). Mechanistic analysis of activation of the innate immune sensor PKR by bacterial RNA. Journal of Molecular Biology 427, 3501–3515.Google Scholar

Hull, C. M. & Bevilacqua, P. C. (2016). Discriminating self and non-self by RNA: roles for RNA structure, misfolding, and modification in regulating the innate immune sensor PKR. Accounts of Chemical Research, doi: 10.1021/acs.accounts.6b00151.Google Scholar

Imamura, H., Huynh Nhat, K. P., Togawa, H., Saito, K., Iino, R., Kato-Yamada, Y., Nagai, T. & Noji, H. (2009). Visualization of ATP levels inside single living cells with fluorescence resonance energy transfer-based genetically encoded indicators. Proceedings of the National Academy of Sciences of the United States of America 106, 15651–15656.CrossRef Google Scholar PubMed

Incarnato, D., Neri, F., Anselmi, F. & Oliviero, S. (2014). Genome-wide profiling of mouse RNA secondary structures reveals key features of the mammalian transcriptome. Genome Biology 15, 1–13.Google Scholar

Jaeger, J. A., Zuker, M. & Turner, D. H. (1990). Melting and chemical modification of a cyclized self-splicing group I intron: similarity of structures in 1 M Na⁺, in 10 mM Mg²⁺, and in the presence of substrate. Biochemistry 29, 10147–10158.Google Scholar

Jaspers, P. & Kangasjärvi, J. (2010). Reactive oxygen species in abiotic stress signaling. Physiologia Plantarum 138, 405–413.Google Scholar

Jiang, T., Kennedy, S. D., Moss, W. N., Kierzek, E. & Turner, D. H. (2014). Secondary structure of a conserved domain in an intron of influenza A M1 mRNA. Biochemistry 53, 5236–5248.Google Scholar

Kertesz, M., Wan, Y., Mazor, E., Rinn, J. L., Nutter, R. C., Chang, H. Y. & Segal, E. (2010). Genome-wide measurement of RNA secondary structure in yeast. Nature 467, 103–107.Google Scholar

Kilburn, D., Roh, J. H., Behrouzi, R., Briber, R. M. & Woodson, S. A. (2013). Crowders perturb the entropy of RNA energy landscapes to favor folding. Journal of the American Chemical Society 135, 10055–10063.CrossRef Google Scholar PubMed

Kilburn, D., Roh, J. H., Guo, L., Briber, R. & Woodson, S. (2010). Molecular crowding stabilizes folded RNA structure by the excluded volume efect. Journal of the American Chemical Society 132, 8690–8696.Google Scholar

Kim, S. H., Quigley, G. J., Suddath, F. L., Mcpherson, A., Sneden, D., Kim, J. J., Weinzierl, J. & Rich, A. (1973). Three-dimensional structure of yeast phenylalanine transfer RNA: folding of the polynucleotide chain. Science 179, 285–288.Google Scholar

Klostermeier, D. & Millar, D. P. (2001). RNA conformation and folding studied with fluorescence resonance energy transfer. Methods 23, 240–254.Google Scholar

Koga, S., Williams, D. S., Perriman, A. W. & Mann, S. (2011). Peptide-nucleotide microdroplets as a step towards a membrane-free protocell model. Nature Chemistry 3, 720–724.Google Scholar

Kubodera, T., Watanabe, M., Yoshiuchi, K., Yamashita, N., Nishimura, A., Nakai, S., Gomi, K. & Hanamoto, H. (2003). Thiamine-regulated gene expression of Aspergillus oryzae thiA requires splicing of the intron containing a riboswitch-like domain in the 5′-UTR. FEBS Letters 555, 516–520.Google Scholar

Kutchko, K. M., Sanders, W., Ziehr, B., Phillips, G., Solem, A., Halvorsen, M., Weeks, K. M., Moorman, N. & Laederach, A. (2015). Multiple conformations are a conserved and regulatory feature of the RB1 5′ UTR. RNA 21, 1274–1285.Google Scholar

Kwok, C. K., Ding, Y., Tang, Y., Assmann, S. M. & Bevilacqua, P. C. (2013). Determination of in vivo RNA structure in low-abundance transcripts. Nature Communications 4, doi: 10.1038/ncomms3971.Google Scholar

Kwok, C. K., Tang, Y., Assmann, S. M. & Bevilacqua, J. M. (2015). The RNA structurome: transcriptome-wide structure probing with next-generation sequencing. Trends in Biochemical Sciences 40, 221–232.Google Scholar

Lager, I., Looger, L. L., Hilpert, M., Lalonde, S. & Frommer, W. B. (2006). Conversion of a putative agrobacterium sugar-binding protein into a FRET sensor with high selectivity for sucrose. Journal of Biological Chemistry 281, 30875–30883.Google Scholar

Lambert, D. & Draper, D. E. (2007). Effects of osmolytes on RNA secondary and tertiary structure stabilities and RNA-Mg²⁺ ion interactions. Journal of Molecular Biology 370, 993–1005.Google Scholar

Lambert, D. & Draper, D. E. (2012). Denaturation of RNA secondary and tertiary structure by urea: simple unfolded state models and free energy parameters account for measured m-values. Biochemistry 51, 9014–9026.Google Scholar

Lambert, D., Leipply, D. & Draper, D. E. (2010). The osmolyte TMAO stabilizes native RNA tertiary structures in the absence of Mg²⁺: evidence for a large barrier to folding form phosphate dehydration. Journal of Molecular Biology 404, 138–157.Google Scholar

Lavender, C. A., Gorelick, R. J. & Weeks, K. M. (2015a). Structure-based alignment and consensus secondary structures for three HIV-related RNA genomes. PLoS Computational Biology 11, e1004230.Google Scholar

Lavender, C. A., Lorenz, R., Zhang, G., Tamayo, R., Hofacker, I. L. & Weeks, K. M. (2015b). Model-free RNA sequence and structure alignment informed by SHAPE probing reveals a conserved alternate secondary structure for 16S rRNA. PLoS Computational Biology 11, e1004126.Google Scholar

Levitt, M. (1969). Detailed molecular model for transfer ribonucleic acid. Nature 224, 759–763.Google Scholar

Li, C., Wen, A., Shen, B., Lu, J., Huang, Y. & Chang, Y. (2011). FastCloning: a highly simplified, purification-free, sequence- and ligation-independent PCR cloning method. BMC Biotechnology 11, 1–10.Google Scholar

Li, F., Zheng, Q., Vandivier, L. E., Willmann, M. R., Chen, Y. & Gregory, B. D. (2012). Regulatory impact of RNA secondary structure across the Arabidopsis transcriptome. The Plant Cell 24, 4346–4359.Google Scholar

Licatalosi, D. D., Mele, A., Fak, J. J., Ule, J., Kayikci, M., Chi, S. W., Clark, T. A., Schweitzer, A. C., Blume, J. E., Wang, X., Darnell, J. C. & Darnell, R. B. (2008). HITS-CLIP yields genome-wide insights into brain alternative RNA processing. Nature 456, 464–469.Google Scholar

Lindenburg, L. H., Vinkenborg, J. L., Oortwijn, J., Aper, S. J. A. & Merkx, M. (2013). MagFRET: the first genetically encoded fluorescent Mg⁽²⁺⁾ sensor. PLoS ONE 8, e82009.Google Scholar

Liu, B., Diamond, J. M., Mathews, D. H. & Turner, D. H. (2011). Fluorescence competition and optical melting measurements of RNA three-way multibranch loops provide a revised model for thermodynamic parameters. Biochemistry 50, 640–653.Google Scholar

Liu, B., Mathews, D. H. & Turner, D. H. (2010a). RNA pseudoknots: folding and finding. F1000 Biology Reports 2, 8.Google Scholar

Liu, B., Shankar, N. & Turner, D. H. (2010b). Fluorescence competition assay measurements of free energy changes for RNA pseudoknots. Biochemistry 49, 623–634.Google Scholar

London, R. E. (1991). Methods for measurement of intracellular magnesium: NMR and fluorescence. Annual Reviews of Physiology 53, 241–258.Google Scholar

Lorenz, R., Wolfinger, M. T., Tanzer, A. & Hofacker, I. L. (2016). Predicting RNA secondary structures from sequence and probing data. Methods.Google Scholar

Lu, Z. J., Gloor, J. W. & Mathews, D. H. (2009). Improved RNA secondary structure prediction by maximizing expected pair accuracy. RNA 15, 1805–1813.Google Scholar

Lu, Z. J., Turner, D. H. & Mathews, D. H. (2006). A set of nearest neighbor parameters for predicting the enthalpy change of RNA secondary formation. Nucleic Acids Research 34, 4912–4924.Google Scholar

Lusk, J. E., Williams, R. J. & Kennedy, E. P. (1968). Magnesium and the growth of Escherichia coli . Journal of Biological Chemistry 243, 2618–2624.Google Scholar

Mahen, E. M., Harger, J. W., Calderon, E. M. & Fedor, M. J. (2005). Kinetics and thermodynamics make different contributions to RNA folding in vitro and in yeast. Molecular Cell 19, 27–37.Google Scholar

Mathews, D. H. (2004). Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimization. RNA 10, 1178–1190.Google Scholar

Mathews, D. H. (2006). Revolutions in RNA secondary structure prediction. Journal of Molecular Biology 359, 526–532.Google Scholar

Mathews, D. H., Disney, M. D., Childs, J. L., Schroeder, S. J., Zuker, M. & Turner, D. H. (2004). Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure. Proceedings of the National Academy of Sciences of the United States of America 101, 7287–7292.Google Scholar

Matteucci, M. D. & Caruthers, M. H. (1981). Synthesis of deoxyoligonucleotides on a polymer support. Journal of the American Chemical Society 103, 3185–3191.Google Scholar

Mccaskill, J. S. (1990). The equilibrium partition function and base pair probabilities for RNA secondary structure. Biopolymers 29, 1105–1119.Google Scholar

Merino, E. J., Wilkinson, K. A., Coughlan, J. L. & Weeks, K. M. (2005). RNA structure analysis at single nucleotide resolution by selective 2′-hydoxyl acylation and primer extension (SHAPE). Journal of the American Chemical Society 127, 4223–4231.Google Scholar

Miao, Z., Adamiak, R. W., Blanchet, M.-F., Boniecki, M., Bujnicki, J. M., Chen, S.-J., Cheng, C., Chojnowski, G., Chou, F.-C., Cordero, P., Cruz, J. A., Ferré-D'amaré, A. R., Das, R., Ding, F., Dokholyan, N. V., Dunin-Horkawicz, S., Kladwang, W., Krokhotin, A., Lach, G., Magnus, M., Major, F., Mann, T. H., Masquida, B., Matelska, D., Meyer, M., Peselis, A., Popenda, M., Purzycka, K. J., Serganov, A., Stasiewicz, J., Szachniuk, M., Tandon, A., Tian, S., Wang, J., Xiao, Y., Xu, X., Zhang, J., Zhao, P., Zok, T. & Westhof, E. (2015). RNA-puzzles round II: assessment of RNA structure prediction programs applied to three large RNA structures. RNA 21, 1066–1084.Google Scholar

Milligan, J. F., Groebe, D. R., Witherell, G. W. & Uhlenbeck, O. C. (1987). Oligoribonucleotide synthesis using T7 RNA polymerase and synthetic DNA templates. Nucleic Acids Research 15, 8783–8798.Google Scholar

Minton, A. P. (2001). The influence of macromolecular crowding and macromolecular confinement on biochemical media. Journal of Biological Chemistry 276, 10577–10589.Google Scholar

Mitchell, D. I., Jarmoskaite, I., Seval, N., Seifert, S. & Russell, R. (2013). The long-range P3 helix of the Tetrahymena ribozyme is disrupted during folding between the native and misfolded conformations. Journal of Molecular Biology 425, 2670–2686.Google Scholar

Mitchell, D. I. & Russell, R. (2014). Folding pathways of the Tetrahymena ribozyme. Journal of Molecular Biology 426, 2300–2312.Google Scholar

Moazed, D., Stern, S. & Noller, H. F. (1986a). Rapid chemical probing of conformation in 16S ribosomal RNA and 30S ribosomal subunits using primer extension. Journal of Molecular Biology 187, 399–416.Google Scholar

Moazed, D., Stern, S. & Noller, H. F. (1986b). Rapid chemical probing of conformation in 16S ribosomal RNA and 30S ribosomal subunits using primer extension. Journal of Molecular Biology 187, 399–416.Google Scholar

Moore, M. & Sharp, P. (1992). Site-specific modification of pre-mRNA: the 2′-hydroxyl groups at the splice sites. Science 256, 992–997.Google Scholar

Mullis, K. B. (1990). The unusual origin of the polymerase chain reaction. Scientific American 262, 56–61, 64–65.Google Scholar

Nadarajan, S. P., Ravikumar, Y., Deepankumar, K., Lee, C.-S. & Yun, H. (2014). Engineering lead-sensing GFP through rational designing. Chemical Communications 50, 15979–15982.Google Scholar

Nakano, S.-I., Karimata, H. T., Kitagawa, Y. & Sugimoto, N. (2009). Facilitation of RNA enzyme activity in the molecular crowding media of cosolutes. Journal of the American Chemical Society 131, 16881–16888.Google Scholar

Nakano, S.-I., Kitagawa, Y., Yamashita, H., Miyoshi, D. & Sugimoto, N. (2015). Effects of cosolvents on the folding and catalytic activities of the hammerhead ribozyme. ChemBioChem 16, 1803–1810.Google Scholar

Nakano, S.-I., Miyoshi, D. & Sugimoto, N. (2014). Effects of molecular crowding on the structures, interactions, and functions of nucleic acids. Chemical Reviews 114, 2733–2758.Google Scholar

Nallagatla, S. R., Hwang, J., Toroney, R., Zheng, X., Cameron, C. E. & Bevilacqua, P. C. (2007). 5′-triphosphate-dependent activation of PKR by RNAs with short stem-loops. Science 318, 1455–1458.Google Scholar

Nick, H. & Gilbert, W. (1985). Detection in vivo of protein–DNA interactions within the lac Operon of Escherichia coli . Nature 313, 795–798.Google Scholar

Nissen, P., Hansen, J., Ban, N., Moore, P. B. & Steitz, T. A. (2000). The structural basis of ribosome activity in peptide bond synthesis. Science 289, 920–930.Google Scholar

Novikova, I. V., Hennelly, S. P. & Sanbonmatsu, K. Y. (2012). Structural architecture of the human long non-coding RNA, steroid receptor RNA activator. Nucleic Acids Research 40, 5034–5051.Google Scholar

Osborne, R. J. & Thornton, C. A. (2006). RNA-dominant diseases. Human Molecular Genetics 15, R162–R169.Google Scholar

Ouyang, Z., Snyder, M. P. & Chang, H. Y. (2013). SeqFold: genome-scale reconstruction of RNA secondary structure integrating high-throughput sequencing data. Genome Research 23, 377–387.Google Scholar

Paige, J. S., Duc, T. N., Song, W. & Jaffrey, S. R. (2012). Fluorescence imaging of cellular metabolites with RNA. Science (New York, NY) 335, 1194–1194.Google Scholar

Paige, J. S., Wu, K. & Jaffrey, S. R. (2011). RNA mimics of green fluorescent protein. Science (New York, NY) 333, 642–646.Google Scholar

Paudel, B. P. & Rueda, D. (2014). Molecular crowding accelerates ribozymes docking and catalysis. Journal of the American Chemical Society 136, 16700–16703.Google Scholar

Pollack, L. (2011). Time resolved SAXS and RNA folding. Biopolymers 95, 543–549.Google Scholar

Pouvreau, S. (2014). Genetically encoded reactive oxygen species (ROS) and redox indicators. Biotechnology Journal 9, 282–293.Google Scholar

Rangan, P., Masuida, B., Westhof, E. & Woodson, S. A. (2003). Assembly of core helices and rapid tertiary folding of a small bacterial group I ribozyme. Proceedings of the National Academy of Sciences of the United States of America 100, 1574–1579.CrossRef Google Scholar PubMed

Record, M. T. J., Courtenay, E. S., Cayley, S. D. & Guttman, H. J. (1998). Responses of E. coli to osmotic stress: large changes in amounts of cytoplasmic solutes and water. Trends in Biochemical Sciences 23, 143–148.Google Scholar

Reeder, J. & Giegerich, R. (2005). Consensus shapes: an alternative to the Sankoff algorithm for RNA consensus structure prediction. Bioinformatics 21, 3516–3523.Google Scholar

Reeder, J., Hochsmann, M., Rehmsmeier, M., Voss, B. & Giegerich, R. (2006). Beyond Mfold: recent advances in RNA bioinformatics. Journal of Biotechnology 124, 41–55.Google Scholar

Reyes, F. E., Garst, A. D. & Batey, R. T. (2009). Chapter 6 – strategies in RNA crystallography. Methods in Enzymology 469, 119–139.Google Scholar

Richards, E. G., Flessel, C. P. & Fresco, J. R. (1963). Polynucleotides. IV. Molecular properties and conformations of polyribonucleic acids. Biopolymers 1, 431–446.Google Scholar

Robertus, J. D., Ladner, J. E., Finch, J. T., Rhodes, D., Brown, R. S., Clark, B. F. C. & Klug, A. (1974). Structure of yeast phenylalanine tRNA at 3 Å resolution. Nature 250, 546–551.Google Scholar

Roh, J. H., Guo, L., Kilburn, D., Briber, R., Irving, T. & Woodson, S. (2010). Multistage collapse of a bacterial ribozyme observed by time-resolved small-angle X-ray scattering. Journal of the American Chemical Society 132, 10148–10154.Google Scholar

Romani, A. M. (2007). Magnesium homeostasis in mammalian cells. Frontiers in Bioscience 12, 308–331.Google Scholar

Rook, M. S., Treiber, D. K. & Williamson, J. R. (1998). Fast folding mutants of the Tetrahymena group I ribozyme reveal a rugged folding energy landscape. Journal of Molecular Biology 281, 609–620.Google Scholar

Roth, A., Weinberg, Z., Chen, A. G. Y., Kim, P. B., Ames, T. D. & Breaker, R. R. (2014). A widespread self-cleaving ribozymes class is revealed by bioinformatics. Nature Chemical Biology 10, 56–60.Google Scholar

Rouskin, S., Zubradt, M., Washietl, S., Kellis, M. & Weissman, J. S. (2014). Genome-wide probing of RNA structure reveals active unfolding of mRNA structures in vivo . Nature 505, 701–705.Google Scholar

Roy, R., Hohng, S. & Ha, T. (2008). A practical guide to single-molecule FRET. Nature Methods 5, 507–516.Google Scholar

Salehi-Ashtiani, K., Luptak, A., Litovchick, A. & Szostak, J. W. (2006). A genomewide search for ribozymes reveals an HDV-like sequence in the Human CPEB3 gene. Science 313, 1788–1792.Google Scholar

Santangelo, P., Nitin, N. & Bao, G. (2006). Nanostructured probes for RNA detection in living cells. Annals of Biomedical Engineering 34, 39–50.Google Scholar

Scalvi, B., Woodson, S., Sullivan, M., Chance, M. R. & Brenowitz, M. (1997). Time-resolved synchrotron X-ray “footprinting”, a new approach to the study of nucleic acid structure and function: application to protein–DNA interactions and RNA folding. Journal of Molecular Biology 266, 144–159.Google Scholar

Scaringe, S., Wincott, F. E. & Caruthers, M. H. (1998). Novel RNA synthesis method using 5′-O-silyl-2′-orthoester protecting groups. Journal of the American Chemical Society 120, 11820–11821.CrossRef Google Scholar

Schroeder, S. J. & Turner, D. H. (2000). Factors affecting the thermodynamic stability of small asymmetric internal loops in RNA. Biochemistry 39, 9257–9274.Google Scholar

Schroeder, S. J. & Turner, D. H. (2009). Optical melting measurements of nucleic acid thermodynamics. Methods in Enzymology 468, 371–387.Google Scholar

Sclavi, B., Sullivan, M., Change, M. R., Brenowitz, M. & Woodson, S. (1998). RNA folding at millisecond intervals by synchrotron hydroxyl radical footprinting. Science 279, 1940–1943.Google Scholar

Seetin, M. G. & Mathews, D. H. (2012a). RNA structure prediction: an overview of methods. Methods in Molecular Biology 905, 99–122.Google Scholar

Seetin, M. G. & Mathews, D. H. (2012b). TurboKnot: rapid prediction of conserved RNA secondary structures including pseudoknots. Bioinformatics 28, 792–798.Google Scholar

Serganov, A. & Nudler, E. (2013). A decade of riboswitches. Cell 152, 17–24.Google Scholar

Serganov, A. & Patel, D. (2007). Ribozymes, riboswitches and beyond: regulation of gene expression without proteins. Nature 8, 776–790.Google Scholar

Serra, M. J., Baird, J. D., Dale, T., Fey, B. L., Retatagos, K. & Westhof, E. (2002). Effects of magnesium ions on the stabilization of RNA oligomers of defined structures. RNA 8, 307–323.Google Scholar

Sherpa, C., Rausch, J. W., Le Grice, S. F. J., Hammarskjold, M.-L. & Rekosh, D. (2015). The HIV-1 Rev response element (RRE) adopts alternative conformations that promote different rates of virus replication. Nucleic Acids Research 43, 4676–4686.Google Scholar

Sierzchala, A., Dellinger, D. J., Betley, J. R., Wyrzykiewicz, T. K., Yamada, C. M. & Caruthers, M. H. (2003). Solid-phase oligodeoxynucleotide synthesis: a two-step cycle using peroxy anion deprotection. Journal of the American Chemical Society 125, 13427–13441.Google Scholar

Sloma, M. F. & Mathews, D. H. (2015). Improving RNA secondary structure prediction with structure mapping data. Methods in Enzymology 553, 91–114.Google Scholar

Solomatin, S. V., Greenfeld, M., Chu, S. & Herschlag, D. (2010). Multiple native states reveal persistent ruggedness of an RNA folding landscape. Nature 463, 681–684.Google Scholar

Somarowthu, S., Legiewicz, M., Chillón, I., Marcia, M., Liu, F. & Pyle, A. M. (2015). HOTAIR forms an intricate and modular secondary structure. Molecular Cell 58, 353–361.Google Scholar

Soto, A. M., Misra, V. & Draper, D. E. (2007). Tertiary structure of an RNA pseudoknot is stabilized by “diffuse” Mg²⁺ ions. Biochemistry 46, 2973–2983.Google Scholar

Spitale, R. C., Flynn, R. A., Zhang, Q. C., Crisalli, P., Lee, B., Jung, J.-W., Kuchelmeister, H. Y., Batista, P. J., Torre, E. A., Kool, E. T. & Chang, H. Y. (2015). Structural imprints in vivo decode RNA regulatory mechanisms. Nature 519, 486–490.Google Scholar

Stael, S., Wurzinger, B., Mair, A., Mehlmer, N., Vothknecht, U. C. & Teige, M. (2011). Plant organellar calcium signalling: an emerging field. Journal of Experimental Botany 63, 1525–1542.Google Scholar

Stein, A. & Crothers, D. M. (1976). Conformational changes of transfer RNA. The role of magnesium(II). Biochemistry 15, 160–168.Google Scholar

Strulson, C. A., Boyer, J. A., Whitman, E. E. & Bevilacqua, P. C. (2014). Molecular crowders and cosolutes promote folding cooperativity of RNA under physiological ionic conditions. RNA 20, 331–347.Google Scholar

Strulson, C. A., Molden, R. C., Keating, C. D. & Bevilacqua, P. C. (2012). RNA catalysis through compartmentalization. Nature Chemistry 4, 941–946.Google Scholar

Strulson, C. A., Yennawar, N. H., Rambo, R. P. & Bevilacqua, P. C. (2013). Molecular crowding favors reactivity of a human ribozyme under physiological ionic conditions. Biochemistry 52, 8187–8197.Google Scholar

Sükösd, Z., Andersen, E. S., Seemann, S. E., Jensen, M. K., Hansen, M., Gorodkin, J. & Kjems, J. (2015). Full-length RNA structure prediction of the HIV-1 genome reveals a conserved core domain. Nucleic Acids Research 43, 10168–10179.Google Scholar PubMed

Sükösd, Z., Knudsen, B., Kjems, J. & Pedersen, C. N. S. (2012). PPfold 3·0: fast RNA secondary structure prediction using phylogeny and auxiliary data. Bioinformatics 28, 2691–2692.Google Scholar

Sussman, J. L., Holbrook, S. R., Warrant, W., Church, G. M. & Kim, S. H. (1978). Crystal structure of yeast phenylalanine transfer RNA. 1. Crystallographic refinement. Journal of Molecular Biology 123, 607–630.Google Scholar

Suurkuusk, J., Alvarez, J., Freire, E. & Biltonen, R. (1977). Calorimetric determination of the heat capacity changes associated with the conformational transitions of polyriboadenylic acid and polyribouridylic acid. Biopolymers 16, 2641–2652.Google Scholar

Swanson, S. J., Choi, W.-G., Chanoca, A. & Gilroy, S. (2011). In vivo imaging of Ca²⁺, pH, and reactive oxygen species using fluorescent probes in plants. Annual Review of Plant Biology 62, 273–297.Google Scholar

Swisher, J. F., Su, L. J., Brenowitz, M., Anderson, V. E. & Pyle, A. M. (2002). Productive folding to the native state by a Group II intron ribozyme. Journal of Molecular Biology 315, 297–310.Google Scholar

Talkish, J., May, G., Lin, Y., Woolford, J. L. J. & Mcmanus, C. J. (2014). Mod-seq: high-throughput sequencing for chemical probind of RNA structure. RNA 20, 713–720.Google Scholar

Tang, S., Reddish, F., Zhuo, Y. & Yang, J. J. (2015a). Fast kinetics of calcium signaling and sensor design. Current Opinion in Chemical Biology 27, 90–97.Google Scholar

Tang, Y., Bouvier, E., Kwok, C. K., Ding, Y., Nekrutenko, A., Bevilacqua, P. C. & Assmann, S. M. (2015b). StructureFold: genome-wide RNA secondary structure mapping and reconstruction in vivo . Bioinformatics 31, 2668–2675.Google Scholar

Tanner, M. A. & Cech, T. R. (1996). Activity and thermostability of the small self-splicing group I intron in the pre-tRNAIle of the purple bacterium Azoarcus . RNA 2, 74–83.Google Scholar

Tantama, M., Hung, Y. P. & Yellen, G. (2011). Imaging intracellular pH in live cells with a genetically encoded red fluorescent protein sensor. Journal of the American Chemical Society 133, 10034–10037.Google Scholar

Tinoco, I. J. & Bustamante, C. (1999). How RNA folds. Journal of Molecular Biology 293, 271–281.CrossRef Google Scholar PubMed

Torarinsson, E., Havgaard, J. H. & Gorodkin, J. (2007). Multiple structural alignment and clustering of RNA sequences. Bioinformatics 23, 926–932.Google Scholar

Treiber, D. K., Rook, M. S., Zarrinkar, P. P. & Williamson, J. R. (1998). Kinetic intermediates trapped by native interactions in RNA folding. Science 279, 1943–1946.CrossRef Google Scholar PubMed

Truong, D. M., Sidote, D. J., Russell, R. & Lambowitz, A. M. (2013). Enhanced group II intron retrohoming in magnesium-deficient Escherichia coli via selection of mutations in the ribozyme core. Proceedings of the National Academy of Sciences of the United States of America 110, E3800–E3809.Google Scholar

Tsien, R. Y. (2010). The 2009 Lindau Nobel Laureate Meeting: Roger Y. Tsien, Chemistry 2008. Journal of Visualized Experiments 13, 1575.Google Scholar

Turner, D. H. & Mathews, D. H. (2010). NNDB: the nearest neighbor parameter database for predicting stability of nucleic acidsecondary structure. Nucleic Acids Research 38, D280–D282.Google Scholar

Tyrrell, J., Mcginnis, J. L., Weeks, K. M. & Pielak, G. J. (2013). The cellular environment stabilized adenine riboswitch RNA structure. Biochemistry 52, 8777–8785.Google Scholar

Tyrrell, J., Weeks, K. M. & Pielak, G. J. (2015). Challenge of mimicking the influence of the cellular environment on RNA structure by PEG-induced macromolecular crowding. Biochemistry 54, 6447–6453.Google Scholar

Underwood, J. G., Uzilov, A. V., Katzman, S., Onodera, C. S., Mainzer, J. E., Mathews, D. H., Lowe, T. M., Salama, S. R. & Haussler, D. (2010). FragSeq: transcriptome-wide RNA structure probing using high-throughput sequencing. Nature Methods 7, 995–1001.Google Scholar

Walter, N. G. (2001). Structural dynamics of catalytic RNA highlighted by fluorescence resonance energy transfer. Methods 25, 19–30.Google Scholar

Wan, Y., Kertesz, M., Spitale, R. C., Segal, E. & Chang, H. Y. (2011). Understanding the transcriptome through RNA structure. Nature Reviews Genetics 12, 641–655.Google Scholar

Wan, Y., Qu, K., Ouyang, Z., Kertesz, M., Li, J., Tibshirani, R., Makino, D. L., Nutter, R. C., Segal, E. & Chang, H. Y. (2012). Genome-wide measurement of RNA folding energies. Molecular Cell 48, 169–181.Google Scholar

Wan, Y., Qu, K., Zhang, Q. C., Flynn, R. A., Manor, O., Ouyang, Z., Zhang, J., Spitale, R. C., Snyder, M. P., Segal, E. & Chang, H. Y. (2014). Landscape and variation of RNA secondary structure across the human transcriptome. Nature 505, 706–709.CrossRef Google Scholar PubMed

Wan, Y., Suh, H., Russell, R. & Herschlag, D. (2010). Multiple unfolding events during native folding of the Tetrahymena group I ribozyme. Journal of Molecular Biology 400, 1067–1077.Google Scholar

Washietl, S., Hofacker, I. L., Stadler, P. F. & Kellis, M. (2012). RNA folding with soft constraints: reconciliation of probing data and thermodynamic secondary structure prediction. Nucleic Acids Research 40, 4261–4272.Google Scholar

Weeks, K. M. (2010). Advances in RNA secondary and tertiary structure analysis by chemical probing. Current Opinion in Structural Biology 20, 295–304.Google Scholar

Weyn-Vanhentenryck, S. M., Mele, A., Yan, Q., Sun, S., Farny, N., Zhang, Z., Xue, C., Herre, M., Silver, P. A., Zhang, M. Q., Krainer, A. R., Darnell, R. B. & Zhang, C. (2014). HITS-CLIP and integrative modeling define the Rbfox splicing-regulatory network linked to brain development and autism. Cell Reports 6, 1139–1152.CrossRef Google Scholar PubMed

Wickiser, J. K., Cheah, M. T., Breaker, R. R. & Crothers, D. M. (2005a). The kinetics of ligand binding by an adenine-sensing riboswitch. Biochemistry 44, 13404–13414.Google Scholar

Wickiser, J. K., Winkler, W. C., Breaker, R. R. & Crothers, D. M. (2005b) The speed of RNA transcription and metabolite binding kinetics operate an FMN riboswitch. Molecular Cell 18, 49–60.Google Scholar

Will, S., Reiche, K., Hofacker, I. L., Stadler, P. F. & Backofen, R. (2007). Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering. PLoS Computational Biology 3, e65.Google Scholar

Wu, Y., Shi, B., Ding, X., Liu, T., Hu, X., Yip, K. Y., Yang, Z. R., Mathews, D. H. & Lu, Z. J. (2015). Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data. Nucleic Acids Research 43, 7247–7259.Google Scholar

Wuchty, S., Fontana, W., Hofacker, I. L. & Schuster, P. (1999). Complete suboptimal folding of RNA and the stability of secondary structures. Biopolymers 49, 145–165.Google Scholar

Xia, T., Santalucia, J., Burkard, M. E., Kierzek, R., Schroeder, S. J., Jiao, X., Cox, C. & Turner, D. H. (1998). Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson–Crick base pairs. Biochemistry 37, 14719–14735.Google Scholar

Xu, Z. & Mathews, D. H. (2011). Multilign: an algorithm to predict secondary structures conserved in multiple RNA sequences. Bioinformatics 27, 626–632.Google Scholar

Yancey, P. H., Clark, M. E., Hand, S. C., Bowlus, R. D. & Somero, G. N. (1982). Living with water stress: evolution of osmolyte systems. Science 217, 1214–1222.Google Scholar

Yang, S. (2014). Methods for SAXS-based structure determination of biomolecular complexes. Advanced Materials 26, 7902–7910.Google Scholar

Yang, Z., Cao, J., He, Y., Yang, J. H., Kim, T., Peng, X. & Kim, J. S. (2014). Macro-/micro-environment-sensitive chemosensing and biological imaging. Chemical Society Reviews 43, 4563–4601.Google Scholar

You, M. & Jaffrey, S. R. (2015). Structure and mechanism of RNA mimics of green fluorescent protein. Annual Review of Biophysics 44, 187–206.Google Scholar

You, M., Litke, J. L. & Jaffrey, S. R. (2015). Imaging metabolite dynamics in living cells using a Spinach-based riboswitch. Proceedings of the National Academy of Sciences of the United States of America 112, E2756–E2765.Google Scholar

Zarringhalam, K., Meyer, M. M., Dotu, I., Chuang, J. H. & Clote, P. (2012). Integrating chemical footprinting data into RNA secondary structure prediction. PLoS ONE 7, e45160.Google Scholar

Zarrinkar, P. P., Wang, J. & Williamson, J. R. (1996). Slow folding kinetics of RNase P RNA. RNA 2, 564–573.Google Scholar

Zaug, A. & Cech, T. R. (1995). Analysis of the structure of Tetrahymena nuclear RNAs in vivo: telomerase RNA, the self-splicing rRNA intron, and U2 snRNA. RNA 1, 363–374.Google Scholar

Zheng, Q., Ryvkin, P., Li, F., Dragomir, I., Valladares, O., Yang, J., Cao, K., Wang, L.-S. & Gregory, B. D. (2010). Genome-wide double-stranded RNA sequencing reveals the functional significance of base-paired RNAs in Arabidopsis . PLoS Genetics 6, e1001141.Google Scholar

Zhuang, X., Bartley, L. E., Babcock, H. P., Russell, R., Ha, T., Herschlag, D. & Chu, S. (2000). A single-molecule study of RNA catalysis and folding. Science 288, 2048–2051.Google Scholar

Zimmerman, S. B. & Trach, S. O. (1991). Estimation of macromolecule concentrations and excluded volume effects for the cytoplasm of Escherichia coli. Journal of Molecular Biology 222, 599–620.Google Scholar

Zuker, M. (1989). On finding all suboptimal foldings of an RNA molecule. Science 244, 48–52.Google Scholar

Fig. 1. The Classical View (top) and the Modern View (bottom) of RNA's role in biology. In the classical view of biology, RNA (top) serves as a messenger molecule between DNA and proteins and proteins have all the main functions in cells. Messenger RNA serves to translate information from DNA to proteins. The modern view of biology (bottom) has emerged in the last 25 years as the field learns more about the many functions of RNA. Non-coding RNA (ncRNA) has vast regulatory functions, some of which include immune responses (‘ppp’ = 5′-triphosphate, which activates PKR) (Nallagatla et al.2007), thermosensors, ribozymes, riboswitches, and genome editing. In the modern view of biology, proteins still have most cellular functions, but RNA plays essential roles in the cell beyond its classical functions.

Fig. 2. RNA interactions with RNA-binding proteins (RBP, left), metal ions (central), and ligands (star, right) can result in structure changes. Unlike typical in vitro conditions, there are other molecules and complex solution conditions in vivo that can interact with RNA and change its structure. These structure changes can result in an RNA with less structure (top left) more structure (top right), or an alternate conformation than the structure that is prevalent in vitro (bottom). Also shown (bottom) are the bacterial expression platforms of riboswitches that switch between two mutually exclusive structures that turn a gene ON (left) or OFF (right) by exposing or sequestering the Shine–Dalgarno sequence (blue).

Fig. 3. Artist's rendition of in vitro conditions (left), in vivo conditions (right) and in vivo-like conditions (center). Typical in vitro solutions are dilute with high monovalent ion concentrations that are very different from cellular conditions. The cellular environment is complex with monovalent and divalent salts, macromolecules, cosolutes, and organelles. In vivo-like conditions (center) bridge in vitro and in vivo conditions and are more complex than in vitro conditions with added synthetic crowding agents and proteins and physiological ion concentrations. However, in vivo-like conditions are still much less complex than those prevailing in vivo.

Table 1. Comparison of in vitro and in vivo solution conditions

Table 2. Common experimental techniques used to study RNA structure and folding

Fig. 4. Depiction of the hierarchical RNA folding pathway and folding funnels for non-cooperative and cooperative folding. (a) RNA folds in a hierarchical manner in which secondary structures form followed by tertiary structure. Hierarchical folding can be (b) rugged and non-cooperative in which the pathway intermediates are populated and the RNA can form misfolds (Mi) before populating the native state (N), or folding can occur in a (c) cooperative manner in which the intermediates do not populate and the RNA folds in a single transition.

Fig. 5. Different RNA structures can be populated under in vitro, in vivo, and in vivo-like conditions. RNA structures induced by the cellular environment, including proteins and crowding, are shown in the two outermost structures. The conditions in vitro favor the population of a structure that may not always be the functional RNA structure (center two structures). Depending on the in vivo-like conditions chosen, specific RNA structures will be populated.

Fig. 7. Under both in vitro (right, grey) and in vivo-like conditions with molecular crowding (left, pink) RNA fold into their native state that is functional, indicated in this figure by catalysis. High concentrations of Mg2+ (10 mM or higher) are needed to achieve the folded state in vitro compared with in vivo-like crowded conditions where low physiological Mg2+ (0·5 mM) folds the RNA. Reprinted with permission from Strulson et al. (2013). Copyright 2016 American Chemical Society.

Article contents

Bridging the gap between in vitro and in vivo RNA folding

Abstract

1. Introduction

2. Setting the stage

2.1 In vitro studies of RNA folding

2.1.1 Major advances: elucidating RNA folding pathways in vitro

2.1.2 Major advances: applying biophysical techniques to study RNA folding in vitro

2.1.3 Benefits and limitations of in vitro studies

2.2 In vivo studies of RNA folding

2.2.1 Major advances: transcript-specific RNA structure mapping in vivo

2.2.2 Major advances: genome-wide RNA structure mapping in vivo

2.2.3 Major advances: quantification of cellular factors in vivo

2.2.4 Benefits and limitations of in vivo studies

2.3 In silico studies of RNA folding

2.3.1 Major advances: RNA structure prediction from one sequence in silico

2.3.2 Major advances: RNA structure prediction from multiple sequences in silico

2.3.3 Major advances: RNA structure prediction in silico restrained with experimental data

2.3.4 Challenges with in silico modeling of RNA secondary structure

3. Bridging the gap between in vitro and in vivo RNA folding using in vivo-like studies

3.1 The gap

3.2 Design of artificial cytoplasms and early experiments

3.2.1 Polymers

3.2.2 Cosolutes

3.2.3 Protocells and synthetic membranes

4. Future directions

Acknowledgements

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests