Introduction
Methods to study single DNA molecules have revolutionized science across disciplines, ranging from fundamental biophysics to clinical diagnostics. The possibility to interrogate single DNA molecules enables information that is intrinsically hidden in bulk experiments to be obtained in a straight-forward fashion. Focusing on DNA molecules with a length in the kilobase pair (kbp) regime, traditional methods include for example optical (Chaurasiya et al., Reference Chaurasiya, Paramanathan, McCauley and Williams2010; Heller et al., Reference Heller, Hoekstra, King, Peterman and Wuite2014) and magnetic (De Vlaminck and Dekker, Reference De Vlaminck and Dekker2012) tweezers, where a single DNA molecule is anchored to two beads or a bead and a surface and then stretched and analyzed, and DNA curtains, where many DNA molecules are anchored to a lipid bilayer and stretched using liquid flow (Collins et al., Reference Collins, Ye, Duzdevich and Greene2014). For genomic analysis, DNA can be efficiently stretched on glass surfaces using molecular combing (Bensimon et al., Reference Bensimon, Simon, Chiffaudel, Croquette, Heslot and Bensimon1994; Neely et al., Reference Neely, Deen and Hofkens2011), allowing high-throughput analysis.
The computer industry has pushed a revolution to miniaturize components, such as transistors, resulting in the development of methods for efficient nanofabrication on length scales of tens of nanometers. The same methods have later been applied to a plethora of other disciplines where miniaturization is desirable, not the least in microfluidic and nanofluidic applications in the life sciences. Nanofluidic devices have many appealing features that make them useful for analyzing single long DNA molecules. Channels of nanoscale dimensions allow stretching of long DNA molecules to an extension close to their full contour length. Importantly, this can be done without anchoring any of the DNA ends, which for example allows analysis of DNA extracted from cells, circular DNA, and interactions occurring at DNA ends.
In this review we will first discuss the basic physics of nanoconfined DNA in detail. We will consider the early theories by de Gennes and Odijk, describing confined polymers, and how they have been gradually improved in order to fully cover the confinement regimes of importance in experiments. In the next section we will discuss how the nanofluidic devices can be fabricated and tailored for the project of interest, for example to obtain maximum stretching of the DNA, vary the degree of stretching, or monitor the response of the DNA upon addition of a solute in real time. Finally, we turn to the two main application areas of studying single DNA molecules in nanofluidic devices: optical DNA mapping (ODM) and DNA–protein interaction studies. For ODM we mainly focus on the technical aspects of the method. Recent years have presented a large increase in literature using ODM for genetic analysis, but these will only be briefly mentioned. For DNA–protein interactions, we will present several different examples that highlight the usefulness of nanofluidic channels for such studies and at the same time present the biological insight they have provided. Figure 1 schematically illustrates the topics covered by this review.
Theory
Length scales characterizing DNA in free solution
The conformations of DNA are characterized by a cascade of length scales: the contour length L, representing the total length of the DNA backbone, the persistence length lp, representing the characteristic length scale for bending that backbone under thermal energy, and the effective width w, representing the length scale for excluded volume interactions between distal segments of the polymer. These fundamental length scales can be combined to produce a number of useful derived quantities, in particular the radius of gyration Rg, which provides a measure of the total size of the polymer, the thermal blob size (correlation length) lb, which separates the highly correlated, short-range interactions between local segments of the polymer from the decorrelated, longer-range interactions, and the excluded volume vex. All of these length scales play a role in understanding the thermodynamics of DNA in confinement.
Fundamental length scales
The smallest of the length scales characterizing the conformations of DNA is the width of the DNA backbone. While the steric dimension of the backbone is approximately 2 nm, electrostatic repulsion arising from the charged phosphate groups makes the DNA backbone appear thicker than would be expected, were the DNA a neutral polymer. The standard approach to treat the electrostatics is to consider an effective width w (Stigter, Reference Stigter1975, Reference Stigter1977), which is obtained from a mapping of the osmotic pressure produced by a suspension of charged cylinders of diameter d to that produced by neutral cylinders of ‘effective width’ w. For the high ionic strength buffers sometimes used in experiments of confined DNA, for example 100 mM of monovalent salt, Stigter's theory (Stigter, Reference Stigter1977) produces an effective width of approximately 5 nm. At lower ionic strength, the effective width is available from Stigter's theory and numerical results have been reported, for example, by Hsieh et al. (Reference Hsieh, Balducci and Doyle2008) or Tree et al. (Reference Tree, Muralidhar, Doyle and Dorfman2013a). When making this mapping, the details of the electrostatic interactions are replaced by an equivalent neutral polymer model.
At the next level up is the DNA persistence length lp, which reflects the decay of the correlations between the tangent vectors along the DNA backbone. A qualitative (but useful) way to think about the persistence length is that it characterizes the ability of the DNA to be bent by thermal energy. For high ionic strength buffers, the persistence length of DNA is approximately 50 nm, a frequently cited result obtained by Bustamante et al. using magnetic tweezers (Bustamante et al., Reference Bustamante, Marko, Siggia and Smith1994). The persistence length is also affected by ionic strength, since the ions in solution screen the charges along the DNA backbone and thus affect its propensity to bend under thermal energy (Odijk, Reference Odijk1977; Skolnick and Fixman, Reference Skolnick and Fixman1977). Similar to the effective width, it is convenient to subsume the details of the electrostatic interactions into a persistence length that captures both the intrinsic contributions, arising from the steric interactions, and an electrostatic contribution. After the mapping is complete, we now have an equivalent neutral polymer model for the DNA that is thicker (due to the effective width) and stiffer (due to the electrostatic contribution to the persistence length) than would be expected based solely on the steric volume of its constituent atoms.
To account for electrostatic contributions to the persistence length, Dobrynin (Reference Dobrynin2005) proposed a semi-empirical formula to correct for the ionic strength dependence, correcting an approximation in the classic Odijk–Skolnick–Fixman (OSF) theory by Odijk (Reference Odijk1977) and Skolnick and Fixman (Reference Skolnick and Fixman1977). Hsieh et al. (Reference Hsieh, Balducci and Doyle2008) showed that the Dobrynin model provides a superior description of DNA confined to a nanoslit when compared to the OSF theory. Subsequently, Chuang et al. (Reference Chuang, Reifenberger, Cao and Dorfman2017) extended Dobrynin's result to further incorporate sequence effects. Their analysis of the stretching of human DNA in 45 nm nanochannels, when combined with Odijk's theory for strong confinement (Odijk, Reference Odijk1983; Burkhardt et al., Reference Burkhardt, Yang and Gompper2010) and Dobrynin's theory (Dobrynin, Reference Dobrynin2005), yielded the useful formula,
where γ is the fraction of G-C base pairs. In the second term, which is the electrostatic contribution attributed to Dobrynin (Reference Dobrynin2005), I is the ionic strength of the buffer. The details surrounding the role of electrostatics and counterion chemistry on the persistence length remain a subject of investigation (Brunet et al., Reference Brunet, Tardin, Salomé, Rousseau, Destainville and Manghi2015; Trizac and Shen, Reference Trizac and Shen2016; Guilbaud et al., Reference Guilbaud, Salome, Destainville, Manghi and Tardin2019), and their impact on nanochannel DNA confinement is also complicated (Chuang et al., Reference Chuang, Reifenberger, Bhandari and Dorfman2019). These additional factors affect the quantitative agreement between theory and experiment.
At the longest length scale, at least for problems of interest to us here, is the DNA contour length L. At first glance, this appears to be a very straightforward quantity to compute; simply multiply the number of base pairs by the 0.34 nm/bp rise of B-DNA. However, fluorescence microscopy experiments typically require labeling of the DNA backbone with an intercalating dye, such as YOYO-1, that increases the extension of the DNA (Perkins et al., Reference Perkins, Smith, Larson and Chu1995; Bakajin et al., Reference Bakajin, Duke, Chou, Chan, Austin and Cox1998; Randall and Doyle, Reference Randall and Doyle2005; Jeffet et al., Reference Jeffet, Kobo, Su, Grunwald, Green, Nilsson, Eisenberg, Ambjörnsson, Westerlund, Weinhold, Shabat, Purohit and Ebenstein2016; Lee et al., Reference Lee, Lee, Kim, Wang, Park, Jung, Chen, Chang, Ikeda, Sugiyama and Jo2018b). At full intercalation, the DNA contour length can increase by circa 20–30% (Dorfman, Reference Dorfman2010; Nyberg et al., Reference Nyberg, Persson, Akerman and Westerlund2013). Kundukad et al. (Reference Kundukad, Yan and Doyle2014) provide a concise review of different reports for DNA contour length increase upon YOYO-1 intercalation; we have found it convenient to assume that each bound dye molecule adds 0.51 nm to the DNA rise (Gupta et al., Reference Gupta, Bhandari and Dorfman2018).
Derived length scales
While the trio of the contour length, persistence length, and effective width describe the length scales of the DNA chain, particular combinations of these parameters are often more useful for describing the configurations of a polymer. For a particularly lucid discussion of polymer conformations in general, consult the Perspectives article by Wang (Reference Wang2017). For the moment, we will focus on the relevant parameters for understanding DNA confinement, adopting typical length scales w = 5 nm and lp = 50 nm for model calculations. We also further focus on the limit of chains where L ≫ l p, which are coils rather than rods (Hiemenz and Lodge, Reference Hiemenz and Lodge2007). For λ-DNA (48.5 kbp), which is a commonly used model system, the stained contour length at full intercalation is close to 20 μm (Tegenfeldt et al., Reference Tegenfeldt, Prinz, Cao, Chou, Reisner, Riehn, Wang, Cox, Sturm, Silberzan and Austin2004), corresponding to 400 persistence lengths assuming no effect of the dye on the persistence length (Gunther et al., Reference Gunther, Mertig and Seidel2010; Kundukad et al., Reference Kundukad, Yan and Doyle2014; Dorfman, Reference Dorfman2018).
The size of the coiled chain is characterized by either the end-to-end distance or the radius of gyration, Rg, of the polymer. While either metric is suitable for our purposes, we will focus here on the radius of gyration. For an ideal polymer, the radius of gyration is given by random walk statistics,
where N is the number of statistical segments and b is the statistical segment length (Hiemenz and Lodge, Reference Hiemenz and Lodge2007); it follows that L = Nb. The exponent characterizing the scaling for the radius of gyration of an ideal chain is thus N 1/2. For a semiflexible chain like DNA, the correspondence between the random walk statistics and the Kratky and Porod (Reference Kratky and Porod1949) result for the end-to-end distance of a long wormlike chain gives b = 2l p.
When excluded volume is important, the chain obeys self-avoiding random walk statistics with a radius of gyration
Comparing Eqs. (2) and (3), the exponent for the scaling of the polymer size changes from N 1/2 for an ideal chain to $N^\nu$ in the presence of excluded volume, where we have made the trivial conversion between contour length and degree of polymerization, L ~ N. The parameter ν is known as the Flory exponent, and accounts for the additional swelling of the polymer coil caused by the excluded volume between distal segments. In Eq. (3), we have used the classical value of the Flory exponent ν = 3/5 to quantify the scaling for the chain size; the most accurate estimate of the scaling exponent by Clisby is ν = 0.587597(7) (Clisby, Reference Clisby2010).
When is excluded volume important? To answer this question, we first need to understand the excluded volume interactions between different segments of the DNA. The cross-over is typically written in terms of an excluded volume parameter (Wang, Reference Wang2017),
The Onsager excluded volume, vex, between two cylindrical rods of width w and length lp is approximately $v_{ex}\cong l_p^2 w$. To determine the length scale over which excluded volume matters, we set z ≅ 1. The number of statistical segments required to satisfy this condition is then $b^6/v_{ex}^2$. Using Eq. (2), the thermal blob size,
is defined as the size of a polymer chain where z ≅ 1, which is conventionally taken as the point at which excluded volume becomes important. In the standard polymer physics interpretation (Wang, Reference Wang2017), a polymer whose radius of gyration exceeds the thermal blob size is affected by excluded volume.
While these scaling arguments are useful for establishing the general phenomena, the neglect of various prefactors makes them less useful for modeling DNA in nanochannel experiments. Indeed, Tree et al. (Reference Tree, Muralidhar, Doyle and Dorfman2013a) pointed out that there are several equally plausible definitions of the thermal blob size and these definitions give remarkably different results for the onset of excluded volume for DNA, ranging from 16.8 to 166 kbp. These values span the range of typical DNA sizes used in experiments. To be more quantitative, Tree et al. (Reference Tree, Muralidhar, Doyle and Dorfman2013a) instead computed the Flory exponent for DNA as a function of the number of base pairs in the relationship R g ~ N υ, leveraging the renormalization group result from Chen and Noolandi (Reference Chen and Noolandi1992). Figure 2 shows that typical molecular size standards, such as λ-DNA, obey neither Eq. (2) nor Eq. (3). Rather, typical DNA sizes used in the experiments discussed here are in a very broad cross-over between ideal chain scaling and self-avoiding random walks.
In what follows, we will define the thermal blob size,
as this definition has proven to be very useful for defining the onset of the de Gennes regime for a confined polymer. For DNA in a high ionic strength buffer, this definition of the thermal blob size is approximately 500 nm, corresponding to approximately 2.5 μm of DNA. This length scale will play an important role in defining the extent of confinement in what follows.
Wall depletion length
When DNA is confined to a nanochannel, there is an additional length scale characterizing the DNA–wall interactions. Both the DNA and the walls are negatively charged and therefore repel one another. In the simplest model, one can capture this effect by introducing a depletion length δ such that the effective channel size is Deff = D−δ. Several models have been proposed for the depletion length, including using the DNA effective width w (Wang et al., Reference Wang, Tree and Dorfman2011), a rigid-rod approximation (Reisner et al., Reference Reisner, Pedersen and Austin2012), and a semiflexible chain model (Reisner et al., Reference Reisner, Pedersen and Austin2012). Bhandari et al. (Reference Bhandari, Reifenberger, Chuang, Cao and Dorfman2018) measured this depletion length at different ionic strengths using a high-throughput genome mapping approach and found that their data were described by the simple phenomenological model δ = 6.5 λD + 0.64, where λD is the Debye length of the buffer solution.
Configuration of DNA in channel confinement
Having reviewed the relevant theory for the configurations of DNA in free solution, we now consider its configuration when confined to a square nanochannel of size D. Much of the literature, in particular for genomics applications in nanochannels, describe the confined DNA as being stretched, and this nomenclature will be adopted for our review. However, this nomenclature is not quite correct, since ‘stretching’ a polymer implies that a force is applied to oppose the entropic elasticity, for example when using tweezers (Bustamante et al., Reference Bustamante, Marko, Siggia and Smith1994). It is more precise to say that the DNA becomes ‘extended’ since it is compressed from its free solution radius of gyration by the walls, which causes the DNA to extend along the channel axis. There are subtle distinctions between compression and stretching (Dai and Doyle, Reference Dai and Doyle2013).
There are two key metrics to consider; the first is the size of the chain. While we could continue to describe the size of the chain by its radius of gyration in confinement (or the end-to-end distance), it is particularly convenient to use the mean span X of the confined DNA because this size can be directly measured by fluorescence microscopy. If we denote by x(s) the axial coordinate of the part of the DNA located at s ∈ [0, L], the span of that configuration is defined as
It is also important to understand the variance about the mean span,
which measures the effects of thermal fluctuations. Note that the mean span of a polymer in free solution is zero, since the random walk has, on average, no net displacement from the origin. (The radius of gyration is related to the variance in the chain size.) When a polymer is confined to a nanochannel, excluded volume effects and its stiffness force it to extend along the channel axis, producing a non-zero value of the mean span.
We focus here on the span of the confined molecule because it is experimentally accessible via fluorescence microscopy. For sufficiently long chains, the mean span and end-to-end distance converge to a single value because end effects make a negligible contribution (Muralidhar et al., Reference Muralidhar, Tree, Wang and Dorfman2014b; Smithe et al., Reference Smithe, Iarko, Muralidhar, Werner, Dorfman and Mehlig2015). However, the contour length at which these various metrics converge can be very long and depends on the channel size.
The different regimes of confinement are summarized by Fig. 3. We discuss each of these models in detail below.
De Gennes regime (weak confinement)
Daoud and De Gennes (Reference Daoud and De Gennes1977) developed the classic treatment of the extension of weakly confined polymers. By weak confinement, we mean that the channel size is smaller than the radius of gyration (so that the DNA needs to extend along the channel axis), but larger than the thermal blob size in Eq. (6). The latter restriction implies that the subchain within a correlation length D is itself a self-avoiding walk that obeys Eq. (2). The contour length of the subchain, which is also referred to as a blob and indicated by the circle in Fig. 3, is given by $L_{{\rm blob}}\cong D^{5/3}l_p^{{-}1/3} w^{{-}1/3}.$ The number of blobs is N blob = L/L blob and the mean span of the chain is X ≅ N blobD. Putting these results together gives the de Gennes scaling for a confined polymer,
The variance in the chain size is σ 2 ≅ N blobD 2, which gives (Dai and Doyle, Reference Dai and Doyle2013)
While these results are derived in the context of blob theory, equivalent results are obtained from a modified Flory theory (Jun et al., Reference Jun, Thirumalai and Ha2008; Wang et al., Reference Wang, Tree and Dorfman2011). The prefactors for the scaling laws can be computed via simulations (see, e.g. Muralidhar et al., Reference Muralidhar, Tree, Wang and Dorfman2014b), but they are expected to be O(1) quantities. For the purpose of understanding experimental data, the exact value of the prefactor is not particularly important.
Odijk regime (strong confinement)
Odijk (Reference Odijk1983) considered the opposite case to Daoud and de Gennes, where the polymer is strongly confined in a channel that is much smaller than its persistence length. Here, rather than forming a series of blobs, Odijk showed that the DNA instead forms a chain of deflection segments of characteristic size $\lambda \cong D^{2/3}l_p^{1/3}$, which is known as the deflection length. In essence, the chain performs a persistent random walk due to its stiffness, but the collisions with the hard wall over distances corresponding to the deflection length lead to decorrelations between different segments of the chain.
In the absence of any hairpin turns, the extension of this chain of deflection segments is
where the prefactor for a square channel was computed by Burkhardt et al. (Reference Burkhardt, Yang and Gompper2010) from an accelerated particle model. These authors further computed the variance in the chain extension, arriving at
Corrections for rectangular channels (Burkhardt et al., Reference Burkhardt, Yang and Gompper2010) and circular channels (Yang et al., Reference Yang, Burkhardt and Gompper2007) are available, and slightly different results have been obtained from different calculations (Chen, Reference Chen2016).
The statistics for strongly confined chains in the Odijk regime and weakly confined chains in the de Gennes regime are markedly different. In the de Gennes regime, the chain is only weakly perturbed from its coil shape and the dependence of the chain extension on channel size is strong. In contrast, the Odijk regime corresponds to almost completely extended chains, since the energetic cost to form a hairpin is extremely high (Odijk, Reference Odijk2006). The extension of the chain in the Odijk regime thus has a very weak dependence on the channel size. The increase in extension as the channel size decreases in the Odijk regime is driven by alignment of the deflection segments along the channel axis, which increases as the channel size decreases. This orientation effect is also the reason why the variance in the Odijk regime decreases with decreasing channel size; as the alignment fluctuations are suppressed, the fluctuations in the chain extension are also suppressed. The fluctuations in the chain extension also decrease with decreasing channel size in the de Gennes regime, albeit more weakly. The reason for the weaker decay in fluctuations is that these fluctuations are driven by the blobs, not by orientation of the chain with the channel axis.
Additional scaling regimes at intermediate confinement
While the de Gennes and Odijk regimes possess an aesthetic beauty as physical models, they are of little use in describing the extension of DNA in confinement. If we take a thermal blob size of 500 nm, then a conservative estimate of the start of the de Gennes regime is a channel that is several microns in size. However, in order to reach de Gennes statistics, the DNA in such a large channel needs to comprise several blobs (Muralidhar et al., Reference Muralidhar, Tree, Wang and Dorfman2014b). If we use an aggressive set of bounds and set D = 1 μm and only one blob, we get a minimum contour length of L = 16 μm. In principle, we would like to be at least an order of magnitude away from this boundary, which would imply we need circa 500 kbp long DNA. While such long molecules have been used in some physics experiments (Tegenfeldt et al., Reference Tegenfeldt, Prinz, Cao, Chou, Reisner, Riehn, Wang, Cox, Sturm, Silberzan and Austin2004; Freitag et al., Reference Freitag, Noble, Fritzsche, Persson, Reiter-Schad, Nilsson, Graneli, Ambjornsson, Mir and Tegenfeldt2015) and are routinely used for genome mapping in nanochannels (Müller and Westerlund, Reference Müller and Westerlund2017), most physical experiments use shorter DNA and/or narrower channels.
At the other end of the spectrum, the Odijk regime requires that the channel size be much smaller than the persistence length. For high ionic strengths, the requisite channel sizes are below the 20 nm lower bound for injecting DNA (Cao et al., Reference Cao, Yu, Wang, Tegenfeldt, Austin, Chen, Wu and Chou2002b) and the smallest channels used for measuring DNA extension (Reisner et al., Reference Reisner, Morton, Riehn, Wang, Yu, Rosen, Sturm, Chou, Frey and Austin2005). It is possible to work with wider channels and remain in the Odijk regime by decreasing the ionic strength (Jo et al., Reference Jo, Dhingra, Odijk, De Pablo, Graham, Runnheim, Forrest and Schwartz2007; Kim et al., Reference Kim, Kim, Kounovsky, Chang, Jung, Depablo, Jo and Schwartz2011; Kounovsky-Shafer et al., Reference Kounovsky-Shafer, Hernandez-Ortiz, Potamousis, Tsvid, Place, Ravindran, Jo, Zhou, Odijk, De Pablo and Schwartz2017; Lee et al., Reference Lee, Lee, Kim, Wang, Park, Jung, Chen, Chang, Ikeda, Sugiyama and Jo2018b), and Odijk's theory seems to be a good description there. However, one does need to be careful in interpreting the data because the electrostatic double-layers from the walls now occupy a substantial fraction of the channel cross-section, complicating the mapping from the polyelectrolyte problem to the neutral polymer model embodied by Odijk's theory (Reisner et al., Reference Reisner, Pedersen and Austin2012; Cheong et al., Reference Cheong, Li and Dorfman2017; Chuang et al., Reference Chuang, Reifenberger, Bhandari and Dorfman2019; Bhandari and Dorfman, Reference Bhandari and Dorfman2020). From a fundamental perspective, the best way to probe the Odijk regime using biopolymers is to switch to stiffer molecules, such as actin (Noding and Koster, Reference Noding and Koster2012) or RecA filaments (Frykholm et al., Reference Frykholm, Alizadehheidari, Fritzsche, Wigenius, Modesti, Persson and Westerlund2014).
Using scaling theory, Odijk (Reference Odijk2008) identified two additional regimes that should exist between what are now referred to as the classic de Gennes regime and the classic Odijk regime. As the channel size decreases below the thermal blob size (~500 nm) but remains above the statistical segment length (~100 nm), the blobs become ellipsoidal, as indicated in Fig. 3, and their statistics are marginal, i.e. the excluded volume corresponds to z ≅ 1. This regime of confinement is often referred to as the ‘extended de Gennes’ regime (Wang et al., Reference Wang, Tree and Dorfman2011) because the scaling for the extension given by Eq. (9) ‘extends’ to even smaller channel sizes. However, the variance of the chain extension is markedly different in the extended de Gennes regime,
The latter result, which can be derived using either blob theory (Dai and Doyle, Reference Dai and Doyle2013) or modified Flory theory (Wang et al., Reference Wang, Tree and Dorfman2011), is remarkable – the fluctuations of the confined, real chain are equal to those of an unconfined, ideal chain of the same contour length. The independence of the variance from the channel size was a key to providing affirmative proof for the existence of the extended de Gennes regime in simulations (Dai and Doyle, Reference Dai and Doyle2013) and experiments (Gupta et al., Reference Gupta, Miller, Muralidhar, Mahshid, Reisner and Dorfman2015; Iarko et al., Reference Iarko, Werner, Nyberg, Muller, Fritzsche, Ambjornsson, Beech, Tegenfeldt, Mehlig, Westerlund and Mehlig2015). However, the best theoretical evidence in support of the extended de Gennes regime lies in the excess free energy of confinement (Dai et al., Reference Dai, van der Maarel and Doyle2014), which put to rest a lingering discussion over the existence (or lack thereof) of this regime (Tree et al., Reference Tree, Wang and Dorfman2013b), but this small change in free energy is not straightforward to measure via experiments.
The extended de Gennes regime has a strong connection with the theory of marginal solutions of semiflexible polyelectrolytes (Schaefer et al., Reference Schaefer, Joanny and Pincus1980; Dai et al., Reference Dai, van der Maarel and Doyle2014; Wang, Reference Wang2017). Werner and Mehlig (Reference Werner and Mehlig2014) also recognized a connection between the statistics of the extended de Gennes regime and weakly self-avoiding random walks. Leveraging exact results for weakly self-avoiding walks in the mathematical literature, they were able to provide exact prefactors for both the extension and variance in the extended de Gennes regime. Data from subsequent experiments (Gupta et al., Reference Gupta, Miller, Muralidhar, Mahshid, Reisner and Dorfman2015; Iarko et al., Reference Iarko, Werner, Nyberg, Muller, Fritzsche, Ambjornsson, Beech, Tegenfeldt, Mehlig, Westerlund and Mehlig2015) were in good but not quantitative agreement with the theoretical predictions. Subsequently, Schotzinger et al. (Reference Schotzinger, Menard and Ramsey2020) provided conclusive evidence for the extension scaling in the extended de Gennes regime, explaining previous disagreements between theory and experiment as the result of inadequate characterization of the channel cross-section that were circumvented in their experiments by using a focused-ion beam approach to channel fabrication (Menard and Ramsey, Reference Menard and Ramsey2013).
When the channel size is below the statistical segment length (~100 nm), Odijk (Reference Odijk2008) predicted a cross-over from blob statistics to deflection segments. However, unless the channel size is much smaller than the persistence length, the chain can still form the hairpins indicated in Fig. 3 because the energy of formation is not particularly high (Polson, Reference Polson2018), a phenomenon identified via simulations by Cifra and Bleha (Reference Cifra and Bleha2012) and Dai et al. (Reference Dai, Ng, Doyle and van der Maarel2012). Odijk (Reference Odijk2006) called the length scale for forming a hairpin the global persistence length g.
The presence of hairpins within the chain invalidates the results in Eqs. (11) and (12). When the chain can form hairpins, Odijk showed that its statistics are described by a different scaling variable (Odijk, Reference Odijk2008; Muralidhar et al., Reference Muralidhar, Tree and Dorfman2014a)
that measures the total excluded volume created by a hairpin to the spatial volume gD 2 occupied by that hairpin. Using a Flory argument, Odijk (Reference Odijk2008) then derived the scaling
for the chain extension when L ≫ g, i.e. when the chain can form many hairpin folds. The concomitant variance in the chain extension is
Comparing Eqs. (16) and (13) shows that the variance in the extended de Gennes regime and backfolded Odijk regimes is of the same form, but the persistence length of the polymer needs to be replaced by the global persistence length to describe the backfolded Odijk regime.
Muralidhar et al. (Muralidhar et al., Reference Muralidhar, Tree and Dorfman2014a, Reference Muralidhar, Quevillon and Dorfman2016; Muralidhar and Dorfman, Reference Muralidhar and Dorfman2016) undertook systematic studies of the backfolded Odijk regime via simulations. In addition to providing a pedagogical introduction (Muralidhar et al., Reference Muralidhar, Tree and Dorfman2014a) to Odijk's scaling theory (Odijk, Reference Odijk2008), they confirmed all predictions of Odijk's theory for square, rectangular, and circular channels. Importantly, the simulation work by Muralidhar et al. (Reference Muralidhar, Tree and Dorfman2014a) identified a shortcoming in Odijk's theory for the global persistence length (Odijk, Reference Odijk2006), which plays a key role in the overall theory. The disagreement between Odijk (Reference Odijk2006) and Muralidhar et al. (Reference Muralidhar, Tree and Dorfman2014a) was resolved by Chen (Reference Chen2017), who provided a complete correction to Odijk's theory.
Owing to the very large increase in the global persistence length as the channel size decreases (Odijk, Reference Odijk2006), Eq. (16) predicts an equally large increase in the fluctuations of the extension in this regime. These large fluctuations were not realized in experiments with DNA because the DNA was too short to form the large hairpins predicted by Odijk's theory (Su et al., Reference Su, Das, Xiao and Purohit2011). Indeed, experiments to probe the backfolded Odijk regime using DNA are infeasible (Muralidhar et al., Reference Muralidhar, Tree and Dorfman2014a).
Telegraph model for intermediate confinement
The schematic illustration of different regimes in Fig. 3 is not entirely correct; the boundaries are listed as inequalities but, strictly speaking, correspond to strong inequalities. The relatively good agreement between theory and experiment for the extended de Gennes regime (Gupta et al., Reference Gupta, Miller, Muralidhar, Mahshid, Reisner and Dorfman2015; Iarko et al., Reference Iarko, Werner, Nyberg, Muller, Fritzsche, Ambjornsson, Beech, Tegenfeldt, Mehlig, Westerlund and Mehlig2015; Schotzinger et al., Reference Schotzinger, Menard and Ramsey2020) suggests that those inequalities are not especially strong. However, there are no experimental data to support the existence of a backfolded Odijk regime for DNA. While it is possible to probe the Odijk excluded volume by a non-equilibrium measurement (Krog et al., Reference Krog, Alizadehheidari, Werner, Bikkarolla, Tegenfeldt, Mehlig, Lomholt, Westerlund and Ambjornsson2018; Reifenberger et al., Reference Reifenberger, Cao and Dorfman2018), we seem to be again stuck in a situation where there is an elegant scaling theory that says little about the problem of DNA confinement.
Werner et al. (Reference Werner, Cheong, Gupta, Dorfman and Mehlig2017) recognized that the exact theory for the extended de Gennes regime (Werner and Mehlig, Reference Werner and Mehlig2014) applies for all confinements below the thermal blob size but outside the Odijk regime. In other words, there is no reason to distinguish between the deflection segments in the backfolded Odijk regime and the blobs in the extended de Gennes regime. Rather, the key to understanding intermediate confinement is to recognize the weakness of excluded volume interactions for$\;D\ll l_p^2 /w$. While the chain can fold back on itself in these smaller channels, the long distance along the contour length causes the chain to ‘forget’ the previous configurations. As a result, the excluded volume interactions can be treated in a mean-field sense (Chen, Reference Chen2018).
Thus, instead of relying on the extended de Gennes or backfolded Odijk models, the DNA statistics for $D\ll l_p^2 /w$ can be described by the weakly-correlated telegraph model. The telegraph model describes a random walk that moves at a fixed velocity with the random changes in direction indicated in Fig. 4. In the weakly correlated telegraph model, there is a small energy penalty $\varepsilon$ when the chain revisits a lattice site. This one-dimensional model is easily simulated to produce predictions for X and σ 2. Importantly, the telegraph model predicts that these parameters should collapse as a function of the scaling parameter
where a quantifies the average alignment of the chain backbone with the channel axis. Remarkably, the limits of large α (narrow channels) and small α (wide channels), respectively, correspond to the parameter ξ for the backfolded Odijk regime in Eq. (14) and the scaling variable (wl p/D 2) in the extended de Gennes regime. As a result, the limits of the telegraph model are the backfolded Odijk and extended de Gennes regime (Werner et al., Reference Werner, Cheong, Gupta, Dorfman and Mehlig2017).
In addition to proposing the new scaling variable in Eq. (17), Werner et al. (Reference Werner, Cheong, Gupta, Dorfman and Mehlig2017) showed how the confined polymer problem maps to the correlated telegraph model. Explicitly, the various quantities appearing in the telegraph model can be computed from the statistics of an ideal semiflexible chain, i.e. when the only excluded volume interactions are between the polymer and the wall, and then the telegraph model provides a way to combine those statistics to predict the behavior of a real polymer. Figure 5 shows that the resulting predictions of the telegraph model are in almost exact agreement with three-dimensional simulation data for a confined wormlike chain (Werner et al., Reference Werner, Cheong, Gupta, Dorfman and Mehlig2017). The only disagreement between theory and simulation arises deep into the backfolded Odijk regime at large α. Here, where hairpin turns are rare, the dominant contribution to the variance becomes the fluctuations in the alignment of the segments with the channel walls, which are not included in the telegraph model. Simply adding the variance of the telegraph model to the result in Eq. (12) for these alignment fluctuations restores the agreement between the theory and simulation. The collapse of the experimental data to the theory in Fig. 5 is also satisfying, especially in light of the possible sources of systematic errors that have been identified in the context of experimental work (Nyberg et al., Reference Nyberg, Persson, Akerman and Westerlund2013; Gupta et al., Reference Gupta, Miller, Muralidhar, Mahshid, Reisner and Dorfman2015; Iarko et al., Reference Iarko, Werner, Nyberg, Muller, Fritzsche, Ambjornsson, Beech, Tegenfeldt, Mehlig, Westerlund and Mehlig2015; Schotzinger et al., Reference Schotzinger, Menard and Ramsey2020).
Chen (Reference Chen2018) arrived at a similar scaling result to Eq. (17) using the self-consistent field theory solution for the wormlike chain propagator. Chen's theoretical approach is more robust and allows considering polymers that are much stiffer than DNA and inaccessible by the simulations needed to compute the parameters appearing in the telegraph model.
The telegraph model can also be solved exactly (Odman et al., Reference Odman, Werner, Dorfman, Doering and Mehlig2018) in the limit of relevance to genome mapping, where the channel cross-section is close to the DNA persistence length, providing a formula for predicting not only the mean span and its variance, but also the distribution for the chain extension. Simulations of semiflexible polymers by Bhandari and Dorfman (Reference Bhandari and Dorfman2019), including situations that mimic DNA-based experiments, showed that the asymptotic solution of Odman et al. (Reference Odman, Werner, Dorfman, Doering and Mehlig2018) is in quantitative agreement with simulations. Experimental data for the DNA extension distribution obtained in genome mapping experiments (Reinhart et al., Reference Reinhart, Reifenberger, Gupta, Muralidhar, Sheats, Cao and Dorfman2015; Sheats et al., Reference Sheats, Reifenberger, Cao and Dorfman2015), however, tend to exhibit increasingly worse agreement with the asymptotic solution to the telegraph model as the channel size decreases. This behavior is in contrast to the expectations of the theory of Odman et al. (Reference Odman, Werner, Dorfman, Doering and Mehlig2018), that infer improved agreement as the channel size decreases. Subsequent experiments and theory by Chuang et al. (Reference Chuang, Reifenberger, Bhandari and Dorfman2019) suggest that this disagreement is due to the simplified approach to DNA–wall electrostatic interactions used to model DNA, not an inherent problem in the telegraph model itself.
Effect of channel geometry
The discussion thus far has focused primarily on square nanochannels of width D. From a theoretical perspective, there are only minor differences between a square channel and a circular channel of diameter D that arise from the entropic depletion of the DNA from the corners of the square channel (Odijk, Reference Odijk2006). While simple to analyze from a theoretical perspective, these geometries are challenging to fabricate in practice. The most commonly encountered situation is rectangular nanochannels with a width D 1 and a height D 2.
The first question to address is whether a rectangular cross-section is actually a nanochannel in the sense that we have discussed so far, or whether it is a slit. To answer this question, let us assume that D 1 > D 2, which is the easier case to fabricate, and recall what would happen in the case D 1 → ∞. This situation corresponds to a nanoslit of height D 2. Daoud and De Gennes (Reference Daoud and De Gennes1977) used scaling theory to demonstrate that the in-plane radius of gyration of a polymer confined to a slit consists of a two-dimensional, self-avoiding random walk of blobs. For a semiflexible polymer, Eq. (3) for a polymer in free solution becomes (Hsieh et al., Reference Hsieh, Balducci and Doyle2008)
when that polymer is confined to a slit. Provided that R slit ≫ D 2, then the system is a nanochannel because the DNA configuration will be influenced by the presence of the side wall. The latter inequality thus defines the cross-over between a nanoslit and a nanochannel.
Once we have confirmed that the system indeed corresponds to a nanochannel, we need to understand how the results for confinement discussed earlier are affected by the aspect ratio. To a first approximation, the theories for DNA confinement emerge from the random walk statistics of the confined chain (Chen, Reference Chen2016). In the random walk statistics, the diffusion in different directions is decoupled. As a result, one can define the equivalent square channel size as D 2 = D 1D 2. For aspect ratios that are not too far from unity, this approximation is sufficient. However, as the aspect ratio grows, additional complications arise because the way in which the confinement causes the DNA to extend along the channel axis depends on the channel size. This effect becomes particularly important for funnel-shaped nanochannels (Persson et al., Reference Persson, Utko, Reisner, Larsen and Kristensen2009). If the length scales D 1 and D 2 correspond to different regimes of confinement, a ‘regime mixing’ effect occurs that complicates the analysis (Gupta et al., Reference Gupta, Sheats, Muralidhar, Miller, Huang, Mahshid, Dorfman and Reisner2014). It is possible to construct scaling theories that account for the regime mixing through the use of superblobs (Werner and Mehlig, Reference Werner and Mehlig2015). However, the resulting theory now requires satisfying two strong inequalities, one for each of the channel dimensions, a situation that is unlikely to be realized in experiments. For rectangular channels, the most powerful approach to understanding the chain statistics is simulation (Gupta et al., Reference Gupta, Sheats, Muralidhar, Miller, Huang, Mahshid, Dorfman and Reisner2014; Muralidhar et al., Reference Muralidhar, Quevillon and Dorfman2016).
In addition to rectangular channels, there is also a small body of literature on simulations of DNA confinement in triangular nanochannels (Manneschi et al., Reference Manneschi, Angeli, Ala-Nissila, Repetto, Firpo and Valbusa2013; Reinhart et al., Reference Reinhart, Tree and Dorfman2013; Han et al., Reference Han, Kim, Matsuoka, Thouless and Takayama2016), which are also relatively easy to fabricate by anisotropic etching of silicon. The entropic depletion effects of acute corners can be very significant, and the collapse of the extension data onto a universal curve is not straightforward (Reinhart et al., Reference Reinhart, Tree and Dorfman2013). Ultimately, triangular nanochannels have found interesting applications as collapsible systems for strongly extending DNA (Huh et al., Reference Huh, Mills, Zhu, Burns, Thouless and Takayama2007) and chromatin (Matsuoka et al., Reference Matsuoka, Kim, Huang, Douville, Thouless and Takayama2012) rather than as a system for fundamental studies.
Effects of crowding agents
The physics of DNA extension in nanochannels changes in the presence of crowding agents, a phenomenon that has been explored experimentally (Zhang et al., Reference Zhang, Shao, Kan and Maarel2009, Reference Zhang, Gong, Guttula, Malar, van Kan, Doyle and van der Maarel2012; Jones et al., Reference Jones, van der Maarel and Doyle2011; Feng et al., Reference Feng, Sosa, Mårtensson, Jiang, Tong, Dorfman, Takahashi, Lincoln, Bustamante, Westerlund and Nordén2019; Scott et al., Reference Scott, Shaheen, McGuinness, Metera, Kouzine, Levens, Benham and Leslie2019) and by simulation (Jones et al., Reference Jones, van der Maarel and Doyle2011; Liao et al., Reference Liao, Chien, Luzhbin and Chen2015; Jeon et al., Reference Jeon, Jung and Ha2016; Chen et al., Reference Chen, Sun, Wang, Tian, Zhu, Zhang and Dai2021). A particularly notable effect of crowding is the impact of confinement on the concentration required to compact the DNA by depletion effects induced on the DNA by the crowders. In experiments in the extended de Gennes regime, compaction was only observed in the nanochannels, not in the loading microchannels (Zhang et al., Reference Zhang, Shao, Kan and Maarel2009, Reference Zhang, Gong, Guttula, Malar, van Kan, Doyle and van der Maarel2012), illustrating the profound effect of the channel confinement on compaction. The effect of crowders is also sensitive to the chemistry of the compacting agent. For dextran, a model system for studying DNA compaction, the DNA initially extends at low concentrations of dextran but then compacts (Fig. 6) (Zhang et al., Reference Zhang, Shao, Kan and Maarel2009). In contrast, addition of bovine serum albumin (BSA) or hemoglobin to DNA in a nanochannel only produces compaction, with no initial stage of extension (Zhang et al., Reference Zhang, Gong, Guttula, Malar, van Kan, Doyle and van der Maarel2012). The addition of poly(ethylene glycol) at concentrations that do not induce compaction is even more complicated, with an extension behavior that cannot be captured by the scaling theories for DNA extension described above (Feng et al., Reference Feng, Sosa, Mårtensson, Jiang, Tong, Dorfman, Takahashi, Lincoln, Bustamante, Westerlund and Nordén2019).
When the crowding agent is provided at over-threshold volume fractions, compaction proceeds in two stages: a fast stage (circa 10 s) where the DNA extension in the channel is dramatically reduced, followed by a longer relaxation of several minutes where the DNA presumably reorganizes itself into an ordered structure (Zhang et al., Reference Zhang, Shao, Kan and Maarel2009). The reorganization and compaction of the blobs is a function of channel size. No compaction is observed for very small (60 nm × 100 nm) channels that are in the backfolded Odijk regime (Zhang et al., Reference Zhang, Gong, Guttula, Malar, van Kan, Doyle and van der Maarel2012), possibly because there are no blobs and bending of the DNA is very unfavorable. The process of compaction changes qualitatively for slit-like confinement because there is room to reorganize the blobs in two unconfined dimensions (Jones et al., Reference Jones, van der Maarel and Doyle2011). As a result, for dextran crowding, there is a gradual transition from the apex of the extension induced by low dextran concentrations to the collapsed form at higher dextran concentrations (Jones et al., Reference Jones, van der Maarel and Doyle2011).
Simulations have explored the compaction process in the context of minimal models that are capable of producing the entropic depletion forces, qualitatively capturing the compaction process (Jones et al., Reference Jones, van der Maarel and Doyle2011), allowing for exploration of the relative effects of polymer–polymer and polymer–wall interactions induced by the depletion effects (Liao et al., Reference Liao, Chien, Luzhbin and Chen2015), and providing insights that have enabled developing scaling-level models for the effect of crowders through an effective solvent picture (Jeon et al., Reference Jeon, Jung and Ha2016). One of the most intriguing results emerging from simulations is the proposition that crowders could push the DNA toward the corners of the channel, thereby effectively increasing the confinement (Chen et al., Reference Chen, Sun, Wang, Tian, Zhu, Zhang and Dai2021). The latter prediction has not yet been explored experimentally.
Outstanding challenges
While the problem of a confined linear DNA molecule has been largely resolved, two significant outstanding challenges remain in the modeling of nanochannel-confined DNA. The first problem emerges when the size of the electric double layer becomes commensurate with the size of the channel, as discussed already in section ‘Additional scaling regimes at intermediate confinement’. The second challenge is describing the configurations and dynamics of knotted DNA molecules (Micheletti, Reference Micheletti, Tezuka and Deguchi2022). In the latter context, significant progress has been achieved by coarse-grained simulations (Micheletti and Orlandini, Reference Micheletti and Orlandini2012, Reference Micheletti and Orlandini2014; Orlandini and Micheletti, Reference Orlandini and Micheletti2013; Suma et al., Reference Suma, Orlandini and Micheletti2015; Dai and Doyle, Reference Dai and Doyle2018; Polson and Hastie, Reference Polson and Hastie2020), at least for relatively small molecules. Experimental data have also started to emerge on knotted DNA in nanochannels (Metzler et al., Reference Metzler, Reisner, Riehn, Austin, Tegenfeldt and Sokolov2006; Amin et al., Reference Amin, Khorshid, Zeng, Zimny and Reisner2018; Ma and Dorfman, Reference Ma and Dorfman2020, Reference Ma and Dorfman2021a, Reference Ma and Dorfman2021b), enabled by the ‘knot factory’ platform (Amin et al., Reference Amin, Khorshid, Zeng, Zimny and Reisner2018) that allows knots to be created at high enough frequency for studies at the single-molecule level. A key challenge in connecting these experimental data to simulations is the unknown and presumably high complexity of the knots created in these experiments when compared to the simpler knots used in simulations (Ma and Dorfman, Reference Ma and Dorfman2020).
Devices
Advances in fabrication of nanofluidic structures as well as design of devices for DNA applications make up important contributions to the progress of this research field. The fabrication and use of nanochannels for DNA studies at the single molecule level started in early 2000s (Cao et al., Reference Cao, Yu, Wang, Tegenfeldt, Austin, Chen, Wu and Chou2002b; Tegenfeldt et al., Reference Tegenfeldt, Prinz, Cao, Chou, Reisner, Riehn, Wang, Cox, Sturm, Silberzan and Austin2004). Since then, different materials and fabrication techniques have been explored (Duan et al., Reference Duan, Wang and Xie2013). In the following sections we will present commonly used fabrication processes in brief, as well as how channel and chip designs have developed in line with the expansion of the field, and we will give examples of designs tailored for different applications.
Fabrication processes
The most reliable and documented method for nanochannel fabrication is the use of standard semiconductor device fabrication processes on silicon (Riehn et al., Reference Riehn, Lu, Wang, Lim, Cox and Austin2005; Lam et al., Reference Lam, Hastie, Lin, Ehrlich, Das, Austin, Deshpande, Cao, Nagarajan, Xiao and Kwok2012) or fused silica (Tegenfeldt et al., Reference Tegenfeldt, Prinz, Cao, Chou, Reisner, Riehn, Wang, Cox, Sturm, Silberzan and Austin2004; Reisner et al., Reference Reisner, Larsen, Silahtaroglu, Kristensen, Tommerup, Tegenfeldt and Flyvbjerg2010; Freitag et al., Reference Freitag, Noble, Fritzsche, Persson, Reiter-Schad, Nilsson, Graneli, Ambjornsson, Mir and Tegenfeldt2015) substrates in a cleanroom environment. A fresh silicon wafer with a 1–2 μm thick thermal oxide layer (based on the depth of the micro- and nanofluidic channels) or a fused silica substrate is preferred. Typically, a 4- or 6-inch wafer is used, designed to contain an array of micro- and nanofluidic systems that subsequently can be diced to yield tens of devices from a single wafer.
The process of fabricating nanofluidic devices for DNA studies can be briefly described in the following steps: (1) Photolithography and reactive ion etching (RIE) of alignment marks that are used to align the micro- and nanofluidic features in the next steps. (2) Electron-beam lithography and RIE of the nanochannels. Focused ion-beam milling (Wang et al., Reference Wang, Tegenfeldt, Reisner, Riehn, Guan, Guo, Golding, Cox, Sturm and Austin2005; Menard and Ramsey, Reference Menard and Ramsey2013), and nanoimprint lithography (Guo et al., Reference Guo, Cheng and Chou2004) are well-established alternate methods that can be used to fabricate the nanochannels. (3) Photolithography and RIE of microchannels connecting the nanochannels. The design can be simply two straight microchannels connecting both nanochannel ends, where one microchannel serves as the inlet and the other as the outlet. Alternately, two U-shaped microchannels connecting the two ends of the nanochannels can be used to provide control in driving DNA into the nanochannels and minimize fouling. (4) Photolithography and deep reactive-ion etching of loading holes. Powder or sand blasting is a suitable alternate process for fused silica substrates to obtain loading holes. The loading holes are used for adding the sample liquid containing DNA to the device, the DNA can then be driven into the microchannels and subsequently into the nanochannels using pressure-driven flow or electrophoresis. (5) High-temperature bonding of a glass wafer to the silicon or fused silica wafer to create the micro-nanofluidic confinement. Anodic bonding (Kutchoukov et al., Reference Kutchoukov, Laugere, Vlist, Pakula, Garini, Alkemade and Bossche2003; Abad et al., Reference Abad, Juarros, Retolaza, Merino, Marie and Kristensen2011) and polymer fusion bonding involving a thin polymer intermediate layer (Gu et al., Reference Gu, Gupta, Chou, Wei and Zenhausern2007; Lesser-Rojas et al., Reference Lesser-Rojas, Sriram, Liao, Lai, Kuo, Chu and Chou2014) are alternate bonding techniques. All the fabrication steps discussed above are shown schematically in Fig. 7.
Fabrication of nanofluidic structures for DNA applications on silicon and fused silica wafers using traditional semiconductor fabrication processes is discussed in more detail in (Westerlund et al., Reference Westerlund, Persson, Fritzsche, Beech, Tegenfeldt and Peterman2018). Further details on tools, chemicals, photoresists, and etching processes used for fabrication of nanofluidic devices on silicon wafers can be found in (Öz et al., Reference Öz, Sriram and Westerlund2019; Kang et al., Reference Kang, Sriram, Jeon, Kim, Chen, Kim, Kim, Lee, Westerlund and Jonsson2022).
Alternative fabrication
For many of the applications foreseen for nanofluidic devices, in particular in diagnostics, disposable chips produced at low cost are desirable. Several different strategies have been proposed for making chips using less demanding techniques and at higher throughput, compared to the traditionally used state-of-the-art nanofabrication processes. Nanofluidic chips for DNA analysis based on thermoplastic polymers can be fabricated by injection molding (Utko et al., Reference Utko, Persson, Kristensen and Larsen2011) or nanoimprinting (Wu et al., Reference Wu, Chantiwas, Amirsadeghi, Soper and Park2011; Uba et al., Reference Uba, Pullagurla, Sirasunthorn, Wu, Park, Chantiwas, Cho, Shin and Soper2015; Esmek et al., Reference Esmek, Bayat, Pérez-Willard, Volkenandt, Blick and Fernandez-Cuesta2019). Devices replicated in PDMS (van Kan et al., Reference van Kan, Zhang, Perumal Malar and van der Maarel2012) from a nanofabricated master is another approach to increase fabrication throughput and decrease cost. Park et al. (Reference Park, Oh and Cho2018) reported how near-field electrospinning can be used to create nanofluidic channels with a circular cross-section in PDMS, suitable for biological applications.
Chip designs
Tapered channels
By varying the channel dimensions along the channel length, it is possible to study the same individual, single DNA molecule at different degrees of confinement (Fig. 8a, b). In analogy to force spectroscopy, Persson et al. coined the concept confinement spectroscopy for probing single DNA molecules in tapered nanochannels (Persson et al., Reference Persson, Utko, Reisner, Larsen and Kristensen2009). Similar funnel-shaped channels were used by Frykholm et al. (Reference Frykholm, Alizadehheidari, Fritzsche, Wigenius, Modesti, Persson and Westerlund2014) and Fornander et al. (Reference Fornander, Frykholm, Fritzsche, Araya, Nevin, Werner, Cakir, Persson, Garcin, Beuning, Mehlig, Modesti and Westerlund2016), when studying the nucleoprotein filaments formed by RecA and Rad51, respectively, on DNA. Channels where the width is varied in discrete steps, rather than continuously tapered, have also been used, for example by Werner et al. (Reference Werner, Jain, Muralidhar, Frykholm, St Clere Smithe, Fritzsche, Westerlund, Dorfman and Mehlig2018) and Gupta et al. (Reference Gupta, Miller, Muralidhar, Mahshid, Reisner and Dorfman2015). The advantage of using discrete steps is that the DNA is not subjected to an entropic force that pushes it toward the larger end of the funnel, allowing equilibrium measurements to be performed at the cost of only doing so at a discrete number of channel dimensions.
Devices for dynamic studies
The nanochannel device can be designed to add functionalities to it, such as the possibility to change the local environment around the DNA molecule under study, i.e. exchange buffers or add reagents to it. Such a set-up enables studies of, for example, DNA–protein interactions in real time.
A chip design presented by Zhang et al. features two arrays of parallel nanochannels in a perpendicular configuration (Zhang et al., Reference Zhang, Jiang, Liu, Doyle, van Kan and van der Maarel2013b) (Fig. 8c). DNA is introduced into the device via electrophoresis through the array of wider (250 × 200 nm2) nanochannels, whereas buffer can be exchanged via diffusion through the intersecting array of narrower (150 × 200 nm2) nanochannels in the perpendicular direction. As a proof of principle, compaction of T4-DNA upon addition of protamine and unpacking of pre-compacted T4-DNA by increasing salt concentration were investigated, demonstrating the functionality of the device (Fig. 9a). Subsequently, the same device was used to study the interaction of the bacterial protein H-NS with DNA (Zhang et al., Reference Zhang, Guttula, Liu, Malar, Ng, Dai, Doyle, van Kan and van der Maarel2013a).
Öz et al. (Reference Öz, Sriram and Westerlund2019) designed a device enabling active exchange of the local environment within the nanofluidic channel (Fig. 8d). The design includes an array of nanochannels (100 × 100 nm2), with a central stretch of slightly larger dimensions (300 × 130 nm2), creating an entropically controlled reaction chamber where single DNA molecules can be confined and stretched. Across the reaction chambers, on top of the array and in an orthogonal fashion, runs a shallow nano-slit (30 μm × 30 nm), through which analytes actively can be introduced via pressure-driven flow to exchange the local buffer environment of the entrapped DNA molecule. The authors demonstrated the functionality of the device by a series of experiments, where analytes, such as SDS, spermidine, ethidium bromide and DNase I, were added to confined DNA molecules and the resulting response could be monitored in situ (Fig. 9b). The same device was later used when studying conformational changes in large DNA upon interaction with DNA-binding proteins (Sharma et al., Reference Sharma, Sriram, Holmstrom and Westerlund2020; Jiang et al., Reference Jiang, Humbert, S, Rouzina, Mely and Westerlund2021a).
Handling ultralong DNA
In the use of nanofluidic channels for ODM applications, discussed in detail below, the size of the DNA of interest can pose experimental challenges. Introducing DNA of genomic size to a nanosized channel is associated with a significant entropic barrier, and coils of large DNA molecules tend to entangle and clog the channel entrance. An approach to promote DNA loading into nanochannels and facilitate DNA entering in an unfolded state with one end first is to include an entropic gradient in the chip design. Physical barriers such as pillar structures, that gradually increase the confinement, provide pre-stretching of the DNA and reduce the thermodynamic cost (Cao et al., Reference Cao, Tegenfeldt, Austin and Chou2002a; Wang et al., Reference Wang, Bruce, Duch, Patel, Smith, Astier, Wunsch, Meshram, Galan, Scerbo, Pereira, Wang, Colgan, Lin and Stolovitzky2015). Lam et al. (Reference Lam, Hastie, Lin, Ehrlich, Das, Austin, Deshpande, Cao, Nagarajan, Xiao and Kwok2012) applied this approach to genome mapping in a nanofluidic device that contained a region with channels of intermediate width and physical barriers between the feeding microfluidic channel and the nanochannel array (Fig. 8e).
Another concern to address when long, genomic DNA is confined to nanofluidic channels and visualized using fluorescence microscopy is that the size of the DNA molecule might exceed the field of view of the camera at sufficient resolution. By designing the channels in a meandering fashion (Fig. 8f) Freitag et al. circumvented this issue (Freitag et al., Reference Freitag, Noble, Fritzsche, Persson, Reiter-Schad, Nilsson, Graneli, Ambjornsson, Mir and Tegenfeldt2015). In this study, a single molecule of an intact 5.7 Mbp Schizosaccharomyces pombe (S. pombe) chromosome was stretched, at an extension of 50% of its contour length to 1 mm, in a meandering nanochannel and visualized in a single field of view.
CLiC (Convex Lens-induced Confinement)
In most nanofluidic devices, the lid is permanently bonded to the bottom of the device. In 2010, Leslie et al. (Reference Leslie, Fields and Cohen2010) introduced the concept of Convex Lens-induced Confinement (CLiC) microscopy, where nanoscale confinement can be repeatedly formed. The principle of CLiC relies on a convex lens pushing the lid of a chamber, forcing the lid to come in close proximity to the bottom of the chamber, thereby confining any molecules present in the solution (Berard et al., Reference Berard, Mcfaul, Leith, Arsenault, Michaud and Leslie2013). In the first version, both top and bottom surfaces of the chamber were unmodified whereas in later studies the bottom surface was patterned with nanoscale features into which long DNA is forced as the chamber is squeezed by the convex lens (Fig. 9c) (Berard et al., Reference Berard, Michaud, Mahshid, Ahamed, McFaul, Leith, Bérubé, Sladek, Reisner and Leslie2014). The nanoscale confinement leads to stretching of long DNA and the degree of stretching can be tailored by varying the distance between the lid and the bottom of the chamber.
The concept is compatible with several nanofluidic assays such as partially confining the DNA in pits, as well as ODM. An extension of the latter was the demonstration of a lab-on-a-chip for single-cell analysis, including cell lysis and DNA stretching in a single device (Mahshid et al., Reference Mahshid, Ahamed, Berard, Amin, Sladek, Leslie and Reisner2015). By combining the CLiC procedure with electrophoresis, an increased number of molecules can be confined in the observation volume and an enhanced degree of stretching can be achieved (Ahamed et al., Reference Ahamed, Mahshid, Berard, Michaud, Sladek, Reisner and Leslie2016). The fact that the chamber can be repeatedly squeezed, and that the distance between the lid and bottom can be adjusted at will, makes it possible to change the solution surrounding the molecules under study. This was demonstrated by changing the ionic strength of the solution, which strongly affects DNA stretching, and further by adding cationic surfactants that condense DNA and restriction enzymes that digest DNA (Henkin et al., Reference Henkin, Berard, Stabile, Shayegan, Leith and Leslie2016). CLiC has in more recent years been implemented in combination with other nanoscale features, such as nanopit arrays, allowing for other types of studies on DNA, in particular supercoiled DNA (Scott et al., Reference Scott, Xu, Kouzine, Berard, Shaheen, Gravel, Saunders, Hofkirchner, Leroux, Laurin, Levens, Benham and Leslie2018, Reference Scott, Shaheen, McGuinness, Metera, Kouzine, Levens, Benham and Leslie2019; Shaheen et al., Reference Shaheen, Hastie, Metera, Scott, Zhang, Chen, Gu, Weber, Munsky, Kouzine, Levens, Benham and Leslie2022).
Passivation
Due to the extreme surface to volume ratio in a nanofluidic device, even minor tendencies of non-specific binding to the channel walls can cause severe difficulties, with molecules getting stuck to the surface or clogging the channels. This is generally not a problem when studying DNA alone and using devices fabricated in hydrophilic materials like fused silica, as the surface becomes negatively charged when the channels are filled with aqueous buffer, thereby repelling the negatively charged DNA. However, when including DNA-binding ligands or proteins, often positively charged, surface interactions become an issue and the literature provides several examples of various strategies to overcome these interactions.
Using an inert protein, like BSA, to saturate the surface before the experiment, or to include it at high concentration in the reaction buffer, is a common strategy for passivation of microfluidic channels that has been used also for nanochannels (Riehn et al., Reference Riehn, Lu, Wang, Lim, Cox and Austin2005; Wang et al., Reference Wang, Tegenfeldt, Reisner, Riehn, Guan, Guo, Golding, Cox, Sturm and Austin2005; Roushan et al., Reference Roushan, Kaur, Karpusenko, Countryman, Ortiz, Fang Lim, Wang and Riehn2014, Reference Roushan, Azad, Movahed, Ray, Livshits, Lim, Weninger and Riehn2018), sometimes in combination with the non-ionic detergent Nonidet P-40 (Streng et al., Reference Streng, Lim, Pan, Karpusenka and Riehn2009; Lim et al., Reference Lim, Karpusenko, Blumers, Streng and Riehn2013). Polyvinylpyrrolidone has been used as a surface passivation agent in the sample buffer (Lim et al., Reference Lim, Karpusenko, Blumers, Streng and Riehn2013; Liu et al., Reference Liu, Movahed, Dangi, Pan, Kaur, Bilinovich, Faison, Leighton, Wang, Williams and Riehn2020) and addition of anionic surfactants like SDS can provide shielding of positive charges of proteins, thereby reducing non-specific binding to the channels (Öz et al., Reference Öz, Howard, Sharma, Törnkvist, Ceppi, Sriram, Kristiansson, Cejka and Westerlund2020). Lipid bilayers can be used to effectively passivate nanofluidic structures, as demonstrated by Persson et al. (Reference Persson, Fritzsche, Mir, Modesti, Westerlund and Tegenfeldt2012) and used in several subsequent studies (Frykholm et al., Reference Frykholm, Alizadehheidari, Fritzsche, Wigenius, Modesti, Persson and Westerlund2014; Fornander et al., Reference Fornander, Frykholm, Fritzsche, Araya, Nevin, Werner, Cakir, Persson, Garcin, Beuning, Mehlig, Modesti and Westerlund2016; Öz et al., Reference Öz, Sriram and Westerlund2019; Sharma et al., Reference Sharma, Sriram, Holmstrom and Westerlund2020). Importantly, the use of a small fraction of fluorescent lipids in the bilayers makes it possible to follow the passivation process and to investigate the integrity of the bilayer before the experiments are started. Recently, the use of a combination of dopamine and methoxypoly(ethylene glycol) for nanochannel passivation was reported (Liu et al., Reference Liu, Movahed, Dangi, Pan, Kaur, Bilinovich, Faison, Leighton, Wang, Williams and Riehn2020).
Optical DNA mapping
ODM is an umbrella term for methods that allow visualization of intact DNA molecules, up to Mbp in size, to obtain sequence information with approximately kbp resolution. The length scales covered by ODM perfectly match the gap between commonly used DNA analysis methods such as fluorescence in-situ hybridization (chromosomal level) and DNA sequencing (base pair level). In ODM the DNA needs to be stretched, which can be done either on a glass surface by molecular combing (Bensimon et al., Reference Bensimon, Simon, Chiffaudel, Croquette, Heslot and Bensimon1994; Michalet et al., Reference Michalet, Ekong, Fougerousse, Rousseaux, Schurra, Hornigold, Slegtenhorst, Wolfe, Povey, Beckmann and Bensimon1997; Neely et al., Reference Neely, Deen and Hofkens2011) or in nanofluidic channels (Müller and Westerlund, Reference Müller and Westerlund2017; Jeffet et al., Reference Jeffet, Margalit, Michaeli and Ebenstein2021b), and visualized using a fluorescence microscope. Stretching the DNA by confinement provides uniform stretching and the extension of the molecule along the channel scales linearly with the contour length of the DNA (Tegenfeldt et al., Reference Tegenfeldt, Prinz, Cao, Chou, Reisner, Riehn, Wang, Cox, Sturm, Silberzan and Austin2004), thus facilitating direct extraction of sequence information from specifically labeled molecules. This makes nanofluidic channels an attractive platform for ODM. Throughput can easily be increased by parallelized systems with arrays of nanofluidic channels (Lam et al., Reference Lam, Hastie, Lin, Ehrlich, Das, Austin, Deshpande, Cao, Nagarajan, Xiao and Kwok2012) and the introduction of a commercialized platform for ODM using nanochannels (see section ‘Recent applications of optical DNA mapping’) has led to an increasingly widespread use of this application. In this chapter we will present ODM with focus primarily on the methodological aspects and only briefly discuss recent studies using the methodologies for high-throughput genomic analysis.
In order to access information about the underlying DNA sequence using ODM, different labeling schemes have been developed to generate a sequence-specific pattern along the extended DNA molecule. Initial efforts in the field were made by Schwartz et al. (Reference Schwartz, Li, Hernandez, Ramnarain, Huff and Wang1993), who used restriction enzymes to generate sequence-specific double-stranded breaks on DNA molecules when trapped in agarose gels. Present labeling approaches that are compatible with nanochannel-based analysis can be divided into two main groups, enzyme-based labeling, and affinity-based labeling (Fig. 10).
Enzyme-based labeling
Sequence-specific labeling of DNA using enzymes can be performed in different ways. In 2007 Jo et al. (Reference Jo, Dhingra, Odijk, De Pablo, Graham, Runnheim, Forrest and Schwartz2007) introduced nick labeling, where nicking enzymes were used to visualize sequence-specific sites along the DNA (Fig. 11a). In short, a nicking enzyme creates a single-stranded break in the DNA at a recognition site, typically 4–7 bp in size and specific for the enzyme used. The nick is then repaired by a processive DNA polymerase that incorporates fluorescently labeled nucleotides and the position(s) of these labels can be mapped as the DNA is stretched. The contour of the DNA is visualized by staining the backbone of the DNA molecule, typically with the bis-intercalating dye YOYO-1. Several additions have been made to expand the nick labeling toolbox, including nick-flap labeling (Das et al., Reference Das, Austin, Akana, Deshpande, Cao and Xiao2010), dual nick labeling (Hastie et al., Reference Hastie, Dong, Smith, Finklestein, Lam, Huo, Cao, Kwok, Deal, Dvorak, Luo, Gu and Xiao2013), and highly specific Cas9-based labeling strategies (McCaffrey et al., Reference McCaffrey, Sibert, Zhang, Zhang, Hu, Riethman and Xiao2016; Zhang et al., Reference Zhang, Chan, Sugerman, Lee, Lam, Bocklandt, Cao and Hastie2018; Abid et al., Reference Abid, Young, McCaffrey, Raseley, Varapula, Wang, Piazza, Mell and Xiao2020; Uppuluri et al., Reference Uppuluri, Jadhav, Wang and Xiao2021).
Nick-flap labeling utilizes a DNA polymerase that lacks exonuclease activity and displaces the nucleotides at the nicking site instead of removing them, generating a single-stranded DNA flap. In this way, dual-labeling can be achieved, with the nicking site labeled by incorporation of fluorescently labeled nucleotides, and the single-stranded DNA flap labeled using a complementary, labeled oligonucleotide (Das et al., Reference Das, Austin, Akana, Deshpande, Cao and Xiao2010). Another approach to obtain dual labels is to run the nick labeling reaction twice, using two different restriction enzymes and an intermediate purification step, as was first demonstrated by Hastie et al. (Reference Hastie, Dong, Smith, Finklestein, Lam, Huo, Cao, Kwok, Deal, Dvorak, Luo, Gu and Xiao2013).
A main limitation of traditional nicking enzymes is the limited specificity (4–7 bp). McCaffrey et al. addressed this issue by using a CRISPR/Cas9-based approach, which provides a 20 bp recognition site that can be designed at will (McCaffrey et al., Reference McCaffrey, Sibert, Zhang, Zhang, Hu, Riethman and Xiao2016). The enzyme used was a mutated version of Cas9, a Cas9 nickase known as Cas9n or Cas9 D10A, that only nicks the DNA at the target site instead of making a double-stranded cut. Cas9-mediated nick labeling can be combined with traditional enzymatic nick labeling, to create multicolor optical maps and enable both global sequence motif mapping and labeling directed specifically to a sequence of interest, which has found applications in structural variation analysis and telomere characterization (McCaffrey et al., Reference McCaffrey, Sibert, Zhang, Zhang, Hu, Riethman and Xiao2016, Reference McCaffrey, Young, Lassahn, Sibert, Pastor, Riethman and Xiao2017; Levy-Sakin et al., Reference Levy-Sakin, Pastor, Mostovoy, Li, Leung, McCaffrey, Young, Lam, Hastie, Wong, Chung, Ma, Sibert, Rajagopalan, Jin, Chow, Chu, Poon, Lin, Naguib, Wang, Cao, Chan, Yip, Xiao and Kwok2019; Young et al., Reference Young, Abid, Kwok, Riethman and Xiao2020). Uppuluri et al. (Reference Uppuluri, Jadhav, Wang and Xiao2021) recently presented another multicolor labeling approach by combining Cas9-mediated nick labeling and the enzymatic direct labeling technology from BioNano Genomics (see section ‘Recent applications of optical DNA mapping’). They validated the usefulness of their methodology by quantifying copy numbers of D4Z4 repeats on chromosome 4q, detecting long non-interspersed elements 1 (LINE-1) insertions, and estimating telomere lengths. The specificity and extended use of Cas9-based labeling was further demonstrated by Abid et al. (Reference Abid, Young, McCaffrey, Raseley, Varapula, Wang, Piazza, Mell and Xiao2020), who identified differences between two genomes at single-base resolution, utilizing the fact that the CRISPR/Cas9 system has a strong requirement for the protospacer adjacent motif following the 20 bp recognition site. They also showed how mapping patterns can be customized by using a set of multiple sgRNAs of custom design in Cas9-mediated nick labeling. In line with the approach of McCaffrey et al. (Reference McCaffrey, Sibert, Zhang, Zhang, Hu, Riethman and Xiao2016), Zhang et al. (Reference Zhang, Chan, Sugerman, Lee, Lam, Bocklandt, Cao and Hastie2018) utilized a dead, fluorescently labeled, Cas9 enzyme which only binds to DNA without cutting any of its strands, to visualize specific sequence motifs with high precision.
Sequence-specific labels can also be introduced using methyltransferases (Klimasauskas and Weinhold, Reference Klimasauskas and Weinhold2007; Lukinavicius et al., Reference Lukinavicius, Lapiene, Stasevskij, Dalhoff, Weinhold and Klimasauskas2007). The methyltransferase recognizes a specific sequence (typically 4 bp) and attaches a synthetic co-factor at the site. The co-factor can either be pre-labeled with a fluorophore, or the fluorophore can be attached in a subsequent reaction. The short recognition site of a methyltransferase has been used to generate dense labels, so-called amplitude modulation profiles, along the DNA backbone (Grunwald et al., Reference Grunwald, Dahan, Giesbertz, Nilsson, Nyberg, Weinhold, Ambjornsson, Westerlund and Ebenstein2015) and methyltransferase-based labeling was utilized to obtain high-resolution optical maps also by Jeffet et al. (Reference Jeffet, Kobo, Su, Grunwald, Green, Nilsson, Eisenberg, Ambjörnsson, Westerlund, Weinhold, Shabat, Purohit and Ebenstein2016). By combining nick labeling protocols with methyltransferase-based labeling hybrid genetic/epigenetic maps, providing both genomic and methylation profiles simultaneously, can be obtained, as illustrated by Sharim et al. (Reference Sharim, Grunwald, Gabrieli, Michaeli, Margalit, Torchinsky, Arielly, Nifker, Juhasz, Gularek, Almalvez, Dufault, Chandra, Liu, Bhattacharya, Chen, Vilain, Wagner, Pevsner, Reifenberger, Lam, Hastie, Cao, Barseghyan, Weinhold and Ebenstein2019) and Gabrieli et al. (Reference Gabrieli, Michaeli, Avraham, Torchinsky, Margalit, Schütz, Juhasz, Coruh, Arbib, Zhou, Law, Weinhold and Ebenstein2022).
Besides nick labeling and methyltransferase-based labeling, there are also other enzymatic reactions that can be used to obtain specific labeling for extracting information beyond the DNA sequence. Examples include labeling of DNA hydroxymethylation (Michaeli et al., Reference Michaeli, Shahal, Torchinsky, Grunwald, Hoch and Ebenstein2013; Gilat et al., Reference Gilat, Tabachnik, Shwartz, Shahal, Torchinsky, Michaeli, Nifker, Zirkin and Ebenstein2017; Gabrieli et al., Reference Gabrieli, Sharim, Nifker, Jeffet, Shahal, Arielly, Levi-Sakin, Hoch, Arbib, Michaeli and Ebenstein2018) and DNA damage (Zirkin et al., Reference Zirkin, Fishman, Sharim, Michaeli, Don and Ebenstein2014; Müller et al., Reference Müller, Dvirnas, Andersson, Singh, Sriram, Johansson, Ebenstein, Ambjörnsson and Westerlund2019; Torchinsky et al., Reference Torchinsky, Michaeli, Gassman and Ebenstein2019).
Affinity-based labeling
In contrast to enzyme-based labeling strategies, affinity-based labeling does not rely on any enzymatic reactions to generate the sequence-specific pattern on DNA. Instead, affinity-based labeling relies either on the difference in melting temperature of AT and GC base pairs, so-called denaturation mapping (Reisner et al., Reference Reisner, Larsen, Silahtaroglu, Kristensen, Tommerup, Tegenfeldt and Flyvbjerg2010), or on molecules with a sequence preference when binding to DNA, such as in competitive binding-based approaches (Nyberg et al., Reference Nyberg, Persson, Berg, Bergström, Fransson, Olsson, Persson, Stålnacke, Wigenius, Tegenfeldt and Westerlund2012).
Developed by Reisner et al. (Reference Reisner, Larsen, Silahtaroglu, Kristensen, Tommerup, Tegenfeldt and Flyvbjerg2010), denaturation mapping utilizes the difference in melting temperature between GC base pairs, connected by three hydrogen bonds, and AT base pairs, connected by two hydrogen bonds. The entire contour of the DNA is stained with YOYO-1 prior to carefully raising the temperature to a level where AT base pairs melt, with subsequent YOYO-1 dissociation, while GC base pairs remain intact, with YOYO-1 still bound. As a result, the emission intensity will be bright in GC-rich regions, and dark in AT-rich regions, along the extended DNA molecule. One advantage of denaturation mapping is that it can be done either on-chip (Reisner et al., Reference Reisner, Larsen, Silahtaroglu, Kristensen, Tommerup, Tegenfeldt and Flyvbjerg2010; Welch et al., Reference Welch, Sladek, Dewar and Reisner2012) or prior to sample loading (Marie et al., Reference Marie, Pedersen, Bauer, Rasmussen, Yusuf, Volpi, Flyvbjerg, Kristensen and Mir2013; Łopacińska-Jørgensen et al., Reference Łopacińska-Jørgensen, Pedersen, Bak, Mehrjouy, Sørensen, Østergaard, Bilenberg, Kristensen, Taboryski, Flyvbjerg, Marie, Tommerup and Silahtaroglu2017). The most recent studies using denaturation mapping have been focused on stretching very long DNA molecules by hydrodynamic flow in a shallow nanoslit. In this way, structural variations in human DNA were identified (Marie et al., Reference Marie, Pedersen, Bærlocher, Koprowska, Pødenphant, Sabatel, Zalkovskij, Mironov, Bilenberg, Ashley, Flyvbjerg, Bodmer, Kristensen and Mir2018).
With competitive binding-based mapping, it is possible to circumvent both the need for enzymatic reactions (enzymatic labeling approaches), as well as precise temperature control (denaturation mapping). DNA is instead stained by addition of a non-selective, fluorescent dye, such as YOYO-1, and a sequence-selective, non-fluorescent molecule, such as the AT-selective netropsin, mixed at a pre-defined ratio (Nyberg et al., Reference Nyberg, Persson, Berg, Bergström, Fransson, Olsson, Persson, Stålnacke, Wigenius, Tegenfeldt and Westerlund2012; Nilsson et al., Reference Nilsson, Emilsson, Nyberg, Noble, Stadler, Fritzsche, Moore, Tegenfeldt, Ambjörnsson and Westerlund2014). Netropsin will block the AT-sites from YOYO-1, rendering an emission intensity profile, which, as for denaturation mapping, will appear dark in AT-rich regions, and bright in GC-rich regions. The use of other fluorescent dyes, such as TOTO-3, and sequence-selective molecules, such as the GC-selective actinomycin D, in competitive binding-based optical mapping has also been reported (Esmek et al., Reference Esmek, Erichlandwehr, Mors, Czech-Sioli, Therre, Günther, Grundhoff, Fischer and Fernandez-Cuesta2021). Moreover, Lee et al. (Reference Lee, Kawamoto, Vaijayanthi, Park, Bae, Kim-Ha, Sugiyama and Jo2018a) demonstrated how a fluorescent TAMRA-polypyrrole could be used for AT sequence visualization on DNA molecules without the need of an intercalating dye, relying on the TAMRA-polypyrrole itself to generate the intensity profile.
Overall, there are advantages and disadvantages of all the labeling approaches described above, and different strategies will be optimal for different applications. As demonstrated by Müller et al. (Reference Müller, Dvirnas, Andersson, Singh, Sriram, Johansson, Ebenstein, Ambjörnsson and Westerlund2019), it is also possible to combine enzymatic labeling with affinity-based labeling, to obtain a dual read-out, which was recently further demonstrated by Torstensson et al. (Reference Torstensson, Goyal, Johnning, Westerlund and Ambjörnsson2021). This principle could in the future be used to increase the specificity of ODM, as well as to combine genomic data with epigenetic information.
Recent applications of optical DNA mapping
ODM was commercialized by BioNano Genomics in 2012. Their instruments Irys® and Saphyr® rely on enzyme-based labeling in combination with nanofluidic chips with automized sample loading, image acquisition, and data analysis. The technology initially relied on nick labeling, but recently an enzymatic protocol for direct labeling of a 6 bp recognition site was introduced, with improved DNA integrity during the labeling procedure that results in longer molecules. The system has been used for a variety of applications including de novo assembly of complex genomes as well as for studying genetically linked diseases.
Levy-Sakin et al. (Reference Levy-Sakin, Pastor, Mostovoy, Li, Leung, McCaffrey, Young, Lam, Hastie, Wong, Chung, Ma, Sibert, Rajagopalan, Jin, Chow, Chu, Poon, Lin, Naguib, Wang, Cao, Chan, Yip, Xiao and Kwok2019) used ODM on the BioNano Genomics platform to investigate large structural variations across the genomes of 154 individuals from 26 different populations. They were able to identify an abundance of large insertions and deletions, in many cases flanked by repetitive elements difficult to resolve by short-read sequencing, and the presence of population-specific structural variations patterns. By including Cas9-mediated nick-labeling in the assay they could also characterize acrocentric chromosome p-arms and the study highlights how ODM can be useful in characterizing complex regions such as subtelomeric and pericentromeric areas. Young et al. (Reference Young, Abid, Kwok, Riethman and Xiao2020) undertook a comprehensive analysis of the subtelomere structure and variation in human genomes, by performing whole-genome ODM of the same sample set. Wu et al. (Reference Wu, Turner, Nguyen, Raviram, Erb, Santini, Luebeck, Rajkumar, Diao, Li, Zhang, Jameson, Corces, Granja, Chen, Coruh, Abnousi, Houston, Ye, Hu, Yu, Kim, Law, Verhaak, Hu, Furnari, Chang, Ren, Bafna and Mischel2019) used ODM in combination with ultrastructural imaging and computational analysis of whole-genome sequencing data to demonstrate the structure of extrachromosomal DNA in humans, DNA which is increasingly associated with oncogenes. The organization of extrachromosomal DNA was also studied by Luebeck et al. (Reference Luebeck, Coruh, Dehkordi, Lange, Turner, Deshpande, Pai, Zhang, Rajkumar, Law, Mischel and Bafna2020), who developed a computational method, AmpliconReconstructor (AR), for integrating ODM from the BioNano Genomics platform with next-generation sequencing to resolve focal copy number amplification of oncogenes. Gabrieli et al. (Reference Gabrieli, Sharim, Nifker, Jeffet, Shahal, Arielly, Levi-Sakin, Hoch, Arbib, Michaeli and Ebenstein2018) combined the enzyme-based ODM technology of BioNano Genomics with labeling of epigenetic 5-hydroxymethylcytosine (5-hmC) marks. The genetic/epigenetic assay allowed sensitive quantification of 5-hmC marks in blood at the single DNA molecule level. Sharim et al. (Reference Sharim, Grunwald, Gabrieli, Michaeli, Margalit, Torchinsky, Arielly, Nifker, Juhasz, Gularek, Almalvez, Dufault, Chandra, Liu, Bhattacharya, Chen, Vilain, Wagner, Pevsner, Reifenberger, Lam, Hastie, Cao, Barseghyan, Weinhold and Ebenstein2019) utilized an enzymatic labeling reaction, distinguishing methylated from non-methylated cytosines, in combination with nick labeling to create hybrid genetic/epigenetic genomic maps, enabling identification of methylation patterns across the genome.
Marie et al. (Reference Marie, Pedersen, Bærlocher, Koprowska, Pødenphant, Sabatel, Zalkovskij, Mironov, Bilenberg, Ashley, Flyvbjerg, Bodmer, Kristensen and Mir2018) demonstrated how denaturation mapping could be combined with on-chip lysis and DNA extraction of single cells, allowing mapping of structural variations at the single DNA molecule level. Competitive binding-based ODM has recently found applications within bacterial diagnostics. Müller et al. (Reference Müller, Nyblom, Johnning, Wrande, Dvirnas, Sriram, Giske, Ambjörnsson, Sandegren, Kristiansson and Westerlund2020) demonstrated how competitive binding-based ODM could be used to identify bacteria with high precision in both complex mixtures as well as from uncultivated urine samples, circumventing the time-consuming step of bacterial cultivation prior to diagnosis. ODM based on competitive binding has also been used for identification and tracing of bacterial plasmids related to antibiotic resistance (Nyberg et al., Reference Nyberg, Quaderi, Emilsson, Karami, Lagerstedt, Müller, Noble, Hammarberg, Nilsson, Sjöberg, Fritzsche, Kristiansson, Sandegren, Ambjörnsson and Westerlund2016; Müller et al., Reference Müller, Karami, Nyberg, Pichler, Torche Pedreschi, Quaderi, Fritzsche, Ambjörnsson, Åhrén and Westerlund2016b). Utilizing the highly specific CRISPR/Cas9 system, the same research group developed an assay to detect antibiotic resistance genes on single plasmid molecules. By targeting the gene of interest, circular plasmids carrying that specific gene can be linearized by Cas9, and by observing where linearization occurred, genes could be detected with high precision and reliability (Muller et al., Reference Muller, Rajer, Frykholm, Nyberg, Quaderi, Fritzsche, Kristiansson, Ambjornsson, Sandegren and Westerlund2016a). This plasmid typing, using competitive binding-based mapping in combination with CRISPR/Cas9 restriction, was applied by Bikkarolla et al. (Reference Bikkarolla, Nordberg, Rajer, Müller, Kabir, Sriram, Dvirnas, Ambjörnsson, Giske, Navér, Sandegren and Westerlund2019) to trace and characterize plasmids from an hospital outbreak of antibiotic multi-resistant bacteria (Fig. 11b), by Lindblom et al. (Reference Lindblom, Sriram, Muller, Oz, Sandstrom, Ahren, Westerlund and Karami2019) to investigate possible transfer of plasmids carrying extended-spectrum β-lactamase genes in sequential Enterobacteriaceae infections, and by Lin et al. (Reference Lin, Sewunet, Sriram, Giske and Westerlund2020) as a tool to detect clonal spread of multidrug-resistant bacteria in a hospital setting.
Outstanding challenges
The commercialization of ODM in nanochannels has led to an increased accessibility of this application. The available platform is capable of very high-throughput mapping of the human genome, resulting in genome maps with very high coverage. The main challenges in the field are therefore rather linked to other aspects of the methodology, such as sample preparation, which is a complex and time-consuming process that remains a bottleneck. An important future direction for DNA mapping would be to combine sample preparation and mapping in a single device, thus minimizing the hands-on manipulation necessary to obtain the optical DNA maps. An ultimate goal of this procedure would be to perform ODM of single cells in such a device, to reveal genetic differences between cells.
Another important challenge is to maximize the information obtained from the DNA map. One approach to this is to increase the genetic information output, by for example combining multiple labeling enzymes or combining enzyme-based and competitive binding-based labeling schemes. The latter has recently been demonstrated (Torstensson et al., Reference Torstensson, Goyal, Johnning, Westerlund and Ambjörnsson2021), as briefly discussed above. Alternatively, labeling to obtain genetic information can be combined with other types of labels, such as epigenetic markers (Sharim et al., Reference Sharim, Grunwald, Gabrieli, Michaeli, Margalit, Torchinsky, Arielly, Nifker, Juhasz, Gularek, Almalvez, Dufault, Chandra, Liu, Bhattacharya, Chen, Vilain, Wagner, Pevsner, Reifenberger, Lam, Hastie, Cao, Barseghyan, Weinhold and Ebenstein2019; Margalit et al., Reference Margalit, Avraham, Shahal, Michaeli, Gilat, Magod, Caspi, Loewenstein, Lahat, Friedmann-Morvinski, Kariv, Rosin-Arbesfeld, Zirkin and Ebenstein2020; Gabrieli et al., Reference Gabrieli, Michaeli, Avraham, Torchinsky, Margalit, Schütz, Juhasz, Coruh, Arbib, Zhou, Law, Weinhold and Ebenstein2022) or labeling of DNA damage (Müller et al., Reference Müller, Dvirnas, Andersson, Singh, Sriram, Johansson, Ebenstein, Ambjörnsson and Westerlund2019), thereby adding an additional layer of information to the optical DNA map. The development of labels that can be used to obtain such additional information that is not yet accessible remains a challenge in ODM.
DNA–protein interactions
To study DNA–protein interactions using nanofluidic channels is an attractive alternative to conventional single molecule manipulation techniques like optical or magnetic tweezers, as the use of channels circumvents the need to tether the DNA molecule to beads or surfaces, which poses limitations in the type and size of DNA that can be studied and/or preclude investigations of events taking place at the ends of the DNA molecule. Nanofluidic channels can also, similarly to other techniques, be used to characterize proteins that affect the physical properties of DNA. The first examples of DNA–protein interactions investigated using nanochannels were reported in 2005, and include the demonstration of restriction mapping in nanochannels (Riehn et al., Reference Riehn, Lu, Wang, Lim, Cox and Austin2005) and a study of sequence-specific binding of the LacI repressor to DNA (Wang et al., Reference Wang, Tegenfeldt, Reisner, Riehn, Guan, Guo, Golding, Cox, Sturm and Austin2005). Although the potential for studying DNA–protein interactions in nanofluidic channels is as great as that of studying DNA itself, the progress in this field has been significantly slower, possibly hampered by the challenge of avoiding non-specific binding to the channel walls, as discussed above (section ‘Passivation’).
In the following sections we will review studies on (i) proteins that compact DNA; (ii) filamentous DNA–protein complexes; (iii) proteins interacting with DNA ends; and (iv) other DNA–protein interactions (Fig. 12). The main goal is to highlight the usefulness of nanochannels for studying DNA–protein interactions, but we will also point out important biophysical insights that the studies have provided.
DNA compacting proteins
In all living cells, compaction of DNA is essential for organization of the genetic material and serves several purposes. By interactions with different proteins the DNA can be densely packed, allowing large amounts of DNA to fit into small environments. A condensed structure also provides protection of DNA from degradation and enables regulation of gene expression. Given the importance of these processes, studying proteins that act to compact DNA and their effect on the physical properties and the structure of DNA is highly interesting and we will discuss below several examples of such studies using nanofluidic channels.
Chromatin
Eukaryotic chromatin is comprised of DNA wrapped up on histone protein complexes, forming a densely packed structure that enables fitting the meter-long DNA in each cell into the cell nucleus. The structure of chromatin DNA plays a crucial role in gene expression and silencing and is therefore subject to heavy regulation, to a large extent governed by epigenetic modifications on the histone tails. To study linearized chromatin fibers and map genetic and/or epigenetic modifications is thus of great interest and nanofluidic channels could potentially be well suited for this purpose.
Streng et al. reported the first study of chromatin stretched in nanochannels (Streng et al., Reference Streng, Lim, Pan, Karpusenka and Riehn2009), where the chromatin used was reconstituted from a commercially available, unfractionated whole histone mixture on blunt-ended λ-DNA that was fluorescently labeled using YOYO-1. The extension of chromatin (in 80 × 80 nm2 channels) was compared to that of bare DNA and chromatin was found to be 2.5 times more compact. However, the authors discuss that the chromatin was probably not in the form of condensed 30 nm fibers but rather a disordered 10 nm fiber with heterogeneous linker lengths, and physical parameters for this non-homogeneous chromatin could be given as rough estimates only.
More recently, Basak et al. (Reference Basak, Rosencrans, Yadav, Yan, Berezhnoy, Chen, van Kan, Nordenskiöld, Zinchenko and van der Maarel2020) investigated the compaction, dynamics, and internal motion of nanoconfined chromatin fibers. They used over-expressed, refolded, and purified histone octamers to reconstitute chromatin on T4-DNA, and nanofluidic devices with two different degrees of confinement (50 × 70 nm2 and 120 × 130 nm2, respectively). The compaction of DNA was followed by monitoring the extension of bare T4-DNA and chromatin fibers formed with an octamer loading of 50 or 100%, respectively, when confined to nanochannels (Fig. 13a). The reconstituted chromatin fibers were found to not achieve a fully compacted state and fluctuations between relatively more and less open configurations were observed. The internal motion of DNA was further investigated through analysis of fluorescence correlation, from which it was deduced that this motion is governed mainly by the dynamics of uncompressed linker DNA.
Identification of histone tail modifications using fluorescent antibodies on chromatin stretched in nanochannels was demonstrated by Lim et al. (Reference Lim, Karpusenko, Blumers, Streng and Riehn2013). In this study, chromatin was reconstituted from calf thymus, HeLa core, and chicken erythrocyte histones and antibodies detecting trimethylation of lysine 4 and acetylation of lysine 9, respectively, on histone 3 (H3) was used to monitor histone modifications. The ratio between these modifications varies among chromatin origins and discrimination between the three histone samples was demonstrated. Channels wider than traditionally used for DNA analysis (200 × 200 nm2), resulting in moderate stretching, were used in this study to avoid stripping of histones from the DNA and antibody aggregation.
Another approach to overcome the risk of stripping the histones off the DNA was presented by Matsuoka et al. (Reference Matsuoka, Kim, Huang, Douville, Thouless and Takayama2012), who used an elastomeric nanochannel device where the channel dimensions can be adjusted post introduction of the chromatin. The chromatin in this study was isolated from HeLa cells and by co-staining with DAPI and antibodies against methylated histone H3 and acetylated histone H4, respectively, the authors were able to visualize chromatin condensation and localization of epigenetic marks on the linearized chromatin. The study points out several advantages of using nanochannels for chromatin analysis: no extra chemistry is required to stretch the chromatin, chromatin extracted from cells can be studied, and multicolor staining enables analysis of several epigenetic markers simultaneously. The latter, together with the single chromatin approach, is crucial for rare samples.
Bacterial nucleoid-associated proteins
The bacterial chromosome is organized into a structure called the nucleoid, where the DNA forms nucleoprotein complexes with several proteins referred to as nucleoid-associated proteins (NAPs) (Azam and Ishihama, Reference Azam and Ishihama1999). Among the most abundant and universally conserved NAPs are the heat-stable nucleoid structuring protein (H-NS) and the heat unstable nucleoid structuring protein (HU) and they have both been subject to studies using nanofluidic channels.
Zhang et al. (Reference Zhang, Guttula, Liu, Malar, Ng, Dai, Doyle, van Kan and van der Maarel2013a) studied the effect of H-NS on DNA conformation using two different set-ups of nanofluidic channels. In steady-state experiments, an array of nanochannels was used to visualize the extension of single DNA molecules, pre-incubated with H-NS, whereas a cross-channel device (Zhang et al., Reference Zhang, Jiang, Liu, Doyle, van Kan and van der Maarel2013b), described in section ‘Devices for dynamic studies’, enabled monitoring the time-dependent response of DNA upon exposure to H-NS. DNA was found to elongate or condense, depending on protein concentration, ionic strength, and presence of divalent ions (Mg2+), and H-NS mediated bridging of DNA was observed.
Later, a similar study on HU was presented (Guttula et al., Reference Guttula, Liu, van Kan, Arluison and van der Maarel2018). Contrary to the findings for H-NS, nanoconfined DNA that had been pre-incubated with HU showed a decrease in extension with increasing protein concentrations, irrespective of ionic strength and buffer composition (presence of Mg2+ ions). Over-threshold concentrations of protein caused DNA to collapse into a condensed conformation.
Another DNA-binding protein within the bacterial nucleoid that has been studied using nanofluidic channels is Hfq, a phylogenetically conserved and abundant bacterial protein with multiple regulatory functions related to nucleic acids metabolism. Jiang et al. (Reference Jiang, Zhang, Guttula, Liu, van Kan, Lavelle, Kubiak, Malabirade, Lapp, Arluison and van der Maarel2015) observed compaction of DNA in the presence of wild-type, full length Hfq. The decrease in molecular extension was, just as for H-NS, explained by the propensity of Hfq to bridge distal DNA segments and at over-threshold concentrations of Hfq the DNA was found to collapse into a condensed form. In a follow-up study, truncated Hfq proteins, representing either the proposed DNA-binding C-terminal domain (Hfq-CTR) or the N-terminal region (Hfq-NTR), were investigated (Malabirade et al., Reference Malabirade, Jiang, Kubiak, Diaz-Mendoza, Liu, van Kan, Berret, Arluison and van der Maarel2017). As a result, a proposed model for DNA binding and bridging by Hfq, where the C-terminal of the protein plays a pivotal role, with possible implications for protein-binding-related gene regulation, was presented. Recently, the effect of intramolecular DNA motion on the mobility of Hfq was investigated using a nanofluidic set-up, demonstrating a dynamic coupling between confinement-induced slowing of DNA internal motion and enhanced protein mobility (Yadav et al., Reference Yadav, Basak, Yan, van Kan, Arluison and van der Maarel2020). Subsequently, a mutant form of Hfq, lacking the C-terminal domain and having a lower DNA binding affinity, was found to have a significantly higher mobility on DNA compared to the wild-type protein (Tan et al., Reference Tan, Basak, Yadav, van Kan, Arluison and van der Maarel2022). The mutant also demonstrated a reverse dependence on the internal motion of DNA, showing reduced protein mobility with slower internal motion.
Other DNA compacting proteins
Condensation and regulation of DNA accessibility are key activities also in virus life cycles. The nucleocapsid protein NC, alone and as part of the polyprotein Gag, from human immunodeficiency virus type 1 (HIV-1) has been investigated using nanochannels in two studies by Jiang et al. (Reference Jiang, Humbert, Sriram, Lequeu, Lin, Mely and Westerlund2019, Reference Jiang, Humbert, S, Rouzina, Mely and Westerlund2021a). Protein concentration-dependent compaction of nanoconfined dsDNA was demonstrated, and Gag, which has a higher binding affinity to DNA than NC alone, was shown to condense DNA at a much lower concentration than NC (Jiang et al., Reference Jiang, Humbert, Sriram, Lequeu, Lin, Mely and Westerlund2019). In the same study, the ability of NC to act as a chaperone and promote annealing of complementary sequences was investigated. By using λ-DNA, that has 12 bp single-stranded overhangs in each end, concatemerization and circularization in the presence of NC could be observed. Recently, a further investigation of the dynamics and kinetics of dsDNA compaction induced by NC was presented (Jiang et al., Reference Jiang, Humbert, S, Rouzina, Mely and Westerlund2021a). A dynamic device, first described by Öz et al. (Reference Öz, Sriram and Westerlund2019) and discussed in section ‘Devices for dynamic studies', enabled real-time observation of single DNA molecules upon addition of NC protein and the dynamics of the condensation process could thereby be followed, from locally compacted regions on the DNA to the fully condensed state (Fig. 13b). The NC-promoted annealing of complementary, single-stranded overhangs of DNA could also be visualized in real time in the dynamic device and the authors concluded that the chaperone activity is not directly correlated with the DNA condensation propensity of NC.
Another virus nucleocapsid protein studied in nanofluidic channels is the hepatitis C virus core protein (HCVcp) (Sharma et al., Reference Sharma, Sriram, Holmstrom and Westerlund2020). HCVcp-induced compaction of DNA was monitored in real time, and repeated compaction and relaxation of DNA could be visualized by iterative flushing of the reaction chamber with either HCVcp (initiating compaction) or Proteinase K (releasing the DNA by degradation of the protein). It was observed that compaction preferentially starts from the ends of linear DNA, whereas for a circular substrate compaction was initiated at random positions along the contour. These observations highlight two major advantages of nanofluidic channels, over many other single-molecule techniques: as the DNA is free in solution in the channels, without any tethers, events occurring at the ends of the molecule and on circular DNA, lacking ends that make it difficult to tether, can be studied.
Stretching and visualizing single DNA–protein complexes in nanofluidic channels can provide complementary information to confirm or add detail to structural predictions from X-ray crystallography and/or models of biomolecular functions. Frykholm et al. (Reference Frykholm, Berntsson, Claesson, De Battice, Odegrip, Stenmark and Westerlund2016) studied the compaction of nanoconfined DNA by the bacteriophage protein Cox, a multifunctional transcriptional regulator in P2-like bacteriophages. Cox forms oligomeric filaments and it has been proposed from the X-ray structure that DNA can be wrapped around these filaments, in a manner similar to how histones condense DNA in eukaryotic cells. The nanofluidic experiments confirmed that Cox compacts DNA and that the binding is highly cooperative, in agreement with the postulated model. By monitoring the DNA extension at different channel widths, it was illustrated how the physical properties of the complex at low protein loads resemble those of naked DNA, whereas they are governed by the stiffer Cox-filament at higher protein coverage.
In a multimodal study by Schmitt et al. (Reference Schmitt, Jiang, Camacho, Jonna, Hofer, Westerlund, Christie and Berntsson2018) the structure, properties, and functions of PrgB, a surface adhesin of Enterococcus faecalis, were investigated. Nanofluidic channels were used to analyze PrgB binding to long DNA fragments. Compaction of DNA was observed, with full condensation of DNA at over-threshold protein concentration and deduced cooperative binding at lower concentrations. The investigation also indicated a higher affinity of PrgB for ends of DNA substrates or, alternatively, for the single-stranded overhangs present on the DNA substrates used.
Filamentous DNA–protein complexes
A special class of DNA-binding proteins are proteins that form filaments on DNA. These filaments often have vastly different physical properties than DNA itself since the properties are governed mainly by the protein. The bacterial protein RecA and its human homolog Rad51 are key proteins in homologous recombination, a process for repairing DNA double-strand breaks (Kowalczykowski, Reference Kowalczykowski2015). Both RecA and Rad51 form stiff filaments on single- and double-stranded DNA. There are, however, significant differences between nucleoprotein filaments formed by the two proteins.
RecA forms homogeneous filaments along DNA and by using tapered channels, Frykholm et al. determined the persistence length of the filaments (Fig. 14) (Frykholm et al., Reference Frykholm, Alizadehheidari, Fritzsche, Wigenius, Modesti, Persson and Westerlund2014). The filaments are stiff enough for the Odijk theory, valid when the persistence length is larger than the channel dimensions (see section ‘Odijk regime (strong confinement)’), to apply. The value obtained from the nanochannel data (1.15 μm) was in good agreement with the value obtained using optical tweezers (0.96 μm) (Hegner et al., Reference Hegner, Smith and Bustamante1999), despite the different experimental set-ups where the nucleoprotein filaments were in solution without any tethering of the DNA when studied in nanochannels. This was a pioneering study as it constituted the first example of using nanofluidic channels for studying the physical properties of DNA–protein complexes.
Fornander et al. performed a similar study on Rad51 (Fornander et al., Reference Fornander, Frykholm, Fritzsche, Araya, Nevin, Werner, Cakir, Persson, Garcin, Beuning, Mehlig, Modesti and Westerlund2016), but as the Rad51 filaments, in contrast to the homogenous nucleoprotein filaments formed by RecA, are heterogeneous, a quantitative analysis using the Odijk theory was not as straightforward. The nucleoprotein filaments formed by Rad51 appeared in patches along the DNA, with stretches of naked DNA in between. By visualizing the filaments in the same type of tapered channels as was used in the study of RecA, Fornander et al. were able to characterize how the stretches of naked DNA affect the physical properties of the filaments. At weaker confinement (~650 × 150 nm2) the filaments form static kinks, suggested to arise when two patches meet. At stronger confinement (~230 × 150 nm2), the protein coverage on the DNA could be determined in more detail and the presence of regions of naked DNA could be confirmed. The potential biological role of the observed kinks was discussed.
Proteins interacting with DNA ends
As discussed previously, a major advantage of using nanofluidic channels for analyzing DNA–protein complexes, compared to traditional methods where DNA is anchored in the ends, is the possibility to study reactions occurring on DNA ends and intermolecular interactions between two or more large DNA molecules. DNA ends are of interest in many different aspects, such as in studies of telomeric DNA and in repair of DNA double-strand breaks. The use of nanofluidic channels for analysis of proteins interacting with DNA ends was first illustrated in the study on the virus nucleocapsid protein NC from HIV-1 discussed above (Jiang et al., Reference Jiang, Humbert, Sriram, Lequeu, Lin, Mely and Westerlund2019), where the protein was shown to promote annealing of the 12 bp long single-stranded overhangs of long DNA molecules to form concatemers of DNA.
A first example of investigating DNA repair using nanochannels was a study of the protein CtIP by Öz et al. (Reference Öz, Howard, Sharma, Törnkvist, Ceppi, Sriram, Kristiansson, Cejka and Westerlund2020). CtIP is a protein particularly known as a co-factor of MRN (Mre11-Rad50-Nbs1), a main player in the repair of double-strand breaks (Reginato and Cejka, Reference Reginato and Cejka2020; Tisi et al., Reference Tisi, Vertemara, Zampella and Longhese2020). Recent studies have however indicated that CtIP on its own can bind to and interact with DNA (Wilkinson et al., Reference Wilkinson, Martín-González, Kang, Northall, Wigley, Moreno-Herrero and Dillingham2019). Öz et al. focused on two effects of CtIP on DNA, the ability to compact DNA and the ability to bring two DNA ends in close proximity to allow annealing of broken ends. The latter was possible in this study since nanochannels enable efficient discrimination between linear and circular DNA (Alizadehheidari et al., Reference Alizadehheidari, Werner, Noble, Reiter-Schad, Nyberg, Fritzsche, Mehlig, Tegenfeldt, Ambjörnsson, Persson and Westerlund2015). Using λ-DNA, that has single-stranded and complementary overhangs, the ability of CtIP to anneal ends was investigated by determining the fraction of circular DNA formed in the presence of the protein. An efficient image-collection pipeline facilitated analysis of thousands of molecules at each condition (Fig. 15). Wild-type CtIP, in tetrameric form, was demonstrated to efficiently promote circle formation and this was suggested to be due to the protein's ability to bridge DNA and force the two DNA ends to come in close proximity. The formation of circular DNA was thus used as a readout for the potential of the protein to bridge DNA for two ends to meet. Enhanced circularization was not observed in the presence of protein mutants lacking tetramer formation ability.
Non-homologous end-joining (NHEJ) is a pathway for DNA double-strand break repair. While NHEJ in eukaryotes is a process that involves many different proteins for efficient repair, bacterial NHEJ uses only two proteins (Bowater and Doherty, Reference Bowater and Doherty2006). The homodimeric Ku finds and binds to DNA ends, and has the ability to bridge DNA ends, while Ligase D (LigD) stabilizes the bridging and ligates the two DNA ends back together. The first detailed study of bacterial NHEJ on the single DNA molecule level was performed by Öz et al. using nanofluidics and magnetic tweezers (Öz et al., Reference Öz, Wang, Guerois, Goyal, Sriram, Ropars, Sharma, Koca, Charbonnier, Modesti, Strick and Westerlund2021). The nanofluidics experiments revealed several important details on the functions of Ku and LigD. The first observation was that Ku alone can stably bridge DNA with as few as four bases complementary overhangs. Furthermore, Ku was found to bind cooperatively to the DNA ends and the number of Ku units in the bridges varied significantly. Another important observation was the fact that Ku was still bound at the ligation site several hours after ligation was completed (Fig. 16). The authors thus suggest that in vivo there must be another system present to remove Ku, and potentially LigD, from the substrate when the repair process is completed.
Other DNA–protein interactions
Nanofluidic channels have been used to investigate the physical characteristics of DNA upon interaction also with other proteins that do not fall under any of the categories discussed above. Three important examples are presented below.
The interaction between α-synuclein, a protein whose assembly process into amyloid fibers is directly related to the neurodegenerative disease Parkinson's disease, and DNA has been studied by Jiang et al. (Reference Jiang, Rocha, Westling, Kesarimangalam, Dorfman, Wittung-Stafshede and Westerlund2018, Reference Jiang, Rocha, Kumar, Westerlund and Wittung-Stafshede2021b). Using nanofluidic channels, in combination with other biophysical methods, binding to DNA by monomeric α-synuclein was confirmed and by studying the DNA–protein complex at different confinements, an increase in DNA persistence length, up to 30%, upon protein binding was observed (Jiang et al., Reference Jiang, Rocha, Westling, Kesarimangalam, Dorfman, Wittung-Stafshede and Westerlund2018). In the follow-up study, it was shown that truncation of the C-terminus of α-synuclein induces differential effects on DNA depending on the extent of the truncation (Jiang et al., Reference Jiang, Rocha, Kumar, Westerlund and Wittung-Stafshede2021b). Upon complete C-terminus removal (residues 1–97 of α-synuclein remaining) a more than 20% decrease in relative DNA extension was observed, in contrast to the increased extension observed for full-length and residue 1–119 α-synuclein variants.
Conformational changes of DNA upon binding of two different methyl-CpG-binding domain (MBD) proteins were investigated, using nanofluidic channels, by Liu et al. (Reference Liu, Movahed, Dangi, Pan, Kaur, Bilinovich, Faison, Leighton, Wang, Williams and Riehn2020). They observed that MeCP2 compacted DNA to less than 50% of its native extension. From atomic force microscopy experiments, the authors deduced that compaction was caused by protein-mediated bridges between distant DNA stretches, creating local loop configurations. MBD2, on the other hand, did not affect DNA configuration and no change in DNA extension was observed upon binding of either unlabeled MBD2 or ATTO-565 labeled MBD2. The authors thus conclude that MBD2, rather than MeCP2, is a promising probe for epigenetic mapping applications, targeting methylated CpG sites.
Roushan et al. investigated the superdiffusive motion of DNA molecules confined to nanochannels, linked to the catalytic activity of DNA-modifying enzymes (Roushan et al., Reference Roushan, Azad, Movahed, Ray, Livshits, Lim, Weninger and Riehn2018). In the presence of T4 DNA ligase, the mismatch DNA repair protein MutS or E. coli DNA ligase, respectively, and the respective energetic cofactor of each enzyme (ATP for T4 DNA ligase and MutS, NAD+ for E. coli DNA ligase), a directed motion of nanoconfined DNA was observed and characterized. Although the displacement resembles that of a motor protein, the authors suggest that a motor-like action is not taking place. Instead, they propose a kinetic model depending on an asymmetry of ionic strengths between the termini of the DNA molecule within the nanochannel, and that this ion concentration polarization is maintained by the enzymatic reaction.
Outstanding challenges
In the majority of the published studies on DNA–protein interactions using nanofluidic channels the experiments were performed at equilibrium, in the sense that the protein was added to DNA outside of the device and the nanofluidic channel was used merely as a tool for visualization of the DNA–protein complexes formed. Moreover, most studies included only one protein binding to DNA. The obvious future direction for DNA–protein interaction studies using nanofluidics is thus to investigate real-time dynamics of complex reactions where more than one protein is involved, either several proteins directly interacting with DNA or indirectly through protein–protein interactions with the DNA-binding protein. Such experiments put increasing demands on device design and fabrication, as well as strategies for device passivation that need to be optimized for each specific study. The examples of device designs allowing real-time observations of DNA–protein interactions (Zhang et al., Reference Zhang, Jiang, Liu, Doyle, van Kan and van der Maarel2013b; Öz et al., Reference Öz, Sriram and Westerlund2019) presented in section ‘Devices for dynamic studies’ and the use of such devices to reveal reaction dynamics of simple DNA–protein systems (Zhang et al., Reference Zhang, Guttula, Liu, Malar, Ng, Dai, Doyle, van Kan and van der Maarel2013a; Sharma et al., Reference Sharma, Sriram, Holmstrom and Westerlund2020; Öz et al., Reference Öz, Wang, Guerois, Goyal, Sriram, Ropars, Sharma, Koca, Charbonnier, Modesti, Strick and Westerlund2021; Jiang et al., Reference Jiang, Humbert, S, Rouzina, Mely and Westerlund2021a) are initial steps taken in this direction. Development of more advanced nanofluidic devices will make it possible to fully explore the benefits of nanofluidics for studies of DNA–protein interactions.
Summary
The intention with this review has been to give a general overview of the usefulness of nanofluidic devices for analyzing long DNA molecules and their interactions with proteins. To do this, a thorough introduction to the polymer physics of confined DNA was initially presented. We then demonstrated how the nanofluidic devices can be tailored for the project at hand, how sequence information can be obtained from long DNA molecules by ODM and how DNA–protein interactions can be analyzed and understood using such devices.
With the development in all aspects of using nanofluidics for DNA analysis, we believe that the methodology now is ready to take a further step into broad use across scientific disciplines. The theoretical comprehension is now at a stage where it can be efficiently used for describing long DNA and its interactions in great detail. The key outstanding questions from the theory side deal with strong confinement, or, equivalently, low ionic strengths, where the neutral polymer model breaks down and the details of the DNA–wall electrostatic interactions become important (Chuang et al., Reference Chuang, Reifenberger, Bhandari and Dorfman2019). For the standard optical mapping analysis, the procedure has already been commercialized. We envision that an important step forward on the optical mapping track is to obtain multiple types of genetic and epigenetic information at the same time, using advanced labeling schemes along with modern imaging platforms, such as CoCoS microscopy (Jeffet et al., Reference Jeffet, Ionescu, Michaeli, Torchinsky, Perlson, Craggs and Ebenstein2021a). Another avenue to pursue is an integrated lab-on-a-chip platform aimed at high-throughput single-cell optical mapping, where initial efforts have already been made by Marie et al. (Reference Marie, Pedersen, Bærlocher, Koprowska, Pødenphant, Sabatel, Zalkovskij, Mironov, Bilenberg, Ashley, Flyvbjerg, Bodmer, Kristensen and Mir2018).
When it comes to using nanofluidic channels for studying DNA–protein interactions the studies presented here have paved the way for more complex analyses, where we foresee the possibility of investigating higher order interactions between DNA and several different proteins in real time with high spatial and temporal resolution, for example using super-resolution fluorescence microscopy. One important note is that the design of the actual nanofluidic device has been underexplored and more sophisticated designs can open up for detailed biophysical analysis. These efforts should in particular focus on studies that utilize the unique features of the nanofluidic channels in that they allow DNA stretching with negligible force, studies at high throughput and studies of events occurring at the ends of long DNA.
We hope that this review will inspire researchers across disciplines to use nanofluidic devices for their research. Options are vast for DNA analysis, but the concepts can also be extended to other important topics, where recent examples in our group include analysis of biological nanoparticles (Friedrich et al., Reference Friedrich, Block, Alizadehheidari, Heider, Fritzsche, Esbjörner, Westerlund and Bally2017) and heterogeneous catalysis in the liquid phase (Levin et al., Reference Levin, Fritzsche, Nilsson, Runemark, Dhokale, Ström, Sundén, Langhammer and Westerlund2019; Levin et al., Reference Levin, Lerch, Boje, Fritzsche, Sriram, Ström, Moth-Poulsen, Sundén, Hellman, Westerlund and Langhammer2022).
Financial support
We acknowledge funding from the European Research Council, in the form of an ERC Consolidator Grant (F.W., nanoDNArepair, no. 866238), the Swedish Research Council (F.W., 2020-03400), and the US National Science Foundation (K.D.D., CBET-2016879).
Conflict of interest
None.