1 Introduction
The importance of finite Larmor radius (FLR) effects in plasma physics is well documented (Braginskii Reference Braginskii1958; Roberts & Taylor Reference Roberts and Taylor1962; Braginskii Reference Braginskii1965; Rosenbluth & Simon Reference Rosenbluth and Simon1965; Liley Reference Liley1972; Callen et al. Reference Callen, Qu, Siebert, Carreras, Shaing and Spong1987; Hazeltine & Meiss Reference Hazeltine and Meiss1992; Mikhailovskii Reference Mikhailovskii1992; Hazeltine & Waelbroeck Reference Hazeltine and Waelbroeck1998; Sulem & Passot Reference Sulem and Passot2008; Hosking & Dewar Reference Hosking and Dewar2016; Goedbloed, Keppens & Poedts Reference Goedbloed, Keppens and Poedts2019). A broad class of models that incorporate FLR effects are those that fall under the fluid category, i.e. the momenta of the underlying particles are integrated out to yield mean field theories that describe the evolution of physical quantities such as density, fluid velocity, etc. The advantage of the fluid formalism stems from the fact that the complex dynamics of a multiparticle system is reduced to a few dynamical equations that are capable of accurately capturing its essential properties.
Fluid models that include FLR effects are often constructed by incorporating kinetic effects, e.g. by moving from particle phase-space coordinates to guiding centre coordinates (Hasegawa & Wakatani Reference Hasegawa and Wakatani1983; Hsu, Hazeltine & Morrison Reference Hsu, Hazeltine and Morrison1986; Brizard Reference Brizard1992; Smolyakov, Pogutse & Hirose Reference Smolyakov, Pogutse and Hirose1995; Belova Reference Belova2001); models with FLR contributions incorporate kinetic effects of importance such as Landau damping and gyroradius averaging (Hammett, Dorland & Perkins Reference Hammett, Dorland and Perkins1992; Beer & Hammett Reference Beer and Hammett1996; Snyder, Hammett & Dorland Reference Snyder, Hammett and Dorland1997; Waltz et al. Reference Waltz, Staebler, Dorland, Hammett, Kotschenreuther and Konings1997; Snyder & Hammett Reference Snyder and Hammett2001; Staebler, Kinsey & Waltz Reference Staebler, Kinsey and Waltz2005; Madsen Reference Madsen2013). A second approach involves expansions in the smallness of the Larmor radius as compared to a characteristic length scale of the system and the imposition of closures for higher-order moments (Macmahon Reference Macmahon1965; Kennel & Greene Reference Kennel and Greene1966; Bowers Reference Bowers1971; Pogutse, Smolyakov & Hirose Reference Pogutse, Smolyakov and Hirose1998; Goswami, Passot & Sulem Reference Goswami, Passot and Sulem2005; Simakov & Catto Reference Simakov, Catto and Sauter2006; Ramos Reference Ramos2005a, Reference Ramos2007; Passot & Sulem Reference Passot and Sulem2007; Ramos Reference Ramos2010, Reference Ramos2011; Passot, Sulem & Hunana Reference Passot, Sulem and Hunana2012; Passot, Sulem & Tassi Reference Passot, Sulem and Tassi2017; Pfefferlé, Hirvijoki & Lingam Reference Pfefferlé, Hirvijoki and Lingam2017). A third method uses the Hamiltonian framework to construct full and reduced magnetohydrodynamic (MHD) models endowed with FLR and other effects (Morrison & Hazeltine Reference Morrison and Hazeltine1984; Morrison, Caldas & Tasso Reference Morrison, Caldas and Tasso1984; Hsu et al. Reference Hsu, Hazeltine and Morrison1986; Hazeltine, Hsu & Morrison Reference Hazeltine, Hsu and Morrison1987; Brizard et al. Reference Brizard, Denton, Rogers and Lotko2008; Tassi et al. Reference Tassi, Morrison, Waelbroeck and Grasso2008; Izacard et al. Reference Izacard, Chandre, Tassi and Ciraolo2011; Waelbroeck & Tassi Reference Waelbroeck and Tassi2012; Comisso et al. Reference Comisso, Grasso, Waelbroeck and Borgogno2013; Lingam & Morrison Reference Lingam and Morrison2014; Lingam Reference Lingam2015b,Reference Lingamc; Passot, Sulem & Tassi Reference Passot, Sulem and Tassi2018). One of the chief advantages of Hamiltonian methods, as explained in the forthcoming sections, is that they are amenable to the extraction of naturally conserved quantities (the Casimirs) and analysing equilibria and stability.
The Hamiltonian formalism is deeply entwined with its twin approach, building models from an action principle – together, we will refer to them as the Hamiltonian and action principle (HAP) approach.Footnote 1 The HAP formalism has a long history in fluid dynamics and plasma physics – examples of seminal publications prior to the 20th century include Lagrange (Reference Lagrange1789), Clebsch (Reference Clebsch1857), von Helmholtz (Reference von Helmholtz1858), Clebsch (Reference Clebsch1859), Hanke (Reference Hanke1861) and Kirchhoff (Reference Kirchhoff1876).Footnote 2 A summary of modern developments in this area can be found in the reviews by Serrin (Reference Serrin and Flügge1959), Truesdell & Toupin (Reference Truesdell and Toupin1960), Seliger & Whitham (Reference Seliger and Whitham1968), Arnold (Reference Arnold1978), Morrison (Reference Morrison1982), Holm et al. (Reference Holm, Marsden, Ratiu and Weinstein1985), Morrison (Reference Morrison1998), Arnold & Khesin (Reference Arnold and Khesin1998), Morrison (Reference Morrison2005), Holm (Reference Holm2008), Morrison (Reference Morrison, Eliasson and Shukla2009), Lingam (Reference Lingam2015d), Sudarshan & Mukunda (Reference Sudarshan and Mukunda2016), Morrison (Reference Morrison2017), Tassi (Reference Tassi2017) and Webb (Reference Webb2018).
Using the action formalism has many advantages. For a starter, each term in the action has a clear physical meaning, which is not always the case when equations of motion have been derived using phenomenological or ad hoc assumptions. Another advantage is that theories derived from action principles are naturally energy conserving. In some cases, equations of motion that had not been derived using the HAP formalism were erroneously believed to conserve energy (see e.g. Scott Reference Scott2005, Reference Scott2007; Kimura & Morrison Reference Kimura and Morrison2014; Tronci et al. Reference Tronci, Tassi, Camporeale and Morrison2014). In addition, by performing an appropriate Legendre transformation, one can recover the Hamiltonian formalism, which is endowed with several advantages of its own. For a review of action principles in MHD models, we refer the reader to Newcomb (Reference Newcomb1962), Holm, Marsden & Ratiu (Reference Holm, Marsden and Ratiu1998), Morrison (Reference Morrison, Eliasson and Shukla2009), Lingam (Reference Lingam2015d), Webb (Reference Webb2018) and for the Hamiltonian formalism to Morrison & Greene (Reference Morrison and Greene1980), Morrison (Reference Morrison1982), Holm et al. (Reference Holm, Marsden, Ratiu and Weinstein1985), Morrison (Reference Morrison1998, Reference Morrison2005) and Tassi (Reference Tassi2017). In particular, we mention its significance in studying symmetric MHD and its properties (Andreussi, Morrison & Pegoraro Reference Andreussi, Morrison and Pegoraro2010, Reference Andreussi, Morrison and Pegoraro2012, Reference Andreussi, Morrison and Pegoraro2013, Reference Andreussi, Morrison and Pegoraro2016) and in constructing and analysing reduced MHD models (Morrison & Hazeltine Reference Morrison and Hazeltine1984; Hazeltine et al. Reference Hazeltine, Hsu and Morrison1987; Kuvshinov, Pegoraro & Schep Reference Kuvshinov, Pegoraro and Schep1994; Krommes & Kolesnikov Reference Krommes and Kolesnikov2004; Waelbroeck, Hazeltine & Morrison Reference Waelbroeck, Hazeltine and Morrison2009; Tassi et al. Reference Tassi, Morrison, Grasso and Pegoraro2010b; Tassi, Grasso & Pegoraro Reference Tassi, Grasso and Pegoraro2010a; Waelbroeck & Tassi Reference Waelbroeck and Tassi2012; Keramidas Charidakos, Waelbroeck & Morrison Reference Keramidas Charidakos, Waelbroeck and Morrison2015; Tassi et al. Reference Tassi, Grasso, Borgogno, Passot and Sulem2018; Tassi Reference Tassi2019).
Earlier we outlined different methods by which FLR effects can be incorporated into fluid models. It is worth noting that the Hamiltonian methods invoke the use of an interesting device – the gyromap, which was discovered in Morrison et al. (Reference Morrison, Caldas and Tasso1984) and subsequently employed in the likes of Hazeltine et al. (Reference Hazeltine, Hsu and Morrison1987) and Izacard et al. (Reference Izacard, Chandre, Tassi and Ciraolo2011). The gyromap is essentially a noncanonical transformation that maps the phase space to itself, and its chief advantage stems from the fact that it renders the noncanonical bracket of the gyroviscous MHD model identical to that of classical ideal MHD bracket (Morrison & Greene Reference Morrison and Greene1980) when expressed in terms of the new set of noncanonical variables; we will elaborate upon this point later in the paper.Footnote 3 The origin of the gyromap was not properly understood until an action principle analysis in Morrison, Lingam & Acevedo (Reference Morrison, Lingam and Acevedo2014) was applied to a specific two-dimensional (2-D) model, which assumed a particular ansatz for the internal energy and the gyromap. In this paper, we generalize the work of Morrison et al. (Reference Morrison, Lingam and Acevedo2014) to three dimensions, and present generic results in terms of freely specifiable functions. Furthermore, when we choose a particular ansatz for our FLR fluid model, we will use the physical principles of Larmor gyration to motivate the choice in detail. We will refer to this magnetofluid model as gyroviscous magnetohydrodynamics (GVMHD).
The paper is organized as follows. In § 2, we outline the necessary tools for carrying out an action formulation of three-dimensional (3-D) GVMHD. Then we proceed to build the action in § 3, where we motivate the reasoning behind the gyroviscous term. In § 4, the relevant equations of motion are presented and a particular choice of the gyroviscous ansatz is constructed. In § 6, we present the equivalent Hamiltonian formalism of this model. In § 6.2, we derive the GVMHD bracket and highlight the differences compared with 3-D ideal MHD. Finally, we summarize our results in § 7. Some of the salient auxiliary calculations are presented in the appendices.
2 The Lagrangian-variable approach to the action principle
In the first part of this section, we briefly describe Hamilton's principle of stationary action. In the second part, we highlight and outline the Lagrangian picture, and present a systematic methodology for moving to the more commonly used Eulerian picture.
2.1 Hamilton's principle of stationary action
The process involved in constructing the action for fluid models has been well known since Lagrange (Reference Lagrange1789). Once the generalized coordinates $q_k(t)$ are chosen, where $k$ runs over all possible degrees of freedom, the action is determined via
with $L$ representing the Lagrangian. It must be noted that $S$ is a ‘functional’, i.e. its domain and range are functions and real numbers, respectively. Hamilton's principle states that that the equations of motion are the extrema of the action, i.e. we require $\delta S[q]/\delta q^k = 0$, where the functional derivative is defined as follows:
The continuum version is very similar to the discrete case since the discrete index $k$ is replaced by a continuous one, which we denote by $a$. The coordinate $q$ is a function of $a$ and $t$, and tracks the location of a fluid particle labelled by $a$. We also note the following important quantities which are used throughout the paper: the deformation matrix $\partial q^i/\partial a^j=:q^{\hspace {1pt}i}_{\,,\hspace {1pt}j}$ and the corresponding determinant, the Jacobian, $\mathcal {J}:= \det (q^{\hspace {1pt}i}_{\,,\hspace {1pt}j})$. The volume evolves in time via
and the area is governed by
where $\mathcal {J} a^{\hspace {1pt}j}_{\,,\hspace {1pt}i}$ is the transpose of the cofactor matrix of $q^{\hspace {1pt}j}_{\,,\hspace {1pt}i}$. The quantities and the relations introduced above can be used to generate a wide range of identities. One can find a detailed discussion of these, for example, in Serrin (Reference Serrin and Flügge1959), Morrison (Reference Morrison1998) and Bennett (Reference Bennett2006).
2.2 Two representations: the Lagrangian and the Eulerian points of view
The Lagrangian position $q$ evolves in time and is entirely characterized by its label $a$. But the fluid parcels are not solely determined by the position alone; they can also carry with them a certain density, entropy and magnetic field. As the fluid moves along its trajectory, these quantities are also transported along with it, and are consequently characterized only by the label $a$ as well. We will refer to these quantities as attributes. As the label $a$ is independent of time, these attributes serve as Lagrangian constants of motion. The subscript $0$ will be used to label the attributes, in order to distinguish them from their Eulerian counterparts.
Let us now consider the Eulerian picture. All Eulerian fields depend on the position $\boldsymbol {r}:=(x^1,x^2,x^3)$ and time $t$, which can both be measured in the laboratory. As a result, we shall refer to these fields as observables. Moving from the Eulerian to Lagrangian viewpoint and vice versa is accomplished with the Lagrange–Euler maps which we describe below in more detail.
The Eulerian velocity field $\boldsymbol {v}(\boldsymbol {r},t)$ is the velocity of the fluid element at a location $\boldsymbol {r}$ and time $t$. If we seek to preserve the equivalence of the Lagrangian and Eulerian pictures, this must also equal $\dot {\boldsymbol {q}}(\boldsymbol {a},t)$. As a result, it is evident that we require $\dot {\boldsymbol {q}}(\boldsymbol {a},t)=\boldsymbol {v}(\boldsymbol {r},t)$, where the dot indicates that the time derivative is obtained at fixed label $\boldsymbol {a}$. However, there is a discrepancy since the left-hand side is a function of $\boldsymbol {a}$ and $t$, while the right-hand side involves $\boldsymbol {r}$ and $t$. This conundrum is resolved by noting that the fluid element is at $\boldsymbol {r}$ in the Eulerian picture, and at $\boldsymbol {q}$ in the Lagrangian one. Hence, we note that $\boldsymbol {r}=\boldsymbol {q}(\boldsymbol {a},t)$, which implies that $\boldsymbol {a}=\boldsymbol {q}^{-1}(\boldsymbol {r},t)=:\boldsymbol {a}(\boldsymbol {r},t)$ upon inversion. As a result, our final Lagrange–Euler map for the velocity is
Now we consider the attributes defined earlier, which we have noted are carried along by the fluid. The first attribute is the entropy of the fluid particle, which we shall label $s_0$. For ideal fluids, one expects the entropy to remain constant along the fluid trajectory. In other words, the Eulerian specific entropy $s(\boldsymbol {r},t)$ must also remain constant throughout, implying that $s=s_0$. Apart from entropy, the magnetic stream function $\psi$ for 2-D GVMHD (Andreussi et al. Reference Andreussi, Morrison and Pegoraro2013; Morrison et al. Reference Morrison, Lingam and Acevedo2014) also obeys this property.
Next, we can consider attributes which obey a conservation law similar to the density. The conservation law in this case is that of mass conservation. The attribute is denoted by $\rho _0(\boldsymbol {a})$ and the observable by $\rho (\boldsymbol {r},t)$. The statement of mass conservation in a given (infinitesimal) volume amounts to $\rho (\boldsymbol {r},t)\,\textrm {d}^3r=\rho _0(\boldsymbol {a})\,\textrm {d}^3a$. Using (2.3) we obtain $\rho _0=\rho \mathcal {J}$. As a result, we have found the Lagrange–Euler map for $\rho$. There exist other attribute-observable pairs in the literature, which also possess similar conservation laws, such as the entropy density.
In the case of magnetofluid models, it is often advantageous to introduce the magnetic field attribute $\boldsymbol {B}_0(\boldsymbol {a})$. In the case of ideal magnetofluid models, the conservation law of frozen-in magnetic flux is applicable. In algebraic terms, this amounts to $\boldsymbol {B}\boldsymbol {\cdot } \boldsymbol {d}^{\boldsymbol {2}}\boldsymbol {r}=\boldsymbol {B}_0\boldsymbol {\cdot } \boldsymbol {d}^{\boldsymbol {2}}{\boldsymbol {a}}$, and from (2.4) we obtain $\mathcal {J} B^i=q^{\hspace {1pt}i}_{\,,\hspace {1pt}j} \,B_0^j$.
In all of the above expressions, the picture is still incomplete since we need to remove the $\boldsymbol {a}$-dependence of the attributes. In a manner similar to that undertaken for the velocity, we evaluate the attributes at $\boldsymbol {a}=\boldsymbol {q}^{-1}(\boldsymbol {r},t)=:\boldsymbol {a}(\boldsymbol {r},t)$. This completes our prescription, and one can fully determine the observables once we are provided the attributes in conjunction with the Lagrangian coordinate $\boldsymbol {q}$.
We may also represent the Lagrange–Euler map in an integral form, which permits a more intuitive interpretation. We shall start with the assumption that the attribute-observable relations are found via appropriate conservation laws. We have stated before that one moves from the Lagrangian to the Eulerian picture by ‘plucking out’ the fluid element that happens to be at the Eulerian observation point $r$ at time $t$. Such a process is accomplished mathematically via the delta function $\delta (\boldsymbol {r}-\boldsymbol {q}(\boldsymbol {a},t))$. For instance, we see that the density can be treated as follows:
Further below, we will also use a new variable, the canonical momentum density $\boldsymbol {M}^c=(M^c_1,M^c_2,M^c_3)$, which is related to its Lagrangian counterpart via
For ideal MHD, the canonical momentum density is $\boldsymbol {\varPi }(a,t)=(\varPi _1,\varPi _2,\varPi _3)=\rho _0 \dot {\boldsymbol {q}}$. It is worth noting that $\boldsymbol {\varPi }(\boldsymbol {a},t)$ can be found from the Lagrangian through $\boldsymbol {\varPi }(\boldsymbol {a},t) = {\delta L}/{\delta \dot {\boldsymbol {q}} }$ and does not necessarily equal $\rho _0 \dot {\boldsymbol {q}}$ in general. One can also construct such integral relations for the entropy and the magnetic field. We refer the reader to Morrison et al. (Reference Morrison, Lingam and Acevedo2014) for a more detailed discussion along these lines.
3 Action principle for a generic magnetofluid
The first part of this section is devoted to a brief description of the procedure outlined in Morrison (Reference Morrison, Eliasson and Shukla2009) and Morrison et al. (Reference Morrison, Lingam and Acevedo2014) for constructing action principles for magnetofluid models. Some of the advantages have been highlighted in the introduction, and others can be found in, for example, Morrison (Reference Morrison, Eliasson and Shukla2009) and Morrison et al. (Reference Morrison, Lingam and Acevedo2014). Then, we proceed to construct our action and motivate our choice of terms along the way.
3.1 The general action
The domain of integration $D$ is chosen to be a subset of $\mathbb {R}^{3}$. Central to our formulation is the Lagrangian coordinate $\boldsymbol {q}\colon D\rightarrow D$, which we shall assume to be a well-behaved function with the required smoothness, invertibility, etc. Next we need to specify our set of observables, or alternatively our set of attributes. For our models, we work with $\mathfrak {E}=\{\boldsymbol {v},\rho ,\sigma ,\boldsymbol {B}\}$ where $\sigma = \rho s$ is the entropy density. Finally, we shall impose the Eulerian closure principle, which is necessary for our model to be ‘Eulerianizable’. Mathematically, this principle amounts to the action being fully expressible in terms of the Eulerian observables. Physically, the principle states that our theory must be solely describable in terms of physically meaningful quantities, the observables, and must also give rise to equations of motion in terms of these observables. As a result, we require our action to be given via
As per the Eulerian closure principle, this amounts to finding an action $\bar {S} = \int _\mathcal {T} \textrm {d}t \int _D \textrm {d}^3r\,\bar {\mathcal {L}}$ in terms of the Eulerian observables. The presence of the bar indicates that the action and the Lagrangian density are expressed solely in terms of the observables.
3.2 Constructing the gyroviscous action
The first step in the process involves the construction of the kinetic energy, which must also satisfy the closure principle. Using the analogy with particle mechanics, we know that it equals
where the last equality is obtained by using relations outlined in § 2.2.
The internal energy per unit mass is a function of the entropy density and the density, and in Eulerian terms it can be represented by $U(\rho ,\sigma )$. Using the inverse Lagrange–Euler maps, we can construct the Lagrangian internal energy density accordingly,
The next step is the construction of the magnetic energy, and we use the same process outlined for the internal energy, viz. we determine the Eulerian term and obtain the Lagrangian version consequently through the Lagrange–Euler map,
The magnetic energy is actually $|\boldsymbol {B}|^2/8{\rm \pi}$ in CGS units but we drop the factor of $4{\rm \pi}$ henceforth by scaling it away through the adoption of Alfvénic units.
Now we are ready to construct the most important term which will be responsible for the gyroviscosity. The gyroviscous term is taken to be linear in $\dot {\boldsymbol {q}}$ and is given by
In other words, we operate under the premise that $\boldsymbol {\varPi }^{\star }$ is solely a functional of $\boldsymbol {q}$ and $t$. As the Eulerian perspective is inherently endowed with physical variables (e.g. density and magnetic field), we will focus on the Eulerian equivalent of $\boldsymbol {\varPi }^{\star }$; from the Eulerian closure principle we obtain the relation
The complete action functional is now given by
The action of (3.7) is general, but not the most general second-order (in $\boldsymbol {v}$) action that satisfies the Eulerian closure principle. For example, the term $S_\textrm {kin}$ could be generalized by replacing its integrand with $\rho _0G |\dot {\boldsymbol {q}}|^2/2|_{\boldsymbol {a}} = \rho G(\rho , \sigma , \boldsymbol {B})|\boldsymbol {v}|^2/2$ and the integrand of $S_\textrm {int}$ could be replaced by $\rho _0 U|_{\boldsymbol {a}}= \rho U(\rho , \sigma , \boldsymbol {B})$, a form that was shown in Morrison (Reference Morrison1982) to allow for anisotropic pressure. Here both $G$ and $U$ could be arbitrary functionals (including derivatives) of their arguments. Similarly the term $S_\textrm {mag}$ could be generalized.
The Eulerian canonical momentum density is defined via (2.7), which can be computed by finding the Lagrangian canonical momentum using $\boldsymbol {\varPi }(\boldsymbol {a},t) = {\delta L}/{\delta \dot {\boldsymbol {q}} }$ and Eulerianizing it. Upon doing so, we arrive at the so-called gyromap, a device introduced in Morrison et al. (Reference Morrison, Caldas and Tasso1984) as follows:
The benefit of employing the gyromap and its natural origin will be discussed in § 6 and further explicated in appendix B.
So far we have only required $\boldsymbol {M}^\star$ to satisfy the closure principle, i.e. that it be expressible in terms of the subset $\{\rho ,\sigma ,\boldsymbol {B}\}\subset \mathfrak {E}$, including all possible Eulerian derivatives. Given that $\boldsymbol {M}^\star$ is a momentum density, arising perhaps from underlying gyration of particles, a natural assumption is that it has the magnetization form
i.e. we assume that $\boldsymbol {M}^\star$ is divergence-free. Since we are interested in a gyroviscosity due to gyromotion, this is a physically reasonable assumption. However, one could replace (3.9) by a Helmholtz decomposition for a more general collisionless viscosity. The present choice is also motivated in part by the realization in Morrison et al. (Reference Morrison, Caldas and Tasso1984) and Morrison et al. (Reference Morrison, Lingam and Acevedo2014) that this choice is consistent with existing 2-D gyroviscous models. Because $\boldsymbol {M}^\star$ has the units of momentum density, from which we see that the quantity $\boldsymbol {J}^\star \propto (q/m) \boldsymbol {M}^\star$ resembles a current density. If one assumes that the fluid ‘particles’ possess a finite magnetic moment, it follows that the fluid must have a finite magnetization. In other words, one may identify $\boldsymbol {J}^\star$ with the magnetization current density, which is divergence-free (Jackson Reference Jackson1998) and the current through an area depends on flux through a bounding curve. Are other choices possible and do any of them conserve angular momentum? Perhaps an even simpler way of envisioning the ansatz for $\boldsymbol {M}^\star$ is that it must emerge from the gyration of particles. In pictorial terms, this gyration is reminiscent of the effect generated by the curl of a vector field, which motivates our choice of $\boldsymbol {M}^\star$. Further grounds for assuming this particular expression are described in Morrison et al. (Reference Morrison, Lingam and Acevedo2014, § 5). With this ansatz, evidently $\boldsymbol {\nabla } \boldsymbol {\cdot } \boldsymbol {M}^c = \boldsymbol {\nabla } \boldsymbol {\cdot } \boldsymbol {M}$, since the second term vanishes. Note that the right-hand side of this expression appears in the continuity equation, and we see that one could also replace it by the left-hand side if we operate with $\boldsymbol {M}^\star = \boldsymbol {\nabla } \times \boldsymbol {L}^\star$. Furthermore, dimensional analysis permits the identification of $\boldsymbol {L}^\star$ with the angular momentum density.
As we have reduced the question of determining $\boldsymbol {L}^\star$, we must ask ourselves as to whether any further simplifications are feasible. Once again, we can resort to physical intuition to gain an idea of what $\boldsymbol {L}^\star$ might look like. Without further special assumptions about the fluid, e.g. it having some intrinsic or extrinsic direction, the vectorial character of $\boldsymbol {L}^\star$ must come from $\boldsymbol {B}$ or from the set of gradients of the observables; these and their cross products are the only vectors available. Thus, for example, a general form for $\boldsymbol {L}^\star$ could be composed of a linear combination of these vectors with coefficients dependent on $\rho , \sigma$ and $|\boldsymbol {B}|$. If we assume $\boldsymbol {L}^\star$ constitutes an internal angular momentum density of some kind associated with particle gyration, then it is reasonable to posit that it would tend to align with the magnetic field $\boldsymbol {B}$. Moreover, in the limit of a large magnetic field, the corresponding gyroradii would become small, owing to which the fluid particle may not be significantly affected by gradients on these scales. Combining the preceding arguments leads to the generic form
In § 4.2 we will argue for further specification of the properties of (3.10).
With the choice of (3.10), the gyroviscous term of the action, expressed in terms of the observables is given by
where the second equality follows from integrating by parts and neglecting the boundary term. We shall use the latter operation consistently throughout the rest of the paper. Now that we have constructed the gyroviscous term, we note that it is still generic since there is considerable freedom in the choice of $\mathcal {F}$.
4 The equations of motion and the choice of ansatz
In this section, we shall present the equations of motion and discuss the origin of the gyroviscous terms, and why a specific choice of the free function $\mathcal {F}$ emerges in a natural manner.
4.1 The equations of motion
The equations for the density, entropy density and the magnetic field can be determined via the attributes/observables relations defined through the appropriate conservation laws and the Lagrange–Euler maps. The entropy density and the density obey similar laws, given by
The equation governing the magnetic field is
which can be recast into the more familiar induction equation if $\boldsymbol {\nabla } \boldsymbol {\cdot } \boldsymbol {B} = 0$ is satisfied. If the constraint is obeyed, then we obtain
The dynamical equation for the momentum is derived from $\delta S = 0$, and is thus equal to
where repeated indices indicate summation (as per the Einstein convention), and we have employed the standard relationship between the internal energy and the scalar pressure $p$. We note that (4.5) can be obtained in two different ways from the action. The first is to follow the conventional variation with respect to $q$ and obtain it accordingly. The second method involves the use of the procedure outlined in Frieman & Rotenberg (Reference Frieman and Rotenberg1960) and Newcomb (Reference Newcomb1962) and is described in appendix A. For our model, (4.1)–(4.3) and (4.5) constitute the complete set of dynamical equations.
Before discussing the ansatz in more detail, a few observations regarding (4.5) are in order. The second term occurring in the first line of this equation represents the ideal MHD momentum flux (enclosed in square brackets), which is seen from the absence of $\mathcal {F}$ in it. The second and third lines contain terms that are purely symmetric under the interchange $k \leftrightarrow j$. The fourth line contains terms that are wholly antisymmetric under $k \leftrightarrow j$. The fifth (and final) line contains terms that are neither purely symmetric nor purely antisymmetric. As a result, we see that the entire momentum flux tensor is not symmetric, as opposed to the ideal MHD tensor, or the 2-D gyroviscous tensor for the specific model considered in Morrison et al. (Reference Morrison, Lingam and Acevedo2014). Note that we refer to the terms from line two onwards as gyroviscous because they are expressed in terms of the velocity shear, akin to viscous hydrodynamics. The gyroviscous tensor thus obtained above can be compared against the general expression(s) presented in Ramos (Reference Ramos2005b). Furthermore, these effects arise from charged particle gyration – the latter aspect is explored below.
4.2 The origin of the gyroviscous ansatz
In § 3.2, we briefly outlined the process involved in constructing a generic gyroviscous term. Now, we shall draw upon further physics to select a specific choice for the ansatz.
First, let us suppose that we start out with the notion of an internal angular momentum $\boldsymbol {L}^\star$. In order to understand where this angular momentum originates, we recall an identity from electromagnetism which relates the angular momentum to the magnetic moment via the gyromagnetic ratio, $(2m)/e$. If we consider a two-species model of ions and electrons, then the ions will play the dominant role, owing to their higher mass. Hence, we know that $\boldsymbol {L}^\star = ({2m}/{e}) \boldsymbol {\mu }$. The magnetic moment $\boldsymbol{\mu}$ is typically an adiabatic invariant in plasmas, and its magnitude is given by $|\boldsymbol {\mu }|={m \boldsymbol {v}_\perp ^2}/{2|\boldsymbol {B}|}$, which is proportional to $P_\perp /|\boldsymbol {B}|$ where $P_\perp$ denotes the perpendicular component of the (anisotropic) pressure. But, the magnetic moment is a vector and the most natural way to construct a vector is through the unit vector of the magnetic field. Putting these results together, we find that a natural ansatz (albeit a specific one) for $\boldsymbol {L}^\star$ is given by
where $\alpha$ is a dimensionless proportionality constant, which can be arbitrarily specified; in the ensuing analysis, we set $\alpha = 1$ for simplicity. By comparison with the more general ansatz outlined in § 3.2, we find that they are identical when $\mathcal {F} = \alpha ({m}/{2e}) P_\perp /|\boldsymbol {B}|^2$.
The function $P_\perp$ is a function of $\sigma$, $\rho$ and $|\boldsymbol {B}|$. For a more detailed discussion of the anisotropic pressure, we refer the reader to Kimura & Morrison (Reference Kimura and Morrison2014). It is defined as
an expression that first appeared in Morrison (Reference Morrison1982), where $U$ is the internal energy that is a function of $\rho$ and $\sigma$, but also of the magnetic field; see also Hazeltine, Mahajan & Morrison (Reference Hazeltine, Mahajan and Morrison2013). If we wish to forgo anisotropy, then we assume that $U$ is independent of $B$, and hence the second term in the above term vanishes. This assumption was used in deriving the equation of motion (4.5) since the internal energy introduced in (3.3) had no $\boldsymbol {B}$-dependence. Such an assumption also leads to the pressure tensor becoming isotropic, given by the first term of (4.7) alone.
In summary, the ansatz constructed was chosen such that the gyroviscosity (and consequently the momentum transport) arises via the gyration of charged particles, thereby lending the term its name. The fact that momentum transport could take place via such gyrations was first noted by Chapman & Cowling (Reference Chapman and Cowling1970) and Kaufman (Reference Kaufman1960) in the 1950s and 1960s. This principle was applied to incompressible gyrofluids in Newcomb (Reference Newcomb1972, Reference Newcomb1973, Reference Newcomb1983) and compressible gyrofluids in Morrison (Reference Morrison, Eliasson and Shukla2009) and Morrison et al. (Reference Morrison, Lingam and Acevedo2014), who showed that this specific ansatz yielded results that were fully compatible with the 2-D version of the Braginskii tensor (Braginskii Reference Braginskii1965).
Lastly, we note that substituting (4.6) in (3.8) after employing $\boldsymbol {M}^\star = \boldsymbol {\nabla } \times \boldsymbol {L}^\star$ will yield a number of extra terms with the same dimensions as $\boldsymbol {M} = \rho \boldsymbol {v}$. Hence, if one divides the expression throughout by $\rho$, the contributions arising from $\boldsymbol {M}^\star$ have the dimensions of velocity and possess physical interpretations. The first term, which is proportional to $(\boldsymbol {B} \times \boldsymbol {\nabla } P_\perp )/|\boldsymbol {B}|^2$, amounts to the diamagnetic drift velocity. The second term, which is proportional to $P/|\boldsymbol {B}|^3 (B \times \boldsymbol {\nabla } |\boldsymbol {B}|)$ is analogous to the $\boldsymbol {\nabla } |\boldsymbol {B}|$ drift velocity for charged particles. This correspondence has been pointed out in Morrison et al. (Reference Morrison, Caldas and Tasso1984, § 6).
5 Angular momentum conservation and its ramifications
In this section, we discuss the chief unusual property of our model – the lack of an ‘orthodox’ angular momentum conservation, and its resolution. We also present a brief illustration of its ramifications in an astrophysical context.
5.1 Constructing a hybrid conserved angular momentum
When we perform the constrained variation of our action, we recover
Additional details can be found in Holm et al. (Reference Holm, Marsden and Ratiu1998, equations (7.6)–(7.8)) and Lingam & Morrison (Reference Lingam and Morrison2014, § 3). Note that the Lagrangian density $\mathcal {L}$ in the above expression refers to the one present in (3.7). A rather unusual fact emerges if one inspects the above energy-momentum tensor: when one considers ideal MHD, or even Hall and extended MHD, the tensor $T_{ij}$ is symmetric. In turn, this ensures that the angular momentum $\boldsymbol {M}={\boldsymbol {r}} \times \rho \boldsymbol {v}$ is conserved. However, this is evidently not the case for the above energy-momentum tensor.
This fact is not unusual because a number of hydrodynamic models are known to possess asymmetric energy-momentum tensors. In particular, if the constituent ‘particles’ (which may be fluid parcels) have an internal degree of freedom (i.e. spin), the energy-momentum tensor of the fluid will manifest a non-symmetric component (Papapetrou Reference Papapetrou1949; Snider & Lewchuk Reference Snider and Lewchuk1967; Olmsted & Snider Reference Olmsted and Snider1976; Dewar Reference Dewar1977; Evans Reference Evans1979; Kopczyński Reference Kopczyński1990; Lingam Reference Lingam2015a). Examples of hydrodynamic models with asymmetric energy-momentum tensors include ferrohydrodynamics (Rosensweig Reference Rosensweig1985; Billig Reference Billig2005) and nematics (de Gennes & Prost Reference de Gennes and Prost1993). Although many core plasma models are characterized by symmetric energy-momentum tensors (Pfirsch & Morrison Reference Pfirsch and Morrison1985; Similon Reference Similon1985), other plasma models feature asymmetric energy-momentum tensors (e.g. Brizard Reference Brizard2010a). In consequence, not all components of the angular momentum will be conserved, although the toroidal component is conserved in such models (Scott & Smirnov Reference Scott and Smirnov2010).
To resolve this, we will adopt the procedure delineated in McLennan (Reference McLennan1966). We begin with the observation that the first expression in (5.1) remains invariant under the transformations $M^c_i \rightarrow M^c_i + \partial _j {\varSigma }_{ij}$ and $T_{ij} \rightarrow T_{ij} - \partial {\varSigma }_{ij}/\partial t$. Let us suppose that we choose $\partial {\varSigma }_{ij}/\partial t$ to be the antisymmetric part of $T_{ij}$, thereby ensuring that $T_{ij} - \partial {\varSigma }_{ij}/\partial t$ is purely symmetric. Hence, by utilizing this choice of ${\varSigma }_{ij}$, we find that
where $\boldsymbol {\tau }$ has the units of torque density and is given by
The first term in the above expression is ${{\boldsymbol {M}}}^c\times {\boldsymbol {v}}$, which can also be expressed as ${{\boldsymbol{M}}}^\star \times {\boldsymbol{v}}$ since $\rho {{\boldsymbol {v}} \times {\boldsymbol {v}}} = 0$. The second and third terms are proportional to $(\boldsymbol {\nabla } \boldsymbol {\cdot } {{\boldsymbol {v}}}) {\boldsymbol {B}}$ and $(\boldsymbol {\nabla } {\boldsymbol {v}}) \boldsymbol {\cdot } {\boldsymbol {B}}$, respectively. Since we know that $\tau$ behaves as a torque density, let us define a dynamical variable $\mathcal {S}$ such that $\partial \mathcal {S}_k/ \partial t = \tau _k$; this constitutes a relation that mirrors the conventional torque-angular momentum relation in classical mechanics. Using this in (5.2), we find that $\varSigma _{ij} = \epsilon _{ijk} \mathcal {S}_k$. With these ingredients, we can now construct a symmetric momentum conservation law as follows:
with $T^S_{ij}$ representing the symmetric energy-momentum tensor and $M^\textrm {tot}_i = M^c_i + \epsilon _{ijk} \partial _j \mathcal {S}_k$. As the resultant energy-momentum tensor is symmetric, it follows that the corresponding angular momentum ${{\boldsymbol {r}}} \times {{\boldsymbol {M}}}^\textrm {tot}$ is conserved.
The ramifications of $\mathcal {S}$ are manifold. It can be interpreted as an intrinsic angular momentum density generated from the torque density (5.3). This is consistent with prior works (Papapetrou Reference Papapetrou1949; Snider & Lewchuk Reference Snider and Lewchuk1967; Olmsted & Snider Reference Olmsted and Snider1976; Dewar Reference Dewar1977; Evans Reference Evans1979; Kopczyński Reference Kopczyński1990) that outlined the connections between intrinsic angular momentum and a non-symmetric energy-momentum tensor. A second justification arises from ${{\boldsymbol {M}}}^\textrm {tot} = {\boldsymbol {M}}^c + \boldsymbol {\nabla } \times \mathcal {S}$, implying by dimensional analysis that $\mathcal {S}$ has the dimensions of angular momentum density. If we define ${\boldsymbol {M}}^\textrm {int} = \boldsymbol {\nabla } \times \mathcal {S}$, we see that $\boldsymbol {\nabla } \boldsymbol {\cdot } {\boldsymbol {M}}^{\textrm {int}} = 0$. The kinship between ${\boldsymbol {M}}^\star$ and ${\boldsymbol {M}}^{\textrm {int}}$ is obvious as they are both generated via an internal angular momentum mechanism and are divergence-free.
Let us now summarize our results. We defined a dynamical variable $\mathcal {S}$ such that it obeys $\partial \mathcal {S}_i/\partial t = \tau _i$ where $\tau$ is given by (5.3), and it emerges from the antisymmetric part of the original energy-momentum tensor. We also find that the new momentum ${\boldsymbol {M}}^{\textrm {tot}} = {\boldsymbol {M}} + ({\boldsymbol {M}}^\star + {\boldsymbol {M}}^{\textrm {int}})$ yields a symmetric momentum tensor (which is the symmetric part of the old one). Using the expressions for ${\boldsymbol {M}}^\star$ and ${\boldsymbol {M}}^{\textrm {int}}$, we have
Hence, we can define a composite intrinsic angular momentum ${\boldsymbol {J}} = {\boldsymbol {L}}^\star + \mathcal {S}$, akin to the total angular momentum in quantum mechanics (Weinberg Reference Weinberg2015). The introduction of ${\boldsymbol {J}}$ yields ${\boldsymbol {M}}^{\textrm {tot}} = {\boldsymbol {M}} + \boldsymbol {\nabla } \times {\boldsymbol {J}}$, which is simple in form and has an immediate physical interpretation. The angular momentum corresponding to ${\boldsymbol {M}}^{\textrm {tot}}$ is conserved, and is given by ${\boldsymbol {r}} \times {\boldsymbol {M}}^{\textrm {tot}}$. Hence, the total angular momentum defined below is an invariant,
Before proceeding further, some major aspects concerning the 2-D GVMHD model described in Morrison et al. (Reference Morrison, Caldas and Tasso1984) and Morrison et al. (Reference Morrison, Lingam and Acevedo2014) merit further explication. To begin with, we can rewrite (5.1) as follows:
where we have introduced the new energy-momentum tensor
The first key point worth highlighting here is that Morrison et al. (Reference Morrison, Caldas and Tasso1984, Reference Morrison, Lingam and Acevedo2014) adopted: (i) a specific equation of state (EOS) for $P_\perp$ wherein $P_\perp /|\boldsymbol {B}|$ was a Lie-dragged scalar density; and (ii) the choice $\boldsymbol {B} = B_z \hat {z}$ for the magnetic field. These two conditions collectively ensured that $\boldsymbol {L}^\star$ had only one component and that the components of $\boldsymbol {M}^\star$ behaved as scalar densities that underwent Lie-dragging; in other words, the term inside the square brackets of (5.7) vanishes identically for the 2-D GVMHD model.
The second essential point is that 2-D GVMHD did not include any variables that were Lie-dragged as vector densities of rank unity. In contrast, the magnetic field in 3-D MHD and GVMHD plays this role (Morrison Reference Morrison1982; Lingam & Morrison Reference Lingam and Morrison2014),Footnote 4 but $B_z$ in 2-D GVMHD is a Lie-dragged scalar density as seen from Morrison et al. (Reference Morrison, Caldas and Tasso1984, equation (3)); to put it differently, $B_z$ in 2-D GVMHD is advected the same way as the plasma density $\rho$. Thus, the terms in (5.8) involving $B_i$ are rendered irrelevant because they were derived under the assumption that the magnetic field is a Lie-dragged vector density. Hence, these two facts collectively ensure that the only potential source of asymmetry in the energy-momentum tensor of 2-D GVMHD is the second term on the right-hand side of (5.8). When one utilizes the particular EOS for this model in conjunction with $M_z = 0$ and $\boldsymbol {B} = B_z \hat {z}$, it can be shown (Morrison et al. Reference Morrison, Caldas and Tasso1984, Reference Morrison, Lingam and Acevedo2014) that the gyroviscous term of 2-D GVMHD yields the contribution
to the energy-momentum tensor, which turns out to be fully symmetric.
The above discussion serves to illustrate how and why the energy-momentum tensor of the simplified 2-D GVMHD model of Morrison et al. (Reference Morrison, Caldas and Tasso1984, Reference Morrison, Lingam and Acevedo2014) is symmetric in nature. However, in order to achieve this symmetry, a number of restrictions on the EOS as well as the magnetic field and momentum density had to be imposed. When all of these constraints are relaxed, which is the case for 3-D GVMHD, one finds that an asymmetric energy-momentum tensor is obtained.
5.2 An illustration of the formalism
We have already noted earlier that the kinetic angular momentum ${r} \times {\boldsymbol {M}}$ is not conserved. However, we have seen that the angular momentum described in (5.6) is conserved. Together, these imply that the rate of loss (or gain) of the kinetic angular momentum ${r} \times {\boldsymbol {M}}$ is precisely equal to the rate of gain (or loss) of the intrinsic angular momentum ${\boldsymbol {J}}$. Let us recall that $\mathcal {S}$ comprises a part of ${\boldsymbol{J}}$, and we know that $\partial \mathcal {S}_i/\partial t = \tau _i$ where $\boldsymbol {\tau }$ is given by (5.3). The first term in (5.3) reduces to ${\boldsymbol {M}}^\star \times {\boldsymbol {v}}$, as noted earlier. It is worth mentioning that the additional two terms are quite different, but exhibit a similar scaling. Hence, we shall use only the first term in our subsequent analysis. The total torque (denoted by $\tilde {\mathcal {T}}$) is found by integrating this term over the volume, and thus gives rise to the scaling
where we have dropped the numerical factors and used a characteristic velocity of $\varOmega R$, with $R$ denoting the radius of the (spherical) object. It is evident that the scaling will be entirely determined by the EOS that is adopted.
Next, let us evaluate the spin-down rate, by using the relation $\tilde {\mathcal {T}} = I \dot {\varOmega }$, from classical mechanics. The moment of inertia, dropping all numerical factors, is approximately $M R^2 \sim \rho R^5$. Using this in (5.10), we find that
The above relation indicates that $\dot {\varOmega } \propto \varOmega$ (holding other quantities fixed). The EOS depends only on $\rho$, $s$ and $|\boldsymbol {B}|$ and hence we can conclude that the relation $\dot {\varOmega } \propto \varOmega$ is likely to be independent of the choice of the EOS. If we treat $\rho$ and $R$ to be independent variables, i.e. by choosing $M$ to be the dependent variable, one can also conclude that $\dot {\varOmega } \propto R^{-2}$ will be independent of the EOS. The characteristic time $t_c = \varOmega /\dot {\varOmega }$, is expected to be independent of $\varOmega$ and is given by
and we see that it is proportional to $R^2$, when the other parameters are held constant. The Chew–Goldberger–Low EOS for $P_\perp$ (Chew, Goldberger & Low Reference Chew, Goldberger and Low1956) is of particular interest since the characteristic time $t_c$ and the rate $\dot {\varOmega }$ are both independent of the density and the magnetic field, thereby demonstrating an unexpected universality. The resulting spin-down corresponds to the dissipation of kinetic angular momentum, which must imply that there is a corresponding increase in the intrinsic angular momentum ${\boldsymbol{J}}$ (which comprises the other fluid variables).
The spin rates of low-mass stars are found to slow down by approximately two orders of magnitude over a span of $10^9$ years (Scholz Reference Scholz and Stempels2009). Modelling stellar spin-down is important for a multitude of reasons, including the fact that the older stars (with lower rotation rates) display lower activity in general, which has numerous ramifications for planetary habitability (Lingam & Loeb Reference Lingam and Loeb2018, Reference Lingam and Loeb2019). We can estimate the characteristic time by choosing solar parameters (i.e. a solar-type star) for an order-of-magnitude calculation. In particular, we substitute $|\boldsymbol {B}| \sim 10^{-4}$ T, $R \sim 7 \times 10^8\ \textrm {m}$ and $T \sim 5.8 \times 10^3$ K (Priest Reference Priest2014) in (5.12), which yields $t_c \sim 3 \times 10^6$ years. The two leading candidates invoked to explain stellar spin-down, star-disk and stellar wind braking, operate on time scales of ${\sim }10^6\text {--}10^7$ years and ${\sim }10^8$ years, respectively (Bouvier et al. Reference Bouvier, Matt, Mohanty, Scholz, Stassun, Zanni, Beuther, Klessen, Dullemond and Henning2014, § 4.1). Hence, we see that our semiquantitative estimate is comparable to these two time scales, and may therefore constitute a viable mechanism for governing angular momentum evolution of solar-mass stars.
The issue of angular momentum losses in protostars is another closely related topic (Bodenheimer Reference Bodenheimer1995; Matt & Pudritz Reference Matt and Pudritz2005; Hartmann, Herczeg & Calvet Reference Hartmann, Herczeg and Calvet2016) which might also be resolvable through the same mechanism. We emphasize that the heuristic treatment in this subsection has primarily relied on simple scaling arguments, and a complete picture can only emerge through the synthesis of rigorous analytical models and numerical simulations. We note that this only represents the tip of the iceberg – other potential applications include pulsar braking, transport in accretion discs and associated phenomena. In the realm of fusion, we note that the formalism developed herein may prove to be useful in explaining intrinsic rotation observed in tokamaks (Gürcan et al. Reference Gürcan, Diamond, Hahm and Singh2007; de Grassie Reference de Grassie2009; Diamond et al. Reference Diamond, Kosuga, Gürcan, McDevitt, Hahm, Fedorczak, Rice, Wang, Ku and Kwon2013; Rice Reference Rice2016).
6 The Hamiltonian description and the origin of the gyromap
In this section, we shall outline some of the basic principles underlying noncanonical Hamiltonian dynamics. The literature on this subject is considerable, and we refer the reader to Morrison (Reference Morrison1998) for a comprehensive introduction.
6.1 The Lagrangian view point and the Lagrange–Euler map
First, note that the Hamiltonian can be obtained from the Lagrangian via a Legendre transform, akin to the usual process in particle mechanics. The Hamiltonian is given by
where
with $L$ defined so that the action of (3.7) is given by $S=\int _T \,\textrm {d}t\, L$. Consequently, the canonical momentum is given by
and we see that we have a field theory counterpart to the finite-dimensional case for particle motion in a magnetic field where the kinetic momentum differs from the canonical momentum, here with the role of the vector potential being played by $\boldsymbol {\varPi }^{\star }$. Thus (6.1) gives the Hamiltonian
This Hamiltonian (6.4) together with the canonical Poisson bracket,
generates the Hamiltonian equations of motion in Lagrangian variables for our class of 3-D GVMHD models as follows:
equations equivalent to the Euler–Lagrange equations obtained via $\delta S = 0$.
Now, one can use the Lagrange–Euler maps to convert both the Hamiltonian and the bracket into Eulerian variables. The procedure is described in the next section. We will see that the origin of the gyromap lies in (6.3) and how this expression relates to different choices of Eulerian variables. The bracket obtained in terms of any of these choices is endowed with Lie algebraic properties (Morrison Reference Morrison1998), most importantly the Jacobi identity, but it does not possess the canonical form of (6.5) because the Eulerian variables are not a set of canonical variables. As a result, one refers to the Hamiltonian and the bracket as being noncanonical in nature, and indeed one version is identical to that originally given in Morrison & Greene (Reference Morrison and Greene1980).
As the Lagrange–Euler maps are not one-to-one, the noncanonical brackets are degenerate in general, which gives rise to the existence of invariants – the Casimirs. The theory of Casimir invariants has been studied quite extensively (Morrison Reference Morrison1998, Reference Morrison2005; Holm Reference Holm2008), but there are still unresolved subtleties regarding their incompleteness, see for example Yoshida, Morrison & Dobarro (Reference Yoshida, Morrison and Dobarro2014) and Yoshida & Morrison (Reference Yoshida and Morrison2014, Reference Yoshida and Morrison2016).
The Casimirs also possess several advantages of their own, such as variational principles for Eulerian equilibria of the form
where $C$ represents any combination of all the known Casimirs. This procedure is known as the energy–Casimir method. Once the equilibria are known, the following symmetric operator can be constructed:
where $F$ is defined in (6.7) and the $\psi$ denote the Eulerian (noncanonical) variables. The energy–Casimir method states that the positive-definiteness of this operator is a sufficient condition for stability, although there are mathematical intricacies involved (Holm et al. Reference Holm, Marsden, Ratiu and Weinstein1985; Rein Reference Rein1994; Batt, Morrison & Rein Reference Batt, Morrison and Rein1995; Morrison Reference Morrison1998; Yoshida et al. Reference Yoshida, Ohsaki, Ito and Mahajan2003). Thus, the Eulerian noncanonical Hamiltonian description we obtain allows for implementation of such energy principles, although we will not pursue this application here.
6.2 The gyro-bracket
We shall choose our new set of observables to be the Eulerian variables $\{\boldsymbol {M}^{c},\rho ,\sigma ,\boldsymbol {B}\}$, where $\boldsymbol {M}^c$ was defined in (3.8). The reason for this choice will soon become obvious. Recall that the Lagrange–Euler maps can be expressed in an integral form, as they were for the density $\rho$ and canonical momentum density $\boldsymbol {M}^c$ in (2.6) and (2.7), respectively. The remaining Eulerian variables are given by
We use these expressions to obtain the noncanonical bracket from the canonical counterpart by the functional chain rule. Any functional of the Eulerian observables can be expressed in terms of $\boldsymbol {\varPi }$ and $\boldsymbol {q}$; hence to delineate, we denote functionals of $\boldsymbol {\varPi }$ and $\boldsymbol {q}$ by $\bar {F}$ and those in terms of the observables by $F$, and note symbolically that $\bar {F}={F}\circ \mathfrak {E}$; consequently,
From (2.6), we can conclude that
and similar identities can be found for (2.6), (2.7), (6.9) and (6.10) as well. We substitute these identities into (6.11) and carry out integrations by parts, followed by a subsequent change in the order of integration. This results in terms that are dotted with $\delta \boldsymbol {q}$ and terms dotted with $\delta \boldsymbol {\varPi }$ on both the left- and right-hand sides of the expression. As $\delta \boldsymbol {q}$ and $\delta \boldsymbol {\varPi }$ are independent, these terms must balance and thereby we obtain relationships between the Eulerian and Lagrangian functional derivatives. The algebra involved is complicated, but quite straightforward, and we refer the reader to Morrison (Reference Morrison, Eliasson and Shukla2009) for a more pedagogical version. The final bracket that we obtain is found to be
By inspection, one notices that the bracket derived above is exactly the same as the 3-D ideal MHD bracket (Morrison & Greene Reference Morrison and Greene1980); however, here the canonical momentum $\boldsymbol {M}^c$ replaces the kinetic momentum $\boldsymbol {M} = \rho \boldsymbol {v}$.
Since the bracket (6.13) uses $\boldsymbol {M}^{c}$ as one of its observables, we must express our Hamiltonian in terms of this observable (and the others) as well. Because of the closure principle we know this is possible; indeed, (6.4) in Eulerian variables becomes
The Hamiltonian of (6.14) with the bracket of (6.13) generates our class of 3-D GVMHD models in the form
On account of the fact that $\boldsymbol {M} = \boldsymbol {M}^c - \boldsymbol {M}^\star$, the energy has the form identical to that of ideal MHD. This is analogous to the fact that the kinetic energy for a charged particle in a magnetic field is identical to that for a free particle. Hence, there is a choice: one can either work with the standard ideal MHD bracket and the more complicated Hamiltonian of (6.14) in terms of the canonical momentum $\boldsymbol {M}^c$, or work with a complicated bracket written in terms of the variable $\boldsymbol {M}$, the conventional variable of magnetofluid theories, and the simpler ideal MHD Hamiltonian. To obtain the bracket in terms of $\boldsymbol {M}$ we can use the gyromap (3.8), $\boldsymbol {M} = \boldsymbol {M}^c - \boldsymbol {M}^\star$, in another chain rule calculation to transform from $\boldsymbol {M}^c$ to the variable $\boldsymbol {M}$. This is worked out in appendix B for the case $\boldsymbol{M}^\star = \boldsymbol {\nabla } \times (\mathcal {F} \boldsymbol {B})$, giving rise to a complicated Poisson bracket.
Given that the noncanonical Poisson bracket in terms of $\boldsymbol {M}^c$ is the same as that of ideal MHD, it possesses the same Casimir invariants as the ideal MHD case if we replace $\boldsymbol {M}$ with $\boldsymbol {M}^c$. This use of the gyromap to obtain Casimirs, which first appeared in Morrison et al. (Reference Morrison, Caldas and Tasso1984) and subsequently in other cases (Hazeltine et al. Reference Hazeltine, Hsu and Morrison1987; Izacard et al. Reference Izacard, Chandre, Tassi and Ciraolo2011; Lingam & Morrison Reference Lingam and Morrison2014; Morrison et al. Reference Morrison, Lingam and Acevedo2014), differs from most of the prior studies that have sought to derive Casimirs and other conserved invariants via the HAP approach using a variety of methods, see for example Morrison (Reference Morrison1982, Reference Morrison1998), Padhye & Morrison (Reference Padhye and Morrison1996a,Reference Padhye and Morrisonb), Hameiri (Reference Hameiri2004) and Webb et al. (Reference Webb, Dasgupta, McKenzie, Hu and Zank2014a, Reference Webb, Dasgupta, McKenzie, Hu and Zankb) for a comprehensive discussion of the same.
So, for our present general gyroviscous models, the gyromap tells us that the $\boldsymbol {M}$-independent Casimirs of ideal MHD will be unchanged, an example being the magnetic helicity $\int \textrm {d}^3r\, \boldsymbol {A}\boldsymbol {\cdot } \boldsymbol {B}$. On the other hand, the cross-helicity and other $\boldsymbol {M}$-dependent invariants are modified by the replacement $\boldsymbol {M}\rightarrow \boldsymbol {M}^c$. Thus, the new cross-helicity Casimir is given by
Equation (6.16) is conserved for any choice of $\boldsymbol {M}^{\star }$ that satisfies the closure principle with a provision similar to that for conservation of the usual helicity of MHD, viz. that the flow be barotropic. In (6.17) we have inserted the special case of $\boldsymbol {M}^{\star }=\boldsymbol {\nabla }\times \boldsymbol {L}^{\star }$ with (3.10). The second term of (6.17) is proportional to the current helicity density, which is encountered regularly in the context of MHD (Moffatt Reference Moffatt1978; Krause & Raedler Reference Krause and Raedler1980; Brandenburg & Subramanian Reference Brandenburg and Subramanian2005; Rincon Reference Rincon2019) and Hall MHD (Mininni, Gómez & Mahajan Reference Mininni, Gómez and Mahajan2003; Lingam & Mahajan Reference Lingam and Mahajan2015; Lingam & Bhattacharjee Reference Lingam and Bhattacharjee2016a,Reference Lingam and Bhattacharjeeb; Mahajan & Lingam Reference Mahajan and Lingam2015, Reference Mahajan and Lingam2020) turbulence and dynamo theory.
7 Conclusions
As we have noted in the introduction, there exist many approaches for constructing FLR models, each with their own advantages and disadvantages. In this paper, we present a HAP formalism that allows us to generate gyroviscous 3-D MHD models.
The action formalism allows us to clearly motivate and introduce the gyroviscous term, which is expressed in terms of a freely specifiable function. However, by using a combination of simple physical reasoning and prior results, we show that there exists a natural choice for this function, the 2-D limit of which exhibits consistency with the Braginskii gyroviscous tensor. We also show that the gyromap – a mathematical construct used to map back and forth between complicated Hamiltonians and easy brackets and vice versa – emerges naturally in this framework. The HAP formalism also has the distinct advantage of generating energy-conserving models from first principles, and all our models presented conserve both energy and momentum. Through the process of reduction, we recover the noncanonical bracket for this model, and a method for finding the Casimirs is elucidated.
One of the central results that emerged in this work was that the 3-D gyroviscous models do not conserve the orthodox angular momentum $\boldsymbol {r} \times \boldsymbol {M}$. We have presented a procedure for symmetrizing the momentum tensor via the construction of a hybrid momentum $\boldsymbol {M}^{\textrm {tot}}$. It is shown that the associated angular momentum $\boldsymbol {r} \times \boldsymbol {M}^{\textrm {tot}}$ is conserved. This procedure leads to the natural introduction of an intrinsic (spin) angular momentum which is likely to possess crucial ramifications in fusion and astrophysical plasmas; an example of the latter is briefly discussed.
The prospects for future work are manifold. The first, and perhaps the most important from a conceptual and mathematical standpoint, is to explore the putative violation of angular momentum conservation on a Lagrangian level. The second entails the application of this framework to astrophysical and fusion systems, and thereby assess whether the ensuing results are consistent with observations. The third involves a detailed comparison with other known gyroviscous tensors, such as those formulated by Braginskii (Reference Braginskii1965), Mikhailovskii & Tsypin (Reference Mikhailovskii and Tsypin1971), Liley (Reference Liley1972), Catto & Simakov (Reference Catto and Simakov2005), Ramos (Reference Ramos2005a,Reference Ramosb, Reference Ramos2010, Reference Ramos2011) and Simakov & Molvig (Reference Simakov and Molvig2016).Footnote 5 This is an ongoing effort, but preliminary results along this direction suggest that the symmetric part of our gyroviscous tensor might be compatible with results obtained by some of these authors, but at present we conclude that the 3-D version of Branginskii's gyroviscosity tensor probably does not emerge from an action principle. A comprehensive analysis is reserved for future publications. The comparison is more tedious (albeit feasible) for the full 3-D case in comparison with the 2-D case considered in Morrison et al. (Reference Morrison, Caldas and Tasso1984), because the latter possessed a simple governing equation for the pressure, and it involved only two components of the momentum density and a single component of the magnetic field.
Our model was centred on the introduction of gyroviscosity into the ideal MHD model. However, given that several variants of extended MHD possess Lagrangian and Hamiltonian formulations (Keramidas Charidakos et al. Reference Keramidas Charidakos, Lingam, Morrison, White and Wurm2014; Abdelhamid, Kawazura & Yoshida Reference Abdelhamid, Kawazura and Yoshida2015; Lingam, Morrison & Miloshevich Reference Lingam, Morrison and Miloshevich2015a; Lingam, Morrison & Tassi Reference Lingam, Morrison and Tassi2015b; D'Avignon, Morrison & Lingam Reference Dvignon, Morrison and Lingam2016; Lingam, Abdelhamid & Hudson Reference Lingam, Abdelhamid and Hudson2016a; Lingam, Miloshevich & Morrison Reference Lingam, Miloshevich and Morrison2016b; Burby Reference Burby2017; Miloshevich, Lingam & Morrison Reference Miloshevich, Lingam and Morrison2017), it would seem natural to utilize the gyromap and thus formulate the gyroviscous contributions for this class of models; after doing so, their equilibria and stability can be obtained by using the HAP approach along the lines of Andreussi et al. (Reference Andreussi, Morrison and Pegoraro2010, Reference Andreussi, Morrison and Pegoraro2012, Reference Andreussi, Morrison and Pegoraro2013, Reference Andreussi, Morrison and Pegoraro2016), Morrison et al. (Reference Morrison, Lingam and Acevedo2014) and Kaltsas, Throumoulopoulos & Morrison (Reference Kaltsas, Throumoulopoulos and Morrison2017, Reference Kaltsas, Throumoulopoulos and Morrison2018, Reference Kaltsas, Throumoulopoulos and Morrison2020) where the stability of a variety of equilibria is analysed using Lagrangian, energy–Casimir and dynamically accessibility methods. Likewise, this approach could also be extended to relativistic MHD and XMHD models with HAP formulations (D'Avignon, Morrison & Pegoraro Reference DÁvignon, Morrison and Pegoraro2015; Grasso et al. Reference Grasso, Tassi, Abdelhamid and Morrison2017; Kawazura, Miloshevich & Morrison Reference Kawazura, Miloshevich and Morrison2017; Coquinot & Morrison Reference Coquinot and Morrison2020; Ludwig Reference Ludwig2020). We mention in passing that it would be interesting to explore how the time-dependent regauging of Andreussi et al. (Reference Andreussi, Morrison and Pegoraro2013) can be used to produce or remove the $\boldsymbol {M}^{\star }$-effects, in a manner analogous to how rotation can produce or remove effects of the magnetic field using Larmor's theorem.
Finally we mention a most basic extension of the present work. Our class of gyroviscous action principles were physically motivated, yet ultimately ad hoc. An alternative would be to start from a more basic model, such as the Vlasov–Maxwell system, and derive a gyroviscous action by asymptotic procedures. A natural starting point would be the Low Lagrangian (Low Reference Low1958) – see also Morrison & Pfirsch (Reference Morrison and Pfirsch1989) and Morrison (Reference Morrison2005) – and then reduce from phase space ‘fluid’ element variables of that theory to the usual fluid element that we have denoted here by $\boldsymbol {q}(\boldsymbol {a},t)$. This would deviate from the usual historical approaches, which encompass most of the early literature, where one proceeds from ordering kinetic equations. Whichever route is taken, one typically uses intuition obtained from finite-dimensional particle orbit dynamics in given strong magnetic fields, and the associated drifts, in order to make approximations, often mixing up discrete particle orbit ideas with field theoretic perturbations. It was argued in Morrison, Vittot & de Guillebon (Reference Morrison, Vittot and de Guillebon2013) that a more consistent approach is to remain within the field theoretic framework, and it would appear prima facie that the Low Lagrangian is a natural framework for doing this. With this approach one could relate $\boldsymbol {M}^\star$ consistently to magnetization and other drifts on the fluid level. We hope to pursue this issue and others in the future.
Acknowledgements
P.J.M. received support from the US Department of Energy Contract no. DE-FG05-80ET-53088 during part of this work. A.W. thanks the Western New England University Research Fund for support. The authors thank the reviewers for their helpful feedback.
Editor Emanuele Tassi thanks the referees for their advice in evaluating this article.
Declaration of interests
The authors report no conflict of interest.
Appendix A. An Euler–Poincaré approach to the 3-D gyromap
The Lagrange–Euler maps, when expressed in an integral form, are given by (2.6) and (2.7). Instead of $\boldsymbol{M}^c$, we can also use the velocity as our observable, and it possesses the following Lagrange–Euler map:
which is equivalent to $\boldsymbol {v} = \dot {\boldsymbol {q}}$, with the right-hand side evaluated at $\boldsymbol {a}=\boldsymbol {q}^{-1}(\boldsymbol {r},t)$. The central idea is to express the Eulerian variations in terms of the Lagrangian ones, and thereby recover the equations of motion conveniently. The approach has classical roots, appeared in the plasma literature in the works of Frieman & Rotenberg (Reference Frieman and Rotenberg1960), Katz (Reference Katz1961), Low (Reference Low1961), Lundgren (Reference Lundgren1963), Calkin (Reference Calkin1963), Merches (Reference Merches1969) and Newcomb (Reference Newcomb1962, Reference Newcomb1972, Reference Newcomb1973, Reference Newcomb1983). The formalism was recast into geometric/group theoretic language in Holm et al. (Reference Holm, Marsden and Ratiu1998), who gave it the title of the ‘Euler–Poincaré’; this paper was motivated to a degree by what the authors called the ‘Arnold program’ (Arnold Reference Arnold1966). It should be pointed out that general variational principles of this form appeared in the early work of Hamel (Reference Hamel1904). The method has subsequently been applied to very many systems, including kinetic theory (Cendra et al. Reference Cendra, Holm, Hoyle and Marsden1998), complex fluids (Gay-Balmaz & Ratiu Reference Gay-Balmaz and Ratiu2009), reduced magnetofluid models (Brizard Reference Brizard2010b) and hybrid fluid-kinetic models (Holm & Tronci Reference Holm and Tronci2012; Tronci & Camporeale Reference Tronci and Camporeale2015; Burby & Tronci Reference Burby and Tronci2017; Close, Burby & Tronci Reference Close, Burby and Tronci2018).
Let us illustrate this procedure by using the magnetic energy density as our example. We shall adopt the notation employed in Andreussi et al. (Reference Andreussi, Morrison and Pegoraro2013) for convenience, where the Lagrangian displacement $\delta \boldsymbol {q}$ is denoted by $\boldsymbol {\xi }$ and its Eulerianized counterpart is denoted by $\boldsymbol {\eta }$. From (3.4), we know that
where we have invoked the Eulerian closure principle. The final step lies in expressing $\delta \boldsymbol {B}$ in terms of $\boldsymbol {\eta }$, which has been undertaken in Frieman & Rotenberg (Reference Frieman and Rotenberg1960) (see also Andreussi et al. Reference Andreussi, Morrison and Pegoraro2013), which we list as follows:
Upon using this in (A 2) and integrating by parts, we recover the $\boldsymbol {J} \times \boldsymbol {B}$ term, which is exactly the term arising in ideal MHD.
Upon applying the Euler–Poincaré method to (3.7), it can be verified that one does indeed recover (4.5) as our final result.
Appendix B. The noncanonical gyroviscous bracket
In (6.13), we presented the gyroviscous bracket in terms of the canonical momenta $M^c$ and the rest of the observables. The correspondence of the gyroviscous bracket with the ideal MHD bracket was also noted.
However, it is much more common to express noncanonical brackets in terms of the kinetic momentum $\boldsymbol{M} = \rho \boldsymbol{v}$, which we shall undertake here. In order to do so, we shall use the gyromap, discussed in § 3.2,
which can be easily rearranged to yield $\boldsymbol {M} = \boldsymbol {M}^c - \boldsymbol {M}^\star$. We shall now use the familiar concept that a given functional can be expressed in any set of (independent) observables. We denote by $F$ the functional in terms of $\boldsymbol {M}^c$ and the rest of the observables, and by $\tilde {F}$, the functional in terms of $\boldsymbol {M}$ and the rest. Since we know that $F \equiv \tilde {F}$, another chain rule calculation starts from
and by using the gyromap, we find that
and by substituting this into (B 2), integrating by parts and eliminating the resultant boundary terms, we finally recover the following relations:
We can now recover the bracket in terms of $\boldsymbol {M}$ from (6.13), by implementing the following two successive steps.
(i) First, replace the ${M}^c_i$ in the first line of (6.13), prior to the functional derivatives, with (B 1). This ensures that only $\boldsymbol {M}$ and the other observables are present.
(ii) Next, the functional derivatives occurring in (6.13) should be replaced with the relations delineated in (B 4).
We shall not list the final bracket in its entirety since its complexity is clearly self-evident.Footnote 6 Hence, this illustrates the advantage of the gyromap in facilitating a much simpler bracket. Simply through the process of inspection, it would have been almost impossible to construct the bracket in terms of $\boldsymbol {M}$ or to find the variable $\boldsymbol {M}^c$ that simplified the bracket.
The Hamiltonian, in terms of $\boldsymbol {M}$, is much simpler as seen from the following expression:
In other words, the resultant Hamiltonian is exactly identical to the total energy associated with ideal MHD (Morrison & Greene Reference Morrison and Greene1980; Freidberg Reference Freidberg2014; Goedbloed et al. Reference Goedbloed, Keppens and Poedts2019).
We note, that any choice for $\boldsymbol {M}^{\star }$ that satisfies the Eulerian closure principle will, under an analogous transformation, yield a complicated bracket in terms of $\boldsymbol {M}$, yet one that reduces to the MHD bracket of Morrison & Greene (Reference Morrison and Greene1980) when the variable $\boldsymbol {M}^c$ is used. Thus, for any choice we have the same trade-off between Hamiltonian and bracket.