Derivation and precision of mean field electrodynamics with mesoscale fluctuations

Hongzhe Zhou; Eric G. Blackman; Luke Chamandy

doi:10.1017/S0022377818000375

Derivation and precision of mean field electrodynamics with mesoscale fluctuations

Part of: 50 years of Mean Field Electrodynamics Featured Articles

Published online by Cambridge University Press: 07 May 2018

Hongzhe Zhou

Eric G. Blackman and

Luke Chamandy

Show author details

Hongzhe Zhou*: Affiliation:
Department of Physics and Astronomy, University of Rochester, Rochester, NY 14627, USA Laboratory for Laser Energetics, University of Rochester, Rochester, NY 14623, USA
Eric G. Blackman*: Affiliation:
Department of Physics and Astronomy, University of Rochester, Rochester, NY 14627, USA Laboratory for Laser Energetics, University of Rochester, Rochester, NY 14623, USA
Luke Chamandy*: Affiliation:
Department of Physics and Astronomy, University of Rochester, Rochester, NY 14627, USA
*: †Email addresses for correspondence: [email protected], [email protected], [email protected]
†Email addresses for correspondence: [email protected], [email protected], [email protected]
†Email addresses for correspondence: [email protected], [email protected], [email protected]

Article contents

Abstract
Introduction
Averaging in MFE using kernels
MFE dynamo equations with correction terms
Precision of mean field theories
MFE precision error in the context of FR
Galactic dynamo and precision for different FR viewing angles
Conclusions
Footnotes
References

Rights & Permissions

Abstract

Mean field electrodynamics (MFE) facilitates practical modelling of secular, large scale properties of astrophysical or laboratory systems with fluctuations. Practitioners commonly assume wide scale separation between mean and fluctuating quantities, to justify equality of ensemble and spatial or temporal averages. Often however, real systems do not exhibit such scale separation. This raises two questions: (I) What are the appropriate generalized equations of MFE in the presence of mesoscale fluctuations? (II) How precise are theoretical predictions from MFE? We address both by first deriving the equations of MFE for different types of averaging, along with mesoscale correction terms that depend on the ratio of averaging scale to variation scale of the mean. We then show that even if these terms are small, predictions of MFE can still have a significant precision error. This error has an intrinsic contribution from the dynamo input parameters and a filtering contribution from differences in the way observations and theory are projected through the measurement kernel. Minimizing the sum of these contributions can produce an optimal scale of averaging that makes the theory maximally precise. The precision error is important to quantify when comparing to observations because it quantifies the resolution of predictive power. We exemplify these principles for galactic dynamos, comment on broader implications, and identify possibilities for further work.

Keywords

astrophysical plasmas plasma dynamics plasma nonlinear phenomena

Type: Research Article
Information: Journal of Plasma Physics , Volume 84 , Issue 3 , June 2018 , 735840302

DOI: https://doi.org/10.1017/S0022377818000375 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Copyright: © Cambridge University Press 2018

1 Introduction

Mean field electrodynamics (MFE) is a powerful tool for semi-analytical modelling of large scale or secular behaviour of magnetic fields and flows in magnetohydrodynamic and plasma systems with spatial or temporal disorder (e.g. Roberts & Soward Reference Roberts and Soward1975; Krause & Rädler Reference Krause and Rädler1980; Ruzmaikin, Sokoloff & Shukurov Reference Ruzmaikin, Sokoloff and Shukurov1988; Brandenburg & Subramanian Reference Brandenburg and Subramanian2005a ; Kleeorin & Rogachevskii Reference Kleeorin and Rogachevskii2008; Kleeorin et al. Reference Kleeorin, Rogachevskii, Sokoloff and Tomin2009; Blackman Reference Blackman2015). As its name indicates, in MFE physical variables such as the magnetic field $\boldsymbol{B}$ and velocity $\boldsymbol{U}$ are decomposed into mean and fluctuating parts and the equations for the means are derived. The ubiquity of turbulence in astrophysics renders MFE essential for practical comparison between theory and observation. Mean field magnetic dynamo theory is a prominent example of MFE. Standard axisymmetric accretion disk theory with ‘turbulent’ transport is another example, although many practitioners use the theory without recognizing that it is only valid as a mean field theory, and in fact one that should be coupled to mean field dynamo theory (Blackman & Nauman Reference Blackman and Nauman2015). By itself, the term MFE does not specify a single set of approximations or method of averaging. If a system shows large scale field or flow patterns the question is not whether MFE is correct but what is the most appropriate MFE.

Specific averaging methods include the ensemble average (over a very large number of accessible microstates), spatial averages (like box or planar averages) and time averages. Calculations are usually simplified by utilization of Reynolds rules, namely, the linearity of averaging, the interchangeability of differential and average operations, and that averaged quantities behave like constants in averages (e.g. an averaged quantity is invariant if averaged more than once, and the average of the product of a quantity and a mean quantity is equal to the product of the mean of these two quantities.) The ensemble average respects the full Reynolds rules, and is commonly favoured (e.g. Roberts & Soward Reference Roberts and Soward1975; Brandenburg & Subramanian Reference Brandenburg and Subramanian2005a ). In the ensemble average, means are obtained by averaging over an ensemble consisting of a large number of identical systems prepared with different initial states. Fluctuations then have zero means by definition, and statistical properties of all mean physical quantities, such as the turbulent electromotive force (EMF), are determined once the partition function is known. There might seem to be no need to invoke the assumption of large scale separation, but the detailed statistical mechanics and partition function are rarely discussed in the MFE context,Footnote ¹ so it is unclear how to calculate variations of these systems from first principles.

Correlation functions in magnetohydrodynamics (MHD) are usually computed from the equations of motion, either in configuration space or Fourier space (e.g. Pouquet, Frisch & Leorat Reference Pouquet, Frisch and Leorat1976; Ruzmaikin et al. Reference Ruzmaikin, Sokoloff and Shukurov1988; Blackman & Field Reference Blackman and Field2002). Spatial or temporal averages are the most directly relevant choices when analysing simulations, laboratory experiments or astrophysical observations. These averages can however, explicitly break the Reynolds rules in the absence of large scale separation. For example, a planar average in the horizontal plane will destroy any spatial dependence in, say, the $x{-}y$ plane and leave physical quantities solely a function of $z$ , therefore variant when interchanged with $\unicode[STIX]{x2202}_{x}$ or $\unicode[STIX]{x2202}_{y}$ unless the boundary conditions are periodic. Another example is a weighted averaging over a local small volume, which retains full coordinate dependence at the price of a double-averaged quantity which is generally unequal to its single-averaged value as we will later discuss in detail. To avoid these complications, MFE practitioners typically assume that the system to which the theory is being compared has a large scale separation between fluctuating and mean quantities. The Reynolds rules are then quasi-justified for spatial and temporal averages, and are deemed to be good approximations to the ensemble average (Brandenburg & Subramanian Reference Brandenburg and Subramanian2005a ).

Some effects of turbulence on astrophysical observables have been discussed (Burn Reference Burn1966; Spangler Reference Spangler1982; Eilek Reference Eilek1989a ,Reference Eilek b ; Tribble Reference Tribble1991; Sokoloff et al. Reference Sokoloff, Bykov, Shukurov, Berkhuijsen, Beck and Poezd1998), but the mean or ordered fields were typically defined explicitly or implicitly via ensemble averages. Here we focus on the problem that real systems do not typically have a large scale separation between fluctuations and large scale quantities, and thus equating ensemble and spatial averages above can be questioned. If $l_{s}$ and $l_{L}$ are the characteristic lengths of small- and large scale fields, galaxies, for example, may have $l_{L}\simeq 1~\text{kpc}$ and $0.05~\leqslant l_{s}\leqslant 0.1~\text{kpc}$ so that $l_{s}/l_{L}\geqslant 1/20$ which is not infinitesimal. As another example, in one of their solar dynamo models, Moss et al. (Reference Moss, Sokoloff, Usoskin and Tutubalin2008) have introduced a dynamo coefficient with long-term variations and a correlation time of turbulent fields set to be of the same order as the period of the solar magnetic activity. In this case, the ratio of the mean to fluctuating time scales would be ${\sim}1$ . Finite scale separation in time scales is equivalent to $\langle \boldsymbol{B}\rangle \neq \int _{t}^{t+T}\text{d}t\,\boldsymbol{B}(t)$ where $T$ is a time scale much greater than the eddy turnover time, but still much smaller than the time scale of mean fields. This implies the system is non-ergodic. For more detailed discussions about non-ergodicity of MHD systems, see Shebalin (Reference Shebalin1989, Reference Shebalin2010, Reference Shebalin2013) and the references therein. We are thus led to two specific questions: (I) In the presence of intermediate or mesoscale fluctuations what are the correction terms to standard ensemble-averaged MFE? And (II) what precision does this imply when comparing the theory to observations?

To address question (I), we compare the standard MFE equations from ensemble averaging to those formally derived using a spatially local average when the scale of averaging is not arbitrarily smaller than the mean field gradient scales. We define spatial averages as convolutions between the total field and a kernel with a prescribed scale of averaging $l$ such that $l_{s}<l<l_{L}$ (Germano Reference Germano1992). Such ‘coarse-graining’ techniques have been applied to hydrodynamic turbulence (Leonard Reference Leonard1974; Meneveau & Katz Reference Meneveau and Katz2000; Eyink & Aluie Reference Eyink and Aluie2009), as well as MHD turbulence (Aluie & Eyink Reference Aluie and Eyink2010; Aluie Reference Aluie2017). Gent et al. (Reference Gent, Shukurov, Sarson, Fletcher and Mantere2013) used a Gaussian kernel for averaging in simulations to explore scale separation of magnetic fields. Frick et al. (Reference Frick, Beck, Berkhuijsen and Patrickeyev2001) used a mathematically similar method, wavelet transforms, for the analysis of galactic images. Relevant kernels are localized in both configuration and Fourier space to filter out small scales. Here we go beyond previous work and derive corrections to standard MFE which depend on the ratio $(l/l_{L})^{c}$ , where the power $c$ depends on the choice of kernel. For $l/l_{L}\ll 1$ the standard MFE equations are recovered.

Another way to describe the importance of mesoscale fluctuations for MFE is that contributions to averages are non-local, requiring weighing over a kernel of finite spatial or temporal range. In this respect, what we do here differs from Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012), even though they also motivate their work by recognizing a need to account for non-locality. Their focus is on empirically extracting from simulations the kernel of proportionality relating the turbulent EMF and the mean magnetic field, and constraining an ansatz for that kernel when the mean magnetic field is defined with a planar average. In contrast, we derive corrections that directly arise from the mean field averaging procedure itself, and identify the lowest-order correction terms resulting from distinct choices of the averaging kernel when Reynolds rules are violated. As we discuss later, the approach of Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012) can actually be viewed as semi-empirically testing the turbulent closure in MFE.

To address question (II) above, the precision of MFE in the presence of mesoscale fluctuations, we identify two types of errors: (i) the ‘intrinsic error’ (IE) of the mean fields that arises from the uncertainties to the input parameters of the mean field equations, and (ii) the ‘filtering error’ (FE) that results if the theoretical averaging procedure does not match that for values extracted from the observational data. As we will see in § 4, when using ensemble averages, the IE vanishes and the FE is finite but unquantifiable if partition functions are unknown. For the IE in our formalism, we identify the importance of the ratio $l/l_{s}$ , where $l_{s}$ is the integral (energy dominating) scale of the turbulent magnetic field. This ratio emerges because contributions to the error about the mean from fluctuations vary as ${\sim}N^{-1/2}$ where $N\simeq (l/l_{s})^{3}$ is the number of eddies contained in an averaged cell. For the FE, the ratio $L/l_{s}$ is most important, where $L$ is the scale of average associated with the observation method, and in general differs from $l$ . Although $l_{s}$ increases with increasing $l$ because $l_{s}$ is roughly the average scale of modes with wavenumbers ${\leqslant}2\unicode[STIX]{x03C0}/l$ , the dependence is weak if the small scale turbulent spectrum of the magnetic field peaks near $l_{s}$ . As a result, the ratio $l/l_{s}$ is roughly proportional to $l$ whereas $L/l_{s}$ decreases as $l$ increases. That the IE and FE have complementary dependences on $l$ implies that their sum may have an optimal scale of averaging that minimizes the total error. We will show that both types of precision errors are quantifiable, and can be significant in galaxies for example.

In § 2 we introduce the local spatial averages using kernels, and formally derive correction terms when the Reynolds rules are not exactly obeyed. In § 3 we apply these results to derive the generalized dynamo equations of MFE and show that the mesoscale correction terms are in fact generally small using order-of-magnitude estimates. We also contrast our method and compare our equations to the dynamo equations of Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012). In § 4 we present a general discussion on the two types of uncertainties aforementioned. In § 5 we show how to compute the total error in the specific case of comparing MFE to Faraday rotation (FR) measurements and apply this to different galactic viewing angles in § 6. We conclude in § 7.

2 Averaging in MFE using kernels

In this section, we introduce the general formalism for averaging using kernels, preparing for the reformulation of MFE in the next section.

2.1 General formalism

As per standard MFE practice, we separate any vector field $\boldsymbol{A}$ into a mean part $\overline{\boldsymbol{A}}$ and a fluctuation part $\boldsymbol{a}$ ,

(2.1)

$$\begin{eqnarray}\boldsymbol{A}(\boldsymbol{x})=\overline{\boldsymbol{A}}(\boldsymbol{x})+\boldsymbol{a}(\boldsymbol{x}).\end{eqnarray}$$

The mean part is defined via

(2.2)

$$\begin{eqnarray}\overline{\boldsymbol{A}}(\boldsymbol{x})=G_{l}(\boldsymbol{x})\ast \boldsymbol{A}(\boldsymbol{x})=\int \text{d}^{3}x^{\prime }G_{l}(\boldsymbol{x}-\boldsymbol{x}^{\prime })\boldsymbol{A}(\boldsymbol{x}^{\prime }),\end{eqnarray}$$

where ‘ $\ast$ ’ denotes a convolution. The filtering kernel $G_{l}(\boldsymbol{x})$ is a prescribed function with a characteristic scale of averaging $l$ , satisfying $l_{s}<l_{\text{eff}}(l)<l_{L}$ , where $l_{\text{eff}}$ can be viewed as the configuration space dividing scale between large and small scale fields. We define $l$ such that $l=l_{\text{eff}}$ for our analytic derivations.Footnote ² We have assumed that the system under consideration is statistically homogeneous and isotropic on scales ${\leqslant}l$ , so that $l$ is independent of location and $G_{l}(\boldsymbol{x})$ is isotropic. For anisotropic or inhomogeneous systems, $G_{l}(\boldsymbol{x})$ could be anisotropic and $l$ could be a function of spatial coordinates.

We use the following definition of the Fourier transform:

(2.3)

$$\begin{eqnarray}{\mathcal{F}}[f(\boldsymbol{x})](\boldsymbol{k})={\displaystyle\mathop{f}\limits_{{\sim}}}(\boldsymbol{k})=\int \text{d}^{3}x\,f(\boldsymbol{x})\text{e}^{-\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{x}},\end{eqnarray}$$

and

(2.4)

$$\begin{eqnarray}{\mathcal{F}}^{-1}[{\displaystyle\mathop{f}\limits_{{\sim}}}(\boldsymbol{k})](\boldsymbol{x})=f(\boldsymbol{x})=\frac{1}{(2\unicode[STIX]{x03C0})^{3}}\int \text{d}^{3}k{\displaystyle\mathop{f}\limits_{{\sim}}}(\boldsymbol{k})\text{e}^{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\boldsymbol{x}},\end{eqnarray}$$

and therefore the Fourier transform of $\overline{\boldsymbol{A}}$ is given by

(2.5)

$$\begin{eqnarray}\overline{{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}}(\boldsymbol{k})={\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k}){\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}(\boldsymbol{k}).\end{eqnarray}$$

Unlike idealized ensemble averages, equation (2.5) implies $\overline{\overline{\boldsymbol{A}}}\neq \overline{\boldsymbol{A}}$ since ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{2}(\boldsymbol{k})\neq {\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})$ unless ${\displaystyle\mathop{G}\limits_{{\sim}}}(\boldsymbol{k})=0$ or 1, such as for a step function in Fourier space. However, interchangeability of differential and average operations, as commonly invoked, is manifest in Fourier space since $k_{i}[{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k}){\displaystyle\mathop{A}\limits_{{\sim}}}_{j}(\boldsymbol{k})]={\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})[k_{i}{\displaystyle\mathop{A}\limits_{{\sim}}}_{j}(\boldsymbol{k})]$ .

The kernel $G_{l}(\boldsymbol{x})$ must meet several requirements for a practical mean field theory. First, it should be a spatially local function that decreases rapidly for $|\boldsymbol{x}|\gtrsim l$ , being that it is used to extract a filtered value at a scale $l$ . Complementarily, its Fourier transform ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})$ should also monotonically decrease and vanish for large $|\boldsymbol{k}|$ . Furthermore, in the limit $l\rightarrow 0$ , ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})$ approaches unity, since no filtering is needed for large scales. Thus, ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})$ can be expanded around $|\boldsymbol{k}|=k=0$ when $|\boldsymbol{k}l|/2\unicode[STIX]{x03C0}=|\boldsymbol{k}|/k_{l}$ is small compared to unity, yielding

(2.6)

$$\begin{eqnarray}{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})=1-{\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}+O({\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}^{2}),\end{eqnarray}$$

where ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}$ is a small parameter related to $|\boldsymbol{k}|/k_{l}$ , and the minus sign is for future convenience. Note that ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}$ is independent of the direction of $\boldsymbol{k}$ due to isotropy.

The inverse Fourier transform of ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}$ is an operator $\hat{\unicode[STIX]{x1D6FE}}$ which is determined by

(2.7)

$$\begin{eqnarray}(\hat{\unicode[STIX]{x1D6FE}}f)(\boldsymbol{x})={\mathcal{F}}^{-1}[{\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}(\boldsymbol{k}){\displaystyle\mathop{f}\limits_{{\sim}}}(\boldsymbol{k})](\boldsymbol{x}).\end{eqnarray}$$

Hereafter we assume that these fields are either vanishing or periodic at the spatial boundaries, and therefore any ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}(\boldsymbol{k})$ proportional to a power of $i\boldsymbol{k}$ is simply translated to a $\hat{\unicode[STIX]{x1D6FE}}$ which is a spatial derivative raised to the corresponding power. When applied to a quantity $Q$ with smallest characteristic scale $l_{\text{ch}}>l$ , the order-of-magnitude estimate yields

(2.8)

$$\begin{eqnarray}\hat{\unicode[STIX]{x1D6FE}}Q\sim \left(\frac{l}{l_{\text{ch}}}\right)^{c}Q,\end{eqnarray}$$

with $c$ being a positive number that depends on the specific choice of kernel.

2.2 Expressions for averages of fluctuations and double averages

Here we obtain formulae for averages of fluctuations and double averages, both of which do not strictly obey Reynolds rules in the presence of mesoscale fluctuations. In particular, the averages of fluctuations do not vanish and the double averages will not agree with single-averaged values. We will use the expressions in subsequent sections.

We first derive an expression for the mean of fluctuations, namely $\overline{\boldsymbol{a}}=\overline{\boldsymbol{A}}-\overline{\overline{\boldsymbol{A}}}$ . This vanishes in conventional MFE using the ensemble average, but not for spatial averages. In Fourier space, by definition,

(2.9)

$$\begin{eqnarray}\overline{{\displaystyle\mathop{\boldsymbol{a}}\limits_{{\sim}}}}(\boldsymbol{k})={\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k}){\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}(\boldsymbol{k})-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{2}(\boldsymbol{k}){\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}(\boldsymbol{k})=(1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l})\overline{{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}}.\end{eqnarray}$$

Since $\overline{{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}}$ is a large scale quantity, it decays rapidly when $|\boldsymbol{k}|/k_{l}\gg 1$ . Therefore we can expand the right-hand side of (2.9) for $|\boldsymbol{k}|/k_{l}\ll 1$ using (2.6) to obtain

(2.10)

$$\begin{eqnarray}\overline{{\displaystyle\mathop{\boldsymbol{a}}\limits_{{\sim}}}}(\boldsymbol{k})=[{\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}+O({\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}^{2})]\,\overline{{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}}.\end{eqnarray}$$

In configuration space, this implies

(2.11)

$$\begin{eqnarray}\overline{\boldsymbol{a}}(\boldsymbol{x})=\overline{\boldsymbol{A}}-\overline{\overline{\boldsymbol{A}}}=[\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})]\,\overline{\boldsymbol{A}}\sim \left(\frac{l}{l_{L}}\right)^{c}\overline{\boldsymbol{A}},\end{eqnarray}$$

if $\overline{\boldsymbol{A}}$ has a characteristic variation scale of $l_{L}$ . Equivalently

(2.12)

$$\begin{eqnarray}\overline{\overline{\boldsymbol{A}}}=[1-\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})]\,\overline{\boldsymbol{A}}.\end{eqnarray}$$

To recover conventional MFE, we simply take the limit $l/l_{L}\rightarrow 0$ and get $\overline{\boldsymbol{a}}=\mathbf{0}$ .

Next, we obtain an expression for the mean of the product of two fields, $\overline{AB}$ , in terms of the mean fields. Here $A$ and $B$ can be either two scalar fields or the components of some vector fields. We adopt a two-scale approach, assuming that the fields have double-peaked spectra and scale separations are large but finite, i.e. we relax the assumption of infinite scale separation in conventional approaches (for details see appendix A where the valid range of scale separation is quantified). Other closures may include a test filtering process like that used in the Smagorinsky model (Smagorinsky Reference Smagorinsky1963; Germano et al. Reference Germano, Piomelli, Moin and Cabot1991; Lilly Reference Lilly1992).

By straightforward expansion we have

(2.13)

$$\begin{eqnarray}\overline{AB}=\overline{\overline{A}~\overline{B}}+\overline{a\overline{B}}+\overline{\overline{A}b}+\overline{ab}.\end{eqnarray}$$

We refer to the terms on the right-hand side of (2.13) as $T_{1}$ , $T_{2}$ , $T_{3}$ and $T_{4}$ , respectively. The calculation of $T_{1}$ involves only mean quantities, but for practical purposes, it is convenient to make some further approximations to avoid integro-differential equations. If $\overline{A}$ and $\overline{B}$ both have a characteristic scale of variation $l_{L}$ , then the spectrum of the Fourier transform of their product will roughly extend to $k=2k_{L}$ . If the scale of average satisfies $2k_{L}l/2\unicode[STIX]{x03C0}=k_{L}l/\unicode[STIX]{x03C0}\ll 1$ , we can use (2.12) to calculate $T_{1}$ , namely,

(2.14)

$$\begin{eqnarray}\overline{\overline{A}~\overline{B}}=[1-\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})](\overline{A}~\overline{B}).\end{eqnarray}$$

The Fourier transform of $T_{2}$ is

(2.15)

$$\begin{eqnarray}{\displaystyle\mathop{T}\limits_{{\sim}}}_{2}(\boldsymbol{k})={\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})[{\displaystyle\mathop{a}\limits_{{\sim}}}(\boldsymbol{k})\ast \overline{{\displaystyle\mathop{B}\limits_{{\sim}}}}(\boldsymbol{k})]={\displaystyle\mathop{G}\limits_{{\sim}}}_{l}\{[({\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{-1}-1)\overline{{\displaystyle\mathop{A}\limits_{{\sim}}}}]\ast \overline{{\displaystyle\mathop{B}\limits_{{\sim}}}}\},\end{eqnarray}$$

where we have used the definition ${\displaystyle\mathop{\boldsymbol{a}}\limits_{{\sim}}}=(1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}){\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}$ . The convolution of two quantities with characteristic wavenumbers $k_{1}$ and $k_{2}$ will yield wavenumbers $k_{1}\pm k_{2}$ . Note that $G_{l}$ is outside of the square brackets and so with periodic or vanishing boundary conditions, only the low wavenumber part of the factor $(G^{-1}-1)\overline{A}$ survives on the right-hand side of (2.15). (The validity of this approximation is discussed in more detail in appendix A.) We therefore expand $G_{l}^{-1}$ in a Taylor series, yielding

(2.16)

$$\begin{eqnarray}({\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{-1}-1)\overline{{\displaystyle\mathop{A}\limits_{{\sim}}}}=[{\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}+O({\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}^{2})]\,\overline{{\displaystyle\mathop{A}\limits_{{\sim}}}},\end{eqnarray}$$

which upon Fourier inversion then implies

(2.17)

$$\begin{eqnarray}T_{2}=\overline{a\overline{B}}=\overline{\overline{B}[\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})]\,\overline{A}}.\end{eqnarray}$$

Similarly, for $T_{3}$ we obtain

(2.18)

$$\begin{eqnarray}T_{3}=\overline{b\overline{A}}=\overline{\overline{A}[\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})]\overline{B}}.\end{eqnarray}$$

The sum $T_{2}+T_{3}$ , using (2.14), is then

(2.19)

$$\begin{eqnarray}\overline{a\overline{B}}+\overline{b\overline{A}}=[\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})](\overline{\overline{A}~\overline{B}})-\overline{\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{A},\overline{B})}=[\hat{\unicode[STIX]{x1D6FE}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})](\overline{A}~\overline{B})-\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{A},\overline{B}),\end{eqnarray}$$

where $\hat{\unicode[STIX]{x1D6FE}}^{\prime }$ , a binary operator, is introduced to account for the violation of the distribution rule of $\hat{\unicode[STIX]{x1D6FE}}$ ; that is,

(2.20)

$$\begin{eqnarray}\hat{\unicode[STIX]{x1D6FE}}^{\prime }(A,B)=\hat{\unicode[STIX]{x1D6FE}}(AB)-(A\hat{\unicode[STIX]{x1D6FE}}B+B\hat{\unicode[STIX]{x1D6FE}}A).\end{eqnarray}$$

Note that $\hat{\unicode[STIX]{x1D6FE}}^{\prime }(A,B)$ has the same order of magnitude as $B\hat{\unicode[STIX]{x1D6FE}}A$ or $A\hat{\unicode[STIX]{x1D6FE}}B$ if $A$ and $B$ have the same characteristic length scale.

Combining (2.13), (2.14) and (2.19) we obtain

(2.21)

$$\begin{eqnarray}\overline{AB}=[1+O(\hat{\unicode[STIX]{x1D6FE}}^{2})](\overline{A}~\overline{B})-\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{A},\overline{B})+\overline{ab}.\end{eqnarray}$$

Furthermore, it can be verified using (2.12), (2.17) and (2.21) together that

(2.22)

$$\begin{eqnarray}\displaystyle \overline{A\overline{B}} & = & \displaystyle \overline{A}~\overline{\overline{B}}-\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{A},\overline{\overline{B}})+\overline{a\overline{b}}\nonumber\\ \displaystyle & = & \displaystyle \overline{A}(1-\hat{\unicode[STIX]{x1D6FE}})\overline{B}-\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{A},(1-\hat{\unicode[STIX]{x1D6FE}})\overline{B})+\overline{\overline{b}\hat{\unicode[STIX]{x1D6FE}}\overline{A}}+O(\hat{\unicode[STIX]{x1D6FE}}^{2})\nonumber\\ \displaystyle & = & \displaystyle \overline{A}(1-\hat{\unicode[STIX]{x1D6FE}})\overline{B}-\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{A},\overline{B})+O(\hat{\unicode[STIX]{x1D6FE}}^{2})\nonumber\\ \displaystyle & = & \displaystyle (1-\hat{\unicode[STIX]{x1D6FE}})(\overline{A}~\overline{B})+\overline{B}\hat{\unicode[STIX]{x1D6FE}}\overline{A}+O(\hat{\unicode[STIX]{x1D6FE}}^{2}).\end{eqnarray}$$

2.3 Comparison to previous work

Expressing a turbulent field as ${\displaystyle\mathop{a}\limits_{{\sim}}}=(1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}){\displaystyle\mathop{A}\limits_{{\sim}}}$ is equivalent to applying a high-pass filter on $A$ , as has been discussed in Yeo (Reference Yeo1987). In Yeo (Reference Yeo1987) all fields are expressed in terms of mean fields and their derivatives (Yeo–Bedford expansion), including small scale fields, so the approach facilitates a closure in their context of the inertial range for large eddy simulations (LES). In our approach, we focus on the large scale mean fields, not the inertial range. We keep two-point correlations of turbulent fields ( $\overline{ab}$ -like terms in (2.21)) but use a separate closure for triple correlations (compare (5.8) to (5.11) of Yeo (Reference Yeo1987) to our (2.21)). In our formalism, $\hat{\unicode[STIX]{x1D6FE}}$ terms enter as corrections to capture finite scale separation effects, facilitating comparisons to conventional approaches (e.g. ensemble averages), while allowing different closures. Also, Yeo (Reference Yeo1987) use a Gaussian kernel, whereas our discussions in the previous sections apply to any kernel meeting the requirements in § 2.1.

2.4 Unifying different averaging methods using kernels

Here we discuss commonly used averages and their kernel forms (if possible). Recall from above that in order to accurately capture large scale features, a suitable kernel for mean field theories should at least be monotonically decreasing in Fourier space.

2.4.1 Gaussian average

For isotropic and homogeneous turbulence, the Gaussian kernel is defined as

(2.23)

$$\begin{eqnarray}G_{l}(\boldsymbol{x})=\left(\frac{k_{l}^{2}}{2\unicode[STIX]{x03C0}}\right)^{3/2}\text{e}^{-k_{l}^{2}|\boldsymbol{x}|^{2}/2},\end{eqnarray}$$

where $k_{l}=2\unicode[STIX]{x03C0}/l$ . It is then evident that $\overline{\boldsymbol{A}}$ represents the large scale part of $\boldsymbol{A}$ by rewriting it in Fourier space. This gives

(2.24)

Since the kernel decreases rapidly for large $k$ , as long as the spectrum of ${\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}(\boldsymbol{k})$ does not increase exponentially at large $k$ , the spectrum of $\overline{{\displaystyle\mathop{\boldsymbol{A}}\limits_{{\sim}}}}(\boldsymbol{k})$ has little power for $k>k_{l}$ . For $k<k_{l}$ we can then write

(2.25)

$$\begin{eqnarray}{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})=1-\frac{k^{2}}{2k_{l}^{2}}+O\left(\frac{k^{4}}{k_{l}^{4}}\right)+\cdots ,\end{eqnarray}$$

so that in configuration space $\hat{\unicode[STIX]{x1D6FE}}=-\unicode[STIX]{x1D6FB}^{2}/2k_{l}^{2}$ (recall the ‘ $-$ ’ sign in the definition of ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}$ from (2.6)) and $\hat{\unicode[STIX]{x1D6FE}}^{\prime }(A,B)=-\unicode[STIX]{x1D735}A\boldsymbol{\cdot }\unicode[STIX]{x1D735}B/k_{l}^{2}$ for any $A$ and $B$ . Overall, $\hat{\unicode[STIX]{x1D6FE}}$ operating on quantity $Q$ gives $\hat{\unicode[STIX]{x1D6FE}}Q\sim (l/l_{\text{ch}})^{2}Q$ where $l_{\text{ch}}$ is the characteristic variation scale of $Q$ .

2.4.2 Moving box average

Here fields at a point $\boldsymbol{x}$ are averaged in a finite box with sides of length $l$ . Expressing the average using a kernel allows the integral bounds to be taken to infinity, that is

(2.26)

$$\begin{eqnarray}\overline{\boldsymbol{A}}(\boldsymbol{x})=\frac{1}{l^{3}}\int _{-l/2}^{l/2}\,\text{d}x^{\prime }\int _{-l/2}^{l/2}\,\text{d}y^{\prime }\int _{-l/2}^{l/2}\,\text{d}z^{\prime }\boldsymbol{A}(\boldsymbol{x}-\boldsymbol{x}^{\prime })=\int \text{d}^{3}x^{\prime }G_{l}(\boldsymbol{x}^{\prime })\boldsymbol{A}(\boldsymbol{x}-\boldsymbol{x}^{\prime }),\end{eqnarray}$$

where $G_{l}(\boldsymbol{x})=\unicode[STIX]{x1D703}_{l}(x)\unicode[STIX]{x1D703}_{l}(y)\unicode[STIX]{x1D703}_{l}(z)$ is the product of three rectangular functions defined by

(2.27)

$$\begin{eqnarray}\unicode[STIX]{x1D703}_{l}(x)=\left\{\begin{array}{@{}ll@{}}1/l & -l/2\leqslant x\leqslant l/2\\ 0 & \text{otherwise}\end{array}\right\}.\end{eqnarray}$$

We call this a ‘moving’ average because it is not taken on a fixed grid, but centred around each point $\boldsymbol{x}$ . Although suitable for numerical simulation analyses and seemingly benign, this has limitations for applicability to realistic contexts. The reason is evident from the Fourier transform of the kernel of a one-dimensional running box, namely

(2.28)

$$\begin{eqnarray}{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k)=\text{sinc}\left(\frac{kl}{2}\right).\end{eqnarray}$$

Here $|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}|$ is a non-monotonic function of $k$ with zero points at $kl=n\unicode[STIX]{x03C0}$ , where $n\in \mathbb{Z}$ . As a result, some modes with large wave numbers may contribute more to the mean than those with small wavenumbers. This contradicts our basic notion of mean field theory and highlights why monotonicity of the kernel is a requirement for a physically motivated kernel. A secondary pathology is that modes with wavelengths $2l/n$ are completely absent from the mean fields calculated using this kernel, although this problem is lessened for large scale separation $l\ll l_{L}$ , since then only a few modes lie near $k=n\unicode[STIX]{x03C0}/l$ .

2.4.3 Moving line segment average

A one-dimensional, or line average over a segment of length $l$ , is a variant of the moving box average but with the averages taken along a single direction $\hat{\boldsymbol{r}}_{0}$ :

(2.29)

$$\begin{eqnarray}\overline{\boldsymbol{A}}(\boldsymbol{x},\hat{\boldsymbol{r}}_{0})=\frac{1}{l}\int _{0}^{l}\text{d}s\,\boldsymbol{A}(\boldsymbol{x}+s\hat{\boldsymbol{r}}_{0})=\int \text{d}s\,\unicode[STIX]{x1D703}_{l}\left(s-\frac{l}{2}\right)\boldsymbol{A}(\boldsymbol{x}+s\hat{\boldsymbol{r}}_{0}).\end{eqnarray}$$

Note that the argument of $\unicode[STIX]{x1D703}_{l}$ is shifted here because the line segment over which the average is taken starts from $\boldsymbol{x}$ , rather than being centred at $\boldsymbol{x}$ . The Fourier transform of the kernel is obtained by directly calculating the Fourier transform of $\overline{\boldsymbol{A}}$ , which gives

(2.30)

$$\begin{eqnarray}{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k},\hat{\boldsymbol{r}}_{0})=\frac{\text{e}^{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\hat{\boldsymbol{r}}_{0}l}-1}{\text{i}\boldsymbol{k}\boldsymbol{\cdot }\hat{\boldsymbol{r}}_{0}l}.\end{eqnarray}$$

When $|\boldsymbol{k}l|\ll 1$ , the expansion of ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k},\hat{\boldsymbol{r}}_{0})$ gives ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}=-(\text{i}/2)\boldsymbol{k}\boldsymbol{\cdot }\hat{\boldsymbol{r}}_{0}l$ and thus $\hat{\unicode[STIX]{x1D6FE}}=(1/2)l\hat{\boldsymbol{r}}_{0}\boldsymbol{\cdot }\unicode[STIX]{x1D735}$ and $\hat{\unicode[STIX]{x1D6FE}}^{\prime }=0$ . The moving line segment average has the same problems of physical applicability as the moving box average for MFE.

2.4.4 Fixed grid averages

We may also average fields inside a set of fixed boxes, i.e. fixed-grid average. For this case, the mean of $\boldsymbol{A}(\boldsymbol{x})$ is given by

(2.31)

$$\begin{eqnarray}\overline{\boldsymbol{A}}(\boldsymbol{x})=\mathop{\sum }_{i=1}^{N_{\text{box}}}\unicode[STIX]{x1D703}_{l}(\boldsymbol{x}-\boldsymbol{x}_{i})\overline{\boldsymbol{A}}_{\text{m.b.}}(\boldsymbol{x}_{i}),\end{eqnarray}$$

where $\{\boldsymbol{x}_{i}\}$ , $i=1,2,\ldots ,N_{\text{box}}$ is the set of points of a grid with side length $l$ and the subscript m.b. denotes that $\overline{\boldsymbol{A}}_{\text{m.b.}}$ is a moving box average. Note that since in each grid cell $\overline{\boldsymbol{A}}(\boldsymbol{x})$ is a constant, $\overline{\overline{\boldsymbol{A}}}=\overline{\boldsymbol{A}}$ , this fixed-grid average results in a mean field valued discretely in space. To recover a mean field which is smooth in space, one may apply a second average using a proper (e.g. Gaussian) kernel. Nevertheless, the averaged field still misses those modes which are resonant to the side length of the grid, and thus again is physically problematic for observational applications of MFE.

2.4.5 Planar average

The planar average is widely used in simulations (e.g. Brandenburg Reference Brandenburg2009; Hubbard & Brandenburg Reference Hubbard and Brandenburg2011; Bhat, Ebrahimi & Blackman Reference Bhat, Ebrahimi and Blackman2016). It is manifestly anisotropic since it integrates out, say, $x$ and $y$ but leaves the full $z$ dependence. We write

(2.32)

$$\begin{eqnarray}\overline{\boldsymbol{A}}(z)=\frac{1}{L^{2}}\int \text{d}x\,\text{d}y\,\boldsymbol{A}(\boldsymbol{x}),\end{eqnarray}$$

if $L$ is the side length of the simulation box. The full Reynolds rules, especially the interchangeability of differential and average operations, are respected if boundary conditions are periodic. That is

(2.33)

$$\begin{eqnarray}\overline{\unicode[STIX]{x2202}_{x}\boldsymbol{A}(\boldsymbol{x})}=\frac{1}{L^{2}}\int \text{d}x\,\text{d}y\unicode[STIX]{x2202}_{x}\boldsymbol{A}(\boldsymbol{x})=\frac{1}{L^{2}}\int \text{d}y\,\boldsymbol{A}(\boldsymbol{x})|_{x=0}^{x=L}=0=\unicode[STIX]{x2202}_{x}\overline{\boldsymbol{A}}(\boldsymbol{x}).\end{eqnarray}$$

The planar average, although fine for simulation boxes, does not remove large $k$ modes in the $z$ direction from the mean and so does not fully filter small scale fields from large scale fields for a real system.

2.4.6 Time average

The time average separates fields into mean and fluctuation components according to their characteristic time variation scales. Mathematically, there is little difference between time average and a one dimensional spatial average if we consider fields to evolve on a four-dimensional space–time manifold, $\boldsymbol{A}=\boldsymbol{A}(t,\boldsymbol{x})$ . The mean quantities are defined via the convolution between the actual fields and a one-dimensional kernel in time, $\overline{\boldsymbol{A}}(t)=G_{l}(t)\ast \boldsymbol{A}(t)$ . For example, a Gaussian kernel is $G_{t_{0}}(t)=\text{e}^{-t^{2}/2t_{0}^{2}}/\sqrt{2\unicode[STIX]{x03C0}t_{0}^{2}}$ where $t_{0}$ is the time scale of average (for applications of space-time filtering, see Dakhoul & Bedford Reference Dakhoul and Bedford1986a ,Reference Dakhoul and Bedford b ). Optimally practical use such as the minimal- $\unicode[STIX]{x1D70F}$ approximation closure (Blackman & Field Reference Blackman and Field2002) still requires a wide spatial scale separation to ensure that temporally averaged quantities decouple from fluctuating ones.

2.5 On averages in simulations versus observations

Planar or box averages used in simulations yield reliable results if compared to a theory based on corresponding averages, and interpreted appropriately. However, it is a different question as to how well lessons learned from box averages apply to observations, for which a differently defined mean is more appropriate.

3 MFE dynamo equations with correction terms

In this section we re-derive mean field dynamo equations using the kernel formalism of local averaging introduced in § 2, keeping track of correction terms that result from (i) the non-vanishing means of fluctuations and (ii) the non-equality of double and single averages. With a finite scale separation, these correction terms can be expressed in terms of mean fields and their spatial derivatives. Here we keep only the lowest order correction terms, but higher-order terms can be derived by the same method.

3.1 Derivation

We average the MHD magnetic induction equation and use (2.21), which yields

(3.1)

$$\begin{eqnarray}\unicode[STIX]{x2202}_{t}\overline{\boldsymbol{B}}=\unicode[STIX]{x1D735}\times \overline{\boldsymbol{U}\times \boldsymbol{B}}+\unicode[STIX]{x1D708}_{m}\unicode[STIX]{x1D6FB}^{2}\overline{\boldsymbol{B}}=\unicode[STIX]{x1D735}\times [(1+O(\hat{\unicode[STIX]{x1D6FE}}^{2}))\overline{\boldsymbol{U}}\times \overline{\boldsymbol{B}}-\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{\boldsymbol{U}},\overline{\boldsymbol{B}})+\boldsymbol{{\mathcal{E}}}]+\unicode[STIX]{x1D708}_{m}\unicode[STIX]{x1D6FB}^{2}\overline{\boldsymbol{B}},\end{eqnarray}$$

where $\unicode[STIX]{x1D708}_{m}$ is the magnetic diffusivity assumed to be a constant, and $\boldsymbol{{\mathcal{E}}}=\overline{\boldsymbol{u}\times \boldsymbol{b}}$ is the turbulent EMF. The relative magnitude of the correction terms to the standard terms that we arrive at in this section are unchanged if $\overline{\boldsymbol{U}}$ is included.

To express $\boldsymbol{{\mathcal{E}}}$ in terms of large scale quantities we adopt the minimal- $\unicode[STIX]{x1D70F}$ approach (MTA Blackman & Field Reference Blackman and Field2002). In deriving $\boldsymbol{{\mathcal{E}}}$ , terms involving $\overline{\boldsymbol{U}}$ come proportional to scalar or pseudoscalar cross-correlations between functions of $\boldsymbol{u}$ and $\boldsymbol{b}$ (e.g. Yoshizawa & Yokoi Reference Yoshizawa and Yokoi1993; Blackman Reference Blackman2000) and for present purposes we ignore these, i.e. we ignore terms linear in $\overline{\boldsymbol{U}}$ or $\overline{\boldsymbol{u}}$ ( $\simeq \hat{\unicode[STIX]{x1D6FE}}\overline{\boldsymbol{U}}=0$ ) in the evolution equations for $\boldsymbol{u}$ and $\boldsymbol{b}$ (but not in that for $\overline{\boldsymbol{B}}$ ). The incompressible momentum equation for velocity fluctuations then reads

(3.2)

$$\begin{eqnarray}\displaystyle \unicode[STIX]{x2202}_{t}u_{l} & = & \displaystyle \hat{P}_{ml} [\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{m}+b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{m}+b_{n}\unicode[STIX]{x2202}_{n}b_{m}-\overline{b_{n}\unicode[STIX]{x2202}_{n}b_{m}}\nonumber\\ \displaystyle & & \displaystyle -\,u_{n}\unicode[STIX]{x2202}_{n}u_{m}+\overline{u_{n}\unicode[STIX]{x2202}_{n}u_{m}}+\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{B}_{n},\unicode[STIX]{x2202}_{n}\overline{B}_{m})]+\unicode[STIX]{x1D708}\unicode[STIX]{x1D6FB}^{2}u_{l},\end{eqnarray}$$

where $\hat{P}_{ml}=\unicode[STIX]{x1D6FF}_{ml}-\unicode[STIX]{x2202}_{m}\unicode[STIX]{x2202}_{l}\unicode[STIX]{x1D6FB}^{-2}$ is the projection operator used to eliminate the sum of thermal and magnetic pressures, and $\unicode[STIX]{x1D708}$ is the viscosity. The units are such that the mass density $\unicode[STIX]{x1D70C}_{f}=1$ and the magnetic permeability $\unicode[STIX]{x1D707}=1$ . The induction equation for $\boldsymbol{b}$ is

(3.3)

$$\begin{eqnarray}\unicode[STIX]{x2202}_{t}\boldsymbol{b}=\unicode[STIX]{x1D735}\times (\boldsymbol{u}\times \overline{\boldsymbol{B}}+\boldsymbol{u}\times \boldsymbol{b}-\overline{\boldsymbol{u}\times \boldsymbol{b}})+\unicode[STIX]{x1D708}_{m}\unicode[STIX]{x1D6FB}^{2}\boldsymbol{b}.\end{eqnarray}$$

Using (2.12), and carrying through the algebra keeping only first-order terms in $\hat{\unicode[STIX]{x1D6FE}}$ or $\hat{\unicode[STIX]{x1D6FE}}^{\prime }$ , we have

(3.4)

$$\begin{eqnarray}\displaystyle \{\overline{\boldsymbol{u}\times \unicode[STIX]{x2202}_{t}\boldsymbol{b}}\}_{i} & = & \displaystyle \unicode[STIX]{x1D716}_{ijk} [\hspace{2.39996pt}(1-\hat{\unicode[STIX]{x1D6FE}})(\overline{u_{j}\unicode[STIX]{x2202}_{n}u_{k}}\,\overline{B_{n}})-(1-\hat{\unicode[STIX]{x1D6FE}})(\overline{u_{j}u_{n}}\,\unicode[STIX]{x2202}_{n}\overline{B_{k}})\nonumber\\ \displaystyle & & \displaystyle +\,\overline{B_{n}}\hat{\unicode[STIX]{x1D6FE}}\overline{u_{j}\unicode[STIX]{x2202}_{n}u_{k}}-\unicode[STIX]{x2202}_{n}\overline{B_{k}}\hat{\unicode[STIX]{x1D6FE}}\overline{u_{j}u_{n}}+\overline{u_{j}b_{n}\unicode[STIX]{x2202}_{n}u_{k}}-\overline{u_{j}u_{n}\unicode[STIX]{x2202}_{n}b_{k}} ]+\unicode[STIX]{x1D708}_{m}\unicode[STIX]{x1D716}_{ijk}\overline{u_{j}\unicode[STIX]{x2202}_{nn}b_{k}},\qquad\end{eqnarray}$$

and

(3.5)

$$\begin{eqnarray}\displaystyle \{\overline{\unicode[STIX]{x2202}_{t}\boldsymbol{u}\times \boldsymbol{b}}\}_{i} & = & \displaystyle \unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\hat{P}_{lj}(\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l}+b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{l})}+\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\hat{P}_{lj}(b_{n}\unicode[STIX]{x2202}_{n}b_{l}-u_{n}\unicode[STIX]{x2202}_{n}u_{l})}\nonumber\\ \displaystyle & & \displaystyle +\,\unicode[STIX]{x1D716}_{ijk}[(1-\hat{\unicode[STIX]{x1D6FE}})(\overline{b}_{k}\overline{C}_{j})+\overline{C}_{j}\hat{\unicode[STIX]{x1D6FE}}\overline{b}_{k}]+\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\hat{P}_{lj}\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{B}_{n},\unicode[STIX]{x2202}_{n}\overline{B}_{l})}+\unicode[STIX]{x1D708}\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\unicode[STIX]{x2202}_{nn}u_{j}},\qquad\end{eqnarray}$$

where $C_{j}=\hat{P}_{lj}(u_{n}\unicode[STIX]{x2202}_{n}u_{l}-b_{n}\unicode[STIX]{x2202}_{n}b_{l})$ . Assuming all small scale quantities are isotropic and homogeneous below scale $l$ but could vary on large scales ( ${\sim}l_{L}$ ), the two previous equations become

(3.6)

$$\begin{eqnarray}\displaystyle \overline{\boldsymbol{u}\times \unicode[STIX]{x2202}_{t}\boldsymbol{b}} & = & \displaystyle (1-\hat{\unicode[STIX]{x1D6FE}})\big(-{\textstyle \frac{1}{3}}\overline{\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{u}}\,\overline{\boldsymbol{B}}+{\textstyle \frac{1}{3}}\overline{u^{2}}\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}}\big)+\unicode[STIX]{x1D708}_{m}\overline{\boldsymbol{u}\times \unicode[STIX]{x1D6FB}^{2}\boldsymbol{b}}+\boldsymbol{T}^{M}\nonumber\\ \displaystyle & & \displaystyle -\,\hat{\unicode[STIX]{x1D6FE}}\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{u}}\right)\overline{\boldsymbol{B}}+\hat{\unicode[STIX]{x1D6FE}}\big({\textstyle \frac{1}{3}}\overline{u^{2}}\big)\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}},\end{eqnarray}$$

where $\boldsymbol{T}^{M}=\overline{\boldsymbol{u}\times \unicode[STIX]{x1D735}\times (\boldsymbol{u}\times \boldsymbol{b}-\overline{\boldsymbol{u}\times \boldsymbol{b}})}$ , and

(3.7)

$$\begin{eqnarray}\overline{\unicode[STIX]{x2202}_{t}\boldsymbol{u}\times \boldsymbol{b}}=(1-\hat{\unicode[STIX]{x1D6FE}})\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}\,\overline{\boldsymbol{B}}\right)+\hat{\unicode[STIX]{x1D6FE}}\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}\right)\overline{\boldsymbol{B}}+\unicode[STIX]{x1D708}\overline{\unicode[STIX]{x1D6FB}^{2}\boldsymbol{u}\times \boldsymbol{b}}+\boldsymbol{T}^{U},\end{eqnarray}$$

where $T_{i}^{U}=\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\hat{P}_{lj}(b_{n}\unicode[STIX]{x2202}_{n}b_{l}-u_{n}\unicode[STIX]{x2202}_{n}u_{l})}$ . The derivation of the first and second terms in (3.7) is given in appendix B. Also note that the small scale part in the $\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{B}_{n},\unicode[STIX]{x2202}_{n}\overline{B}_{l})$ term will have its maximum wavenumber at ${\sim}2k_{L}$ . Therefore if the scale separation is large enough such that $2k_{L}\ll k_{s}$ , the $\hat{\unicode[STIX]{x1D6FE}}^{\prime }(\overline{B}_{n},\unicode[STIX]{x2202}_{n}\overline{B}_{l})$ term can be roughly treated as a large scale quantity in (3.5).

Adding (3.6) and (3.7) gives

(3.8)

$$\begin{eqnarray}\displaystyle \unicode[STIX]{x2202}_{t}\boldsymbol{{\mathcal{E}}} & = & \displaystyle (1-\hat{\unicode[STIX]{x1D6FE}})(\tilde{\unicode[STIX]{x1D6FC}}\overline{\boldsymbol{B}}-\tilde{\unicode[STIX]{x1D6FD}}\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}})+(\hat{\unicode[STIX]{x1D6FE}}\tilde{\unicode[STIX]{x1D6FC}})\overline{\boldsymbol{B}}-(\hat{\unicode[STIX]{x1D6FE}}\tilde{\unicode[STIX]{x1D6FD}})\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}}\nonumber\\ \displaystyle & & \displaystyle +\,\unicode[STIX]{x1D708}_{m}\overline{\boldsymbol{u}\times \unicode[STIX]{x1D6FB}^{2}\boldsymbol{b}}+\unicode[STIX]{x1D708}\overline{\unicode[STIX]{x1D6FB}^{2}\boldsymbol{u}\times \boldsymbol{b}}+\boldsymbol{T}^{M}+\boldsymbol{T}^{U},\end{eqnarray}$$

where $\tilde{\unicode[STIX]{x1D6FC}}=(\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}-\overline{\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{u}})/3$ and $\tilde{\unicode[STIX]{x1D6FD}}=\overline{u^{2}}/3$ . In the spirit of the MTA, the sum of the triple correlation terms in (3.8) is equated to a damping term $-\boldsymbol{{\mathcal{E}}}/\unicode[STIX]{x1D70F}$ . For $|\unicode[STIX]{x1D70F}\unicode[STIX]{x2202}_{t}\boldsymbol{{\mathcal{E}}}|\ll |\boldsymbol{{\mathcal{E}}}|$ , equation (3.8) then gives

(3.9)

$$\begin{eqnarray}\boldsymbol{{\mathcal{E}}}=(1-\hat{\unicode[STIX]{x1D6FE}})(\unicode[STIX]{x1D6FC}\overline{\boldsymbol{B}}-\unicode[STIX]{x1D6FD}\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}})+(\hat{\unicode[STIX]{x1D6FE}}\unicode[STIX]{x1D6FC})\overline{\boldsymbol{B}}-(\hat{\unicode[STIX]{x1D6FE}}\unicode[STIX]{x1D6FD})\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}}\end{eqnarray}$$

in the ideal MHD limit $\unicode[STIX]{x1D708},\unicode[STIX]{x1D708}_{m}\rightarrow 0$ , where $\unicode[STIX]{x1D6FC}=\unicode[STIX]{x1D70F}\tilde{\unicode[STIX]{x1D6FC}}$ and $\unicode[STIX]{x1D6FD}=\unicode[STIX]{x1D70F}\tilde{\unicode[STIX]{x1D6FD}}$ are the helical and diffusion dynamo coefficients and $\unicode[STIX]{x1D70F}$ is the damping time for the EMF when mean fields are removed. Empirically, this is approximately equal to the turnover time at the turbulent driving scales in forced isotropic simulations (Brandenburg & Subramanian Reference Brandenburg and Subramanian2005b ). We also define $\unicode[STIX]{x1D6FC}_{k}=-\unicode[STIX]{x1D70F}\overline{\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{u}}/3$ and $\unicode[STIX]{x1D6FC}_{m}=\unicode[STIX]{x1D70F}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}/3$ being the kinetic and magnetic contributions to the $\unicode[STIX]{x1D6FC}$ -effect, respectively.

When there is large scale separation $\hat{\unicode[STIX]{x1D6FE}},\hat{\unicode[STIX]{x1D6FE}}^{\prime }\rightarrow 0$ , equations (3.1) and (3.9) reduce exactly to the standard dynamo equations derived with ensemble average. This important feature indicates that different kinds of suitable averaging – like local Gaussian average or ensemble average – converge to the same set of equations when scale separation is large.

The turbulent EMF now has routes of expansion: (i) higher gradients of $\overline{\boldsymbol{B}}$ ; (ii) $\hat{\unicode[STIX]{x1D6FE}}$ due to the violation of Reynolds rules. Expanding to every higher-order results in (using order-of-magnitude estimates) an extra factor of $l_{s}/l_{L}$ for the former, and $(l/l_{L})^{c}$ for the later. Interestingly, both of these two ratios are related to the scale separation, and the question of which dominates higher-order terms in $\boldsymbol{{\mathcal{E}}}$ varies for different models. In this work we assume the $\hat{\unicode[STIX]{x1D6FE}}$ corrections dominate.

3.2 Comparison to previous work on non-local EMF kernels

We see from (3.9) that the violation of the Reynolds rules from mesoscale fluctuations is a direct source of contributions to the EMF from terms with higher than linear order in derivatives of $\overline{\boldsymbol{B}}$ . In Fourier space these terms imply that ${\displaystyle\mathop{{\mathcal{E}}}\limits_{{\sim}}}_{i}(\boldsymbol{k})={\displaystyle\mathop{K}\limits_{{\sim}}}_{ij}(\boldsymbol{k}){\displaystyle\mathop{\{}\limits_{{\sim}}}_{j}(\boldsymbol{k})$ where ${\displaystyle\mathop{K}\limits_{{\sim}}}_{ij}$ could contain terms of order higher than linear in $\boldsymbol{k}$ , in contrast to the conventional mean field dynamo theory where ${\displaystyle\mathop{K}\limits_{{\sim}}}_{ij}=\unicode[STIX]{x1D6FC}\unicode[STIX]{x1D6FF}_{ij}-i\unicode[STIX]{x1D716}_{imj}\unicode[STIX]{x1D6FD}k_{m}$ .

Consequently, in configuration space we have $\boldsymbol{{\mathcal{E}}}(\boldsymbol{x})=\mathbf{K}\ast \overline{\boldsymbol{B}}$ , and the turbulent EMF depends on $\overline{\boldsymbol{B}}$ through its weighted average in the vicinity of $\boldsymbol{x}$ , i.e. non-locally. More generally, if we have used a time average in (3.9), $\boldsymbol{K}$ could also be time dependent, and correspondingly $\boldsymbol{{\mathcal{E}}}$ becomes non-local in both space and time.

The EMF kernel $\boldsymbol{K}$ that we derive includes terms caused by violation of Reynolds rules, and varies depending on the choice of our (potentially anisotropic) averaging kernel $\boldsymbol{G}$ . Although previous work has identified the need for an EMF kernel to capture non-locality (Krause & Rädler Reference Krause and Rädler1980; Rädler Reference Rädler, Page and Hirsch2000; Rädler & Rheinhardt Reference Rädler and Rheinhardt2007; Brandenburg, Rädler & Schrinner Reference Brandenburg, Rädler and Schrinner2008; Hubbard & Brandenburg Reference Hubbard and Brandenburg2009; Rheinhardt & Brandenburg Reference Rheinhardt and Brandenburg2012), this previous work did not address the contribution to this kernel from the violation of the Reynolds rules. For example, Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012) used numerical simulations (DNS) to test an ansatz for the EMF kernel in the case of homogeneous isotropic turbulence with mean fields defined by an average over the $x{-}y$ plane. They assessed whether the EMF kernel takes the form

(3.10)

$$\begin{eqnarray}{\displaystyle\mathop{K}\limits_{{\sim}}}_{ij}(\unicode[STIX]{x1D714},\boldsymbol{k})=\frac{\unicode[STIX]{x1D6FC}\unicode[STIX]{x1D6FF}_{ij}-\text{i}\unicode[STIX]{x1D716}_{imj}\unicode[STIX]{x1D6FD}k_{m}}{1-\text{i}\unicode[STIX]{x1D714}\unicode[STIX]{x1D70F}_{\text{RB}}+l_{\text{RB}}^{2}k^{2}}\end{eqnarray}$$

at low wavenumbers, where $\unicode[STIX]{x1D70F}_{\text{RB}}$ is approximately equal to the eddy turnover time $\unicode[STIX]{x1D70F}$ , and $l_{\text{RB}}$ is a parameter whose value is to be extracted from fits to simulation data. Their resulting evolution equation for the EMF reads

(3.11)

$$\begin{eqnarray}(1+\unicode[STIX]{x1D70F}_{\text{RB}}\unicode[STIX]{x2202}_{t}-l_{\text{RB}}^{2}\unicode[STIX]{x2202}_{z}^{2})\boldsymbol{{\mathcal{E}}}=\unicode[STIX]{x1D6FC}\overline{\boldsymbol{B}}^{(p)}-\unicode[STIX]{x1D6FD}\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}}^{(p)},\end{eqnarray}$$

where the superscript ‘ $(p)$ ’ distinguishes their planar average from our kernel averages. Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012) found that (3.10) was at least consistent with simulation data up to $k/k_{1}\approx 3$ where $k_{1}$ is the simulation box wavenumber, for $\unicode[STIX]{x1D70F}_{\text{RB}}\sim \unicode[STIX]{x1D70F}$ and $l_{\text{RB}}\sim l_{s}$ , the energy-dominating eddy scale.

To compare (3.11) with our result equation (3.8), we ignore the second to fifth terms on the right-hand side of the latter (i.e. assuming $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ are constants and taking the ideal MHD limit), and identify the sixth and seventh terms (triple correlations) with $\hat{T}\boldsymbol{{\mathcal{E}}}$ where $\hat{T}$ is an operator. This gives

(3.12)

$$\begin{eqnarray}(-\unicode[STIX]{x1D70F}\hat{T}+\unicode[STIX]{x1D70F}\unicode[STIX]{x2202}_{t})\boldsymbol{{\mathcal{E}}}=(1-\hat{\unicode[STIX]{x1D6FE}})(\unicode[STIX]{x1D6FC}\overline{\boldsymbol{B}}-\unicode[STIX]{x1D6FD}\unicode[STIX]{x1D735}\times \overline{\boldsymbol{B}}).\end{eqnarray}$$

The identification of the triple correlations with a damping term, $\hat{T}=-1/\unicode[STIX]{x1D70F}$ , serves as the closure in MTA. Comparison to (3.11) shows that the left sides of the two equations can be made to mutually correspond if we replace the triple correlations by the sum of a damping term and a diffusion term, that is $\hat{T}\boldsymbol{{\mathcal{E}}}=-\boldsymbol{{\mathcal{E}}}/\unicode[STIX]{x1D70F}+\unicode[STIX]{x1D702}_{\text{t.c.}}\unicode[STIX]{x1D6FB}^{2}\boldsymbol{{\mathcal{E}}}$ , where $\unicode[STIX]{x1D702}_{\text{t.c.}}=l_{\text{RB}}^{2}/\unicode[STIX]{x1D70F}$ is a diffusion coefficient determined by statistical properties of turbulent fields. The spatially non-local term in (3.11), $-l_{\text{RB}}^{2}\unicode[STIX]{x2202}_{z}^{2}\boldsymbol{{\mathcal{E}}}$ , can thus be understood as textured specification of the form of terms for which the crude MTA approximates. This additional term plays a similar role to that of the standard MTA term, namely that it depletes the turbulent EMF in the absence of any other mean fields.

We emphasize that the derivation of (3.12) differs from that of (3.11) in that the correction terms appearing on the right of (3.12) are derived from the averaging procedure itself, and represent the lowest-order corrections when Reynolds rules are violated. Higher-order terms can also be derived. The form of $\hat{\unicode[STIX]{x1D6FE}}$ is determined by the scale $l$ and the kernel of average. These terms are not included in the semi-empirical approach of Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012) that produced (3.11), because they vanish identically due to the planar average.

Finally, we note that in deriving (3.12), we averaged the MHD equations using a kernel that retains a spatial dependence, so that $\overline{\boldsymbol{B}}$ can depict large scale magnetic fields and retain large scale gradients in all directions. In contrast, the $x{-}y$ planar average used in Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012) does not retain large scale field gradients in $x$ and $y$ directions, which is self-consistent for the simulation boxes but not sufficiently general for investigating mean fields. In addition planar averages do not remove large $k_{z}$ modes from $\overline{\boldsymbol{B}}^{(p)}$ , and hence (3.11) might not be complete even for the simulations in the absence of including higher-order terms since the EMF kernel (3.10) is valid only for small $|\boldsymbol{k}|$ .

4 Precision of mean field theories

The precision error of a mean field theory (MFT) can be classified into two types: (i) intrinsic error (IE) $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ from the theory itself and (ii) filtering error (FE) $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ associated with comparing the mean field theory values filtered through a measuring kernel (thus double filtered) with the total field filtered through the measuring kernel. Both of these depend on the scale of average $l$ . We now derive these in full.

4.1 Intrinsic error from statistical fluctuations in inputs to mean field equations

The dynamo input parameters in the mean field equations (e.g. $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ in (3.9) along with boundary and initial conditions) are themselves random variables (in an ensemble) and so is $\overline{\boldsymbol{B}}=\overline{\boldsymbol{B}}(\boldsymbol{x};\unicode[STIX]{x1D6FC},\unicode[STIX]{x1D6FD},\ldots )$ , because the small scale fields $\boldsymbol{u}$ and $\boldsymbol{b}$ are statistically fluctuating. The intrinsic error is thus defined as the variation of statistical fluctuations of $\overline{\boldsymbol{B}}$ (about its ensemble mean) due to these small scale fluctuations, which we denote by $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ :

(4.1)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE},B_{i}}^{2}=\langle (\overline{B}_{i}-\langle \overline{B}_{i}\rangle )^{2}\rangle ,\quad \text{for}~i=1,2,3.\end{eqnarray}$$

With this definition the IE vanishes if the mean field theory is defined using ensemble average, i.e. $\unicode[STIX]{x1D70E}_{\text{IE},B_{i}}^{2}=\langle (\langle B_{i}\rangle -\langle \langle B_{i}\rangle \rangle )^{2}\rangle =0$ .

The IE can be calculated by propagating the statistical variations of input parameters to the solutions of mean field equations. We consider the IE of the steady-state solutions of MFE dynamo equations for a minimalist model where $\unicode[STIX]{x1D6FC}_{k}$ and $\unicode[STIX]{x1D6FD}$ are the only input parameters: $\overline{\boldsymbol{B}}=\overline{\boldsymbol{B}}(\boldsymbol{x};\unicode[STIX]{x1D6FC}_{k},\unicode[STIX]{x1D6FD})$ . The magnetic $\unicode[STIX]{x1D6FC}$ -effect, $\unicode[STIX]{x1D6FC}_{m}$ , is dynamical in our model, and not an input parameter, since it is governed by the transport equation of the helicity density, equation (6.2).

Let us consider a minimalist model where all turbulent transport coefficients are statistically homogeneous over the whole space. In one such dynamo model that we discuss later, turbulent transport coefficients depend on the radial coordinate, but since its variation scale is greater than $l$ , they remain locally approximately homogeneous.

The deviation of kernel-filtered values from the ensemble averages of turbulent coefficients contributes to the IE of the mean fields. The resulting average (in the sense of an ensemble average) imprecision in $\overline{\boldsymbol{B}}$ can be calculated by propagating the imprecision to turbulent coefficients. For $\unicode[STIX]{x1D6FC}_{k}$ , this is

(4.2)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}=\langle (\unicode[STIX]{x1D6FC}_{k}-\langle \unicode[STIX]{x1D6FC}_{k}\rangle )^{2}\rangle .\end{eqnarray}$$

Similarly, we have

(4.3)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}=\langle (\unicode[STIX]{x1D6FD}-\langle \unicode[STIX]{x1D6FD}\rangle )^{2}\rangle ,\end{eqnarray}$$

and

(4.4)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}\unicode[STIX]{x1D6FD}}^{2}=\langle (\unicode[STIX]{x1D6FC}_{k}-\langle \unicode[STIX]{x1D6FC}_{k}\rangle )(\unicode[STIX]{x1D6FD}-\langle \unicode[STIX]{x1D6FD}\rangle )\rangle .\end{eqnarray}$$

The uncertainty in $\overline{\boldsymbol{B}}$ derives from the uncertainties from $\unicode[STIX]{x1D6FC}_{k}$ and $\unicode[STIX]{x1D6FD}$ as follows:

(4.5)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE},\overline{B}_{i}}^{2}=(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FC}_{k}}\overline{B}_{i})^{2}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}+(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FD}}\overline{B}_{i})^{2}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}+2(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FC}_{k}}\overline{B}_{i})(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FD}}\overline{B}_{i})\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}\unicode[STIX]{x1D6FD}}.\end{eqnarray}$$

To estimate magnitudes of (4.2) to (4.4), we decompose filtered quantities into (ensemble averaged) means and random parts. Consequently we have

(4.6)

$$\begin{eqnarray}\unicode[STIX]{x1D6FC}_{k,r}=\unicode[STIX]{x1D6FC}_{k}-\langle \unicode[STIX]{x1D6FC}_{k}\rangle =\frac{\unicode[STIX]{x1D70F}}{3}\overline{\boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{u}-\langle \boldsymbol{u}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{u}\rangle },\end{eqnarray}$$

where $\unicode[STIX]{x1D6FC}_{k,r}$ is the random part. Combining (4.2) and (4.6) we have

(4.7)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}=\langle \unicode[STIX]{x1D6FC}_{k,r}^{2}\rangle .\end{eqnarray}$$

Similarly,

(4.8)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}=\langle \unicode[STIX]{x1D6FD}_{r}^{2}\rangle ,\end{eqnarray}$$

where

(4.9)

$$\begin{eqnarray}\unicode[STIX]{x1D6FD}_{r}=\unicode[STIX]{x1D6FD}-\langle \unicode[STIX]{x1D6FD}\rangle =\frac{\unicode[STIX]{x1D70F}}{3}\overline{u^{2}-\langle u^{2}\rangle }\end{eqnarray}$$

is the random part of $\unicode[STIX]{x1D6FD}$ .

To estimate the quantities in (4.7) and (4.8), we consider the system of study to be divided into cells of typical length $l_{s}$ and crudely assume that in each cell, $\boldsymbol{u}$ is nearly uniform with components drawn from independent Gaussian distributions,

(4.10)

$$\begin{eqnarray}f(u_{i})=\frac{1}{\sqrt{2\unicode[STIX]{x03C0}}u_{0}}\text{e}^{-u_{i}^{2}/2u_{0}^{2}},\quad i=1,2,3.\end{eqnarray}$$

Then for each cell,

(4.11a-c )

$$\begin{eqnarray}\langle u_{i}\rangle =0,\quad \langle u_{i}^{2}\rangle =u_{0}^{2},\quad \langle u_{i}^{4}\rangle =3u_{0}^{4},\end{eqnarray}$$

and

(4.12a,b )

$$\begin{eqnarray}\langle u^{2}\rangle =\mathop{\sum }_{i=1}^{3}\langle u_{i}^{2}\rangle =3u_{0}^{2},\quad \langle u^{4}\rangle =\left\langle \left(\mathop{\sum }_{i=1}^{3}u_{i}^{2}\right)^{2}\right\rangle =15u_{0}^{4}=\frac{5}{3}\langle u^{2}\rangle ^{2},\end{eqnarray}$$

so that

(4.13)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{u^{2}}^{2}=\langle u^{4}\rangle -\langle u^{2}\rangle ^{2}={\textstyle \frac{2}{3}}\langle u^{2}\rangle ^{2},\end{eqnarray}$$

which links fluctuations to mean quantities.

The filtering $\overline{(\cdot )}$ in (4.7) and (4.8) can be roughly seen as the algebraic average of the quantity $(\cdot )$ of $N=(2l/l_{s})^{3}$ cells, with the factor of two accounting for the fact that the variation scale of $u^{2}$ will be $l_{s}/2$ if that of $\boldsymbol{u}$ is $l_{s}$ . The central limit theorem (CLT) then yields

(4.14a,b )

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}\simeq \frac{\langle \unicode[STIX]{x1D6FC}_{k,r}^{2}\rangle }{N}=\frac{\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k,r}}^{2}}{N},\quad \unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}\simeq \frac{\langle \unicode[STIX]{x1D6FD}_{r}^{2}\rangle }{N}=\frac{\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}_{r}}^{2}}{N},\end{eqnarray}$$

where $\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k,r}}^{2}$ and $\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}_{r}}^{2}$ are the variances of the random parts in each cell.

Since both $\unicode[STIX]{x1D6FC}_{k}$ and $\unicode[STIX]{x1D6FD}$ are quadratic in $\boldsymbol{u}$ , equations (4.13) and (4.14) then yield

(4.15a,b )

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}\simeq \frac{2\unicode[STIX]{x1D6FC}_{k}^{2}/3}{(2l/l_{s})^{3}},\quad \unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}\simeq \frac{2\unicode[STIX]{x1D6FD}^{2}/3}{(2l/l_{s})^{3}}.\end{eqnarray}$$

It then follows from (4.5) that

(4.16)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE},\overline{B}_{i}}^{2}\simeq \frac{1}{12(l/l_{s})^{3}}[(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FC}_{k}}\overline{B}_{i})^{2}\unicode[STIX]{x1D6FC}_{k}^{2}+(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FD}}\overline{B}_{i})^{2}\unicode[STIX]{x1D6FD}^{2}]+2(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FC}_{k}}\overline{B}_{i})(\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FD}}\overline{B}_{i})\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}\unicode[STIX]{x1D6FD}}.\end{eqnarray}$$

Note that $\unicode[STIX]{x1D70E}_{\text{IE},\overline{B}_{i}}^{2}$ depends on spatial coordinates $\boldsymbol{x}$ , just as $\overline{\boldsymbol{B}}$ does. In galaxies, a typical value of the variation scale of turbulent fields satisfies $l_{s}\lesssim 0.1~\text{kpc}$ . Hence $(l/l_{s})^{3}\gtrsim 8$ for $l=0.2~\text{kpc}$ and ${\gtrsim}64$ for $l=0.4~\text{kpc}$ .

From the CLT, this IE decreases with increasing $l$ because the average variations from turbulence are inversely proportional to the number of eddy cells in the region being averaged, $(l/l_{s})^{3}$ , provided that $l_{s}$ is rather insensitive to the choice of $l$ .

4.2 Filtering error from mismatch between measurement and theoretical kernels

Measuring physical quantities always results in measuring mean quantities to a certain extent. Detectors have limited sensitivity so measurements represent a convolution between true physical quantities and an instrument kernel. Furthermore, the physical quantity being measured typically involves a superposition of microphysical contributions and an average over many local macroscopic contributions. In particular, the predicted values of observed FR and synchrotron polarization are limited in precision when these predictions are made using MFE.

For a given physical quantity of a real system $Q^{A}$ (e.g. the actual magnetic field of a galaxy) we define the measured value as $(Q^{A})_{{\mathcal{M}}}$ , where the subscript ${\mathcal{M}}$ indicates that quantity subjected to a measuring kernel that the instrument uses to project out the actual measured value. Complementarily, we write $\overline{Q^{A}}$ to indicate the value of $Q^{A}$ subjected to a theoretically chosen mean field theory filter. We will assume these two filters commute, i.e. $\overline{(Q^{A})_{{\mathcal{M}}}}=(\overline{Q^{A}})_{{\mathcal{M}}}$ . We use $Q$ to indicate a theoretically predicted value of $Q^{A}$ . Like $Q^{A}$ we can subject $Q$ mathematically to a theoretical mean field filtering and obtain $\overline{Q}$ or to measurement filtering to obtain $(Q)_{{\mathcal{M}}}$ , or both $(\overline{Q})_{{\mathcal{M}}}$ ( $=\overline{(Q)_{{\mathcal{M}}}}$ by assumption). For the common practice in which observations are not subjected to the theoretical mean filtering but the theory is subjected to the instrument filtering, the difference the measured value and the theoretically predicted mean can be written

(4.17)

$$\begin{eqnarray}\displaystyle (Q^{A})_{{\mathcal{M}}}-(\overline{Q})_{{\mathcal{M}}} & = & \displaystyle [\overline{(Q^{A})}_{{\mathcal{M}}}-(\overline{Q})_{{\mathcal{M}}}+(q^{A})_{{\mathcal{M}}}-(q)_{{\mathcal{M}}}]+(q)_{{\mathcal{M}}}\nonumber\\ \displaystyle & = & \displaystyle [(\overline{Q^{A}}-\overline{Q})_{{\mathcal{M}}}+(q^{A}-q)_{{\mathcal{M}}}]+(q)_{{\mathcal{M}}},\end{eqnarray}$$

where $(q^{A})_{{\mathcal{M}}}=(Q^{A})_{{\mathcal{M}}}-\overline{(Q^{A})}_{{\mathcal{M}}}$ is the difference between the actual quantity and its value using the theoretical mean filter, then both filtered through the measuring kernel. Analogously, $(q)_{{\mathcal{M}}}=(Q)_{{\mathcal{M}}}-(\overline{Q})_{{\mathcal{M}}}$ is the difference between the theoretical quantity and its theoretical mean filter then both filtered through the measuring kernel. The terms in the square brackets on the right of (4.17), measure accuracy of the theoretical model and these terms will be small if the theory provides a good match to the real system. We focus on $(q)_{{\mathcal{M}}}$ , the last term in (4.17), which is a precision error of the theory and the FE that we will quantify. The smaller its magnitude, the more precise the theory.

In principle, one would like to subject $(Q^{A})_{{\mathcal{M}}}$ to the same filtering which corresponds to that of the mean field model, that is, compute $\overline{(Q^{A})_{{\mathcal{M}}}}$ , and compare it to $(\overline{Q})_{{\mathcal{M}}}$ . This would obviate computation of the FE. For simulations this may be possible, but for observations one cannot always compute $\overline{(Q^{A})_{{\mathcal{M}}}}$ , due to limited resolution. Moreover, it is typically not done in practice, and cannot be done if $\overline{(\cdot )}$ represents the ensemble average and the system has finite scale separation. If both $\overline{(\cdot )}$ and $(\cdot )_{{\mathcal{M}}}$ averages were equivalent to ensemble averages due to infinite scale separation, then $(q)_{{\mathcal{M}}}=\langle Q-\langle Q\rangle \rangle \rightarrow 0$ ; but this is not the case with finite scale separation and local spatial averages.

Unlike the IE of the previous subsection, the FE increases with increasing $l$ , since smaller $l$ means including a greater fraction of modes into what comprises the mean field, and so the theoretical predictions from mean field theory would be less coarse grained and thus more capable of characterizing the actual field. If the presumption is made that IE and FE are statistically independent and uncorrelated for a given $l$ , then the total uncertainty of the mean field theory is given by $\unicode[STIX]{x1D70E}^{2}=\unicode[STIX]{x1D70E}_{\text{IE}}^{2}+\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ . Due to their competitive behaviours when changing $l$ , an optimal scale of average $l_{\text{opt}}$ which minimizes either $\unicode[STIX]{x1D70E}^{2}$ or the relative uncertainty $\unicode[STIX]{x1D70E}^{2}/\overline{B}^{2}$ can arise, satisfying $l_{s}<l_{\text{opt}}<l_{L}$ .

In the next section we combine all of the formalism of this section into a specific example. We discuss implications of a finite precision when comparing observations and MFE theory for measuring galactic fields by FR from extragalactic sources. Our formalism is not restricted to that particular example and the precision of theoretical predictions for other kinds of observations, such as pulsar FRs, or polarized synchrotron emission, can also similarly be worked out.

5 MFE precision error in the context of FR

FR is commonly used to measure strengths and directions of magnetic fields in galaxies. The rotation measure (RM), i.e. the rotation of the polarization plane of light from a distant pulsar or extragalactic radio source is given by Ruzmaikin et al. (Reference Ruzmaikin, Sokoloff and Shukurov1988) to be

(5.1)

$$\begin{eqnarray}\text{RM}=0.81\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\boldsymbol{B}n_{e}~(\text{rad m}^{-2})\propto \int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\boldsymbol{B},\end{eqnarray}$$

where the integrals are along the line of sight, and the proportionality is valid when the thermal electron density $n_{e}$ varies on scales larger than those of $\boldsymbol{B}$ .

Here we focus on the RMs through a galaxy other than the Milky Way from extragalactic sources, and leave the discussion of pulsar RMs in the Milky way for § 6.5. We also omit any influence of weak intergalactic magnetic fields. The relevant segment of integration is then the segment of each line of sight $L(R,r,h)$ inside the galaxy (see figure 1 with an (a) edge-on view, (b) face-on view, and (c) inclined view), where $L$ is a function of the galactic radius $R$ , the distance $r$ from the line of sight to the galactic centre, and the semi-thickness of the galactic disk $h$ .

Figure 1. Schematic diagrams of line-of-sight averages for calculating the precision of RM in an (a) edge-on view, (b) face-on view, (c) inclined view and (d) inside the galaxy with $R$ being the galactic radius, $L$ the chord length along the line of sight and $h$ the semi-thickness of the galactic disk. $\hat{\unicode[STIX]{x1D746}}$ is the radial direction of the disk.

In what follows, we use the subscript $L$ Footnote ³ for a constant thermal electron density FR-like average along path $L$ . For a given vector field $\boldsymbol{Q}$ and scalar field $f$ this line of sight average gives

(5.2a,b )

$$\begin{eqnarray}(Q)_{L}=\frac{1}{L}\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\boldsymbol{Q}\quad \text{and}\quad (f)_{L}=\frac{1}{L}\int \text{d}s\,f.\end{eqnarray}$$

For FR measurements the line-of-sight average $(\cdot )_{L}$ will thus correspond to $(\cdot )_{{\mathcal{M}}}$ mentioned above. We denote the theoretical prediction of the line-of-sight mean field from MFE as $(\overline{B})_{L}$ . While $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ of $(\overline{B})_{L}$ can be computed by propagating the IE of $\overline{\boldsymbol{B}}$ , the FE $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ arises from calculating the difference $(b)_{L}$ , between $(\overline{B})_{L}$ and $(B)_{L}$ , the latter determined by how the RMs are measured. We have

(5.3)

$$\begin{eqnarray}(b)_{L}\equiv (B)_{L}-(\overline{B})_{L}=\frac{1}{L}\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\boldsymbol{B}-\frac{1}{L}\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\overline{\boldsymbol{B}}=\frac{1}{L}\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\boldsymbol{b}.\end{eqnarray}$$

This represents the line-of-sight mean of a fluctuation and is the deviation that results from comparing single average to a mixed double average $(q)_{{\mathcal{M}}}$ , discussed in § 4. Here $(\overline{Q})_{{\mathcal{M}}}=(\overline{B})_{L}=(1/L)\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\overline{\boldsymbol{B}}$ .

Our mission is to express $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ and $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}=\langle (b)_{L}^{2}\rangle$ in terms of known or derivable quantities for a MFT. Equation (4.16) gives the general form of $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ for $\overline{\boldsymbol{B}}$ . If fluctuations in different directions are uncorrelated, the intrinsic error of $(\overline{B})_{L}$ can be approximated by

(5.4)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE}}^{2}\simeq \frac{1}{L}\int \text{d}s[(\unicode[STIX]{x1D70E}_{\overline{B}_{x}}\hat{\boldsymbol{x}}\boldsymbol{\cdot }\hat{\boldsymbol{s}})^{2}+(\unicode[STIX]{x1D70E}_{\overline{B}_{y}}\hat{\boldsymbol{y}}\boldsymbol{\cdot }\hat{\boldsymbol{s}})^{2}+(\unicode[STIX]{x1D70E}_{\overline{B}_{z}}\hat{\boldsymbol{z}}\boldsymbol{\cdot }\hat{\boldsymbol{s}})^{2}],\end{eqnarray}$$

where fluctuation scales are less than $L$ , which is true away from the galactic edge.

To compute $\langle (b)_{L}^{2}\rangle$ we assume a statistically isotropic turbulent field $\boldsymbol{b}(\boldsymbol{x})$ , and therefore the integrand on the right-hand side in (5.3) is insensitive to the direction of the line of sight. We use a scalar $b_{s}(x)$ to represent the component of $\boldsymbol{b}(\boldsymbol{x})$ along the line of sight. Next, we assume $b_{s}(x)$ can be decomposed into different modes with specific wavelengths indicated by a superscript $(m)$ , namely

(5.5)

$$\begin{eqnarray}b_{s}(x)=\mathop{\sum }_{m}b^{(m)}(x),\end{eqnarray}$$

with $k_{m}$ being the characteristic wavenumber of each mode and satisfying

(5.6)

$$\begin{eqnarray}\frac{2\unicode[STIX]{x03C0}}{k_{m}}\leqslant l,\end{eqnarray}$$

since the turbulent scale is smaller than the averaging scale. Correspondingly, for each mode $b^{(m)}$ , we divide $L$ evenly into $n_{m}=k_{m}L/\unicode[STIX]{x03C0}$ cells. For most lines of sight, $n_{m}$ is greater than $L/l$ since roughly the largest mode has a wavelength no larger than $l$ . The length of the line of sight inside the galaxy will typically be of order $l_{L}$ , the characteristic scale of a large scale magnetic field, except when observations are made edge-on and close to the galactic outer edge. Therefore, if we assume that $L/l>1$ , we have $n_{m}>1$ . Large $n_{m}$ will allow more accurate application of the CLT.

In each separate cell of scale $\unicode[STIX]{x03C0}/k_{m}$ , $b^{(m)}$ is nearly coherent in space with the same sign, parallel or anti-parallel to $\text{d}\boldsymbol{s}$ . We can then replace $b^{(m)}$ by its root-mean-square value $b_{m}$ defined by a MFE-appropriate average (§ 2.4), supplemented by a ‘ $+$ ’ sign if parallel to $\text{d}\boldsymbol{r}$ , and a ‘ $-$ ’ sign if anti-parallel. Then (5.3) becomes the sum of $m$ averages, each being the mean of $n_{m}$ random variables $b_{i}^{(m)}$ , taking a value $b_{m}$ or $-b_{m}$ :

(5.7)

$$\begin{eqnarray}b_{L}=\frac{1}{L}\int \text{d}\boldsymbol{s}\boldsymbol{\cdot }\boldsymbol{b}=\frac{1}{L}\mathop{\sum }_{m}\int \text{d}s\,b^{(m)}=\mathop{\sum }_{m}\frac{1}{n_{m}}\mathop{\sum }_{i=1}^{n_{m}}b_{i}^{(m)}.\end{eqnarray}$$

Although $b_{i}^{(m)}$ is likely to be correlated with both its spectral neighbour $b_{i}^{(m+1)}$ and spatial neighbour $b_{i+1}^{(m)}$ because the turbulent fields are entangled locally in both configuration and Fourier space, we assume that every $b_{i}^{(m)}$ varies independently and leave generalizations for future work.

For $n_{m}\gg 1$ the scale separation is large and $\sum _{i}b_{i}^{(m)}/n_{m}$ is close to a normally distributed random variable with zero mean and variance $b_{m}^{2}/n_{m}$ . Then $b_{L}$ is the sum of $m$ independent normally distributed random variables and thus a random variable itself, with variance (Ruzmaikin et al. Reference Ruzmaikin, Sokoloff and Shukurov1988, p. 256)

(5.8)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{FE}}^{2}=\mathop{\sum }_{m}\frac{b_{m}^{2}}{n_{m}}=\frac{1}{L}\mathop{\sum }_{m}\frac{\unicode[STIX]{x03C0}b_{m}^{2}}{k_{m}}.\end{eqnarray}$$

The summation on the right-hand side in (5.8) is the energy density-weighted average wavelength up to a constant. The relation to energy density is somewhat of a coincidence arising because both energy and variance are related to $\langle {b^{(m)}}^{2}\rangle$ .

The variance is more useful in its integral form. Let ${\displaystyle\mathop{M}\limits_{{\sim}}}(k)$ be the energy spectrum of the total magnetic field. In general, ${\displaystyle\mathop{M}\limits_{{\sim}}}(k)$ could vary in space, but for line-of-sight measurements, the energy spectrum averaged over the line of sight is a reasonable approximation. The energy spectra of large and small scale fields are then $|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})|^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}(k)$ and $|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})|^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}(k)$ , respectively. Hence $b_{m}^{2}$ is related to the energy spectrum through

(5.9)

$$\begin{eqnarray}\frac{b_{m}^{2}}{8\unicode[STIX]{x03C0}}=|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k}_{m})|^{2}{\displaystyle\mathop{M}\limits_{{\sim}}}(k_{m})\,\text{d}k_{m},\end{eqnarray}$$

given that ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}$ is isotropic. Using this and the integral version of (5.8), we obtain

(5.10)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{FE}}^{2}=\frac{8\unicode[STIX]{x03C0}^{2}}{L}\int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k\frac{|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})|^{2}{\displaystyle\mathop{M}\limits_{{\sim}}}(k)}{k}=\frac{8\unicode[STIX]{x03C0}^{2}}{k_{\text{int}}L}\int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})|^{2}{\displaystyle\mathop{M}\limits_{{\sim}}}(k),\end{eqnarray}$$

where $k_{\unicode[STIX]{x1D708}}=2\unicode[STIX]{x03C0}/l_{\unicode[STIX]{x1D708}}$ is the wavenumber of the dissipation scale, and we have defined

(5.11)

$$\begin{eqnarray}k_{\text{int}}\equiv \frac{\displaystyle \int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})|^{2}{\displaystyle\mathop{M}\limits_{{\sim}}}(k)}{\displaystyle \int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})|^{2}{\displaystyle\mathop{M}\limits_{{\sim}}}(k)/k}\end{eqnarray}$$

to be the integral scale of fluctuations which depends weakly on $l$ but roughly equals $\unicode[STIX]{x03C0}/l_{s}$ , since $l_{s}$ is the coherent scale and the wavelength corresponding to it will be $2l_{s}$ .

Equation (5.10) reveals that $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ is proportional to the total magnetic energy in fluctuations, and the ratio between $\unicode[STIX]{x03C0}/k_{\text{int}}\simeq l_{s}$ and the segment length $L$ through the source. Equation (5.10) is testable with simulations. The ensemble associated with the standard deviation on its left-hand side could be realized by taking snapshots of the system at different times (which would equate the time average to an ensemble average), whereas the integral on the right-hand side is measurable in Fourier space.

To illustrate the use of (5.10), we assume $\unicode[STIX]{x03C0}/k_{\text{int}}=l_{s}$ and define

(5.12)

$$\begin{eqnarray}q_{l}=\frac{\displaystyle \int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}|^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}}{\displaystyle \int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}}\end{eqnarray}$$

as the proportionality between small and large scale magnetic energies. The $q_{l}$ is independent of location along each line of sight but depends upon how we define large and small scale fields, through $l$ and $G_{l}$ . Hence (5.10) yields

(5.13)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{FE}}^{2}=\frac{l_{s}}{L}\left(\frac{\displaystyle \int \text{d}k|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}|^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}}{\displaystyle \int \text{d}k{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}}\right)\left(8\unicode[STIX]{x03C0}\int \text{d}k{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}\right)=\frac{l_{s}}{L}q_{l}(\overline{B}^{2})_{L},\end{eqnarray}$$

where in the last equality, $(\overline{B}^{2})_{L}/8\unicode[STIX]{x03C0}=\int \text{d}k{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}$ , the line-of-sight average of the large scale field energy. Note that $(\overline{B}^{2})_{L}$ is distinct from $(\overline{B}_{L})^{2}$ as the latter is the square of the line-of-sight average (see (5.2a,b ) of the theoretically predicted mean field $\overline{\boldsymbol{B}}$ ).

To express $q_{l}$ in terms of $l$ , we assume that $l_{s}$ and $l_{L}$ are insensitive to $l$ . We have checked that this is justified if, regardless of shape, ${\displaystyle\mathop{M}\limits_{{\sim}}}(k)$ has two peaks, one near $k=k_{L}=2\unicode[STIX]{x03C0}/l_{L}$ and one near $k=k_{s}=2\unicode[STIX]{x03C0}/l_{s}$ , and is small near $k=k_{l}$ . We also define $q\equiv \langle b^{2}\rangle /\langle B\rangle ^{2}$ as the proportionality between the unfiltered small- and large scale magnetic fields (= ratio of areas under the two peaks of ${\displaystyle\mathop{M}\limits_{{\sim}}}(k)$ ). Observations indicate that $q$ is on average somewhere between 3 and 4 (Fletcher Reference Fletcher2010; Van Eck et al. Reference Van Eck, Brown, Shukurov and Fletcher2015; Beck Reference Beck2016); we adopt a fiducial value $q=4$ . Consequently, we have

(5.14)

$$\begin{eqnarray}\displaystyle q_{l} & = & \displaystyle \frac{\displaystyle \int \text{d}k|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}|^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}}{\displaystyle \int \text{d}k|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}|^{2}{\displaystyle\mathop{ M}\limits_{{\sim}}}}\nonumber\\ \displaystyle & \simeq & \displaystyle \frac{|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}\langle B\rangle ^{2}+|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}\langle b^{2}\rangle }{|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}\langle B\rangle ^{2}+|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}\langle b^{2}\rangle }\nonumber\\ \displaystyle & = & \displaystyle \frac{|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}+|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}q}{|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}+|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}q}.\end{eqnarray}$$

Combining (5.13) and (5.14) we have

(5.15)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{FE}}^{2}=\frac{l_{s}}{L}q_{l}(\overline{B}^{2})_{L}=\frac{l_{s}}{L}\frac{|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}+|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}q}{|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}+|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}q}(\overline{B}^{2})_{L}.\end{eqnarray}$$

Equation (5.15) highlights that the variance in predicted RM is the product of three factors. First, the inverse of the number of eddy cells along the line of sight, $l_{s}/L$ . Being linear in the length ratio, this can be significant even when the correction terms to the modified MFE equations are small. The MFE corrections are of order $(l/l_{L})^{c}$ (see (2.8)), so for small $l/l_{L}$ ratio or large $c$ , the corrections could be small even if $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ is significant. Second, $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ depends on how energy is distributed between large and small scale fields through $q_{l}$ . Since a larger $l$ implies more modes are counted as small scale fields ( $k\lesssim 2\unicode[STIX]{x03C0}/l$ ), $q_{l}$ is a monotonic function of $l$ . Finally, equation (5.15) shows that $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ is also proportional to the average magnetic energy density along the line of sight.

Some complexities of the true error are not considered in (5.15). First, due to local inhomogeneities (spiral arms for example), cells for each mode along a single line of sight may not be statistically identical nor have the same total amplitude of fluctuating magnetic energy as we have assumed. In (5.9) we have used the line-of-sight-averaged energy spectrum ${\displaystyle\mathop{M}\limits_{{\sim}}}(k)$ as an approximation and ignored spatial variation of $\langle b^{2}\rangle$ . Second, differential rotation makes turbulent magnetic fields anisotropic in an eddy turnover time $\unicode[STIX]{x1D70F}$ in the galactic mid-plane. The azimuthal fluctuation is amplified beyond the radial field such that $b_{\unicode[STIX]{x1D719}}\simeq b_{r}(1+q_{r}\unicode[STIX]{x1D6FA}\unicode[STIX]{x1D70F})\simeq 2b_{r}$ with the $q_{r}\simeq 1$ for a flat rotation curve, and Rossby number $Ro=1/(\unicode[STIX]{x1D6FA}\unicode[STIX]{x1D70F})\approx 1$ in spiral galaxies. Therefore, the two components $b_{r},b_{\unicode[STIX]{x1D719}}$ contribute unequally along different lines of sight, making FR measurements depend not only on $L$ , but also on direction.

6 Galactic dynamo and precision for different FR viewing angles

In this section we consider specific cases to elucidate the application of the calculations of precision of mean field theories given by (5.4) and (5.15) in the context of FR measurements. We calculate $\unicode[STIX]{x1D70E}^{2}=\unicode[STIX]{x1D70E}_{\text{IE}}^{2}+\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ in terms of mean fields when the measured galaxy is edge-on, face-on and inclined. We also consider the special case of measuring FR from within our own Galaxy. We use a cylindrical coordinate system centred at the galactic centre with coordinates $(r,\unicode[STIX]{x1D719},z)$ and the $z$ axis coinciding with the galactic rotation axis.

6.1 Galactic dynamo model

We augment the simplified galactic dynamo model from § 4.5 of Zhou & Blackman (Reference Zhou and Blackman2017),Footnote ⁴ where the ‘no- $z$ ’ approximation (Subramanian & Mestel Reference Subramanian and Mestel1993; Moss Reference Moss1995; Phillips Reference Phillips2001; Sur, Shukurov & Subramanian Reference Sur, Shukurov and Subramanian2007; Chamandy et al. Reference Chamandy, Shukurov, Subramanian and Stoker2014) is used. The resulting $\boldsymbol{B}$ is $r$ -dependent and cylindrically symmetric (i.e. azimuthally averaged). We include the correction terms of § 3 employing a Gaussian kernel (and thus $\hat{\unicode[STIX]{x1D6FE}}=-l^{2}\unicode[STIX]{x1D6FB}^{2}/8\unicode[STIX]{x03C0}^{2}$ ), which gives

(6.1)

$$\begin{eqnarray}\hat{\unicode[STIX]{x1D6FE}}=-\frac{1}{2}\frac{l^{2}}{4\unicode[STIX]{x03C0}^{2}}\unicode[STIX]{x2202}_{z}^{2}\rightarrow \frac{l^{2}}{32h^{2}},\end{eqnarray}$$

where derivatives in the radial direction are dropped assuming the disk is thin, $h/R\ll 1$ . The last relation in (6.1) follows from the ‘no- $z$ ’ approximation, $\unicode[STIX]{x2202}_{z}^{2}\rightarrow -(k_{h}/4)^{2}$ where $k_{h}=2\unicode[STIX]{x03C0}/h$ (Phillips Reference Phillips2001; Sur et al. Reference Sur, Shukurov and Subramanian2007). To the helicity density evolution equation with flux terms (Brandenburg & Subramanian Reference Brandenburg and Subramanian2005a ; Subramanian & Brandenburg Reference Subramanian and Brandenburg2006; Sur et al. Reference Sur, Shukurov and Subramanian2007) we also add the correction terms resulting from violation of the Reynolds rules and obtain

(6.2)

$$\begin{eqnarray}\unicode[STIX]{x2202}_{t}\unicode[STIX]{x1D6FC}_{m}=-\frac{2\unicode[STIX]{x1D6FD}}{l_{s}^{2}}\left[\frac{(1-\hat{\unicode[STIX]{x1D6FE}})(\boldsymbol{{\mathcal{E}}}\boldsymbol{\cdot }\overline{\boldsymbol{B}})+\overline{\boldsymbol{B}}\hat{\unicode[STIX]{x1D6FE}}\boldsymbol{{\mathcal{E}}}}{B_{eq}^{2}}+\frac{\unicode[STIX]{x1D6FC}_{m}}{R_{m}}\right]-\unicode[STIX]{x1D735}\boldsymbol{\cdot }(\unicode[STIX]{x1D6FC}_{m}\overline{\boldsymbol{U}})-\unicode[STIX]{x1D6FD}_{d}\unicode[STIX]{x1D6FB}^{2}\unicode[STIX]{x1D6FC}_{m}.\end{eqnarray}$$

The last term of (6.2) governs the diffusive flux and we adopt $\unicode[STIX]{x1D6FD}_{d}=\unicode[STIX]{x1D6FD}$ .

With the requirement that $l<h$ , we find that the $\hat{\unicode[STIX]{x1D6FE}}$ correction terms produce only small changes in the dynamo model solutions and we can omit them in the later discussion of the precision error. However, the smallness of the effect on the solutions is a feature of our particular dynamo model that is exacerbated by the aforementioned ‘no- $z$ ’ approximation. To see this note that for our choice of $l$ , ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}(k_{h})<1$ and the magnitude of $\hat{\unicode[STIX]{x1D6FE}}$ is always less than $1/16$ . The maximum value of $(\overline{B})_{L}(r)$ when $l$ is increased from $0.1h$ to $0.9h$ from the solutions changes by just ${\sim}1\,\%$ . If instead we had used the approximation that $\unicode[STIX]{x2202}_{z}^{2}\sim -k_{h}^{2}$ , there would be a ${\sim}40\,\%$ decrease in the maximum value of $(\overline{B})_{L}(r)$ when $l$ is increased from $0.1h$ to $0.9h$ from the solutions with the correction terms. This highlights that the correction terms are not necessarily small for every model. Moreover, in the absence of any significant scale separation between large and small scale parts of the magnetic energy spectrum, the expansion of ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(\boldsymbol{k})$ in (2.6) would itself be invalid, and corrections to the MFE would be non-perturbative.

Numerically, Shapovalov & Vishniac (Reference Shapovalov and Vishniac2011) found, from the (uncorrected) evolution equation of small scale helicity, that the resultant spectra of large scale quantities are insensitive to different filtering methods, for reasonable spectra of relevant total quantities.

The steady state,Footnote ⁵ non-dimensionalized dynamo equations read

(6.3)

$$\begin{eqnarray}\displaystyle & \displaystyle 0=\unicode[STIX]{x2202}_{t}B_{r}=-\frac{2}{\unicode[STIX]{x03C0}}R_{\unicode[STIX]{x1D6FC}}(1+\unicode[STIX]{x1D6FC}_{m})B_{\unicode[STIX]{x1D719}}-\left(R_{U}+\frac{\unicode[STIX]{x03C0}^{2}}{4}\right)B_{r} & \displaystyle\end{eqnarray}$$

(6.4)

$$\begin{eqnarray}\displaystyle & \displaystyle 0=\unicode[STIX]{x2202}_{t}B_{\unicode[STIX]{x1D719}}=R_{\unicode[STIX]{x1D714}}B_{r}-\left(R_{U}+\frac{\unicode[STIX]{x03C0}^{2}}{4}\right)B_{\unicode[STIX]{x1D719}} & \displaystyle\end{eqnarray}$$

(6.5)

$$\begin{eqnarray}\displaystyle \displaystyle 0 & = & \displaystyle \unicode[STIX]{x2202}_{t}\unicode[STIX]{x1D6FC}_{m}=-R_{U}\unicode[STIX]{x1D6FC}_{m}-\frac{\unicode[STIX]{x1D6FD}_{d}}{\unicode[STIX]{x1D6FD}}\frac{\unicode[STIX]{x03C0}}{2}\unicode[STIX]{x1D6FC}_{m}\nonumber\\ \displaystyle & & \displaystyle -\,C\left[(1+\unicode[STIX]{x1D6FC}_{m})(B_{r}^{2}+B_{\unicode[STIX]{x1D719}}^{2})+\frac{3}{8}\sqrt{\frac{-\unicode[STIX]{x03C0}(1+\unicode[STIX]{x1D6FC}_{m})R_{\unicode[STIX]{x1D714}}}{R_{\unicode[STIX]{x1D6FC}}}}B_{r}B_{\unicode[STIX]{x1D719}}+\frac{\unicode[STIX]{x1D6FC}_{m}}{R_{m}}\right],\end{eqnarray}$$

where

(6.6a-d )

$$\begin{eqnarray}R_{\unicode[STIX]{x1D6FC}}=\frac{\unicode[STIX]{x1D6FC}_{k}h}{\unicode[STIX]{x1D6FD}},\quad R_{U}=\frac{|\overline{\boldsymbol{U}}|h}{\unicode[STIX]{x1D6FD}},\quad R_{\unicode[STIX]{x1D714}}=-\frac{h^{2}\unicode[STIX]{x1D6FA}}{\unicode[STIX]{x1D6FD}},\quad C=2\left(\frac{h}{l_{s}}\right)^{2}\end{eqnarray}$$

are dimensionless parameters with a flat rotation curve $\unicode[STIX]{x1D6FA}\propto 1/r$ adopted, and magnetic fields are normalized by the equipartition field strength $B_{\text{eq}}=\sqrt{4\unicode[STIX]{x03C0}\unicode[STIX]{x1D70C}_{f}u^{2}}$ with $\unicode[STIX]{x1D70C}_{f}$ being the fluid density. The $\unicode[STIX]{x1D6FC}$ -coefficients are normalized by $\unicode[STIX]{x1D6FC}_{k}$ . The $r$ -dependence of (6.6) is described in detail in § 2.4 and (41) in Zhou & Blackman (Reference Zhou and Blackman2017). The approximation for the $\boldsymbol{{\mathcal{E}}}\boldsymbol{\cdot }\overline{\boldsymbol{B}}$ term can be found in the appendix of Sur et al. (Reference Sur, Shukurov and Subramanian2007) or that of Chamandy, Subramanian & Shukurov (Reference Chamandy, Subramanian and Shukurov2013).

Analytical expressions of $\overline{B}_{\unicode[STIX]{x1D719}}$ and $\overline{B}_{r}$ are obtainable from (6.3) to (6.5). The intrinsic error of $\overline{\boldsymbol{B}}(\boldsymbol{x})$ is then given by (4.16) in terms of $\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}$ , $\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}$ and $\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}\unicode[STIX]{x1D6FD}}$ . The first two are given in (4.15), whereas for $\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}\unicode[STIX]{x1D6FD}}$ we assume that fluctuations of $\unicode[STIX]{x1D6FC}_{k}$ and $\unicode[STIX]{x1D6FD}$ are uncorrelated, and

(6.7)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}\unicode[STIX]{x1D6FD}}\simeq (\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FC}_{k}}^{2}\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2})^{1/2}=\unicode[STIX]{x1D70E}_{\unicode[STIX]{x1D6FD}}^{2}R_{\unicode[STIX]{x1D6FC}}/h.\end{eqnarray}$$

(For galaxies, $R_{\unicode[STIX]{x1D6FC}}\simeq 1$ .) At a fixed location, $\overline{\boldsymbol{B}}$ is a function of $R_{\unicode[STIX]{x1D6FC}}$ , $R_{U}$ and $R_{\unicode[STIX]{x1D714}}$ . Therefore the partial derivatives with respect to $\unicode[STIX]{x1D6FC}_{k}$ and $\unicode[STIX]{x1D6FD}$ can be evaluated using the chain rule,

(6.8a,b )

$$\begin{eqnarray}\unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FC}_{k}}=\frac{h}{\unicode[STIX]{x1D6FD}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D6FC}}},\quad \unicode[STIX]{x2202}_{\unicode[STIX]{x1D6FD}}=-\frac{1}{\unicode[STIX]{x1D6FD}}(R_{\unicode[STIX]{x1D6FC}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D6FC}}}+R_{U}\unicode[STIX]{x2202}_{R_{U}}+R_{\unicode[STIX]{x1D714}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D714}}}).\end{eqnarray}$$

Combining (4.16), (6.7) and (6.8), we have for the intrinsic error of $\overline{B}_{i}$ ,

(6.9)

$$\begin{eqnarray}\displaystyle \unicode[STIX]{x1D70E}_{\text{IE},\overline{B}_{i}}^{2} & = & \displaystyle \frac{1}{12(l/l_{s})^{3}}\{(\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D6FC}}}\overline{B}_{i})^{2}R_{\unicode[STIX]{x1D6FC}}^{2}+[(R_{\unicode[STIX]{x1D6FC}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D6FC}}}+R_{U}\unicode[STIX]{x2202}_{R_{U}}+R_{\unicode[STIX]{x1D714}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D714}}})\overline{B}_{i}]^{2}\nonumber\\ \displaystyle & & \displaystyle -\,2(R_{\unicode[STIX]{x1D6FC}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D6FC}}}\overline{B}_{i})[(R_{\unicode[STIX]{x1D6FC}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D6FC}}}+R_{U}\unicode[STIX]{x2202}_{R_{U}}+R_{\unicode[STIX]{x1D714}}\unicode[STIX]{x2202}_{R_{\unicode[STIX]{x1D714}}})\overline{B}_{i}]\!\},\end{eqnarray}$$

and that of $(\overline{B})_{L}$ is given by substituting (6.9) into (5.4), given the solutions of (6.3) to (6.5).

6.2 Edge-on view

We first consider a special case representing the measurement of FR of a perfectly edge-on disc galaxy with radius $R=12~\text{kpc}$ (see the schematic diagrams of figure 1). Note that the integration path segments along the line of sight within the galaxy form chords with lengths $L(\unicode[STIX]{x1D71B})=2\sqrt{R^{2}-\unicode[STIX]{x1D71B}^{2}}$ , where $\unicode[STIX]{x1D71B}$ is the distance from the galactic centre to the closest point on the chord. From the geometry of the configuration, the line of sight average is

(6.10)

$$\begin{eqnarray}\overline{B}_{L}(\unicode[STIX]{x1D71B})=\frac{2\unicode[STIX]{x1D71B}}{L(\unicode[STIX]{x1D71B})}\int _{0}^{L/2}\,\text{d}y\frac{\overline{B}_{\unicode[STIX]{x1D719}}(r)}{r},\end{eqnarray}$$

and

(6.11)

$$\begin{eqnarray}(\overline{B}^{2})_{L}(\unicode[STIX]{x1D71B})=\frac{2}{L(\unicode[STIX]{x1D71B})}\int _{0}^{L/2}\,\text{d}y\overline{B}^{2}(r),\end{eqnarray}$$

where $r=\sqrt{\unicode[STIX]{x1D71B}^{2}+y^{2}}$ is the radial coordinate from the galactic centre. Only $\overline{B}_{\unicode[STIX]{x1D719}}$ contributes to $(\overline{B})_{L}$ for the edge-on view because $\overline{B}_{r}$ is mirror symmetric about the $x$ -axis and its contributions from the $y>0$ and $y<0$ regions cancel each other. The intrinsic error is given by

(6.12)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE}}^{2}=\frac{2\unicode[STIX]{x1D71B}^{2}}{L(\unicode[STIX]{x1D71B})}\int _{0}^{L/2}\,\text{d}y\frac{\unicode[STIX]{x1D70E}_{\text{int},\overline{B}_{\unicode[STIX]{x1D719}}}^{2}}{r^{2}}.\end{eqnarray}$$

The imprecision associated with the observation is given by (5.15) and is

(6.13)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{FE}}^{2}=\frac{2l_{s}}{L^{2}}\frac{(1-\text{e}^{-l^{2}/2l_{L}^{2}})^{2}+(1-\text{e}^{-l^{2}/2l_{s}^{2}})^{2}q}{\text{e}^{-l^{2}/l_{L}^{2}}+\text{e}^{-l^{2}/l_{s}^{2}}q}\int _{0}^{L/2}\,\text{d}y\overline{B}^{2}(r),\end{eqnarray}$$

where we take $l_{L}\simeq h=0.5~\text{kpc}$ , for galactic disk semi-thickness $h$ , and the variation scale of turbulent fields $l_{s}\simeq 0.1~\text{kpc}$ is assumed to be the same for velocity and magnetic fields. Here $l_{L}\simeq h$ because $\unicode[STIX]{x2202}_{r}\ll \unicode[STIX]{x2202}_{z}$ in a thin disk and $h$ is the smallest natural scale of variation for the mean field. Correspondingly we take an averaging scale $0.12\leqslant l\leqslant 0.48~\text{kpc}$ .

Figure 2. (a) Theoretical predictions of the line-of-sight-averaged magnetic field $\overline{B}_{L}$ with the filtering error $\unicode[STIX]{x1D70E}_{\text{df}}$ shown as error bars in an edge-on view of a disc galaxy, assuming that the mean field has the form $\overline{\boldsymbol{B}}=B_{0}\hat{\unicode[STIX]{x1D753}}$ with $B_{0}=1$ . Lengths are normalized by the galactic radius $R=12~\text{kpc}$ . Two sets of error bars are shown for different choices of $l$ . (b) Fractional error bar values at different radii as a function of the averaging scale $l$ .

Figure 3. Similar to figure 2(a) but using analytic dynamo solutions for $\overline{\boldsymbol{B}}$ from § 4.5 of Zhou & Blackman (Reference Zhou and Blackman2017) by solving equations (6.3) to (6.5). (a) Intrinsic error and (b) filtering error.

The predicted line-of-sight average of the magnetic field, together with the error bars are shown in figures 2 and 3, where two different profiles of $\boldsymbol{B}$ are separately considered: (i) in figure 2(a) $\overline{\boldsymbol{B}}(\boldsymbol{x})=B_{0}\hat{\unicode[STIX]{x1D753}}$ where $B_{0}=1$ is a constant, and (ii) in figure 3 the analytic solution of the mean field dynamo model from § 6.1, normalized by the equipartition field strength $B_{\text{eq}}=\sqrt{4\unicode[STIX]{x03C0}\unicode[STIX]{x1D70C}_{f}u^{2}}$ . The dimensionless parameters we have used for the analytic solution are the same as those in Zhou & Blackman (Reference Zhou and Blackman2017):

(6.14a-d )

$$\begin{eqnarray}R_{\unicode[STIX]{x1D6FC}}=R_{\unicode[STIX]{x1D6FC}0}/2,\quad R_{U}=2R_{U0}/(r/r_{\odot })^{2}F^{5/2},\quad R_{\unicode[STIX]{x1D714}}=2R_{\unicode[STIX]{x1D714}0}/(r/r_{\odot })^{2}F^{3},\quad C=4C_{0}/(r/r_{\odot })^{2}F^{3},\end{eqnarray}$$

where quantities with subscripts 0 are computed using

(6.15)

$$\begin{eqnarray}\left.\begin{array}{@{}c@{}}\unicode[STIX]{x1D70F}_{\text{ed}}=10^{15}~\text{s},\quad u=10~\text{km s}^{-1},\quad r\unicode[STIX]{x1D6FA}=200~\text{km s}^{-1},\\ l_{s}=0.1~\text{kpc},\quad h=0.5~\text{kpc},\quad U_{0}=1~\text{km s}^{-1},\end{array}\right\}\end{eqnarray}$$

which yields

(6.16a-c )

$$\begin{eqnarray}R_{\unicode[STIX]{x1D6FC}0}=1,\quad R_{U0}=0.3,\quad R_{\unicode[STIX]{x1D714}0}=-15,\end{eqnarray}$$

and we use $R_{m}=10^{5}$ . Above $r_{\odot }\equiv 8~\text{kpc}$ is the location of the Sun, and the function $F$ determines the $r$ -dependence of the dimensionless parameters and is described in detail in the appendix in Zhou & Blackman (Reference Zhou and Blackman2017).

The line-of-sight averages of the mean magnetic fields are shown as black solid curves, along with different types of error bars $=\pm \unicode[STIX]{x1D70E}$ about the mean computed from (5.4) and (6.13) for the cases associated with two different choices of the scale of average, $l$ . The blue dashed lines with circular markers give error bars with $l=0.2~\text{kpc}$ , and the yellow solid lines with triangular markers give those with $l=0.4~\text{kpc}$ . In the constant magnetic field case, the intrinsic error does not exist because here $\overline{\boldsymbol{B}}$ is presumed, rather than derived from MFE equations.

Different choices of $l$ conspicuously show different levels of precision in the predictions for measurements, as evidenced by a comparison of the blue versus yellow IE bars in the $r$ -dependent model (figure 3 a). Variations in a data curve beneath the level of the error bars cannot be deemed a disagreement with the MFE theory. That is, whether uncorrelated or weakly correlated deviations with amplitudes below the error bars are systematic (Chamandy, Shukurov & Taylor Reference Chamandy, Shukurov and Taylor2016) or stochastic is beyond the resolution of the theory.

Comparing figure 3(a,b) highlights competing dependences of $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ and $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ on $l$ , as discussed in § 4: $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ grows with $l$ but $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ decreases with $l$ . Assuming $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ and $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ are independent and uncorrelated, adding them in quadrature gives the total uncertainty $\unicode[STIX]{x1D70E}^{2}$ .

In figure 2(b) and in figure 4, we show the relative total errors, $\unicode[STIX]{x1D70E}^{2}/(\overline{B})_{L}^{2}$ , as a function of $0.12~\text{kpc}\leqslant l\leqslant 0.48~\text{kpc}$ at different galactic radii. For figure 2 there is only one uncertainty, namely $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ which is a monotonic function of $l$ for all radii shown. More interesting case is figure 4 where both $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ and $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ are competitive. There is an optimal scale of average, located at $0.15\leqslant r/R\leqslant 0.20$ for all four chosen radii, that minimizes the total error, and thus maximizes the precision of comparing theory and observation. In general. the existence and location of such a ‘sweet spot’ depends on the solution to a given dynamo model, and the observational method used.

Figure 4. The total relative error using the analytic dynamo solutions at different radii as a function of averaging scale $l$ . An optimal scale arises at 0.15–0.20 kpc which minimizes the relative error, and therefore provides the best precision of theoretical predictions.

6.3 Face-on view

A complementary extreme to the edge-on case is a face-on view. Here every line of sight is perpendicular to the galactic disk, taken along the $z$ direction. In this orientation, $B_{\unicode[STIX]{x1D719}}$ and $B_{r}$ do not contribute to $(\overline{B})_{L}$ , and for a weak $\overline{B}_{z}$ , the dominant non-vanishing RM would come from small scale fluctuations. If we assume quasi-equipartition between the total mean and fluctuating small scale magnetic energies, the FR measurements still predict a precision error about which the mean field is indeterminate.

Taking $L=2h$ , the thickness of the galactic disk, and noting that $\overline{\boldsymbol{B}}$ is solely a function of $r$ in (5.15), we have

(6.17)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{FE}}^{2}(r)=\frac{l_{s}}{2h}\frac{|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}+|1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}q}{|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{L})|^{2}+|{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k_{s})|^{2}q}(\overline{B}_{\unicode[STIX]{x1D719}}^{2}+\overline{B}_{r}^{2})_{L}.\end{eqnarray}$$

Figure 5 shows $\overline{B}_{L}$ as a function of the galactic radial coordinate $r$ (normalized by the galactic radius $R$ ) from a face-on view of the same $r$ -dependent dynamo model used in the last subsection (Zhou & Blackman Reference Zhou and Blackman2017). The predicted RM is now zero and its filtering error is given in blue dashed lines with circular markers for $l=0.2~\text{kpc}$ , and in yellow solid lines with triangular markers for $l=0.4~\text{kpc}$ . These emerge purely from stochastic fluctuations. The intrinsic error is zero because $\overline{B}_{z}=0$ everywhere.

Figure 5. Similar to figure 3 but for a face-on view of a disc galaxy, using the analytic dynamo solution.

6.4 Views at intermediate inclinations

The formulation becomes a bit more complicated when the line of sight is at an intermediate inclination. We adopt Cartesian coordinates in this subsection, where the $z$ -axis coincides with the galactic rotation axis, $x{-}y$ plane coincides with the galactic mid-plane, the $y$ -axis is parallel to the line of sight. Figure 1 shows a schematic plot. Let the angle between the $z$ axis and the line of sight be $\unicode[STIX]{x1D703}$ , and $0<\unicode[STIX]{x1D703}<\unicode[STIX]{x03C0}/2$ . The line-of-sight averages depend on the location of the intersection point of the line of sight and the galactic mid-plane, $(x,y)$ , and are given by

(6.18)

$$\begin{eqnarray}(\overline{B})_{L}(x,y)=\frac{\sin \unicode[STIX]{x1D703}}{2h\sqrt{x^{2}+y^{2}}}\int _{-h}^{h}\,\text{d}z\,[x\overline{B}_{\unicode[STIX]{x1D719}}(\unicode[STIX]{x1D70C})+y\overline{B}_{r}(\unicode[STIX]{x1D70C})],\end{eqnarray}$$

and

(6.19)

$$\begin{eqnarray}(\overline{B}^{2})_{L}(x,y)=\frac{1}{2h}\int _{-h}^{h}\,\text{d}z\,\overline{B}^{2}(\unicode[STIX]{x1D70C}),\end{eqnarray}$$

where $\unicode[STIX]{x1D70C}=\sqrt{x^{2}+(z\tan \unicode[STIX]{x1D703}+y)^{2}}$ . We include only the region $\{(x,y)|\unicode[STIX]{x1D70C}\leqslant R\}$ . Equation (6.19) can then be used in (5.15) to compute the precision error associated with FR measures, and the intrinsic error is given by

(6.20)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE}}^{2}(x,y)=\frac{\sin ^{2}\unicode[STIX]{x1D703}}{2h(x^{2}+y^{2})}\int _{-h}^{h}\,\text{d}z\,[x^{2}\unicode[STIX]{x1D70E}_{\text{int},\overline{B}_{\unicode[STIX]{x1D719}}}^{2}(\unicode[STIX]{x1D70C})+y^{2}\unicode[STIX]{x1D70E}_{\text{int},\overline{B}_{r}}^{2}(\unicode[STIX]{x1D70C})],\end{eqnarray}$$

which can be determined once the intrinsic error of $\overline{\boldsymbol{B}}$ is calculated.

6.5 View from within our Galaxy

Finally, we discuss pulsar rotation measures as measured from inside our galaxy. For simplicity, we omit the $z$ -dependence and assume that both the observer and pulsars are in the galactic mid-plane. A schematic plot is shown in figure 1. The distance of the observer to the galactic centre is denoted by $r_{1}$ , and for this simple example, we assume pulsars to have a fixed distance $L=r_{2}<r_{1}$ from the observer and lie in the galactic mid-plane. We use $r_{1}=8~\text{kpc}$ and $r_{2}=3~\text{kpc}$ for typical values in calculations. The line-of-sight average of magnetic fields is also a function of $\unicode[STIX]{x1D703}$ , the azimuthal angle for a polar coordinate system centred at the Earth which denotes the positions of pulsars, and $\unicode[STIX]{x1D703}=0$ points to the galactic centre. The line-of-sight average of the mean field is then

(6.21)

$$\begin{eqnarray}(\overline{B})_{L}(\unicode[STIX]{x1D703})=-\frac{r_{1}\sin \unicode[STIX]{x1D703}}{r_{2}}\int _{0}^{r_{2}}\,\text{d}r\frac{\overline{B}_{\unicode[STIX]{x1D719}}(\unicode[STIX]{x1D70C})}{\unicode[STIX]{x1D70C}}+\frac{1}{r_{2}}\int _{0}^{r_{2}}\,\text{d}r\frac{r-r_{1}\cos \unicode[STIX]{x1D703}}{\unicode[STIX]{x1D70C}}\overline{B}_{r}(\unicode[STIX]{x1D70C}),\end{eqnarray}$$

where $\unicode[STIX]{x1D70C}^{2}=r_{1}^{2}-2r_{1}r\cos \unicode[STIX]{x1D703}+r^{2}$ is the radial coordinate in the galactocentric coordinate system (see figure 1). The line-of-sight-averaged $\overline{B}^{2}$ is given by

(6.22)

$$\begin{eqnarray}(\overline{B}^{2})_{L}(\unicode[STIX]{x1D703})=\frac{1}{r_{2}}\int _{0}^{r_{2}}\,\text{d}r\,\overline{B}^{2}(\unicode[STIX]{x1D70C}).\end{eqnarray}$$

The intrinsic error is given by

(6.23)

$$\begin{eqnarray}\unicode[STIX]{x1D70E}_{\text{IE}}^{2}(\unicode[STIX]{x1D703})=\frac{r_{1}^{2}\sin ^{2}\unicode[STIX]{x1D703}}{r_{2}}\int _{0}^{r_{2}}\,\text{d}r\frac{\unicode[STIX]{x1D70E}_{\text{IE},\overline{B}_{\unicode[STIX]{x1D719}}}^{2}}{\unicode[STIX]{x1D70C}^{2}}+\frac{1}{r_{2}}\int _{0}^{r_{2}}\,\text{d}r\left(\frac{r-r_{1}\cos \unicode[STIX]{x1D703}}{\unicode[STIX]{x1D70C}}\right)^{2}\unicode[STIX]{x1D70E}_{\text{IE},\overline{B}_{r}}^{2}.\end{eqnarray}$$

The resultant curve is shown in figure 6 in the same plot style as those in the previous subsections. In this case, stochastic fluctuations introduce only small $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ and moderate $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ , the latter being dominant because the line-of-sight average yields a large $(\overline{B})_{L}$ and the number of eddy cells along the line of sight is small as a consequence of small $L$ . Thus $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ dominates the total uncertainty $\unicode[STIX]{x1D70E}^{2}=\unicode[STIX]{x1D70E}_{\text{IE}}^{2}+\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ , and therefore in figure 7 which again shows relative errors at different directions of observation as a function of $l$ , most curves are monotonic and reach their minima when $l\rightarrow l_{s}$ . Since $l$ is physically constrained in the region $[l_{s},l_{L}]$ (otherwise the statistical prescriptions of $\unicode[STIX]{x1D6FC}$ and $\unicode[STIX]{x1D6FD}$ break down), this implies that $l\simeq l_{s}$ is the optimal choice of average scale in this case.

Figure 6. Line-of-sight predictions and error bars of pulsar rotation measures for our view from within our Galaxy based on the analytically solvable dynamo model, equations (6.3) to (6.5), taken from § 4.5 in Zhou & Blackman (Reference Zhou and Blackman2017). (a,b) Show error bars corresponding to the intrinsic error and filtering error, respectively.

Figure 7. The total relative error for pulsar RMs at different azimuthal angle (centred at the Earth) as a function of averaging scale $l$ . Filtering error dominates as a result of short length of the line of sight.

It cannot be excluded that for different parameters, e.g. if $q\equiv \langle b^{2}\rangle /\langle B\rangle ^{2}$ were to exceed some critical value, the errors might dominate mean field variations making it difficult to statistically identify mean field reversals.

7 Conclusions

7.1 Summary

For large scale separation between mean fields and fluctuations, ensemble and spatial averages are approximately equivalent, but this is not guaranteed in many astrophysical circumstances where mesoscale fluctuations are present. With this motivation, we formally derived correction terms to MFE for spatial averaging that result from a finite scale separation. In addition, we have quantified two types of MFE precision errors: (i) the intrinsic error $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ , which can be derived by differentiating the solution of the mean field equations with respect to its input parameters and propagating the uncertainty of each parameter to the mean field; and (ii) the filtering error $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ that results because the prediction from mean field theory is filtered differently from the observations. Specifically we considered the case where the predicted value is filtered using the kernel for the mean field and then again by the measurement kernel – whereas the observations only singly filter the full field through the measurement kernel.

We derived the MFE corrections and precision errors using convolutions of the full field and kernels, which introduce a prescribed averaging scale $l$ . To realistically depict large scale fields, the kernels must be chosen to be local in both configuration and Fourier space, and monotonically decreasing in Fourier space. We expanded the MFE equations in the ratio $l/l_{L}$ , where $l_{L}$ is the dominant scale of variation of the mean field. The zeroth-order equations have the same form as those from an ensemble average, but new first-order corrections of order $(l/l_{L})^{c}$ arise due to a violation of Reynolds rules, where $c>0$ depends on the kernel. Our approach allows for moderate scale separations.

To exemplify the calculation of the precision errors, we considered contributions to (uniform density) galactic Faraday rotation measures from mesoscale fluctuations where the mean field filter is a local spatial average and the measurement kernel is a line-of-sight average. We applied the formalism to different viewing angles of a disc galaxy and find that the precision error of MFE can be large even when the corrections to the MFE equations themselves are small. This highlights the necessity of quantifying this precision of mean field theories to avoid misconstruing stochastic from systematic deviations between theory and observations. The error quantifies the predictive resolution of the theory.

Since $\unicode[STIX]{x1D70E}_{\text{IE}}^{2}$ decreases with $l$ while $\unicode[STIX]{x1D70E}_{\text{FE}}^{2}$ increases with $l$ , the sum of the two errors may be non-monotonic over the physically allowed range of $l$ , in turn allowing determination of optimal scale of $l$ that maximizes the precision of the theory. For example, we identified the optimal averaging scale for FR that minimizes the error to be about 0.17 kpc in our dynamo model for edge-on galactic viewing.

We also showed how our study differs from that of Rheinhardt & Brandenburg (Reference Rheinhardt and Brandenburg2012) who were also motivated to address corrections to MFE equations for modest spatial scale separation. Our focus is on the influence of the kernel that enters the averaging of fields themselves whereas their focus was on the semi-empirically determined kernel relating the EMF to the mean magnetic field when the latter was defined through a planar average.

7.2 Further work

Our formalism can be tested and developed further. First, using DNS for a system that exhibits a statistically steady large scale dynamo for a specific choice of kernel average, the saturated state from simulations could be sampled at different times and an ensemble constructed. The mean field precision error can then be measured and compared to our predictions. Second, the MFE precision calculations that we exemplified for FR could be generalized for more realistic numerical dynamo models, for comparison to observations. Generalization of the form of the magnetic spectra, allowance for spatial inhomogeneities, or calculation of still higher-order corrections to MFE equations are also possible. Third, there remains analytical and numerical work to study dynamo models in which the linear-order corrections to the MFE equations are not as small as those in the example models we considered with the ‘no- $z$ ’ formalism. For systems in which there is very little scale separation between large and small scale energy spectral peaks, going beyond our perturbative treatment of Reynolds rules violations would be necessary. The resulting generalized MFE equations in this non-perturbative regime, with correction terms that involve the full unexpanded kernel, could be solved numerically.

More broadly, analogous computations of MFE precision are warranted for comparing theory and observations for observables other than RMs such as polarized synchrotron emission in galaxies, or spectral fluxes in turbulent accretion disks. For the latter, the standard axisymmetric theory in common use is also an example of a mean field theory which is a limiting case of MFE and has a finite precision that has not yet been fully quantified (Blackman, Nauman & Edgar Reference Blackman, Nauman and Edgar2010).

Acknowledgements

We are grateful to referee M. Rheinhardt for providing numerous thoroughly perceptive and detailed comments that helped us to very significantly improve the manuscript. We acknowledge support from grants NSF-AST-15156489 and HST-AR-13916 and the Laboratory for Laser Energetics at U. Rochester. E.G.B. also acknowledges the Kavli Institute for Theoretical Physics (KITP) USCB and associated support from grant NSF PHY-1125915.

Appendix A. On the validity of (2.17)

In deriving (2.17), an approximation of $\overline{a\overline{B}}$ , we have only considered the convolution of ${\displaystyle\mathop{a}\limits_{{\sim}}}(\boldsymbol{k})$ and ${\displaystyle\mathop{\{}\limits_{{\sim}}}(\boldsymbol{k}^{\prime })$ assuming that in this combination, only small $k=|\boldsymbol{k}|$ and $k^{\prime }=|\boldsymbol{k}^{\prime }|$ contribute. There are also contributions from other combinations of $k$ and $k^{\prime }$ . In this appendix we discuss and quantify the validity of (2.17), and show that it depends primarily on the scale separation $l_{s}/L$ . Specifically, we show that for a Gaussian kernel with $k_{l}=5$ , equation (2.17) is a good approximation when $k_{s}/k_{L}\gtrsim 20$ , assuming both ${\displaystyle\mathop{A}\limits_{{\sim}}}(k)$ and ${\displaystyle\mathop{B}\limits_{{\sim}}}(k)$ are double peaked, and $k_{s}$ and $k_{L}$ are the characteristic wave numbers of small and large scales, respectively. In this respect, the approximation we use improves the standard theory by relaxing the assumption of infinite scale separation, but is not valid for arbitrarily small separation.

For simplicity, we focus on one-dimensional cases here. Which parts in the spectra of $a$ and $\overline{B}$ contribute most to the quantity $\overline{a\overline{B}}$ depends on the kernel, $k_{l}$ , and $k_{s}/k_{L}$ . We explain these dependencies in turn. The dependence on the kernel can be seen from the following. We will express $\overline{a\overline{B}}$ in $k$ -space in terms of ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}$ , ${\displaystyle\mathop{A}\limits_{{\sim}}}$ and ${\displaystyle\mathop{B}\limits_{{\sim}}}$ . First we focus on the Fourier transform of $a\overline{B}$ , which is given by the convolution

(A 1)

$$\begin{eqnarray}({\displaystyle\mathop{a}\limits_{{\sim}}}\ast \overline{{\displaystyle\mathop{B}\limits_{{\sim}}}})(k)=\int \text{d}k^{\prime }{\displaystyle\mathop{a}\limits_{{\sim}}}(k^{\prime }){\displaystyle\mathop{\{}\limits_{{\sim}}}(k-k^{\prime })=\int \text{d}k^{\prime }[1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k^{\prime })]{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k-k^{\prime }){\displaystyle\mathop{A}\limits_{{\sim}}}(k^{\prime }){\displaystyle\mathop{B}\limits_{{\sim}}}(k-k^{\prime }).\end{eqnarray}$$

For fixed $k$ , we can calculate which wavenumber $k^{\prime }$ in the convolution contributes most by differentiating the factor $[1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k^{\prime })]{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k-k^{\prime })$ with respect to $k^{\prime }$ and setting it to zero. The solution $k_{0}^{\prime }(k)$ depends on the form of the kernel ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}$ . How $k_{0}^{\prime }(k)$ behaves at small $k$ is of interest because we ultimately need to multiply $({\displaystyle\mathop{\{}\limits_{{\sim}}})(k)$ by ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k)$ to get the Fourier transform of $\overline{a\overline{B}}$ . If $k_{0}^{\prime }$ is small for small $k$ , then we need only consider the low wavenumber parts of $a$ and $\overline{B}$ because both $k^{\prime }$ and $k-k^{\prime }$ would be small in the integrand. But $k_{0}^{\prime }$ could in general be comparable to $k_{l}$ or even larger for small $k$ . For example, figure 8 shows $k_{0}^{\prime }(k)$ for a Gaussian kernel ${\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k)=\text{e}^{-k^{2}/2k_{l}^{2}}$ with $k_{l}/k_{L}=1$ . For $k\leqslant k_{l}$ , we see that $k_{0}^{\prime }(k)$ is not small, and is of order $k_{l}$ .

Figure 8. $k_{0}^{\prime }(k)$ for a Gaussian kernel $\text{e}^{-k^{2}/2k_{l}^{2}}$ with $k_{l}=1$ .

However, if the spectra ${\displaystyle\mathop{A}\limits_{{\sim}}}(k^{\prime })$ or ${\displaystyle\mathop{B}\limits_{{\sim}}}(k-k^{\prime })$ vanishes near $k^{\prime }=k_{0}^{\prime }(k)$ then the maximum contribution to (A 1) must come from other wave numbers where ${\displaystyle\mathop{A}\limits_{{\sim}}}$ and ${\displaystyle\mathop{B}\limits_{{\sim}}}$ are non-vanishing. In the case of double peaked spectra, with peaks at $k_{L}$ and $k_{s}$ , the scale separation plays an important role in determining the significantly contributing wave numbers. In the aforementioned example of figure 8, it is possible that ${\displaystyle\mathop{A}\limits_{{\sim}}}(k^{\prime }){\displaystyle\mathop{B}\limits_{{\sim}}}(k-k^{\prime })$ in the integrand of (A 1) vanishes at $k^{\prime }=k_{0}^{\prime }(k)\simeq k_{l}$ for small $k$ . That is, although $[1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k^{\prime })]{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k-k^{\prime })$ reaches its maximum at $k_{0}^{\prime }(k)$ for small $k$ , ${\displaystyle\mathop{A}\limits_{{\sim}}}(k^{\prime }){\displaystyle\mathop{B}\limits_{{\sim}}}(k-k^{\prime })\simeq 0$ there because of large scale separation. Indeed, equation (2.17) is appropriate for cases with large scale separations between peaks, because the factor $[1-{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k^{\prime })]{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k-k^{\prime })$ cannot be large at small $k$ and large $k^{\prime }$ . Given a fixed small $k$ , this factor will vanish toward large $k^{\prime }$ and retain some non-zero value at intermediate ( $\simeq k_{l}$ ) and small ( ${\lesssim}k_{l}$ ) $k^{\prime }$ depending on the kernel. Provided there is a large enough scale separation, the intermediate $k^{\prime }$ regime does not contribute since ${\displaystyle\mathop{A}\limits_{{\sim}}}$ and ${\displaystyle\mathop{B}\limits_{{\sim}}}$ vanish there, leaving only the small $k^{\prime }$ part.

Figure 9. Comparisons of exact and approximated results of $\overline{f\overline{F}}$ for different scale separations $k_{s}/k_{L}$ .

Figure 10. (a) Mean relative errors defined in (A 4) and (A 5) from comparing the exact and approximated results of $\overline{f\overline{F}}$ as a function of $k_{s}/k_{L}$ . (b) Mean relative error for $k<k_{l}$ as a function of $k_{s}/k_{L}$ and $k_{l}$ .

We quantify the importance of scale separation for the validity of (2.17) in figures 9 and 10 using the following double-peaked spectrum:

(A 2)

$$\begin{eqnarray}{\displaystyle\mathop{ F}\limits_{{\sim}}}(k)=\frac{1}{\sqrt{2\unicode[STIX]{x03C0}}\unicode[STIX]{x1D70E}_{L}}\text{e}^{-(k-k_{L})^{2}/2\unicode[STIX]{x1D70E}_{L}^{2}}+\frac{q}{\sqrt{2\unicode[STIX]{x03C0}}\unicode[STIX]{x1D70E}_{s}}\text{e}^{-(k-k_{s})^{2}/2\unicode[STIX]{x1D70E}_{s}^{2}},\quad k\geqslant 0;{\displaystyle\mathop{F}\limits_{{\sim}}}(k)={\displaystyle\mathop{ F}\limits_{{\sim}}}(-k),k<0,\end{eqnarray}$$

where $k_{L}=1$ , $\unicode[STIX]{x1D70E}_{L}=1$ , $\unicode[STIX]{x1D70E}_{s}=4$ and $q=4$ are fixed. We use a Gaussian kernel for filtering, namely

(A 3)

$$\begin{eqnarray}{\displaystyle\mathop{G}\limits_{{\sim}}}_{l}(k)=\text{e}^{-k^{2}/2/k_{l}^{2}},\end{eqnarray}$$

where $k_{l}=5$ is fixed. We then test (2.17) by comparing the exact result $P_{e}={\mathcal{F}}[\overline{f\overline{F}}]=G(k)[{\displaystyle\mathop{f}\limits_{{\sim}}}\ast {\displaystyle\mathop{\{}\limits_{{\sim}}}](k)$ and its approximation $P_{a}=G(k)[{\displaystyle\mathop{\{}\limits_{{\sim}}}\ast {\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}{\displaystyle\mathop{\{}\limits_{{\sim}}}](k)$ ( ${\displaystyle\mathop{\unicode[STIX]{x1D6FE}}\limits_{{\sim}}}$ is defined through (2.6)) for different scale separations of the peaks, as quantified by $k_{s}/k_{L}$ . The comparison is shown in figure 9 where blue curves are the exact results, and yellow ones are the approximations.

The efficacy of the approximation can be quantified by the mean relative difference between blue and yellow curves in the plots figure 9, that is

(A 4)

$$\begin{eqnarray}\overline{\unicode[STIX]{x1D6E5}}=\frac{\displaystyle \int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k\,\frac{P_{e}-P_{a}}{P_{e}}}{\displaystyle \int _{0}^{k_{\unicode[STIX]{x1D708}}}\,\text{d}k},\end{eqnarray}$$

where we set $k_{\unicode[STIX]{x1D708}}=k_{s}+2\unicode[STIX]{x1D70E}_{s}$ . The quantity $\overline{\unicode[STIX]{x1D6E5}}$ as function of $k_{s}/k_{L}$ is shown in blue in figure 10(a). It remains relatively constant over the plot, even when scale separation is large. In that case, even though the approximation agrees with the exact result at small $k$ , the relative deviation from the approximation becomes large at large $k$ . But since we are interested in the net value of the convolution at small $k\leqslant k_{l}$ , a better indicator of the efficacy of the approximation is the mean relative difference at $k\leqslant k_{l}$ ; that is

(A 5)

$$\begin{eqnarray}\overline{\unicode[STIX]{x1D6E5}}_{k\leqslant k_{l}}=\frac{\displaystyle \int _{0}^{k_{l}}\,\text{d}k\,\frac{P_{e}-P_{a}}{P_{e}}}{\displaystyle \int _{0}^{k_{l}}\,\text{d}k}.\end{eqnarray}$$

This is shown in the yellow curve in figure 10(a). $P_{a}$ becomes a good approximation of $P_{e}$ when $k_{l}=5$ and $k_{s}/k_{L}\gtrsim 20$ (noting that $k_{L}=1$ ). In figure 10(b) we plot $\overline{\unicode[STIX]{x1D6E5}}_{k\leqslant k_{l}}$ but now also varying $k_{l}$ in addition to $k_{s}/k_{L}$ . The scale separation required to validate (2.17) increases with increasing $k_{l}$ .

Note that the dependencies of correction terms to the mean field equations of § 3 and the efficacy of the approximation (2.17) on scales are different: The former depends on the ratio $k_{L}/k_{l}$ , whereas the latter depends on $k_{L}/k_{s}$ . In the case of large scale separation, it is therefore possible that the error of the approximations are negligible but the MFE correction terms are still significant.

Appendix B. Derivation of the first two terms in (3.7)

The expansion rule (2.22) cannot be immediately applied to the first term on the right-hand side of (3.5) because $\boldsymbol{b}$ does not commute with the projection operator $\hat{\boldsymbol{P}}$ . Therefore let us write it as

(B 1)

$$\begin{eqnarray}\displaystyle \unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\hat{P}_{lj}(\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l}+b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{l})} & = & \displaystyle \unicode[STIX]{x1D716}_{ijk}\overline{b_{k}(\unicode[STIX]{x1D6FF}_{lj}-\unicode[STIX]{x2202}_{l}\unicode[STIX]{x2202}_{j}\unicode[STIX]{x1D6FB}^{-2})(\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l}+b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{l})}\nonumber\\ \displaystyle & = & \displaystyle \unicode[STIX]{x1D716}_{ijk}\overline{b_{k}[(\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{j}+b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{j})-\unicode[STIX]{x2202}_{j}\unicode[STIX]{x1D6FB}^{-2}(\unicode[STIX]{x2202}_{l}\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l}+\unicode[STIX]{x2202}_{l}b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{l})]}\nonumber\\ \displaystyle & = & \displaystyle \unicode[STIX]{x1D716}_{ijk}\overline{\overline{B}_{n}b_{k}\unicode[STIX]{x2202}_{n}b_{j}}+\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{j}}-2\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\unicode[STIX]{x2202}_{j}\unicode[STIX]{x1D6FB}^{-2}(\unicode[STIX]{x2202}_{l}\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l})}.\qquad\end{eqnarray}$$

The first term can be readily calculated assuming isotropy for turbulent fields, yielding

(B 2)

$$\begin{eqnarray}(1-\hat{\unicode[STIX]{x1D6FE}})\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}\,\overline{B}_{i}\right)+\overline{B}_{i}\hat{\unicode[STIX]{x1D6FE}}\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}\right).\end{eqnarray}$$

Denote the third term in (B 1) by $-2X_{i}$ . Then

(B 3)

$$\begin{eqnarray}X_{i}=\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\unicode[STIX]{x2202}_{j}\unicode[STIX]{x1D6FB}^{-2}(\unicode[STIX]{x2202}_{l}\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l})}=\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\unicode[STIX]{x1D6FB}^{-2}(\unicode[STIX]{x2202}_{jl}\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l}+\unicode[STIX]{x2202}_{l}\overline{B}_{n}\unicode[STIX]{x2202}_{jn}b_{l})}.\end{eqnarray}$$

The first term in the parentheses is $k_{s}/k_{L}$ times smaller than the second, and is therefore dropped. In Fourier space, the inverse of the Laplacian operator acting on the second term yields

(B 4)

$$\begin{eqnarray}{\mathcal{F}}[\unicode[STIX]{x1D6FB}^{-2}(\unicode[STIX]{x2202}_{l}\overline{B}_{n}\unicode[STIX]{x2202}_{jn}b_{l})]=\frac{1}{k^{2}}\int \text{d}^{3}k^{\prime }\text{i}(k_{l}-k_{l}^{\prime })\overline{B}_{n}(\boldsymbol{k}-\boldsymbol{k}^{\prime })(k_{j}^{\prime }k_{n}^{\prime })b_{l}(\boldsymbol{k}^{\prime }).\end{eqnarray}$$

$\boldsymbol{k}-\boldsymbol{k}^{\prime }$ is close to zero because of the presence of $\overline{\boldsymbol{B}}(\boldsymbol{k}-\boldsymbol{k}^{\prime })$ . Therefore we expand $1/k^{2}$ as

(B 5)

$$\begin{eqnarray}\frac{1}{k^{2}}=\frac{1}{k^{\prime 2}}+O(|\boldsymbol{k}-\boldsymbol{k}^{\prime }|).\end{eqnarray}$$

Only the zeroth-order term is kept, because terms of higher order yield derivatives of $\overline{\boldsymbol{B}}$ , which makes $X_{i}$ contain second or higher-order derivatives of $\overline{\boldsymbol{B}}$ . Equivalently, this means the $\unicode[STIX]{x1D6FB}^{-2}$ operator will not act on the $\overline{\boldsymbol{B}}$ term to this order. We now have, up to terms linear in $\overline{\boldsymbol{B}}$ or $\unicode[STIX]{x1D735}\overline{\boldsymbol{B}}$ ,

(B 6)

$$\begin{eqnarray}X_{i}\simeq \unicode[STIX]{x1D716}_{ijk}\overline{\unicode[STIX]{x2202}_{l}\overline{B}_{n}b_{k}\unicode[STIX]{x2202}_{jn}\unicode[STIX]{x1D6FB}^{-2}b_{l}}\end{eqnarray}$$

using (2.22).

Now the sum of the last two terms in (B 1) can be written as

(B 7)

$$\begin{eqnarray}\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{j}}-2\unicode[STIX]{x1D716}_{ijk}\overline{\unicode[STIX]{x2202}_{l}\overline{B}_{n}b_{k}\unicode[STIX]{x2202}_{jn}\unicode[STIX]{x1D6FB}^{-2}b_{l}}=(1-\hat{\unicode[STIX]{x1D6FE}})(\unicode[STIX]{x2202}_{l}\overline{B}_{n}\unicode[STIX]{x1D709}_{jkln})+\unicode[STIX]{x2202}_{l}\overline{B}_{n}\hat{\unicode[STIX]{x1D6FE}}\unicode[STIX]{x1D709}_{jkln},\end{eqnarray}$$

where

(B 8)

$$\begin{eqnarray}\unicode[STIX]{x1D709}_{\text{iln}}=\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}(\unicode[STIX]{x1D6FF}_{jn}-2\unicode[STIX]{x2202}_{jn}\unicode[STIX]{x1D6FB}^{-2})b_{l}}.\end{eqnarray}$$

To evaluate $\unicode[STIX]{x1D709}_{jkln}$ , note that its Fourier transform is proportional to

(B 9)

$$\begin{eqnarray}\unicode[STIX]{x1D716}_{ijk}\int \text{d}^{3}k^{\prime }P_{kl}(k^{\prime })\left(\unicode[STIX]{x1D6FF}_{jn}-2\frac{k_{j}^{\prime }k_{n}^{\prime }}{k^{\prime 2}}\right)\end{eqnarray}$$

since the helical part of $\overline{{\displaystyle\mathop{b}\limits_{{\sim}}}_{k}{\displaystyle\mathop{b}\limits_{{\sim}}}_{l}}$ ( $\propto \unicode[STIX]{x1D716}_{pkl}k_{p}$ ) does not contribute. Equation (B 9) then gives

(B 10)

$$\begin{eqnarray}\unicode[STIX]{x1D716}_{ijk}\int \text{d}^{3}k^{\prime }\left(\unicode[STIX]{x1D6FF}_{kl}\unicode[STIX]{x1D6FF}_{jn}-2\unicode[STIX]{x1D6FF}_{kl}\frac{k_{j}^{\prime }k_{n}^{\prime }}{k^{\prime 2}}-\unicode[STIX]{x1D6FF}_{jn}\frac{k_{k}^{\prime }k_{l}^{\prime }}{k^{\prime 2}}\right)=0\end{eqnarray}$$

using $\int \text{d}\unicode[STIX]{x1D6FA}^{\prime }k_{i}k_{j}/k^{2}=\unicode[STIX]{x1D6FF}_{ij}/3$ . Therefore the right-hand side of (B 7) is zero and altogether we have

(B 11)

$$\begin{eqnarray}\unicode[STIX]{x1D716}_{ijk}\overline{b_{k}\hat{P}_{lj}(\overline{B}_{n}\unicode[STIX]{x2202}_{n}b_{l}+b_{n}\unicode[STIX]{x2202}_{n}\overline{B}_{l})}=(1-\hat{\unicode[STIX]{x1D6FE}})\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}\overline{B}_{i}\right)+\overline{B}_{i}\hat{\unicode[STIX]{x1D6FE}}\left({\textstyle \frac{1}{3}}\overline{\boldsymbol{b}\boldsymbol{\cdot }\unicode[STIX]{x1D735}\times \boldsymbol{b}}\right).\end{eqnarray}$$

Footnotes

1 For hydrodynamic ensembles, see Kraichnan (Reference Kraichnan1973); Frisch et al. (Reference Frisch, Pouquet, Leorat and Mazure1975) has studied MHD ensembles at absolute statistical equilibrium; more applications of ensembles in MHD systems can be found in Shebalin (Reference Shebalin2013) and the references therein.

2 The choice of amplitude in the filter function that separates mean from fluctuations and defining the relation between $l$ and $l_{\text{eff}}$ is not unique. For a Gaussian average, taking ${\displaystyle\mathop{G}\limits_{{\sim}}}(\boldsymbol{k})=\text{e}^{-k^{2}l^{2}/8\unicode[STIX]{x03C0}^{2}}=1/2$ as the dividing line implies that $l_{\text{eff}}=l/\sqrt{2\ln 2}$ separates large and small scale fields in configuration space (as in Gent et al. Reference Gent, Shukurov, Sarson, Fletcher and Mantere2013). If instead we use ${\displaystyle\mathop{G}\limits_{{\sim}}}(\boldsymbol{k})=1/e$ , then $l_{\text{eff}}=l/\sqrt{2}$ . We adopt $l_{\text{eff}}=l$ , but note that different criteria for the dividing line can lead to a constant multiplicative factor on $l$ . If our averaging scale were based on a real space choice such as telescope beam width for $l_{\text{eff}}$ , leading us to set the exponent in (2.23) to say $1/2$ , then $l_{\text{eff}}=l\sqrt{\ln 2/2}/\unicode[STIX]{x03C0}=1/2$ . Beam width may not however, determine the most appropriate theoretical choice of $l_{\text{eff}}$ for a given magnetic energy spectrum.

3 This shall not be confused with, say, the characteristic length of large scale quantities $l_{L}$ , whose subscript is in roman type.

4 Use of this model is intended to exemplify the method. Other models (e.g. Chamandy Reference Chamandy2016) can also be used.

5 Here we focus on a time-independent field (as a valid and simple solution to the dynamo model) to illustrate the idea of quantifying precisions of a mean field theory. In principle, similar calculations can be done at each instant time for a non-steady state (e.g. oscillatory) solution.

References

Aluie, H. 2017 Coarse-grained incompressible magnetohydrodynamics: analyzing the turbulent cascades. New J. Phys. 19 (2), 025008.Google Scholar

Aluie, H. & Eyink, G. L. 2010 Scale locality of magnetohydrodynamic turbulence. Phys. Rev. Lett. 104 (8), 081101.Google Scholar

Beck, R. 2016 Magnetic fields in spiral galaxies. Astron. Astrophys. Rev. 24, 4.Google Scholar

Bhat, P., Ebrahimi, F. & Blackman, E. G. 2016 Large-scale dynamo action precedes turbulence in shearing box simulations of the magnetorotational instability. Mon. Not. R. Astron. Soc. 462, 818–829.CrossRef Google Scholar

Blackman, E. G. 2000 Mean magnetic field generation in sheared rotators. Astrophys. J. 529, 138–145.Google Scholar

Blackman, E. G. 2015 Magnetic helicity and large scale magnetic fields: a primer. Space Sci. Rev. 188, 59–91.Google Scholar

Blackman, E. G. & Field, G. B. 2002 New dynamical mean-field dynamo theory and closure approach. Phys. Rev. Lett. 89 (26), 265007.CrossRef Google Scholar PubMed

Blackman, E. G. & Nauman, F. 2015 Motivation and challenge to capture both large-scale and local transport in next generation accretion theory. J. Plasma Phys. 81 (5), 395810505.Google Scholar

Blackman, E. G., Nauman, F. & Edgar, R. G.2010 Quantifying the imprecision of accretion theory and implications for multi-epoch observations of protoplanetary discs. ArXiv e-prints.Google Scholar

Brandenburg, A. 2009 The critical role of magnetic helicity in astrophysical large-scale dynamos. Plasma Phys. Control. Fusion 51 (12), 124043.Google Scholar

Brandenburg, A., Rädler, K.-H. & Schrinner, M. 2008 Scale dependence of alpha effect and turbulent diffusivity. Astron. Astrophys. 482, 739–746.Google Scholar

Brandenburg, A. & Subramanian, K. 2005a Astrophysical magnetic fields and nonlinear dynamo theory. Phys. Rep. 417, 1–209.CrossRef Google Scholar

Brandenburg, A. & Subramanian, K. 2005b Minimal tau approximation and simulations of the alpha effect. Astron. Astrophys. 439, 835–843.Google Scholar

Burn, B. J. 1966 On the depolarization of discrete radio sources by Faraday dispersion. Mon. Not. R. Astron. Soc. 133, 67.Google Scholar

Chamandy, L. 2016 An analytical dynamo solution for large-scale magnetic fields of galaxies. Mon. Not. R. Astron. Soc. 462, 4402–4415.Google Scholar

Chamandy, L., Shukurov, A., Subramanian, K. & Stoker, K. 2014 Non-linear galactic dynamos: a toolbox. Mon. Not. R. Astron. Soc. 443, 1867–1880.CrossRef Google Scholar

Chamandy, L., Shukurov, A. & Taylor, A. R. 2016 Statistical tests of galactic dynamo theory. Astrophys. J. 833, 43.Google Scholar

Chamandy, L., Subramanian, K. & Shukurov, A. 2013 Galactic spiral patterns and dynamo action – I. A new twist on magnetic arms. Mon. Not. R. Astron. Soc. 428, 3569–3589.CrossRef Google Scholar

Dakhoul, V. M. & Bedford, K. W. 1986a Improved averaging method for turbulent flow simulation. I – Theoretical development and application to Burgers’ transport equation. II – Calculations and verification. Intl J. Numer. Meth. Fluids 6, 49–82.CrossRef Google Scholar

Dakhoul, Y. M. & Bedford, K. W. 1986b Improved averaging method for turbulent flow simulation. Part II: calculations and verification. Intl J. Numer. Meth. Fluids 6, 65–82.Google Scholar

Eilek, J. A. 1989a Turbulence in extended synchrotron radio sources. I – Polarization of turbulent sources. II – Power-spectral analysis. Astrophys. J. 98, 244–266.Google Scholar

Eilek, J. A. 1989b Turbulence in extended synchrotron radio sources. II. Power-spectral analysis. Astrophys. J. 98, 256.Google Scholar

Eyink, G. L. & Aluie, H. 2009 Localness of energy cascade in hydrodynamic turbulence. I. Smooth coarse graining. Phys. Fluids 21 (11), 115107.CrossRef Google Scholar

Fletcher, A. 2010 Magnetic fields in nearby galaxies. In The Dynamic Interstellar Medium: A Celebration of the Canadian Galactic Plane Survey, Astronomical Society of the Pacific Conference Series, vol. 438, p. 197. Astronomical Society of the Pacific.Google Scholar

Frick, P., Beck, R., Berkhuijsen, E. M. & Patrickeyev, I. 2001 Scaling and correlation analysis of galactic images. Mon. Not. R. Astron. Soc. 327, 1145–1157.CrossRef Google Scholar

Frisch, U., Pouquet, A., Leorat, J. & Mazure, A. 1975 Possibility of an inverse cascade of magnetic helicity in magnetohydrodynamic turbulence. J. Fluid Mech. 68, 769–778.Google Scholar

Gent, F. A., Shukurov, A., Sarson, G. R., Fletcher, A. & Mantere, M. J. 2013 The supernova-regulated ISM – II. The mean magnetic field. Mon. Not. R. Astron. Soc. 430, L40–L44.CrossRef Google Scholar

Germano, M. 1992 Turbulence – the filtering approach. J. Fluid Mech. 238, 325–336.CrossRef Google Scholar

Germano, M., Piomelli, U., Moin, P. & Cabot, W. H. 1991 A dynamic subgrid-scale eddy viscosity model. Phys. Fluids A 3, 1760–1765.CrossRef Google Scholar

Hubbard, A. & Brandenburg, A. 2009 Memory effects in turbulent transport. Astrophys. J. 706, 712–726.Google Scholar

Hubbard, A. & Brandenburg, A. 2011 Magnetic helicity flux in the presence of shear. Astrophys. J. 727, 11.Google Scholar

Kleeorin, N. & Rogachevskii, I. 2008 Mean-field dynamo in a turbulence with shear and kinetic helicity fluctuations. Phys. Rev. E 77 (3), 036307.Google Scholar

Kleeorin, N., Rogachevskii, I., Sokoloff, D. & Tomin, D. 2009 Mean-field dynamos in random Arnold–Beltrami–Childress and Roberts flows. Phys. Rev. E 79 (4), 046302.Google Scholar

Kraichnan, R. H. 1973 Helical turbulence and absolute equilibrium. J. Fluid Mech. 59, 745–752.Google Scholar

Krause, F. & Rädler, K.-H. 1980 Mean-field Magnetohydrodynamics and Dynamo Theory. Elsevier.Google Scholar

Leonard, A. 1974 Energy cascade in large-eddy simulations of turbulent fluid flows. Adv. Geophys. 18, 237.CrossRef Google Scholar

Lilly, D. K. 1992 A proposed modification of the Germano subgrid-scale closure method. Phys. Fluids A 4, 633–635.CrossRef Google Scholar

Meneveau, C. & Katz, J. 2000 Scale-invariance and turbulence models for large-eddy simulation. Annu. Rev. Fluid Mech. 32 (1), 1–32.CrossRef Google Scholar

Moss, D. 1995 On the generation of bisymmetric magnetic field structures in spiral galaxies by tidal interactions. Mon. Not. R. Astron. Soc. 275, 191–194.Google Scholar

Moss, D., Sokoloff, D., Usoskin, I. & Tutubalin, V. 2008 Solar grand minima and random fluctuations in dynamo parameters. Solar Phys. 250, 221–234.CrossRef Google Scholar

Phillips, A. 2001 A comparison of the asymptotic and no-

$z$ approximations for galactic dynamos. Geophys. Astrophys. Fluid Dyn. 94, 135–150.Google Scholar

Pouquet, A., Frisch, U. & Leorat, J. 1976 Strong MHD helical turbulence and the nonlinear dynamo effect. J. Fluid Mech. 77, 321–354.Google Scholar

Rädler, K.-H. 2000 The generation of cosmic magnetic fields. In From the Sun to the Great Attractor (ed. Page, D. & Hirsch, J. G.), Lecture Notes in Physics, vol. 556, p. 101. Springer.CrossRef Google Scholar

Rädler, K.-H. & Rheinhardt, M. 2007 Mean-field electrodynamics: critical analysis of various analytical approaches to the mean electromotive force. Geophys. Astrophys. Fluid Dyn. 101, 117–154.Google Scholar

Rheinhardt, M. & Brandenburg, A. 2012 Modeling spatio-temporal nonlocality in mean-field dynamos. Astron. Nachr. 333, 71–77.CrossRef Google Scholar

Roberts, P. H. & Soward, A. M. 1975 A unified approach to mean field electrodynamics. Astron. Nachr. 296, 49–64.Google Scholar

Ruzmaikin, A. A., Sokoloff, D. D. & Shukurov, A. M.(Eds) 1988 Magnetic Fields of Galaxies, Astrophysics and Space Science Library, vol. 133. Kluwer.Google Scholar

Shapovalov, D. S. & Vishniac, E. T. 2011 Simulations of turbulent dynamos driven by the magnetic helicity flux. Astrophys. J. 738, 66.Google Scholar

Shebalin, J. V. 1989 Broken ergodicity and coherent structure in homogeneous turbulence. Physica D 37, 173–191.Google Scholar

Shebalin, J. V. 2010 Broken ergodicity in two-dimensional homogeneous magnetohydrodynamic turbulence. Phys. Plasmas 17 (9), 092303.Google Scholar

Shebalin, J. V. 2013 Broken ergodicity in magnetohydrodynamic turbulence. Geophys. Astrophys. Fluid Dyn. 107, 411–466.Google Scholar

Smagorinsky, J. 1963 General circulation experiments with the primitive equations. Mon. Weath. Rev. 91, 99.2.3.CO;2>CrossRef Google Scholar

Sokoloff, D. D., Bykov, A. A., Shukurov, A., Berkhuijsen, E. M., Beck, R. & Poezd, A. D. 1998 Depolarization and Faraday effects in galaxies. Mon. Not. R. Astron. Soc. 299, 189–206.Google Scholar

Spangler, S. R. 1982 The transport of polarized synchrotron radiation in a turbulent medium. Astrophys. J. 261, 310–320.Google Scholar

Subramanian, K. & Brandenburg, A. 2006 Magnetic helicity density and its flux in weakly inhomogeneous turbulence. Astrophys. J. Lett. 648, L71–L74.CrossRef Google Scholar

Subramanian, K. & Mestel, L. 1993 Galactic dynamos and density wave theory – Part Two – an alternative treatment for strong non-axisymmetry. Mon. Not. R. Astron. Soc. 265, 649.CrossRef Google Scholar

Sur, S., Shukurov, A. & Subramanian, K. 2007 Galactic dynamos supported by magnetic helicity fluxes. Mon. Not. R. Astron. Soc. 377, 874–882.Google Scholar

Tribble, P. C. 1991 Radio emission in a random magnetic field – Radio haloes and the structure of the magnetic field in the Coma cluster. Mon. Not. R. Astron. Soc. 253, 147–152.Google Scholar

Van Eck, C. L., Brown, J. C., Shukurov, A. & Fletcher, A. 2015 Magnetic fields in a sample of nearby spiral galaxies. Astrophys. J. 799, 35.Google Scholar

Yeo, W. K.1987 A generalized high pass/low pass averaging procedure for deriving and solving turbulent flow equations. PhD thesis, The Ohio State University.Google Scholar

Yoshizawa, A. & Yokoi, N. 1993 Turbulent magnetohydrodynamic dynamo for accretion disks using the cross-helicity effect. Astrophys. J. 407, 540–548.CrossRef Google Scholar

Zhou, H. & Blackman, E. G. 2017 Some consequences of shear on galactic dynamos with helicity fluxes. Mon. Not. R. Astron. Soc. 469, 1466–1475.Google Scholar

Figure 2. (a) Theoretical predictions of the line-of-sight-averaged magnetic field $\overline{B}_{L}$ with the filtering error $\unicode[STIX]{x1D70E}_{\text{df}}$ shown as error bars in an edge-on view of a disc galaxy, assuming that the mean field has the form $\overline{\boldsymbol{B}}=B_{0}\hat{\unicode[STIX]{x1D753}}$ with $B_{0}=1$. Lengths are normalized by the galactic radius $R=12~\text{kpc}$. Two sets of error bars are shown for different choices of $l$. (b) Fractional error bar values at different radii as a function of the averaging scale $l$.

Figure 3. Similar to figure 2(a) but using analytic dynamo solutions for $\overline{\boldsymbol{B}}$ from § 4.5 of Zhou & Blackman (2017) by solving equations (6.3) to (6.5). (a) Intrinsic error and (b) filtering error.

Figure 4. The total relative error using the analytic dynamo solutions at different radii as a function of averaging scale $l$. An optimal scale arises at 0.15–0.20 kpc which minimizes the relative error, and therefore provides the best precision of theoretical predictions.

Figure 5. Similar to figure 3 but for a face-on view of a disc galaxy, using the analytic dynamo solution.

Figure 6. Line-of-sight predictions and error bars of pulsar rotation measures for our view from within our Galaxy based on the analytically solvable dynamo model, equations (6.3) to (6.5), taken from § 4.5 in Zhou & Blackman (2017). (a,b) Show error bars corresponding to the intrinsic error and filtering error, respectively.

Figure 7. The total relative error for pulsar RMs at different azimuthal angle (centred at the Earth) as a function of averaging scale $l$. Filtering error dominates as a result of short length of the line of sight.

Figure 8. $k_{0}^{\prime }(k)$ for a Gaussian kernel $\text{e}^{-k^{2}/2k_{l}^{2}}$ with $k_{l}=1$.

Figure 9. Comparisons of exact and approximated results of $\overline{f\overline{F}}$ for different scale separations $k_{s}/k_{L}$.

Figure 10. (a) Mean relative errors defined in (A 4) and (A 5) from comparing the exact and approximated results of $\overline{f\overline{F}}$ as a function of $k_{s}/k_{L}$. (b) Mean relative error for $k as a function of $k_{s}/k_{L}$ and $k_{l}$.

Article contents

Derivation and precision of mean field electrodynamics with mesoscale fluctuations

Abstract

Keywords

1 Introduction

2 Averaging in MFE using kernels

2.1 General formalism

2.2 Expressions for averages of fluctuations and double averages

2.3 Comparison to previous work

2.4 Unifying different averaging methods using kernels

2.4.1 Gaussian average

2.4.2 Moving box average

2.4.3 Moving line segment average

2.4.4 Fixed grid averages

2.4.5 Planar average

2.4.6 Time average

2.5 On averages in simulations versus observations

3 MFE dynamo equations with correction terms

3.1 Derivation

3.2 Comparison to previous work on non-local EMF kernels

4 Precision of mean field theories

4.1 Intrinsic error from statistical fluctuations in inputs to mean field equations

4.2 Filtering error from mismatch between measurement and theoretical kernels

5 MFE precision error in the context of FR

6 Galactic dynamo and precision for different FR viewing angles

6.1 Galactic dynamo model

6.2 Edge-on view

6.3 Face-on view

6.4 Views at intermediate inclinations

6.5 View from within our Galaxy

7 Conclusions

7.1 Summary

7.2 Further work

Acknowledgements

Appendix A. On the validity of (2.17)

Appendix B. Derivation of the first two terms in (3.7)

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests