Quasilinear theory for inhomogeneous plasma

I.Y. Dodin

doi:10.1017/S0022377822000502

Quasilinear theory for inhomogeneous plasma

Part of: Featured Articles

Published online by Cambridge University Press: 09 August 2022

I.Y. Dodin

Show author details

I.Y. Dodin*: Affiliation:
Princeton Plasma Physics Laboratory, Princeton, NJ 08543, USA Department of Astrophysical Sciences, Princeton University, Princeton, NJ 08544, USA
*: †Email address for correspondence: [email protected]

Article contents

Abstract
Introduction
A math primer
Model
Preliminaries
Interaction with prescribed fields
Interaction with self-consistent fields
Interaction with on-shell waves
Thermal equilibrium
Examples
Summary
Funding
Declaration of interests
Footnotes
References

Rights & Permissions

Abstract

This paper presents quasilinear theory (QLT) for a classical plasma interacting with inhomogeneous turbulence. The particle Hamiltonian is kept general; for example, relativistic, electromagnetic and gravitational effects are subsumed. A Fokker–Planck equation for the dressed ‘oscillation-centre’ distribution is derived from the Klimontovich equation and captures quasilinear diffusion, interaction with the background fields and ponderomotive effects simultaneously. The local diffusion coefficient is manifestly positive-semidefinite. Waves are allowed to be off-shell (i.e. not constrained by a dispersion relation), and a collision integral of the Balescu–Lenard type emerges in a form that is not restricted to any particular Hamiltonian. This operator conserves particles, momentum and energy, and it also satisfies the $\smash {H}$-theorem, as usual. As a spin-off, a general expression for the spectrum of microscopic fluctuations is derived. For on-shell waves, which satisfy a quasilinear wave-kinetic equation, the theory conserves the momentum and energy of the wave–plasma system. The action of non-resonant waves is also conserved, unlike in the standard version of QLT. Dewar's oscillation-centre QLT of electrostatic turbulence (Phys. Fluids, vol. 16, 1973, p. 1102) is proven formally as a particular case and given a concise formulation. Also discussed as examples are relativistic electromagnetic and gravitational interactions, and QLT for gravitational waves is proposed.

Keywords

plasma nonlinear phenomena plasma waves

Type: Research Article
Information: Journal of Plasma Physics , Volume 88 , Issue 4 , August 2022 , 905880407

DOI: https://doi.org/10.1017/S0022377822000502 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: Copyright © The Author(s), 2022. Published by Cambridge University Press

1. Introduction

1.1. Background

Electromagnetic waves are present in plasmas naturally, and they are also launched into plasmas using external antennas, for example, for plasma heating and current drive (Fisch Reference Fisch1987; Stix Reference Stix1992; Pinsker Reference Pinsker2001). Nonlinear effects produced by these waves are often modelled within the quasilinear (QL) approximation, meaning that the nonlinearities are retained in the low-frequency (‘average’) dynamics but neglected in the high-frequency dynamics. Two separate paradigms exist within this approach.

In the first paradigm, commonly known as ‘the’ QL theory (QLT), the focus is on resonant interactions. Non-resonant particles are considered as a background that is homogeneous in spatial (Vedenov, Velikhov & Sagdeev Reference Vedenov, Velikhov and Sagdeev1961; Drummond & Pines Reference Drummond and Pines1962; Kennel & Engelmann Reference Kennel and Engelmann1966; Rogister & Oberman Reference Rogister and Oberman1968, Reference Rogister and Oberman1969) or generalized coordinates (Kaufman Reference Kaufman1972; Eriksson & Helander Reference Eriksson and Helander1994; Catto, Lee & Ram Reference Catto, Lee and Ram2017); then, the oscillating fields can be described in terms of global modes. This approach has the advantage of simplicity, but its applications are limited in that real plasmas are never actually homogeneous in any predefined variables (and, furthermore, tend to exhibit nonlinear instabilities in the presence of intense waves). The ‘ponderomotive’ dynamics determined by the gradients of the wave and plasma parameters is lost in this approach; then, spurious effects can emerge and have to be dealt with (Lee et al. Reference Lee, Smithe, Wright and Bonoli2018).

The second paradigm successfully captures the ponderomotive dynamics by introducing effective Hamiltonians for the particle average motion (Gaponov & Miller Reference Gaponov and Miller1958; Motz & Watson Reference Motz and Watson1967; Cary & Kaufman Reference Cary and Kaufman1981; Kaufman Reference Kaufman1987; Dodin Reference Dodin2014). But, as usual in perturbation theory (Lichtenberg & Lieberman Reference Lichtenberg and Lieberman1992), those Hamiltonians are by default singular for resonant interactions. Thus, such models have limited reach as well, and remarkable subtleties are still found even in basic QL problems. For example, it is still debated (Ochs & Fisch Reference Ochs and Fisch2021a; Ochs Reference Ochs2021) to which extent the QL effects that remove resonant particles while capturing their energy (Fisch & Rax Reference Fisch and Rax1992) also remove charge along with the resonant particles, thereby driving plasma rotation (Fetterman & Fisch Reference Fetterman and Fisch2008). This state of affairs means, arguably, that a clear comprehensive theory of QL wave–plasma interactions remains to be developed – a challenge that must be faced.

The first framework that subsumed both resonant and non-resonant interactions in inhomogeneous plasmas was proposed by Dewar (Reference Dewar1973) for electrostatic turbulence in non-magnetized plasma and is known as ‘oscillation-centre’ (OC) QLT. It was later extended by McDonald, Grebogi & Kaufman (Reference McDonald, Grebogi and Kaufman1985) to non-relativistic magnetized plasma. However, both of these models are partly heuristic and limited in several respects. For example, they are bounded by the limitations of the variational approach used therein, and they separate resonant particles from non-resonant particles somewhat arbitrarily (see also Ye & Kaufman Reference Ye and Kaufman1992). Both models also assume specific particle Hamiltonians and require that waves be governed by a QL wave-kinetic equation (WKE), i.e. be only weakly dissipative, or ‘on-shell’. (Somewhat similar formulations were also proposed, independently and without references to the OC formalism, in Weibel Reference Weibel1981; Yasseen Reference Yasseen1983; Yasseen & Vaclavik Reference Yasseen and Vaclavik1986.) This means that collisions and microscopic fluctuations are automatically excluded. Attempts to merge QLT and the WKE with the theory of plasma collisions were made (Rogister & Oberman Reference Rogister and Oberman1968; Schlickeiser & Yoon Reference Schlickeiser and Yoon2014; Yoon et al. Reference Yoon, Ziebell, Kontar and Schlickeiser2016) but have not yielded a local theory applicable to inhomogeneous plasma. In particular, the existing models rely on global-mode decompositions and treat complex frequencies heuristically. Thus, the challenge stands.

Related problems are also of interest in the context of gravitostatic interactions (Chavanis Reference Chavanis2012; Hamilton Reference Hamilton2020; Magorrian Reference Magorrian2021), where inhomogeneity of the background fields cannot be neglected in principle (Binney & Tremaine Reference Binney and Tremaine2008). (To our knowledge, OC QLT analogues have not been considered in this field.) Similar challenges also arise in QLT of dispersive gravitational waves (Garg & Dodin Reference Garg and Dodin2020, Reference Garg and Dodin2021a). Hence, one cannot help but wonder whether a specific form of the particle Hamiltonian really matters for developing QLT or it is irrelevant and therefore should not be assumed. Since basic theory of linear waves is independent of Maxwell's equations (Dodin & Fisch Reference Dodin and Fisch2012; Tracy et al. Reference Tracy, Brizard, Richardson and Kaufman2014; Dodin, Zhmoginov & Ruiz Reference Dodin, Zhmoginov and Ruiz2017), a general QLT might be possible too, and it might be easier to develop than a zoo of problem-specific models.

1.2. Outline

Here, we propose a general QLT that allows for plasma inhomogeneity and is not restricted to any particular Hamiltonian or interaction field. By starting with the Klimontovich equation, we derive a model that captures QL diffusion, interaction with background fields and ponderomotive effects simultaneously. The local diffusion coefficient in this model is manifestly positive-semidefinite. Waves are allowed to be off-shell, and a collision integral of the Balescu–Lenard type emerges for general Hamiltonian interactions. This operator conserves particles, momentum and energy, and it also satisfies the $\smash {H}$-theorem, as usual. As a spin-off, a general expression for the spectrum of microscopic fluctuations of the interaction field is derived. For on-shell waves governed by the WKE, the theory conserves the momentum and energy of the wave–plasma system. The action of non-resonant waves is also conserved, unlike in the standard version of QLT.Footnote ¹ Dewar's OC QLT of electrostatic turbulence (Dewar Reference Dewar1973) is proven formally as a particular case and given a concise formulation. Also discussed as examples are relativistic electromagnetic and gravitational interactions, and QLT for gravitational waves is proposed. Overall, our formulation interconnects many known results that in the past were derived independently and reproduces them within a unifying framework.

This progress is made by giving up the traditional Fourier–Laplace approach. The author takes the stance that the global-mode language is unnatural for inhomogeneous-plasma problems (i.e. all real-plasma problems). A fundamental theory must be local. Likewise, the variational approach that is used sometimes in QL calculations is not universally advantageous, especially for describing dissipation. Instead of those methods, we use operator analysis and the Weyl symbol calculus, as has also been proven fruitful in other recent studies of ponderomotive effects and turbulence (Ruiz Reference Ruiz2017; Ruiz & Dodin Reference Ruiz and Dodin2017b; Zhu & Dodin Reference Zhu and Dodin2021) and linear-wave theory (Dodin et al. Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019). No logical leaps are made in this paper other than assuming the QL approximation per se and a certain ordering.Footnote ² In a nutshell, we treat the commonly known QL-diffusion coefficient as a non-local operator, and we systematically approximate it using the Weyl symbol calculus. It is the non-locality of this operator that gives rise to ponderomotive effects and ensures the proper conservation laws. The existing concept of ‘adiabatic diffusion’ (Galeev & Sagdeev Reference Galeev and Sagdeev1985; Stix Reference Stix1992) captures some of that, but systematic application of operator analysis yields a more general, more accurate and more rigorous theory.

The author hopes not that this paper is an entertaining read. However, the paper was intended as self-contained, maximally structured and easily searchable, so readers interested in specific questions could find and understand answers without having to read the whole paper. The text is organized as follows. In § 2, we present a primer on the Weyl symbol calculus and the associated notation. In § 3, we formulate our general model. In § 4, we introduce the necessary auxiliary theorems. In § 5, we derive a QL model for plasma interacting with prescribed waves. The waves may or may not be on-shell or self-consistent. (Their origin and dynamics are not addressed in § 5.) In § 6, we consider interactions with self-consistent waves. In particular, we separate out microscopic fluctuations, calculate their average distribution and derive the corresponding collision operator. In § 7, we assume that the remaining macroscopic waves are on-shell, rederive the WKE, and show that our QL model combined with the WKE is conservative. In § 8, we discuss the general properties that our model predicts for plasmas in thermal equilibrium. In § 9, we show how to apply our theory to non-relativistic electrostatic interactions, relativistic electromagnetic interactions, Newtonian gravity and relativistic gravity. In § 10, we summarize our results. Auxiliary calculations are presented in Appendices A –D, and Appendix G summarizes our notation. This notation is extensive and may not be particularly intuitive. Thus, readers are encouraged to occasionally scout § 9 for examples even before fully absorbing the preceding sections.

An impatient reader can also skip calculations entirely and consult only the summaries of the individual sections (2.3, 3.4, 4.4, 5.6, 6.9, 7.6 and 8.5; they are mostly self-contained) and then proceed to the examples in § 9. However, the main point of this work is not just the final results per se (surely, some readers will find them obvious) but also that they are derived with minimal assumptions and rigorously, which makes them reliable. A reader may also notice that we rederive some known results along the way, for example, basic linear-wave theory and the WKE. This is done for completeness and, more importantly, with the goal to present all pieces of the story within a unified notation.

2. A math primer

Here, we summarize the machinery to be used in the next sections. This machinery is not new, but a brief overview is in order at least to introduce the necessary notation. A more comprehensive summary, with proofs, can be found in Dodin et al. (Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019, supplementary material). For extended discussions, see Tracy et al. (Reference Tracy, Brizard, Richardson and Kaufman2014), Ruiz (Reference Ruiz2017), McDonald (Reference McDonald1988) and Littlejohn (Reference Littlejohn1986).

2.1. Weyl symbol calculus on spacetime

2.1.1. Basic notation

We denote the time variable as $t$, space coordinates as ${\boldsymbol {x}} \equiv (x^1, x^2, \ldots, x^n)$, spacetime coordinates as ${\boldsymbol {\mathsf {x}}} \equiv ({\mathsf {x}}^0, {\mathsf {x}}^1, \ldots, {\mathsf {x}}^n)$, where ${\mathsf {x}}^0 \doteq t$ and ${\mathsf {x}}^i \doteq x^i$. The symbol $\doteq$ denotes definitions, and Latin indices from the middle of the alphabet ($i, j, \ldots$) range from 1 to $n$ unless specified otherwise. We assume the spacetime-coordinate domain to be $\smash {\mathbb {R}^{{\mathsf {n}}}}$.Footnote ³ Functions on ${\boldsymbol {\mathsf {x}}}$ form a Hilbert space $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$ with an inner product that we define as

(2.1)

\begin{equation} {\left\langle {\xi |\psi}\right\rangle} \doteq \int \mathrm{d}{\boldsymbol {\mathsf{x}}}\, \xi^*({\boldsymbol {\mathsf{x}}}) \psi({\boldsymbol {\mathsf{x}}}). \end{equation}

The symbol $^*$ denotes complex conjugate,

(2.2)

\begin{equation} \mathrm{d}{\boldsymbol {\mathsf{x}}} \doteq \mathrm{d}{\mathsf{x}}^0\, \mathrm{d}{\mathsf{x}}^1 \ldots \mathrm{d} {\mathsf{x}}^n = \mathrm{d} t\, \mathrm{d} x^1 \ldots \mathrm{d} x^n \end{equation}

(a similar convention is assumed also for other multi-dimensional variables used below), and integrals in this paper are taken over $(-\infty, \infty )$ unless specified otherwise. Operators on $\mathscr {H}_{{{\mathsf {x}}}}$ will be denoted with carets, and we will use indexes $_\text {H}$ and $_\text {A}$ to denote their Hermitian and anti-Hermitian parts. For a given operator $\smash {\widehat {{\mathsf {A}}}}$, one has $\smash {\widehat {{\mathsf {A}}}=\widehat {{\mathsf {A}}}_\text {H} + \mathrm {i} \widehat {{\mathsf {A}}}_\text {A}}$,

(2.3)

\begin{equation} \widehat{{\mathsf{A}}}_\text{H} = \widehat{{\mathsf{A}}}_\text{H}^{{\dagger}} \doteq \frac{1}{2}\,(\widehat{{\mathsf{A}}} + \widehat{{\mathsf{A}}}^{{\dagger}}), \qquad \widehat{{\mathsf{A}}}_\text{A} = \widehat{{\mathsf{A}}}_\text{A}^{{\dagger}} \doteq \frac{1}{2\mathrm{i}}\,(\widehat{{\mathsf{A}}} - \widehat{{\mathsf{A}}}^{{\dagger}}), \end{equation}

where $^{{\dagger}}$ denotes the Hermitian adjoint with respect to the inner product (2.1). The case of a more general inner product is detailed in Dodin et al. (Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019, supplementary material).

2.1.2. Vector fields

For multi-component fields $\smash {{\boldsymbol {\psi }} \equiv (\psi ^1, \psi ^2, \ldots, \psi ^M)}$${}^{\intercal }$ (our $\smash {^\intercal }$ denotes the matrix trans pose), or ‘row vectors’ (actually, tuples), we define the dual ‘column vectors’ $\smash {{\boldsymbol {\psi }}^{{\dagger}} \equiv (\psi _1^*, \psi _2^*, \ldots, \psi _M^*)^\intercal }$ via $\smash {{\boldsymbol {\psi }}^{{\dagger}} \doteq \boldsymbol{\mathsf {g}}{\boldsymbol {\psi }}^*}$. The matrix $\boldsymbol{\mathsf {g}}$ is assumed to be real, diagonal, invertible and constant; other than that, it can be chosen as suits a specific problem. (For example, a unit matrix may suffice.) This induces the standard rule of index manipulation

(2.4)

\begin{equation} \psi_i = {\mathsf{g}}_{ij}\psi^j, \qquad \psi^i = {\mathsf{g}}^{ij}\psi_j, \qquad i, j = 1, 2, \ldots, M, \end{equation}

where $\smash {{\mathsf {g}}_{ij}}$ are elements of $\boldsymbol{\mathsf {g}}$ and $\smash {{\mathsf {g}}^{ij}}$ are elements of $\boldsymbol{\mathsf {g}}^{-1}$. Summation over repeating indices is assumed. The rules of matrix multiplication apply to row and column vectors as usual. Then, for $\smash {{\boldsymbol {\psi }} \equiv (\psi ^1, \psi ^2, \ldots, \psi ^M)}$${}^{\intercal }$ and $\smash {{\boldsymbol {\xi }} \equiv (\xi ^1, \xi ^2, \ldots, \xi ^M)^{\intercal }}$, the quantity $\smash {{\boldsymbol {\psi }}{\boldsymbol {\xi }}}$ is a matrix with elements $\smash {\psi ^i\xi ^j}$, $\smash {{\boldsymbol {\psi }}{\boldsymbol {\xi }}^{{\dagger}} }$ is a matrix with elements $\smash {\psi ^i\xi _j^*}$ and $\smash {{\boldsymbol {\xi }}^{{\dagger}} {\boldsymbol {\psi }}}$ is its scalar trace:Footnote ⁴

(2.5)

\begin{equation} {\boldsymbol{\xi}}^{{\dagger}}{\boldsymbol{\psi}} = \operatorname{tr}({\boldsymbol{\psi}}{\boldsymbol{\xi}}^{{\dagger}}) = \xi_i^* \psi^i = {\mathsf{g}}_{ij} \xi^{i*} \psi^j, \qquad i, j = 1, 2, \ldots, M. \end{equation}

(Similarly, for $\smash {{\boldsymbol {\chi }} \equiv (\chi _1, \chi _2, \ldots, \chi _M)}$ and $\smash {{\boldsymbol {\eta }} \equiv (\eta _1, \eta _2, \ldots, \eta _M)}$, $\smash {{\boldsymbol {\eta }}{\boldsymbol {\chi }}}$ is a matrix with elements $\smash {\eta _i \chi _j}$.) We use (2.5) to define a Hilbert space $\smash {\mathscr {H}_{{{\mathsf {x}}}}^M}$ of $\smash {M}$-dimensional vector fields on ${\boldsymbol {\mathsf {x}}}$, specifically, by adopting the inner product

(2.6)

\begin{equation} \left\langle{\boldsymbol{\xi}|\boldsymbol{\psi}}\right\rangle \doteq \int \mathrm{d}{\boldsymbol {\mathsf{x}}}\, {\boldsymbol{\xi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}}) {\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}). \end{equation}

Below, the distinction between $\mathscr {H}_{{{\mathsf {x}}}}^M$ and $\mathscr {H}_{{{\mathsf {x}}}}$ will be assumed but not emphasized. Also note that (2.5) yields

(2.7)

\begin{equation} {\boldsymbol{\xi}}^{{\dagger}}({\boldsymbol{\psi}}{\boldsymbol{\psi}}^{{\dagger}}){\boldsymbol{\xi}} = ({\boldsymbol{\xi}}^{{\dagger}} {\boldsymbol{\psi}})({\boldsymbol{\psi}}^{{\dagger}} {\boldsymbol{\xi}}) = |{\boldsymbol{\xi}}^{{\dagger}} {\boldsymbol{\psi}}|^2 \geqslant 0, \end{equation}

for any $\smash {{\boldsymbol {\xi }}}$ and $\smash {{\boldsymbol {\psi }}}$. Thus, dyadic matrices of the form $\smash {{\boldsymbol {\psi }}{\boldsymbol {\psi }}^{{\dagger}} }$ are positive-semidefinite, even though $\smash {{\boldsymbol {\psi }}^{{\dagger}} {\boldsymbol {\psi }}}$ may be negative (when $\boldsymbol{\mathsf {g}}$ is not positive-definite).

For general matrices, the indices can be raised and lowered using $\boldsymbol{\mathsf {g}}$ and $\boldsymbol{\mathsf {g}}^{-1}$ as usual. The Hermitian adjoint $\smash {{{\boldsymbol {\mathsf {A}}}}^{{\dagger}} }$ for a given matrix $\smash {{\boldsymbol {\mathsf {A}}}}$ is defined such that $\smash {({{\boldsymbol {\mathsf {A}}}}^{{\dagger}} {\boldsymbol {\xi }})^{{\dagger}} {\boldsymbol {\psi }} = {\boldsymbol {\xi }}^{{\dagger}} ({{\boldsymbol {\mathsf {A}}}}{\boldsymbol {\psi }})}$ for any $\smash {{\boldsymbol {\psi }}}$ and $\smash {{\boldsymbol {\xi }}}$, which means

(2.8)

\begin{equation} ({{\boldsymbol {\mathsf{A}}}}^{{\dagger}})_j{}^i = ({{\boldsymbol {\mathsf{A}}}})^{i*}{}_j \equiv {\mathsf{A}}^{i*}{}_j, \qquad i, j = 1, 2, \ldots, M. \end{equation}

The Hermitian and anti-Hermitian parts are defined as

(2.9)

\begin{equation} {{\boldsymbol {\mathsf{A}}}}_\text{H} = {{\boldsymbol {\mathsf{A}}}}_\text{H}^{{\dagger}} \doteq \frac{1}{2}\,({{\boldsymbol {\mathsf{A}}}} + {{\boldsymbol {\mathsf{A}}}}^{{\dagger}}), \qquad {{\boldsymbol {\mathsf{A}}}}_\text{A} = {{\boldsymbol {\mathsf{A}}}}_\text{A}^{{\dagger}} \doteq \frac{1}{2\mathrm{i}}\,({{\boldsymbol {\mathsf{A}}}} - {{\boldsymbol {\mathsf{A}}}}^{{\dagger}}), \end{equation}

so $\smash {{{\boldsymbol {\mathsf {A}}}} = {{\boldsymbol {\mathsf {A}}}}_\text {H} + \mathrm {i} {{\boldsymbol {\mathsf {A}}}}_\text {A}}$. For one-dimensional matrices (scalars), one has ${{\boldsymbol {\mathsf {A}}}} = {\mathsf {A}}$,

(2.10)

\begin{equation} {\mathsf{A}}_\text{H} = {\mathsf{A}}_\text{H}^* = \operatorname{re} {\mathsf{A}}, \qquad {\mathsf{A}}_\text{A} = {\mathsf{A}}_\text{A}^* = \operatorname{im} {\mathsf{A}}, \end{equation}

where $\smash {\operatorname {re}}$ and $\smash {\operatorname {im}}$ denote the real part and the imaginary part, respectively. We also define matrix operators $\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {A}}}}}}$ as matrices of the corresponding operators $\smash {\widehat {{\mathsf {A}}}^{\,\,i}_j}$. Because $\boldsymbol{\mathsf {g}}$ is constant, index manipulation applies as usual. Also as usual, one has

(2.11)

\begin{equation} \widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}_\text{H} = \widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}_\text{H}^{{\dagger}} \doteq \frac{1}{2}\,(\widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}} + \widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}^{{\dagger}}), \qquad \widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}_\text{A} = \widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}_\text{A}^{{\dagger}} \doteq \frac{1}{2\mathrm{i}}\,(\widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}} - \widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}^{{\dagger}}), \end{equation}

and $\,\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {A}}}}} = \widehat {\boldsymbol {{\boldsymbol {\mathsf {A}}}}}_\text {H} + \mathrm {i} \widehat {\boldsymbol {{\boldsymbol {\mathsf {A}}}}}_\text {A}}$, where $^{{\dagger}}$ is the Hermitian adjoint with respect to the inner product (2.6).

2.1.3. Bra–ket notation

Let us define the following operators that are Hermitian under the inner product (2.1):

(2.12)

\begin{equation} \widehat{{\mathsf{x}\,}}^0 \equiv \widehat{t} \doteq t, \quad \widehat{{\mathsf{x}\,}}^i \equiv \widehat{x\,}^i \doteq x^i, \quad \widehat{{\mathsf{k}}}_0 \equiv{-}\widehat{\omega} \doteq{-}\mathrm{i}\partial_t, \quad \widehat{k}_i \doteq{-}\mathrm{i}\partial_i, \end{equation}

where $\partial _0 \equiv \partial _t \doteq \partial /\partial x^0$ and $\partial _i \doteq \partial /\partial x^i$. Accordingly,

(2.13)

\begin{equation} \widehat{\boldsymbol{{\boldsymbol {\mathsf{x}}}}} \equiv (\,\widehat{{\mathsf{x}}}^{\kern1.5pt0}, \widehat{{\mathsf{x}}}^{\kern1.5pt1}, \ldots, \widehat{{\mathsf{x}}}^{\kern1.5pt n}) = (\widehat{\,t}, \widehat{\boldsymbol{x}}), \qquad \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} \equiv (\widehat{{\mathsf{k}}}_0, \widehat{{\mathsf{k}}}_1, \ldots, \widehat{{\mathsf{k}}}_{n}) = (-\widehat{\omega}, \widehat{\boldsymbol{k}}) \end{equation}

are understood as the spacetime-position operator and the corresponding wavevector operator, which will also be expressed as follows:

(2.14)

\begin{equation} \widehat{\boldsymbol{{\boldsymbol {\mathsf{x}}}}} = {\boldsymbol {\mathsf{x}}}, \qquad \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} ={-} \mathrm{i}\partial_{\boldsymbol {\mathsf{x}}}. \end{equation}

Also note the commutation property, where $\smash {\delta _i^j}$ is the Kronecker symbol:Footnote ⁵

(2.15)

\begin{equation} [\,\widehat{{\mathsf{x}}}^{\kern1.5pt i}, \widehat{{\mathsf{k}}}_j] = \mathrm{i} \delta^i_j, \qquad i, j = 0, 1, \ldots, n. \end{equation}

The eigenvectors of the operators (2.14) will be denoted as ‘kets’ $\smash {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}$ and $\smash {\left. {\boldsymbol {\mathsf {|k}}} \right\rangle}$:Footnote ⁶

(2.16)

\begin{equation} \widehat{\boldsymbol{{\boldsymbol {\mathsf{x}}}}} {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle} = {\boldsymbol {\mathsf{x}}} {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}, \qquad \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} {\left. {\boldsymbol {\mathsf {|k}}} \right\rangle} = {\boldsymbol {\mathsf{k}}} {\left. {\boldsymbol {\mathsf {|k}}} \right\rangle}, \end{equation}

and we assume the usual normalization

(2.17)

\begin{equation} {\left\langle{\boldsymbol {\mathsf{x}}}_1|{{\boldsymbol {\mathsf{x}}}_2}\right\rangle} = \delta({\boldsymbol {\mathsf{x}}}_1 - {\boldsymbol {\mathsf{x}}}_2), \qquad {\left\langle{\boldsymbol {\mathsf{k}}}_1|{\boldsymbol {\mathsf{k}}}_2 \right\rangle} = \delta({\boldsymbol {\mathsf{k}}}_1 - {\boldsymbol {\mathsf{k}}}_2), \end{equation}

where $\delta$ is the Dirac delta function. Both sets $\lbrace {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}, {\boldsymbol {\mathsf {x}}} \in \mathbb {R}^{{\mathsf {n}}} \rbrace$ and $\lbrace {\left. {\boldsymbol {\mathsf {|k}}} \right\rangle}, {\boldsymbol {\mathsf {k}}} \in \mathbb {R}^{{\mathsf {n}}} \rbrace$, where ${{\mathsf {n}}} \doteq n + 1$, form a complete basis on $\mathscr {H}_{{{\mathsf {x}}}}$, and the eigenvalues of these operators form an extended real phase space $({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})$, where

(2.18)

\begin{equation} {\boldsymbol {\mathsf{x}}} \equiv (t, {\boldsymbol{x}}), \qquad {\boldsymbol {\mathsf{k}}} \equiv (-\omega, {\boldsymbol{k}}). \end{equation}

The notation $\smash {{\boldsymbol {\mathsf {k}}} \cdot {\boldsymbol {\mathsf {s}}} \doteq -\omega \tau + {\boldsymbol {k}} \cdot {\boldsymbol {s}}}$ will be assumed for any ${\boldsymbol {\mathsf {s}}} \equiv (\tau, {\boldsymbol {s}})$ and $\smash {{\boldsymbol {k}} \cdot {\boldsymbol {s}} \doteq k_i s^i}$. In particular, for any $\psi$ and constant ${\boldsymbol {\mathsf {s}}}$, one has

(2.19)

\begin{equation} \exp(\mathrm{i} \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} \cdot {\boldsymbol {\mathsf{s}}}) \psi({\boldsymbol {\mathsf{x}}}) = \exp({\boldsymbol {\mathsf{s}}} \cdot \partial_{\boldsymbol {\mathsf{x}}}) \psi({\boldsymbol {\mathsf{x}}}) = \psi({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}), \end{equation}

as seen from comparing the Taylor expansions in ${\boldsymbol {\mathsf{s}}}$ of the latter two expressions. (A generalization of this formula is discussed in § 4.1.) Also,

(2.20)

\begin{equation} \left\langle{{\boldsymbol {\mathsf{x}}}|{\boldsymbol {\mathsf{k}}}}\right\rangle = \left\langle{{\boldsymbol {\mathsf{k}}}|{\boldsymbol {\mathsf{x}}}}\right\rangle^* = (2{\rm \pi})^{-{{\mathsf{n}}}/2} \exp(\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{x}}}), \end{equation}

and

(2.21)

\begin{equation} \int \mathrm{d}{\boldsymbol {\mathsf{x}}}\,{\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {{{\boldsymbol {\mathsf{x|}}}}}}}} \right.} = \widehat{{\mathsf{1}}}, \qquad \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\left. {\boldsymbol {\mathsf {|k}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\mathsf{k|}}}}}}} \right.} = \widehat{{\mathsf{1}}}. \end{equation}

Here, ‘bra’ ${\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\mathsf{x|}}}}}}} \right.}$ is the one-form dual to ${\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}$, ${\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\mathsf{k|}}}}}}} \right.}$ is the one-form dual to ${\left. {\boldsymbol {\mathsf {|k}}} \right\rangle}$, and $\widehat {{\mathsf {1}}}$ is the unit operator. Any field $\psi$ on ${\boldsymbol {\mathsf {x}}}$ can be viewed as the ${\boldsymbol {\mathsf {x}}}$ representation (‘coordinate representation’) of ${\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle}$, i.e. the projection of an abstract ket vector ${\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle} \in \mathscr {H}_{{{\mathsf {x}}}}$ on $\smash {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}$:

(2.22)

\begin{equation} \psi({\boldsymbol {\mathsf{x}}}) = \left\langle{{\boldsymbol {\mathsf{x}}}|\psi}\right\rangle. \end{equation}

Similarly, $\left\langle {{\boldsymbol {\mathsf {k}}}|\psi }\right\rangle$ is the ${\boldsymbol {\mathsf {k}}}$ representation (‘spectral representation’) of ${\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle}$, or the Fourier image of $\psi$:

(2.23)

\begin{equation} \mathring{\psi}({\boldsymbol {\mathsf{k}}}) \doteq \left\langle{{\boldsymbol {\mathsf{k}}}|\psi}\right\rangle = \frac{1}{(2{\rm \pi})^{{{\mathsf{n}}}/2}}\int \mathrm{d}{\boldsymbol {\mathsf{x}}}\, \mathrm{e}^{-\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{x}}}}\psi({\boldsymbol {\mathsf{x}}}). \end{equation}

2.1.4. Wigner–Weyl transform

For a given operator $\widehat {{\mathsf {A}}}$ and a given field $\psi$, $\widehat {{\mathsf {A}}}\psi$ can be expressed in the integral form

(2.24)

\begin{equation} \widehat{{\mathsf{A}}}\psi({\boldsymbol {\mathsf{x}}}) = \int \mathrm{d} {\boldsymbol {\mathsf{x}}}' \left\langle{{\boldsymbol {\mathsf{x}}}|\widehat{{\mathsf{A}}}|{\boldsymbol {\mathsf{x}}}'}\right\rangle\psi({\boldsymbol {\mathsf{x}}}'), \end{equation}

where ${\left\langle {\boldsymbol {\mathsf {x}}}|\widehat {{\mathsf {A}}}|{\boldsymbol {\mathsf {x}}}'\right\rangle}$ is a function of $({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {x}}}')$. This is called the ${\boldsymbol {\mathsf {x}}}$ representation (‘coordinate representation’) of $\smash {\widehat {{\mathsf {A}}}}$. Equivalently, $\widehat {{\mathsf {A}}}$ can be given a phase-space, or Weyl, representation, i.e. expressed through a function of the phase-space coordinates, ${\mathsf {A}}({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})$Footnote ⁷:

(2.25)

\begin{equation} \widehat{{\mathsf{A}}} = \frac{1}{(2{\rm \pi})^{{{\mathsf{n}}}}} \int \mathrm{d} {\boldsymbol {\mathsf{x}}} \,\mathrm{d}{\boldsymbol {\mathsf{k}}}\,\mathrm{d} {\boldsymbol {\mathsf{s}}}\, {{\boldsymbol {\mathsf{|x}}} + {\boldsymbol {\mathsf{s}}}/2} {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}})\,\mathrm{e}^{\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}} {\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\mathsf{k|}}}}}}} \right.} {{\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\mathsf{x|}}}}}}} \right.}-{\boldsymbol {\mathsf{s}}}/2|} \equiv \text{oper}_{{\mathsf{x}}} {\mathsf{A}}. \end{equation}

The function ${\mathsf {A}}({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})$, called the Weyl symbol (or just ‘symbol’) of $\widehat {{\mathsf {A}}}$, is given by

(2.26)

\begin{equation} {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \doteq \int \mathrm{d}{\boldsymbol {\mathsf{s}}} \left\langle{{\boldsymbol {\mathsf{x}}}+{\boldsymbol {\mathsf{s}}}/2 | \widehat{{\mathsf{A}}} | {\boldsymbol {\mathsf{x}}}-{\boldsymbol {\mathsf{s}}}/2}\right\rangle \mathrm{e}^{-\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}} \equiv \text{symb}_{{\mathsf{x}}} \widehat{{\mathsf{A}}}. \end{equation}

The ${\boldsymbol {\mathsf {x}}}$ and phase-space representations are connected by the Fourier transform:

(2.27)

\begin{equation} \left\langle{{\boldsymbol {\mathsf{x}}}| \widehat{{\mathsf{A}}} | {\boldsymbol {\mathsf{x}}}'}\right\rangle = \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}}\int\mathrm{d} {\boldsymbol {\mathsf{k}}}\, \mathrm{e}^{\mathrm{i}{\boldsymbol {\mathsf{k}}} \cdot({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{x}}}')} \, {\mathsf{A}}\left(\frac{{\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{x}}}'}{2}, {\boldsymbol {\mathsf{k}}}\right). \end{equation}

This also leads to the following notable properties of Weyl symbols:

(2.28)

\begin{equation} \left\langle{{\boldsymbol {\mathsf{x}}}| \widehat{{\mathsf{A}}} | {\boldsymbol {\mathsf{x}}}}\right\rangle = \frac{1}{(2{\rm \pi})^{{{\mathsf{n}}}}}\int \mathrm{d} {\boldsymbol {\mathsf{k}}}\,{\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}), \qquad \left\langle{{\boldsymbol {\mathsf{k}}}| \widehat{{\mathsf{A}}} | {\boldsymbol {\mathsf{k}}}}\right\rangle = \frac{1}{(2{\rm \pi})^{{{\mathsf{n}}}}}\int \mathrm{d} {\boldsymbol {\mathsf{x}}}\,{\mathsf{A}}({\boldsymbol {\mathsf{x}}},{\boldsymbol {\mathsf{k}}}). \end{equation}

An operator unambiguously determines its symbol, and vice versa. We denote this isomorphism as $\smash {\widehat {{\mathsf {A}}} \leftrightarrow {\mathsf {A}}}$. The mapping $\smash {\widehat {{\mathsf {A}}} \mapsto {\mathsf {A}}}$ is called the Wigner transform, and $\smash {{\mathsf {A}} \mapsto \widehat {{\mathsf {A}}}}$ is called the Weyl transform. For uniformity, we call them the direct and inverse Wigner–Weyl transform. The isomorphism $\smash {\leftrightarrow }$ is natural in that it has the following properties:

(2.29)

\begin{equation} \widehat{{\mathsf{1}}} \leftrightarrow 1, \quad \widehat{\boldsymbol{{\boldsymbol {\mathsf{x}}}}} \leftrightarrow {\boldsymbol {\mathsf{x}}}, \quad \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} \leftrightarrow {\boldsymbol {\mathsf{k}}}, \quad h(\,\widehat{\boldsymbol{{\boldsymbol {\mathsf{x}}}}}) \leftrightarrow h({\boldsymbol {\mathsf{x}}}), \quad h(\widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}}) \leftrightarrow h({\boldsymbol {\mathsf{k}}}), \quad \widehat{{\mathsf{A}}}^{{\dagger}} \leftrightarrow {\mathsf{A}}^*, \end{equation}

where $h$ is any function and $\smash {\widehat {{\mathsf {A}}}}$ is any operator. The product of two operators maps to the so-called Moyal product, or star product, of their symbols (Moyal Reference Moyal1949):

(2.30)

\begin{equation} \widehat{{\mathsf{A}}}\,\widehat{{\mathsf{B}}}\,\leftrightarrow \, {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {\mathsf{B}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \doteq {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}})\mathrm{e}^{\mathrm{i}\widehat{\mathcal{L}}_{{\mathsf{x}}}/2}{\mathsf{B}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}), \end{equation}

which is associative:

(2.31)

\begin{equation} \widehat{{\mathsf{A}}}\,\widehat{{\mathsf{B}}}\,\widehat{{\mathsf{C}}}\, \leftrightarrow \, ({\mathsf{A}} \star {\mathsf{B}}) \star {\mathsf{C}} = {\mathsf{A}} \star ({\mathsf{B}} \star {\mathsf{C}}) \equiv {\mathsf{A}} \star {\mathsf{B}} \star {\mathsf{C}}. \end{equation}

Here, $\smash {\widehat {\mathcal {L}}_{{\mathsf {x}}} \doteq \overset {{\scriptscriptstyle \leftarrow }}{\partial }_{\boldsymbol {\mathsf {x}}} \cdot \overset {{\scriptscriptstyle \rightarrow }}{\partial }_{\boldsymbol {\mathsf {k}}} - \overset {{\scriptscriptstyle \leftarrow }}{\partial }_{\boldsymbol {\mathsf {k}}} \cdot \overset {{\scriptscriptstyle \rightarrow }}{\partial }_{\boldsymbol {\mathsf {x}}}}$, and the arrows indicate the directions in which the derivatives act. For example, $\smash {{\mathsf {A}}\widehat {\mathcal {L}}_{{\mathsf {x}}}{\mathsf {B}}}$ is just the canonical Poisson bracket on $\smash {({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$:

(2.32)

\begin{equation} {\mathsf{A}} \widehat{\mathcal{L}}_{{\mathsf{x}}} {\mathsf{B}} = \lbrace {\mathsf{A}}, {\mathsf{B}} \rbrace_{{{\mathsf{x}}}} \doteq{-} \frac{\partial {\mathsf{A}}}{\partial t}\frac{\partial {\mathsf{B}}}{\partial \omega} + \frac{\partial {\mathsf{A}}}{\partial \omega}\frac{\partial {\mathsf{B}}}{\partial t} + \frac{\partial {\mathsf{A}}}{\partial x^i}\frac{\partial {\mathsf{B}}}{\partial k_i} - \frac{\partial {\mathsf{A}}}{\partial k_i}\frac{\partial {\mathsf{B}}}{\partial x^i}. \end{equation}

These formulas readily yield

(2.33)

\begin{equation} h(\widehat{\,\boldsymbol{{\boldsymbol {\mathsf{x}}}}\,})\widehat{{\mathsf{k}}}_\alpha \leftrightarrow {\mathsf{k}}_\alpha h({\boldsymbol {\mathsf{x}}}) + \frac{\mathrm{i}}{2}\,\partial_\alpha h({\boldsymbol {\mathsf{x}}}), \qquad \widehat{{\mathsf{k}}}_\alpha h(\widehat{\,\boldsymbol{{\boldsymbol {\mathsf{x}}}}\,}) \leftrightarrow {\mathsf{k}}_\alpha h({\boldsymbol {\mathsf{x}}}) - \frac{\mathrm{i}}{2}\,\partial_\alpha h({\boldsymbol {\mathsf{x}}}), \end{equation}

also $h(\widehat {\boldsymbol {{\boldsymbol {\mathsf {k}}}}}) \mathrm {e}^{\mathrm {i}{\boldsymbol {\mathsf {K}}} \cdot \widehat {\boldsymbol {{\boldsymbol {\mathsf {x}}}}}} \leftrightarrow h({\boldsymbol {\mathsf {k}}})\star \mathrm {e}^{\mathrm {i}{\boldsymbol {\mathsf {K}}} \cdot {\boldsymbol {\mathsf {x}}}} = h({\boldsymbol {\mathsf {k}}} + {\boldsymbol {\mathsf {K}}}/2) \mathrm {e}^{\mathrm {i}{\boldsymbol {\mathsf {K}}} \cdot {\boldsymbol {\mathsf {x}}}}$, etc. Another notable formula to be used below, which flows from (2.28) and (2.31), is

(2.34)

\begin{equation} \left\langle{{\boldsymbol {\mathsf{x}}}| \widehat{{\mathsf{A}}}\,\widehat{{\mathsf{B}}}\,\widehat{{\mathsf{C}}} | {\boldsymbol {\mathsf{x}}}}\right\rangle = \frac{1}{(2{\rm \pi})^{{{\mathsf{n}}}}}\int \mathrm{d} {\boldsymbol {\mathsf{k}}}\,({\mathsf{A}} \star {\mathsf{B}} \star {\mathsf{C}})({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}). \end{equation}

The Moyal product is particularly handy when $\partial _{{\boldsymbol {\mathsf {x}}}} \partial _{{\boldsymbol {\mathsf {k}}}} \sim \epsilon \ll 1$. Such $\epsilon$ is often called the geometrical-optics parameter. Since $\widehat {\mathcal {L}}_{{\mathsf {x}}} = \mathcal {O}(\epsilon )$, one can express the Moyal product as an asymptotic series in powers of $\epsilon$:

(2.35)

\begin{equation} \star = \widehat{{\mathsf{1}}} + \mathrm{i} \widehat{\mathcal{L}}_{{\mathsf{x}}}/2 - \widehat{\mathcal{L}\,}_{{\mathsf{x}}}^2/8 + \ldots . \end{equation}

2.1.5. Weyl expansion of operators

Operators can be approximated by approximating their symbols (McDonald Reference McDonald1988; Dodin et al. Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019). If $\smash {\widehat {{\mathsf {A}}}}$ is approximately local in ${\boldsymbol {\mathsf {x}}}$ (i.e. if $\smash {\widehat {{\mathsf {A}}}\psi ({\boldsymbol {\mathsf {x}}})}$ is determined by values $\psi ({\boldsymbol {\mathsf {x}}} + {\boldsymbol {\mathsf {s}}})$ only with small enough ${\boldsymbol {\mathsf {s}}}$), its symbol can be adequately represented by the first few terms of the Taylor expansion in ${\boldsymbol {\mathsf {k}}}$:

(2.36)

\begin{equation} {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{0}}}) + {\boldsymbol{\Theta}}_0({\boldsymbol {\mathsf{x}}}) \cdot {\boldsymbol {\mathsf{k}}} + \ldots, \qquad {\boldsymbol{\Theta}}_0({\boldsymbol {\mathsf{x}}}) \doteq (\partial_{\boldsymbol {\mathsf{k}}} {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}))_{{\boldsymbol {\mathsf{k}}}={\boldsymbol {\mathsf{0}}}}. \end{equation}

Application of $\smash {\text {oper}_{{\mathsf {x}}} }$ to this formula leads to

(2.37)

\begin{equation} \widehat{{\mathsf{A}}} \approx {\mathsf{A}}(\widehat{\,\boldsymbol{{\boldsymbol {\mathsf{x}}}}}, {\boldsymbol {\mathsf{0}}}) + \frac{1}{2}\,(\widehat{\boldsymbol{\Theta}}_0 \cdot \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} + \widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}} \cdot \widehat{\boldsymbol{\Theta}}_0) + \ldots, \end{equation}

where $\widehat {\boldsymbol {\Theta }}_0 \doteq {\boldsymbol {\Theta }}_0(\widehat {\,\boldsymbol {{\boldsymbol {\mathsf {x}}}}\,})$. One can also rewrite (2.37) using the commutation property

(2.38)

\begin{equation} [\widehat{\boldsymbol{{\boldsymbol {\mathsf{k}}}}}, \widehat{\boldsymbol{\Theta}}_0] ={-}\mathrm{i}(\partial_{{\boldsymbol {\mathsf{x}}}} \cdot {\boldsymbol{\Theta}}_0)(\widehat{\,\boldsymbol{{\boldsymbol {\mathsf{x}}}}\,}). \end{equation}

In the ${\boldsymbol {\mathsf {x}}}$ representation, this leads to

(2.39)

\begin{equation} \widehat{{\mathsf{A}}} = {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{0}}}) - \mathrm{i} {\boldsymbol{\Theta}}_0({\boldsymbol {\mathsf{x}}}) \cdot \partial_{{\boldsymbol {\mathsf{x}}}} - \frac{\mathrm{i}}{2}\,(\partial_{{\boldsymbol {\mathsf{x}}}} \cdot {\boldsymbol{\Theta}}_0({\boldsymbol {\mathsf{x}}})) + \ldots . \end{equation}

The effect of a non-local operator on eikonal (monochromatic or quasimonochromatic) fields can be approximated similarly. Suppose $\psi = \mathrm {e}^{\mathrm {i}\theta }{\breve {\psi }}$, where the dependence of $\overline {{\boldsymbol {\mathsf {k}}}} \doteq \partial _{{\boldsymbol {\mathsf {x}}}}\theta$ and ${\breve {\psi }}$ on ${\boldsymbol {\mathsf {x}}}$ is slower than that of $\theta$ by factor $\epsilon \ll 1$. Then, $\widehat {{\mathsf {A}}}\psi = \mathrm {e}^{\mathrm {i}\theta }\widehat {{\mathsf {A}}}' {\breve {\psi }}$, where $\widehat {{\mathsf {A}}}' \doteq \mathrm {e}^{-\mathrm {i}\theta (\widehat {{\mathsf {x}}})}\widehat {{\mathsf {A}}}\mathrm {e}^{\mathrm {i}\theta (\widehat {{\mathsf {x}}})}$, and the symbol of $\widehat {{\mathsf {A}}}'$ can be approximated as follows:

(2.40)

\begin{equation} {\mathsf{A}}'({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, \overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}}) + {\boldsymbol {\mathsf{k}}}) + \mathcal{O}(\epsilon^2). \end{equation}

By expanding this in ${\boldsymbol {\mathsf {k}}}$ and applying $\smash {\text {oper}_{{\mathsf {x}}} }$, one obtains

(2.41)

\begin{equation} \widehat{{\mathsf{A}}}' = {\mathsf{A}}({\boldsymbol {\mathsf{x}}}, \overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}})) - \mathrm{i} {\boldsymbol{\Theta}}({\boldsymbol {\mathsf{x}}}) \cdot \partial_{{\boldsymbol {\mathsf{x}}}} - \frac{\mathrm{i}}{2}\,(\partial_{{\boldsymbol {\mathsf{x}}}} \cdot {\boldsymbol{\Theta}}({\boldsymbol {\mathsf{x}}})) + \mathcal{O}(\epsilon^2), \end{equation}

where ${\boldsymbol {\Theta }}({\boldsymbol {\mathsf {x}}}) \doteq (\partial _{\boldsymbol {\mathsf {k}}} {\mathsf {A}}({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}}))_{{\boldsymbol {\mathsf {k}}}=\overline {{\boldsymbol {\mathsf {k}}}}({\boldsymbol {\mathsf {x}}})}$. Neglecting the $\mathcal {O}(\epsilon ^2)$ corrections in this formula leads to what is commonly known as the geometrical-optics approximation (Dodin et al. Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019).

2.1.6. Wigner functions

Any ket ${\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle}$ generates a dyadic ${\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle} {\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\mathsf{\psi|}}}}}}} \right.}$. In quantum mechanics, such dyadics are known as density operators (of pure states). For our purposes, though, it is more convenient to define the density operator in a slightly different form, namely, as

(2.42)

\begin{equation} \widehat{{\mathsf{W}}}_\psi \doteq (2{\rm \pi})^{-{{\mathsf{n}}}} {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {x|}}}} \right.}. \end{equation}

The symbol of this operator, $\smash {{\mathsf {W}}_{\psi } = \text {symb}_{{\mathsf {x}}} \widehat {{\mathsf {W}}}_\psi }$, is a real function called the Wigner function. It is given by

(2.43)

\begin{align} {\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) & = \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{s}}} \left\langle{{\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2 | \psi}\right\rangle \left\langle{\psi | {\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2}\right\rangle \mathrm{e}^{-\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}} \nonumber\\ & = \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{s}}}\, \psi({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2) \psi^*({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2)\,\mathrm{e}^{-\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}}, \end{align}

which is manifestly real and can be understood as the (inverse) Fourier image of

(2.44)

\begin{equation} {\mathsf{C}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{s}}}) \doteq \psi({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2) \psi^*({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2) = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}})\,\mathrm{e}^{\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}}. \end{equation}

Any function bilinear in $\smash {\psi }$ and $\smash {\psi ^*}$ can be expressed through $\smash {{\mathsf {W}}_\psi }$. Specifically, for any operators $\smash {\widehat {{\mathsf {L}}}}$ and $\smash {\widehat {{\mathsf {R}}}}$, one has

(2.45)

\begin{align} (\widehat{{\mathsf{L}}}\psi({\boldsymbol {\mathsf{x}}}))(\widehat{{\mathsf{R}}}\psi({\boldsymbol {\mathsf{x}}}))^* & = \left\langle{{\boldsymbol {\mathsf{x}}}|\widehat{{\mathsf{L}}}|\psi}\right\rangle \left\langle{\psi|\widehat{{\mathsf{R}}}^{{\dagger}} |{\boldsymbol {\mathsf{x}}}}\right\rangle \nonumber\\ & = (2{\rm \pi})^{{\mathsf{n}}} \left\langle{{\boldsymbol {\mathsf{x}}}|\widehat{{\mathsf{L}}}\,\widehat{{\mathsf{W}}}_\psi\widehat{{\mathsf{R}}}^{{\dagger}}|{\boldsymbol {\mathsf{x}}}}\right\rangle \nonumber\\ & = \textstyle \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\mathsf{L}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {\mathsf{R}}^*({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}), \end{align}

where ${\mathsf {L}}$ and ${\mathsf {R}}$ are the corresponding symbols and (2.28) was used along with (2.31). As a corollary, and as also seen from (2.28), one has

(2.46)

\begin{equation} |\psi({\boldsymbol {\mathsf{x}}})|^2 = \int \mathrm{d} {\boldsymbol {\mathsf{k}}}\,{\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}), \qquad |\mathring{\psi}({\boldsymbol {\mathsf{k}}})|^2 = \int \mathrm{d} {\boldsymbol {\mathsf{x}}}\,{\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}). \end{equation}

As a reminder, $\psi ({\boldsymbol {\mathsf {x}}}) = \left\langle {{\boldsymbol {\mathsf {x}}}|\psi }\right\rangle$ and $\mathring {\psi }({\boldsymbol {\mathsf {k}}}) \doteq \left\langle {{\boldsymbol {\mathsf {k}}}|\psi }\right\rangle$ is the Fourier image of $\psi$ (2.23), so $\smash {|\psi ({\boldsymbol {\mathsf {x}}})|^2}$ and $\smash {|\mathring {\psi }({\boldsymbol {\mathsf {k}}})|^2}$ can be loosely understood as the densities of quanta (associated with the field $\smash {\psi }$) in the $\smash {{\boldsymbol {\mathsf {x}}}}$-space and the $\smash {{\boldsymbol {\mathsf {k}}}}$-space, respectively. Because of (2.46), $\smash {{\mathsf {W}}_\psi }$ is commonly attributed as a quasiprobability distribution of wave quanta in phase space. (The prefix ‘quasi’ is added because $\smash {{\mathsf {W}}_\psi }$ can be negative.) In case of real fields, which satisfy $\left\langle {{\boldsymbol {\mathsf {x}}}|\psi }\right\rangle = \left\langle {\psi |{\boldsymbol {\mathsf {x}}}}\right\rangle$, one also has

(2.47)

\begin{equation} {\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = {\mathsf{W}}_\psi({\boldsymbol {\mathsf{x}}}, -{\boldsymbol {\mathsf{k}}}). \end{equation}

Of particular importance are Wigner functions averaged over a sufficiently large phase-space volume $\varDelta {\boldsymbol {\mathsf {x}}}\,\varDelta {\boldsymbol {\mathsf {k}}} \gtrsim 1$. The average Wigner function $\smash {\overline {{\mathsf {W}}}_\psi }$ is a local property of the field (as opposed to, say, the field's global Fourier spectrum) and satisfies (Appendix A)

(2.48)

\begin{equation} \overline{{\mathsf{W}}}_\psi \geqslant 0. \end{equation}

2.1.7. Generalization to vector fields

In case of vector (tuple) fields ${\boldsymbol {\psi }} = (\psi ^1, \psi ^2, \ldots, \psi ^M)^{\intercal }$, kets are column vectors, ${\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle} = ({\left. {\boldsymbol {\mathsf {|\psi ^1}}} \right\rangle} , {\left. {\boldsymbol {\mathsf {|\psi ^2}}} \right\rangle}, \ldots, {\left. {\boldsymbol {\mathsf {|\psi ^M}}} \right\rangle})^{\intercal }$, and bras are row vectors, ${\left\langle {{\boldsymbol {\mathsf {{\boldsymbol {\psi |}}}}}} \right.} = ({\left\langle {{\boldsymbol {\mathsf {{\psi ^1}|}}}} \right.}, {\left\langle {{\boldsymbol {\mathsf {{\psi ^2}|}}}} \right.}, \ldots, {\left\langle {{\boldsymbol {\mathsf {{\psi ^M}|}}}} \right.})^\intercal$. The operators acting on such kets and bras are matrices of operators. The Weyl symbol of a matrix operator is defined as the matrix of the corresponding symbols. As a result, the symbol of a Hermitian adjoint of a given operator is the Hermitian adjoint of the symbol of that operator:

(2.49)

\begin{equation} \smash{\widehat{\boldsymbol{{\boldsymbol {\mathsf{A}}}}}}^{{\dagger}} \leftrightarrow \smash{{{\boldsymbol {\mathsf{A}}}}}^{{\dagger}}, \end{equation}

and as a corollary, the symbol of a Hermitian matrix operator is a Hermitian matrix.

In particular, the density operator of a given vector field ${\boldsymbol {\psi }}$ is a matrix operator

(2.50)

\begin{equation} \widehat{\boldsymbol{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}} \doteq (2{\rm \pi})^{-{{\mathsf{n}}}} {\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {\psi |}}}} \right.}. \end{equation}

The symbol of this operator, $\smash {{{\boldsymbol {\mathsf {W}}}}_{{\boldsymbol {\psi }}} = \text {symb}_{{\mathsf {x}}} \widehat {\boldsymbol {{\boldsymbol {\mathsf {W}}}}}_{{\boldsymbol {\psi }}}}$, is a Hermitian matrix functionFootnote ⁸

(2.51)

\begin{equation} {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{s}}}\, {\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2)\,\mathrm{e}^{-\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}}\end{equation}

called the Wigner matrix. (It is also called the ‘Wigner tensor’ when $\smash {{\boldsymbol {\psi }}}$ is a true vector rather than a tuple.) It can be understood as the (inverse) Fourier image of

(2.52)

\begin{equation} {{\boldsymbol {\mathsf{C}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{s}}}) \doteq {\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2) = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}})\,\mathrm{e}^{\mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{s}}}}. \end{equation}

The analogue of (2.45) is (Appendix B.1)

(2.53a)

\begin{equation} (\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}))(\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}))^{{\dagger}} = \textstyle \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{{\boldsymbol {\mathsf{L}}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {{\boldsymbol {\mathsf{R}}}}^{{\dagger}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}). \end{equation}

The Wigner matrix averaged over a sufficiently large phase-space volume $\varDelta {\boldsymbol {\mathsf {x}}}\,\varDelta {\boldsymbol {\mathsf {k}}} \gtrsim 1$ is a local property of the field, and it is positive-semidefinite (Appendix A).

For real fields, one also has

(2.53b)

\begin{equation} {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}^\intercal({\boldsymbol {\mathsf{x}}}, -{\boldsymbol {\mathsf{k}}}) = {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}^*({\boldsymbol {\mathsf{x}}}, -{\boldsymbol {\mathsf{k}}}), \end{equation}

and (2.53) yields the following corollary at $\smash {\epsilon \to 0}$, when $\smash {\star }$ becomes the usual product (Appendix B.1):

(2.53c)

\begin{equation} (\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}})^{{\dagger}} \widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}} = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\operatorname{tr}\big({{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({{\boldsymbol {\mathsf{L}}}}^{{\dagger}}{{\boldsymbol {\mathsf{R}}}})_\text{H}\big). \end{equation}

The generalizations of the other formulas from the previous sections are obvious.

2.2. Weyl symbol calculus on phase space

2.2.1. Notation

Consider a Hamiltonian system with coordinates ${\boldsymbol {x}} \equiv (x^1, x^2, \ldots, x^n)$ and canonical momenta ${\boldsymbol {p}} \equiv (p_1, p_2, \ldots, p_n)$. Together, these variables comprise the phase-space coordinates ${\boldsymbol {z}} \equiv ({\boldsymbol {x}}, {\boldsymbol {p}})$, i.e.

(2.54)

\begin{equation} {\boldsymbol{z}} \equiv (z^1, \ldots, z^{2n}) = (x^1, \ldots, x^n, p_1, \ldots, p_n). \end{equation}

Components of ${\boldsymbol {z}}$ will be denoted with Greek indices ranging from 1 to $2n$.Footnote ⁹

Hamilton's equations for $z^\alpha$ can be written as $\dot {z}^{\alpha } = \lbrace z^{\alpha }, H \rbrace$, or equivalently, as

(2.55)

\begin{equation} \dot{z}^{\alpha} = J^{\alpha\beta}\,\partial_\beta H. \end{equation}

Here, $H = H(t, {\boldsymbol {z}})$ is a Hamiltonian, $\partial _\beta \doteq \partial /\partial z^\beta$,

(2.56)

\begin{equation} \lbrace A, B \rbrace \doteq J^{\alpha\beta}\, (\partial_\alpha A) (\partial_\beta B) \end{equation}

is the Poisson bracket on $\smash {{\boldsymbol {z}}}$, and $J^{\alpha \beta }$ is the canonical Poisson structure:

(2.57)

\begin{equation} {\boldsymbol{J}} ={-}{\boldsymbol{J}}^\intercal = \left( \begin{array}{cc} {\boldsymbol{0}}_n & {\boldsymbol{1}}_n \\ -{\boldsymbol{1}}_n & {\boldsymbol{0}}_n \\ \end{array} \right), \end{equation}

where ${\boldsymbol {0}}_n$ is an $n$-dimensional zero matrix, and ${\boldsymbol {1}}_n$ is an $n$-dimensional unit matrix. The corresponding equation for the probability distribution $f(t, {\boldsymbol {z}})$ is

(2.58)

\begin{equation} \partial_t f = \lbrace H, f \rbrace. \end{equation}

Solutions of (2.58) and other functions of the extended-phase-space coordinates ${\boldsymbol {X}} \equiv (t, {\boldsymbol {z}})$ can be considered as vectors in the Hilbert space $\mathscr {H}_X$ with the usual inner productFootnote ¹⁰

(2.59)

\begin{equation} {\left\langle{\xi|\psi}\right\rangle} \doteq \int \mathrm{d}{\boldsymbol{X}}\, \xi^*({\boldsymbol{X}}) \psi({\boldsymbol{X}}). \end{equation}

Assuming the notation $N \doteq \dim {\boldsymbol {X}} = 2n + 1$, one has

(2.60)

\begin{equation} \mathrm{d}{\boldsymbol{X}} \doteq \mathrm{d} X^1\, \mathrm{d} X^2 \ldots \mathrm{d} X^N = \mathrm{d} t\, \mathrm{d} x^1 \ldots \mathrm{d} x^n\,\mathrm{d} p_1, \ldots , \mathrm{d} p_n. \end{equation}

Let us introduce the position operator on ${\boldsymbol {z}}$,

(2.61)

\begin{equation} \widehat{\boldsymbol{z}} \doteq ( \underbrace{x^1, \ldots, x^n}_{\widehat{\boldsymbol{x}}}, \underbrace{p_1, \ldots, p_n}_{\widehat{\boldsymbol{p}}} ), \end{equation}

and the momentum operator on ${\boldsymbol {z}}$,

(2.62)

\begin{equation} \widehat{\boldsymbol{q}} \equiv ( \underbrace{-\mathrm{i} \partial_1, \ldots, -\mathrm{i}\partial_n}_{\widehat{\boldsymbol{k}}}, \underbrace{-\mathrm{i}\partial^1, \ldots, -\mathrm{i}\partial^n}_{\widehat{\boldsymbol{r}}} ), \end{equation}

where $\partial _i \doteq \partial /\partial x^i$ but $\partial ^i \doteq \partial /\partial p_i$; that is, $\widehat {\boldsymbol {z}} = (\widehat {\boldsymbol {x}}, \widehat {\boldsymbol {p}})$, $\widehat {\boldsymbol {q}} = (\widehat {\boldsymbol {k}}, \widehat {\boldsymbol {r}}\,)$ and

(2.63)

\begin{equation} \widehat{z}^{\kern1.5pt\alpha}\doteq z^{\alpha}, \qquad \widehat{q}_{\alpha}\doteq{-}\mathrm{i}\partial_{\alpha}. \end{equation}

Then, much like in § 2.1, one can also introduce the position and momentum operators on the extended phase space ${\boldsymbol {X}}$:

(2.64)

\begin{equation} \widehat{\boldsymbol{X}} = (\;\widehat{t}, \widehat{\boldsymbol{z}\,}) = (\;\widehat{t}, \widehat{\boldsymbol{x}}, \widehat{\boldsymbol{p}\,}), \qquad \widehat{\boldsymbol{K}} = (-\widehat{\omega}, \widehat{\boldsymbol{q}}\,) = (-\widehat{\omega}, \widehat{\boldsymbol{k}}, \widehat{\boldsymbol{r}\,}). \end{equation}

Assuming the convention that Latin indices from the beginning of the alphabet $(a, b, c, \ldots )$ range from 0 to $2n$, and $\partial _a \doteq \partial /\partial X^a$, one can compactly express this as

(2.65)

\begin{equation} \widehat{X}^a = X^a, \qquad \widehat{K}_a ={-} \mathrm{i}\partial_a. \end{equation}

The eigenvectors of these operators will be denoted ${\left. {\boldsymbol {\mathsf {|X}}} \right\rangle}$ and ${\left. {\boldsymbol {\mathsf {|K}}} \right\rangle}$:

(2.66)

\begin{equation} \widehat{\boldsymbol{X}} {\left. {\boldsymbol {\mathsf {|X}}} \right\rangle} = {\boldsymbol{X}} {\left. {\boldsymbol {\mathsf {|X}}} \right\rangle}, \qquad \widehat{\boldsymbol{K}} {\left. {\boldsymbol {\mathsf {|K}}} \right\rangle} = {\boldsymbol{K}} {\left. {\boldsymbol {\mathsf {|K}}} \right\rangle}, \end{equation}

and we assume the usual normalization

(2.67)

\begin{equation} \left\langle{{\boldsymbol{X}}_1|{\boldsymbol{X}}_2}\right\rangle = \delta({\boldsymbol{X}}_1 - {\boldsymbol{X}}_2), \qquad \left\langle{{\boldsymbol{K}}_1|{\boldsymbol{K}}_2}\right\rangle = \delta({\boldsymbol{K}}_1 - {\boldsymbol{K}}_2). \end{equation}

Both sets $\lbrace {\left. {\boldsymbol {\mathsf {|X}}} \right\rangle}, {\boldsymbol {X}} \in \mathbb {R}^N \rbrace$ and $\lbrace {\left. {\boldsymbol {\mathsf {|K}}} \right\rangle}, {\boldsymbol {K}} \in \mathbb {R}^N \rbrace$ form a complete basis on $\mathscr {H}_X$, and the eigenvalues of these operators form a real extended phase space $({\boldsymbol {X}}, {\boldsymbol {K}})$, where

(2.68)

\begin{equation} {\boldsymbol{X}} \equiv (t, {\boldsymbol{z}}), \qquad {\boldsymbol{K}} \equiv (-\omega, {\boldsymbol{q}}). \end{equation}

Particularly note the following formula, which will be used below:

(2.69)

\begin{equation} J^{\alpha\beta}\widehat{q}_\alpha q_\beta =\left( \begin{array}{cc} \widehat{\boldsymbol{k}} & \widehat{\boldsymbol{r}} \end{array} \right) \left( \begin{array}{cc} {\boldsymbol{0}}_n & {\boldsymbol{1}}_n \\ -{\boldsymbol{1}}_n & {\boldsymbol{0}}_n \\ \end{array} \right) \left( \begin{array}{c} {\boldsymbol{k}}\\ {\boldsymbol{r}}\\ \end{array} \right) = \widehat{\boldsymbol{k}} \cdot {\boldsymbol{r}} - \widehat{\boldsymbol{r}} \cdot {\boldsymbol{k}}. \end{equation}

2.2.2. Wigner–Weyl transform

One can construct the Weyl symbol calculus on the extended phase space ${\boldsymbol {X}}$ just like it is done on spacetime ${\boldsymbol {\mathsf {x}}}$ in § 2.1, with an obvious modification of the notation. The Wigner–Weyl transform is defined as

(2.70)

\begin{gather} \displaystyle A({\boldsymbol{X}}, {\boldsymbol{K}}) = \int \mathrm{d}{\boldsymbol{S}} \left\langle{{\boldsymbol{X}} + {\boldsymbol{S}}/2 | \widehat{A} | {\boldsymbol{X}} - {\boldsymbol{S}}/2}\right\rangle \mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}} \equiv \text{symb}_X \widehat{A}, \end{gather}

(2.71)

\begin{gather}\displaystyle \widehat{A} = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{X}}\,\mathrm{d}{\boldsymbol{K}}\,\mathrm{d}{\boldsymbol{S}} {\left. {\boldsymbol {\mathsf {|X}}} \right\rangle}+{{\boldsymbol{S}}/2} A({\boldsymbol{X}}, {\boldsymbol{K}}) {\left\langle {{\boldsymbol {\mathsf {X|}}}} \right.} - {{\boldsymbol{S}}/2} \mathrm{e}^{\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}} \equiv \text{oper}_X A. \end{gather}

(Notice the change in the font and in the index compared with (2.26) and (2.25).) The corresponding Moyal product is denoted $\bigstar$ (as opposed to $\star$ introduced earlier) and is given by

(2.72)

\begin{equation} A \,\bigstar\, B = A({\boldsymbol{X}}, {\boldsymbol{K}})\,\mathrm{e}^{\mathrm{i}\widehat{\mathcal{L}}_X/2}B({\boldsymbol{X}}, {\boldsymbol{K}}), \end{equation}

where $\widehat {\mathcal {L}}_X \doteq \overset {{\scriptscriptstyle \leftarrow }}{\partial }_{\boldsymbol {X}} \cdot \overset {{\scriptscriptstyle \rightarrow }}{\partial }_{\boldsymbol {K}} - \overset {{\scriptscriptstyle \leftarrow }}{\partial }_{\boldsymbol {K}} \cdot \overset {{\scriptscriptstyle \rightarrow }}{\partial }_{\boldsymbol {X}}$ can be expressed as follows:

(2.73)

\begin{equation} A \widehat{\mathcal{L}}_X B \doteq{-} \frac{\partial A}{\partial t}\frac{\partial B}{\partial \omega} + \frac{\partial A}{\partial \omega}\frac{\partial B}{\partial t} + \frac{\partial A}{\partial x^i}\frac{\partial B}{\partial k_i} - \frac{\partial A}{\partial k_i}\frac{\partial B}{\partial x^i} + \frac{\partial A}{\partial p^i}\frac{\partial B}{\partial r_i} - \frac{\partial A}{\partial r_i}\frac{\partial B}{\partial p^i}. \end{equation}

If an operator $\smash {\widehat {A}}$ is local in $\smash {{\boldsymbol {p}}}$, its $\smash {{\boldsymbol {X}}}$ representation and $\smash {{\boldsymbol {\mathsf {x}}}}$ representation satisfy

(2.74)

\begin{equation} \left\langle{t, {\boldsymbol{x}}, {\boldsymbol{p}}|\widehat{\boldsymbol{A}}|t', {\boldsymbol{x}}', {\boldsymbol{p}}'}\right\rangle = \left\langle{t, {\boldsymbol{x}}|\widehat{\boldsymbol{A}}|t', {\boldsymbol{x}}'}\right\rangle \delta({\boldsymbol{p}} - {\boldsymbol{p}}'), \end{equation}

and therefore the Weyl symbol of $\smash {\widehat {A}}$ is the same irrespective of whether the operator is considered on $\smash {\mathscr {H}_X}$ or on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$. In this case, we will use a unifying notation $\smash {\text {symb}\,\widehat {A}}$ instead of $\smash {\text {symb}_X \widehat {A}}$ and $\smash {\text {symb}_{{\mathsf {x}}} \widehat {A}}$.

2.2.3. Wigner functions and Wigner matrices

The density operator of a given scalar field $\psi$ is given by

(2.75)

\begin{equation} \widehat{W}_{\psi} \doteq (2{\rm \pi})^{{-}N} {\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {\psi |}}}} \right.}. \end{equation}

The symbol of this operator, $\smash {W_{\psi } = \text {symb}_X \widehat {W}_\psi }$, is a real function called the Wigner function. It is given by

(2.76)

\begin{equation} W_{\psi}({\boldsymbol{X}}, {\boldsymbol{K}}) = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{S}}\, \psi({\boldsymbol{X}} + {\boldsymbol{S}}/2) \psi^{*}({\boldsymbol{X}} - {\boldsymbol{S}}/2)\,\mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}, \end{equation}

which can be understood as the (inverse) Fourier image of

(2.77)

\begin{equation} C_\psi({\boldsymbol{X}}, {\boldsymbol{S}}) \doteq \psi({\boldsymbol{X}} + {\boldsymbol{S}}/2) \psi^*({\boldsymbol{X}} - {\boldsymbol{S}}/2) = \int \mathrm{d}{\boldsymbol{K}}\,W_\psi({\boldsymbol{X}}, {\boldsymbol{K}})\,\mathrm{e}^{\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}. \end{equation}

In particular, one has

(2.78)

\begin{equation} \int \mathrm{d}{\boldsymbol{r}}\,\text{symb}_X \widehat{W}_\psi = \text{symb}_{{\mathsf{x}}} \widehat{{\mathsf{W}}}_\psi({\boldsymbol{p}}), \end{equation}

where the right-hand side is $\smash {{\mathsf {W}}_\psi }$ given by (2.43), with ${\boldsymbol {p}}$ treated as a parameter. Also, for real fields,

(2.79)

\begin{equation} W_\psi({\boldsymbol{X}}, {\boldsymbol{K}}) = W_\psi({\boldsymbol{X}}, -{\boldsymbol{K}}). \end{equation}

The density operator of a given vector field ${\boldsymbol {\psi }} = (\psi ^1, \psi ^2, \ldots, \psi ^M)$ is a matrix operator

(2.80)

\begin{equation} \widehat{\boldsymbol{W}}_{{\boldsymbol{\psi}}} \doteq (2{\rm \pi})^{{-}N} {\left. {\boldsymbol {\mathsf {|\psi}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {\psi |}}}} \right.}. \end{equation}

The symbol of this operator, or the Wigner matrix, is a Hermitian matrix function

(2.81)

\begin{equation} {\boldsymbol{W}}_{{\boldsymbol{\psi}}}({\boldsymbol{X}}, {\boldsymbol{K}}) = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{S}}\, {\boldsymbol{\psi}}({\boldsymbol{X}} + {\boldsymbol{S}}/2) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol{X}} - {\boldsymbol{S}}/2)\,\mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}, \end{equation}

which can be understood as the (inverse) Fourier image of

(2.82)

\begin{equation} {\boldsymbol{C}}_{{\boldsymbol{\psi}}}({\boldsymbol{X}}, {\boldsymbol{S}}) \doteq {\boldsymbol{\psi}}({\boldsymbol{X}} + {\boldsymbol{S}}/2) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol{X}} - {\boldsymbol{S}}/2) = \int \mathrm{d}{\boldsymbol{K}}\,{\boldsymbol{W}}_{{\boldsymbol{\psi}}}({\boldsymbol{X}}, {\boldsymbol{K}})\,\mathrm{e}^{\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}. \end{equation}

In particular, one has

(2.83)

\begin{equation} \int \mathrm{d}{\boldsymbol{r}}\,\text{symb}_X \widehat{\boldsymbol{W}}_{{\boldsymbol{\psi}}} = \text{symb}_{{\mathsf{x}}} \widehat{\boldsymbol{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}}({\boldsymbol{p}}), \end{equation}

where the right-hand side is $\smash {{{\boldsymbol {\mathsf {W}}}}_{{\boldsymbol {\psi }}}}$ given by (2.51), with ${\boldsymbol {p}}$ treated as a parameter. Also, for real fields,

(2.84)

\begin{equation} {\boldsymbol{W}}_{{\boldsymbol{\psi}}}({\boldsymbol{X}}, {\boldsymbol{K}}) = {\boldsymbol{W}}_{{\boldsymbol{\psi}}}^\intercal({\boldsymbol{X}}, -{\boldsymbol{K}}) = {\boldsymbol{W}}_{{\boldsymbol{\psi}}}^*({\boldsymbol{X}}, -{\boldsymbol{K}}). \end{equation}

Like those on $\smash {({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$, the Wigner matrices (Wigner functions) on $\smash {({\boldsymbol {X}}, {\boldsymbol {K}})}$ become positive-semidefinite (non-negative), and characterize local properties of the corresponding fields, when averaged over a sufficiently large phase-space volume $\varDelta {\boldsymbol {X}}\,\varDelta {\boldsymbol {K}} \gtrsim 1$.

2.3. Summary of § 2

In summary, we have introduced a generic $\smash {n}$-dimensional physical space $\smash {{\boldsymbol {x}}}$, the dual $\smash {n}$-dimensional wavevector space $\smash {{\boldsymbol {k}}}$, the corresponding $\smash {{{\mathsf {n}}}}$-dimensional ($\smash {{{\mathsf {n}}} = n + 1}$) spacetime $\smash {{\boldsymbol {\mathsf {x}}} \equiv (t, {\boldsymbol {x}})}$ and the dual $\smash {{{\mathsf {n}}}}$-dimensional wavevector space $\smash {{\boldsymbol {\mathsf {k}}} \equiv (-\omega, {\boldsymbol {k}})}$. We have also introduced an $\smash {n}$-dimensional momentum space $\smash {{\boldsymbol {p}}}$, the corresponding $\smash {2n}$-dimensional phase space $\smash {{\boldsymbol {z}} \equiv ({\boldsymbol {x}}, {\boldsymbol {p}})}$, the $\smash {N}$-dimensional ($\smash {N = 2n + 1}$) extended space $\smash {{\boldsymbol {X}} \equiv (t, {\boldsymbol {z}}) \equiv (t, {\boldsymbol {x}}, {\boldsymbol {p}})}$ and the dual $\smash {N}$-dimensional wavevector space $\smash {{\boldsymbol {K}} \equiv (-\omega, {\boldsymbol {q}}) \equiv (-\omega, {\boldsymbol {k}}, {\boldsymbol {r}})}$, where $\smash {{\boldsymbol {r}}}$ is the $\smash {n}$-dimensional wavevector space dual to $\smash {{\boldsymbol {p}}}$. We have also introduced the $\smash {2N}$-dimensional phase space $\smash {({\boldsymbol {X}}, {\boldsymbol {K}})}$. Each of the said variables has a corresponding operator associated with it, which is denoted with a caret. For example, $\smash {\widehat {\boldsymbol {x}}}$ is the operator of position in the $\smash {{\boldsymbol {x}}}$ space, and $\smash {\widehat {\boldsymbol {k}} = -\mathrm {i}\partial _{{\boldsymbol {x}}}}$ is the corresponding wavevector operator.

Functions on $\smash {{\boldsymbol {\mathsf {x}}}}$ form a Hilbert space $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$, and the corresponding bra–ket notation is introduced as usual. Any operator $\smash {\widehat {{\mathsf {A}}}}$ on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$ can be represented by its Weyl symbol $\smash {{\mathsf {A}}({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$. The correspondence between operators and their symbols, $\smash {\widehat {{\mathsf {A}}} \leftrightarrow {\mathsf {A}}}$, is determined by the Wigner–Weyl transform and is natural in the sense that (2.29) is satisfied. In particular, $\smash {\widehat {{\mathsf {A}}}\,\widehat {{\mathsf {B}}} \leftrightarrow {\mathsf {A}} \star {\mathsf {B}}}$, where $\smash {\star }$ is the Moyal product on $\smash {({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$. When the geometrical-optics parameter is negligible ($\smash {\epsilon \to 0}$), one has $\smash {\widehat {{\mathsf {A}}} = {\mathsf {A}}(\,\widehat {\boldsymbol {{\boldsymbol {\mathsf {x}}}}}, \widehat {\boldsymbol {{\boldsymbol {\mathsf {k}}}}})}$ and the Moyal product becomes the usual product. Similarly, functions on $\smash {{\boldsymbol {X}}}$ form a Hilbert space $\smash {\mathscr {H}_X}$, the corresponding bra–ket notation is also introduced as usual, any operator $\smash {\widehat {A}}$ on $\smash {\mathscr {H}_X}$ can be represented by its Weyl symbol $\smash {A({\boldsymbol {X}}, {\boldsymbol {K}})}$, and $\smash {\widehat {A\,}\widehat {B} \leftrightarrow A \,\bigstar \, B}$. An operator that is local in $\smash {{\boldsymbol {p}}}$ has the same symbol irrespective of whether it is considered on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$ or on $\smash {\mathscr {H}_X}$.

Any given field $\smash {\psi }$ generates the corresponding density operator and its symbol called the Wigner function (Wigner matrix, if the field is a vector). If the density operator is considered on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$, it is denoted $\smash {\widehat {{\mathsf {W}}}_\psi }$ and the corresponding Wigner function is denoted $\smash {{\mathsf {W}}_\psi ({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$. If the density operator is considered on $\smash {\mathscr {H}_X}$, it is denoted $\smash {\widehat {W}_\psi }$ and the corresponding Wigner function is denoted $\smash {W_\psi ({\boldsymbol {X}}, {\boldsymbol {K}})}$. The two Wigner functions are related via $\smash {\int \mathrm {d}{\boldsymbol {r}}\,W_\psi (t, {\boldsymbol {x}}, {\boldsymbol {p}}, \omega, {\boldsymbol {k}}, {\boldsymbol {r}}) = {\mathsf {W}}_\psi (t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}}; {\boldsymbol {p}})}$, where $\smash {{\boldsymbol {p}}}$ enters $\smash {{\mathsf {W}}_\psi }$ as a parameter, if at all. If averaged over a sufficiently large phase-space volume, the Wigner functions (matrices) are non-negative (positive-semidefinite) and characterize local properties of the corresponding fields.

3. Model

Here, we introduce the general assumptions and the key ingredients of our theory.

3.1. Basic assumptions

3.1.1. Ordering

Let us consider particles governed by a Hamiltonian $H = \overline {H} + \widetilde {H}$ such that

(3.1)

\begin{equation} \widetilde{H} = \mathcal{O}(\varepsilon) \ll \overline{H} = \mathcal{O}(1). \end{equation}

In other words, $\widetilde {H}$ serves as a small perturbation to the leading-order Hamiltonian $\overline {H}$. The system will be described in canonical variables $\smash {{\boldsymbol {z}} \equiv ({\boldsymbol {x}}, {\boldsymbol {p}}) \in \mathbb {R}^{2n}}$. Let us also assume that the system is close to being homogeneous in ${\boldsymbol {x}}$. This includes two conditions. First, we require that the external fields are weak (yet see § 3.1.2), meaning

(3.2)

\begin{equation} \partial_{{\boldsymbol{x}}}\overline{H} \sim \kappa_x\overline{H} = \mathcal{O}(\epsilon), \qquad \partial_{{\boldsymbol{p}}}\overline{H} \sim \kappa_p\overline{H} = \mathcal{O}(1), \end{equation}

where $\epsilon \ll 1$ is a small parameter, $\kappa _x$ and $\kappa _p$ are the characteristic inverse scales in the ${\boldsymbol {x}}$ and ${\boldsymbol {p}}$ spaces, respectively, and the bar denotes local averaging.Footnote ¹¹ Hence, the particle momenta ${\boldsymbol {p}}$ are close to being local invariants. Second, the statistical properties of $\smash {\widetilde {H}}$ are also assumed to vary in ${\boldsymbol {x}}$ slowly. These properties can be characterized using the density operator of the perturbation Hamiltonian,

(3.3)

\begin{equation} \widehat{W} \doteq (2{\rm \pi})^{{-}N} {\left. {\boldsymbol {\mathsf {|\widetilde{H}}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {\widetilde{H} |}}}} \right.}, \end{equation}

and its symbol, the (real) Wigner function, as in (2.43):

(3.4)

\begin{equation} W({\boldsymbol{X}}, {\boldsymbol{K}}) = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{S}}\,\widetilde{H}({\boldsymbol{X}} + {\boldsymbol{S}}/2) \widetilde{H}({\boldsymbol{X}} - {\boldsymbol{S}}/2)\,\mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}. \end{equation}

Specifically, we will use the average Wigner function, $\overline {W}$, which represents the Fourier spectrum of the symmetrized autocorrelation function of $\widetilde {H}$:

(3.5)

\begin{equation} \overline{C}({\boldsymbol{X}}, {\boldsymbol{S}}) \doteq \overline{\widetilde{H}({\boldsymbol{X}} + {\boldsymbol{S}}/2) \widetilde{H}({\boldsymbol{X}} - {\boldsymbol{S}}/2)} = \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}({\boldsymbol{X}}, {\boldsymbol{K}})\,\mathrm{e}^{\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}. \end{equation}

The averaging is performed over sufficiently large volume of ${\boldsymbol {x}}$ to eliminate rapid oscillations and also over phase-space volumes $\varDelta {\boldsymbol {X}}\,\varDelta {\boldsymbol {K}} \gtrsim 1$, which guarantees $\overline {W}$ to be non-negative and local (§ 2.2.3). The function $\overline {W}$ can be understood as a measure of the phase-space density of wave quanta when the latter is well defined (§ 7).

We will assumeFootnote ¹²

(3.6)

\begin{equation} \partial_t \overline{W} = \mathcal{O}(\epsilon), \qquad \partial_{\boldsymbol{x}} \overline{W} = \mathcal{O}(\epsilon), \qquad \partial_{\boldsymbol{p}} \overline{W} = \mathcal{O}(1). \end{equation}

That said, we will also allow (albeit not require) for oscillations to be constrained by a dispersion relation. In this case, $\overline {W}\, \propto \, \delta (\omega -\overline {\omega }(t, {\boldsymbol {x}}))$, so (3.6) per se is not satisfied; then we assume a similar ordering for $\int \mathrm {d}\omega \,\overline {W}$ instead. Also note that in application to the standard QLT of homogeneous turbulence (Stix Reference Stix1992, chapter 16), $\epsilon$ is understood as the geometrical-optics parameter characterizing the smallness of the linear-instability growth rates. (We discuss the ordering further in the end of § 3.3.)

3.1.2. Quasilinear approximation

The particle-motion equations can be written as

(3.7)

\begin{equation} \dot{z}^{\alpha} = \lbrace z^{\alpha}, \overline{H} + \widetilde{H} \rbrace = v^{\alpha} + u^{\alpha}, \end{equation}

where $v^{\alpha }$ and $u^{\alpha }$ are understood as the unperturbed phase-space velocity and the perturbation to the phase-space velocity, respectively:

(3.8)

\begin{equation} v^{\alpha} \doteq J^{\alpha\beta} \partial_{\beta}\overline{H}, \qquad u^{\alpha} \doteq J^{\alpha\beta}\partial_{\beta}\widetilde{H}. \end{equation}

The notation $v^i$ (with $i = 1, 2, \ldots n$) will also be used for the spatial part of the phase-space velocity $v^{\alpha }$, i.e. for the true velocity per se. Likewise, ${\boldsymbol {v}}$ will be used to denote either the phase-space velocity vector or the spatial velocity vector depending on the context. Also note that a slightly different definition of ${\boldsymbol {v}}$ will be used starting from § 5.6.

The corresponding Klimontovich equation for the particle distribution $f(t, {\boldsymbol {z}})$ is

(3.9)

\begin{equation} \partial_t f = \lbrace \overline{H} + \widetilde{H}, f \rbrace. \end{equation}

(If collisions are not of interest, (3.9) can as well be understood as the Vlasov equation. Also, a small collision term can be included ad hoc; see the comment in the end of § 3.3.) Let us search for $f$ in the form

(3.10)

\begin{equation} f = \overline{f} + \widetilde{f}, \qquad \overline{\widetilde{f}} = 0. \end{equation}

The equations for $\smash {\overline {f}}$ and $\smash {\widetilde {f}}$ are obtained as the average and oscillating parts of (3.9), and we neglect the nonlinearity in the equation for $\smash {\widetilde {f}}$, following the standard QL approximation (Stix Reference Stix1992, chapter 16). Then, one obtains

(3.11)

\begin{gather} \partial_t\overline{f} = \lbrace \overline{H}, \overline{f} \rbrace + \overline{\lbrace \widetilde{H}, \widetilde{f} \rbrace}, \end{gather}

(3.12)

\begin{gather}\partial_t\widetilde{f} = \lbrace \overline{H}, \widetilde{f} \rbrace + \lbrace \widetilde{H}, \overline{f} \rbrace. \end{gather}

A comment is due here regarding plasmas in strong fields and magnetized plasmas in particular. Our formulation can be applied to such plasmas in canonical angle–action variables $\smash {({{\phi }}, \boldsymbol{\mathsf {J}})}$. For fast angle variables, the ordering (3.2) is not satisfied and the Weyl symbol calculus is inapplicable as is (see the footnote in § 2.1.3). Such systems can be accommodated by representing the distribution function as a Fourier series in $\smash {{{\phi }}}$ and treating the individual-harmonic amplitudes separately as slow functions of the remaining coordinates. Then, our averaging procedure subsumes averaging over $\smash {{{\phi }}}$, so the averaged quantities are $\smash {{{\phi }}}$-independent and (3.2) is reinstated. In particular, magnetized plasmas can be described using guiding-centre variables. Although not canonical by default (Littlejohn Reference Littlejohn1983), they can always be cast in a canonical form, at least in principle (Littlejohn Reference Littlejohn1979). Examples of canonical guiding-centre variables are reviewed in Cary & Brizard (Reference Cary and Brizard2009). To make the connection with the homogeneous-plasma theory, one can also order the canonical pairs of guiding-centre variables such that they would describe the gyromotion, the parallel motion, and the drifts separately (Wong Reference Wong2000). This readily leads to results similar to those in Catto et al. (Reference Catto, Lee and Ram2017). Further discussions on this topic are left to future papers.

3.2. Equation for ${\widetilde {f}}$

Let us consider solutions of (3.12) as a subclass of solutions of the more general equation

(3.13)

\begin{equation} \partial_\tau\widetilde{f} = \widehat{L}\widetilde{f} + \mathscr{F}, \qquad \mathscr{F}({\boldsymbol{X}}) \doteq \lbrace \widetilde{H}, \overline{f} \rbrace. \end{equation}

Here, we have introduced an auxiliary second ‘time’ $\smash {\tau }$, the operator

(3.14)

\begin{equation} \widehat{L} \doteq{-}\partial_t + \lbrace \overline{H}, {\unicode{x25AA}} \rbrace ={-}\partial_t + J^{\alpha\beta}(\partial_{\alpha}\overline{H})\partial_{\beta} ={-}\partial_t - v^{\lambda}\partial_{\lambda} ={-}V^a \partial_a \end{equation}

(here and further, ${\unicode{x25AA}}$ denotes a placeholder), and ${\boldsymbol {V}}({\boldsymbol {X}}) \equiv (1, {\boldsymbol {v}}(t, {\boldsymbol {z}}))$ is the unperturbed velocity in the ${\boldsymbol {X}}$ space. Note that

(3.15)

\begin{equation} \partial_a V^a = \partial_{\lambda}v^{\lambda} = 0 \end{equation}

due to the incompressibility of the phase flow. Hence, $\smash {[\partial _a, V^a] = 0}$, so $\smash {\widehat {L}}$ is anti-Hermitian.

Let us search for a solution of (3.13) in the formFootnote ¹³

(3.16)

\begin{equation} \widetilde{f}(\tau, {\boldsymbol{X}}) = \mathrm{e}^{\widehat{L} \tau} \xi (\tau, {\boldsymbol{X}}). \end{equation}

Then, $\partial _\tau \widetilde {f} = \widehat {L\,}\,\widetilde {f} + \mathrm {e}^{\widehat {L} \tau }\partial _\tau \xi$, so $\partial _\tau \xi = \mathrm {e}^{-\widehat {L} \tau }\mathscr {F}({\boldsymbol {X}})$ and therefore

(3.17)

\begin{equation} \xi(\tau, {\boldsymbol{X}}) = \mathrm{e}^{-\widehat{L} \tau_0} \xi_0({\boldsymbol{X}}) + \int_{\tau_0}^\tau \mathrm{d} \tau'\,\mathrm{e}^{-\widehat{L} \tau'} \mathscr{F}({\boldsymbol{X}}), \end{equation}

where $\smash {\xi _0({\boldsymbol {X}}) \doteq \widetilde {f}(\tau _0, {\boldsymbol {X}})}$. Hence, one obtains

(3.18)

\begin{equation} \widetilde{f}(\tau, {\boldsymbol{X}}) = \mathrm{e}^{\widehat{L}(\tau - \tau_0)} \xi_0({\boldsymbol{X}}) + \int_{\tau_0}^\tau \mathrm{d} \tau'\,\mathrm{e}^{-\widehat{L} (\tau' - \tau)} \mathscr{F}({\boldsymbol{X}}), \end{equation}

or equivalently, using $\tau '' \doteq \tau - \tau '$,

(3.19)

\begin{equation} \widetilde{f}(\tau, {\boldsymbol{X}}) = g_0(\tau, {\boldsymbol{X}}) + \int_0^{\tau - \tau_0} \mathrm{d}\tau''\,\widehat{T}_{\tau''} \mathscr{F}({\boldsymbol{X}}). \end{equation}

Here, $\smash {g_0}$ is a solution of $\partial _\tau g_0 = \widehat {L} g_0$, specifically,

(3.20)

\begin{equation} g_0(\tau, {\boldsymbol{X}}) \doteq \widehat{T}_{\tau - \tau_0} \xi_0({\boldsymbol{X}}), \qquad g_0(\tau_0, {\boldsymbol{X}}) = \widetilde{f}(\tau_0, {\boldsymbol{X}}), \end{equation}

and we have also introduced

(3.21)

\begin{equation} \widehat{T}_\tau \doteq \mathrm{e}^{\widehat{L} \tau} = \mathrm{e}^{-\tau V^a\partial_a}. \end{equation}

Because $\smash {\widehat {L}}$ is anti-Hermitian, the operator $\smash {\widehat {T}_\tau }$ is unitary, and comparison with (2.19) shows that it can be recognized as a shift operator. For further details, see § 4.1.

Using $\smash {\widehat {T}_\tau }$, one can express (3.19) as

(3.22)

\begin{equation} \widetilde{f} = g_0 + \widehat{\mathscr{G}}\mathscr{F}, \qquad \textstyle \widehat{\mathscr{G}} \doteq \int_0^{\tau-\tau_0} \mathrm{d}\tau'\,\widehat{T}_{\tau'}, \end{equation}

where $\widehat {\mathscr {G}}$ is the Green's operator understood as the right inverse of the operator $\smash {\partial _\tau - \widehat {L}}$, or on the space of $\smash {\tau }$-independent functions, $\smash {\partial _t - \lbrace \overline {H}, {\unicode{x25AA}} \rbrace }$. Let us rewrite this operator as $\smash {\widehat {\mathscr {G}} = \widehat {\mathscr {G}}_< + \widehat {\mathscr {G}}_>}$, where

(3.23)

\begin{equation} \widehat{\mathscr{G}}_<{=} \int_0^{\tau-\tau_0} \mathrm{d}\tau'\,\mathrm{e}^{-\nu \tau'}\widehat{T}_{\tau'}, \qquad \widehat{\mathscr{G}}_>{=} \int_0^{\tau-\tau_0} \mathrm{d}\tau'\,(1 - \mathrm{e}^{-\nu \tau'})\widehat{T}_{\tau'}, \end{equation}

and $\nu$ is a positive constant. Note that $\widehat {\mathscr {G}}_<$ is well defined at $\tau _0 \to -\infty$, meaning that $\widehat {\mathscr {G}}_<\mathscr {F}$ is well defined for any physical (bounded) field $\mathscr {F}$.Footnote ¹⁴ Thus, so is $\smash {g_0 + \widehat {\mathscr {G}}_>\mathscr {F}}$. Let us take $\tau _0 \to -\infty$ and then take $\nu \to 0+$. (Here, $0+$ denotes that $\nu$ must remain positive, i.e. the upper limit is taken.) Then, (3.22) can be expressed as

(3.24)

\begin{equation} \widetilde{f} = g + \widehat{G}\mathscr{F}, \qquad g \doteq \lim_{\nu \to 0+} \lim_{\tau_0 \to -\infty} (g_0 + \widehat{\mathscr{G}}_>\mathscr{F}). \end{equation}

Here, we introduced an ‘effective’ Green's operator $\smash {\widehat {G} \doteq \lim _{\nu \to 0+} \lim _{\tau _0 \to -\infty } \widehat {\mathscr {G}}_<}$, i.e.

(3.25)

\begin{equation} \widehat{G} \doteq \lim_{\nu \to 0+} \int_0^\infty \mathrm{d}\tau\,\mathrm{e}^{-\nu \tau}\widehat{T}_{\tau}. \end{equation}

This operator will be discussed in § 4.2, and $g$ will be discussed in § 4.3. Meanwhile, note that because $\smash {\tau }$ is just an auxiliary variable, we will be interested in solutions independent of $\tau$. In particular, this means that $\smash {\widetilde {f}(\tau _0, {\boldsymbol {X}}) = \widetilde {f}({\boldsymbol {X}})}$, so $\smash {\xi _0({\boldsymbol {X}}) = \widetilde {f}({\boldsymbol {X}})}$, so (3.20) leads to

(3.26)

\begin{equation} g_0(\tau, {\boldsymbol{X}}) = \widehat{T}_{\tau - \tau_0} \widetilde{f}({\boldsymbol{X}}). \end{equation}

3.3. Equation for ${\overline {f}}$

Using (3.22), one can rewrite (3.12) for $\overline {f}$ as follows:

(3.27)

\begin{equation} \partial_t\overline{f} = \lbrace \overline{H}, \overline{f} \rbrace + \overline{\lbrace \widetilde{H}, g \rbrace} + \overline{\lbrace \widetilde{H}, \widehat{G}\lbrace \widetilde{H}, \overline{f} \rbrace \rbrace} .\end{equation}

Notice that

(3.28)

\begin{equation} \lbrace \widetilde{H}, g \rbrace ={-} \lbrace g, \widetilde{H} \rbrace ={-}\partial_{\alpha} (J^{\alpha\beta}\, g \partial_{\beta} \widetilde{H}) ={-}\partial_{\alpha} (u^\alpha g), \end{equation}

and also

(3.29)

\begin{align} \lbrace \widetilde{H}, \widehat{G}\lbrace \widetilde{H}, \overline{f} \rbrace \rbrace & = \partial_{\beta}( J^{\alpha\beta} (\partial_{\alpha}\widetilde{H}) \widehat{G}\lbrace \widetilde{H}, \overline{f} \rbrace )\nonumber\\ & = \partial_{\beta}(J^{\alpha\beta} (\partial_{\alpha}\widetilde{H}) \widehat{G} (J^{\mu \nu} (\partial_{\mu}\widetilde{H})(\partial_{\nu}\overline{f})))\nonumber\\ & = \partial_{\beta}(u^{\beta}\widehat{G}(u^{\nu}\partial_{\nu}\overline{f})). \end{align}

The field $u^\alpha$ enters here as a multiplication factor and can be considered as an operator:

(3.30)

\begin{equation} \widehat{u\,\,}^\alpha\psi({\boldsymbol{X}}) \doteq u^\alpha({\boldsymbol{X}})\psi({\boldsymbol{X}}). \end{equation}

Then, (3.29) can be compactly represented as

(3.31)

\begin{equation} \lbrace \widetilde{H}, \widehat{G}\lbrace \widetilde{H}, \overline{f} \rbrace \rbrace = \partial_{\alpha} (\,\widehat{u}^{\kern1.5pt\alpha} \widehat{G}\, \widehat{u}^{\kern1.5pt\beta} \partial_{\beta} \overline{f}). \end{equation}

We will also use the notation

(3.32)

\begin{equation} \mathrm{d}_t \doteq \partial_t + v^{\gamma}\partial_{\gamma} = \partial_t - \lbrace \overline{H}, {\unicode{x25AA}} \rbrace. \end{equation}

This leads to the following equation for $\overline {f}$:

(3.33)

\begin{equation} \mathrm{d}_t \overline{f} = \partial_{\alpha}(\widehat{D}^{\alpha\beta} \partial_{\beta}\overline{f}) + \varGamma, \end{equation}

where we introduced the following average quantities:

(3.34)

\begin{equation} \widehat{D}^{\alpha\beta} \doteq \overline{\widehat{u\,}^{\alpha}\widehat{G\,}\widehat{u}^{\beta}}, \qquad \varGamma \doteq{-} \partial_\alpha(\overline{u^{\alpha} g}). \end{equation}

Our goal is to derive explicit approximate expressions for the quantities (3.34) and to rewrite (3.33) in a more tractable form using the assumptions introduced in § 3.1. We will useFootnote ¹⁵

(3.35)

\begin{equation} \partial_t\overline{f} \sim \lbrace \overline{H}, \overline{f} \rbrace = \mathcal{O}(\epsilon), \qquad \mathrm{d}_t\overline{f} = \mathcal{O}(\varepsilon^2), \end{equation}

and we will keep terms of order $\epsilon$, $\varepsilon ^2$ and $\smash {\epsilon \varepsilon ^2}$ in the equation for $\smash {\overline {f}}$, while terms of order $\smash {\varepsilon ^4}$, $\smash {\epsilon ^2 \varepsilon ^2}$ and higher will be neglected. This implies the ordering

(3.36)

\begin{equation} \varepsilon^2 \ll \epsilon \ll \varepsilon \ll 1. \end{equation}

As a reminder, $\smash {\varepsilon }$ is a linear measure of the characteristic amplitude of oscillations, and $\smash {\epsilon }$ is the geometrical-optics parameter, which is proportional to the inverse scale of the plasma inhomogeneity in spacetime. As usual then, linear dissipation is assumed to be of order $\smash {\epsilon }$. This model implies the assumption that collisionless dissipation is much stronger than collisional dissipation, which is to emerge as an effect quadratic in $\smash {\widetilde {f}}$ (§ 6). Furthermore, the inverse plasma parameterFootnote ¹⁶ will be assumed to be of order $\smash {\epsilon }$, so the collision operator for $\smash {\overline {f}}$ (§ 6.8) will be of order $\smash {\epsilon \varepsilon ^2}$. Within the assumed accuracy, this operator must be retained, while the dynamics of $\smash {\widetilde {f}}$ is considered linear and therefore collisionless. Alternatively, one can switch from the Klimontovich description to the Vlasov–Boltzmann description and introduce an ad hoc order-$\smash {\epsilon }$ collision operator directly in (3.9). This will alter the Green's operator, but the conceptual formulation would remain the same, so it will not be considered separately in detail.

3.4. Summary of § 3

Our QL model is defined as usual, except: (i) we allow for a general particle Hamiltonian $\smash {H}$; (ii) we use the Klimontovich equation rather than the Vlasov equation to retain collisions; (iii) we use local averaging (denoted with overbar) and allow for weak inhomogeneity of all averaged quantities; (iv) we retain the initial conditions $\smash {g}$ for the oscillating part of the distribution function (defined as in (3.24) but yet to be calculated explicitly). Then, the average part of the distribution function satisfies

(3.37)

\begin{equation} \partial_t\overline{f} - \lbrace \overline{H}, \overline{f} \rbrace = \partial_{\alpha}(\widehat{D}^{\alpha\beta} \partial_{\beta}\overline{f}) + \varGamma, \end{equation}

where $\smash {\widehat {D}^{\alpha \beta } \doteq \overline {\widehat {u}^{\alpha }\widehat {G\,}\widehat {u}^{\beta }}}$, $\smash {\vphantom{\frac{f^l}{f^l}}\varGamma \doteq - \partial _\alpha (\overline {u^{\alpha } g})}$, $\smash {u^{\alpha }}$ is the wave-driven perturbation of the phase-space velocity (see (3.8)), $\smash {\widehat {u\,\,}^{\alpha }}$ is the same quantity considered as an operator on $\smash {\mathscr {H}_X}$ (see (3.30)) and $\smash {\widehat {G} }$ is the ‘effective’ Green's operator given by

(3.38)

\begin{equation} \widehat{G} \doteq \lim_{\nu \to 0+} \int_0^\infty \mathrm{d}\tau\,\mathrm{e}^{-\nu \tau-\tau V^a\partial_a}. \end{equation}

Also, $\smash {\partial _{\alpha } \equiv \partial /\partial z^\alpha }$, and $\smash {\lbrace {\unicode{x25AA}}, {\unicode{x25AA}} \rbrace }$ is the Poisson bracket on the particle phase space $\smash {{\boldsymbol {z}}}$. The equation for $\smash {\overline {f}}$ used in the standard QLT is recovered from (3.37) by neglecting $\smash {\varGamma }$ and the spatial gradients (in particular, the whole Poisson bracket) and also by approximating the operator $\smash {\widehat {D}^{\alpha \beta }}$ with a local function of $\smash {{\boldsymbol {z}}}$.

4. Preliminaries

Before we start calculating the functions in (3.37) explicitly, let us get some preliminaries out of the way. In this section, we discuss the shift operators $\smash {\widehat {T}_\tau }$ (§ 4.1), approximate the operator $\smash {\widehat {G}}$ (§ 4.2) and develop a model for the function $g$ that encodes the initial conditions for $\smash {\widetilde {f}}$ (§ 4.3).

4.1. Shift operator

Here, we derive some properties of the shift operator $\smash {\widehat {T}_\tau }$ introduced in § 3.2.

4.1.1. ${\widehat {T}_\tau }$ as a shift

Here, we formally prove (an admittedly obvious fact) that

(4.1)

\begin{equation} \widehat{T}_\tau \psi({\boldsymbol{X}}) = \psi({\boldsymbol{X}} - {\boldsymbol{\ell}}_\tau({\boldsymbol{X}})), \qquad \textstyle \ell_{\tau}^{a}({\boldsymbol{X}}) \doteq \int_0^{\tau} \mathrm{d} t\, V^{a}({\boldsymbol{Y}}(t, {\boldsymbol{X}})), \end{equation}

where the ‘characteristics’ $\smash {Y^a}$ solveFootnote ¹⁷

(4.2)

\begin{equation} \frac{\mathrm{d} Y^{a}}{\mathrm{d}\tau} ={-}V^{a}({\boldsymbol{Y}}), \qquad Y^a(\tau = 0) = X^a, \end{equation}

and thus $\ell _{\tau }^{a}$ can be Taylor-expanded in $\tau$ as

(4.3)

\begin{equation} \ell_{\tau}^{a}({\boldsymbol{X}}) = \tau V^{a} - \frac{1}{2}\,\tau^2 V^{b} \partial_{b} V^{a} + \ldots, \qquad V^{a} \equiv V^{a}({\boldsymbol{X}}). \end{equation}

As the first step to proving (4.1), let us Taylor-expand $V^{a}$ around a fixed point ${\boldsymbol {X}}_1$:

(4.4)

\begin{equation} V^{a} = V^{a}_1 + (\partial_{b} V^{a}_1)\,\delta X^{b} + \ldots, \qquad \delta X^{a}\doteq X^{a} - X_1^{a}, \end{equation}

where $V^a_1 \equiv V^a({\boldsymbol {X}}_1)$. If one neglects the first and higher derivatives of $V^{a}$, one obtains

(4.5)

\begin{equation} \widehat{T}_{\tau}\psi({\boldsymbol{X}}) \approx \mathrm{e}^{-\tau V^{a}_1\partial_{a}}\psi({\boldsymbol{X}}) = \psi({\boldsymbol{X}} - \tau {\boldsymbol{V}}_1). \end{equation}

By taking the limit ${\boldsymbol {X}}_1 \to {\boldsymbol {X}}$, which corresponds to ${\boldsymbol {V}}_1 \to {\boldsymbol {V}}$, one obtains

(4.6)

\begin{equation} \widehat{T}_{\tau}\psi({\boldsymbol{X}}) = \psi({\boldsymbol{X}} - \tau {\boldsymbol{V}}) + \mathcal{O}(\tau^2). \end{equation}

Similarly, if one neglects the second and higher derivatives of $V^{a}$, one obtainsFootnote ¹⁸

(4.7)

\begin{align} \widehat{T}_{\tau}\psi({\boldsymbol{X}}) & = \mathrm{e}^{ - \tau (V^{a}_1 + (\partial_{b}V^{a}_1) \delta X^{b}+\ldots) \partial_{a} } \psi({\boldsymbol{X}}) \nonumber\\ & \approx \mathrm{e}^{- \tau (\partial_{b} V^{a}_1) \delta X^{b} \partial_{a}} \,\mathrm{e}^{-\tau V^{a}_1 \partial_{a}} \,\mathrm{e}^{-\frac{1}{2}[ -\tau(\partial_{b}V^{a}_1) \delta X^{b}\partial_{a}, -\tau V^{c}_1\partial_{c} ]} \psi({\boldsymbol{X}}) \nonumber\\ & \approx \mathrm{e}^{ -\tau (\partial_{b} V^{a}_1) \delta X^{b} \partial_{a}} \,\mathrm{e}^{-\tau V^{a}_1 \partial_{a}} \,\mathrm{e}^{ \frac{1}{2}\tau^2 V^{c}_1(\partial_{b}V^{a}_1) [\partial_{c}, \delta X^{b}\partial_{a}] }\psi({\boldsymbol{X}}) \nonumber\\ & \approx \mathrm{e}^{-\tau (\partial_{b}V^{a}_1)\delta X^{b}\partial_{a}} \,\mathrm{e}^{-\tau V^{a}_1 \partial_{a}} \,\mathrm{e}^{\frac{1}{2}\tau^2 V^{b}_1 (\partial_{b}V^{a}_1) \partial_{a}} \psi({\boldsymbol{X}}) \nonumber\\ & \approx \mathrm{e}^{-\tau (\partial_{b}V^{a}_1) \delta X^{b} \partial_{a}} \,\mathrm{e}^{-\tau V^{a}_1\partial_{a} + \frac{1}{2}\tau^2 V^{b}_1(\partial_{b}V^{a}_1)\partial_{a}}\psi({\boldsymbol{X}}) \nonumber\\ & \approx \mathrm{e}^{-\tau (\partial_{b}V^{a}_1)\delta X^{b}\partial_{a}} \psi({\boldsymbol{X}} - \tau {\boldsymbol{V}}_1 + \textstyle\frac{1}{2}\,\tau^2 V^{b}_1 \partial_{b}{\boldsymbol{V}}_1). \end{align}

In the limit ${\boldsymbol {X}}_1 \to {\boldsymbol {X}}$, when $\smash {\mathrm {e}^{-\tau (\partial _{b}V^{a}_1)\delta X^{b} \partial _{a}} \to 1}$ and ${\boldsymbol {V}}_1 \to {\boldsymbol {V}}$, one obtains

(4.8)

\begin{equation} \widehat{T}_{\tau}\psi({\boldsymbol{X}}) = \psi\left({\boldsymbol{X}} - \tau {\boldsymbol{V}}({\boldsymbol{X}}) + \frac{1}{2}\,\tau^2 ({\boldsymbol{V}} \cdot \partial_{{\boldsymbol{X}}}) {\boldsymbol{V}}\right) + \mathcal{O}(\tau^3). \end{equation}

In conjunction with (4.3), equations (4.6) and (4.8) show agreement with the sought result (4.3) within the assumed accuracy. One can also retain ${\mathsf {m}}$ derivatives of ${\boldsymbol {V}}$ and derive the corresponding approximations similarly. Then the error will be $\smash {\mathcal {O}(\tau ^{{\mathsf {m}} + 2})}$.

For an order-one time interval $\tau$, one can split this interval on $N_\tau \gg 1$ subintervals of small duration $\tau /N_\tau$ and apply finite-${\mathsf {m}}$ formulas (for example, (4.6) or (4.8)) to those. Then the total error scales as $\smash {\mathcal {O}(N_\tau ^{-{\mathsf {m}} - 1})}$ and the exact formula (4.1) is obtained at $N_\tau \to \infty$.

4.1.2. Symbol of ${\widehat {T}_\tau }$

Using the bra–ket notation, (4.1) can be written as

(4.9)

\begin{equation} \left\langle{{\boldsymbol{X}} |\widehat{T}_{\tau} | \psi}\right\rangle = \left\langle{{\boldsymbol{X}} - {\boldsymbol{\ell}}_{\tau}({\boldsymbol{X}})|\psi}\right\rangle. \end{equation}

Thus, $ {{\left\langle {{\boldsymbol {\mathsf {X|}}}} \right.}} \widehat {T}_{\tau } = {\left\langle {{\boldsymbol {\mathsf {X|}}}} \right.} - {\boldsymbol {\ell }}_{\tau }({\boldsymbol {X}})$, so

(4.10)

\begin{equation} \left\langle{{\boldsymbol{X}}_1|\widehat{T}_{\tau}|{\boldsymbol{X}}_2}\right\rangle = \left\langle{{\boldsymbol{X}}_1 - {\boldsymbol{\ell}}_{\tau}({\boldsymbol{X}}_1)|{\boldsymbol{X}}_2}\right\rangle = \delta ({\boldsymbol{X}}_1 - {\boldsymbol{X}}_2 - {\boldsymbol{\ell}}_{\tau}({\boldsymbol{X}}_1)). \end{equation}

Using (2.70), one obtains the Weyl symbol of $\smash {\widehat {T}_{\tau }}$ in the form

(4.11)

\begin{equation} T_{\tau}({\boldsymbol{X}}, {\boldsymbol{K}}) = \int \mathrm{d}{\boldsymbol{S}}\, \mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}\, \delta ({\boldsymbol{S}} - {\boldsymbol{\ell}}_{\tau}({\boldsymbol{X}} + {\boldsymbol{S}}/2)). \end{equation}

From (4.3), one has

(4.12)

\begin{align} \ell^a_{\tau}({\boldsymbol{X}} + {\boldsymbol{S}}/2) & = \tau V^{a}({\boldsymbol{X}} + {\boldsymbol{S}}/2) - (\tau^2/2)\,V^{b} \partial_{b} V^{a} + \mathcal{O}(\epsilon^2) \nonumber\\ & = \tau V^{a} + (\tau/2) (\partial_b V^{a}) S^b - (\tau^2/2)\,V^{b} \partial_{b} V^{a} + \mathcal{O}(\epsilon^2) \nonumber\\ & = {M^a}_b V^{b}\tau + {m^a}_b S^b + \mathcal{O}(\epsilon^2), \end{align}

where we introduced a matrix $\smash {{\boldsymbol {M}} \doteq {\boldsymbol {1}} - {\boldsymbol {m}}}$, or explicitly,

(4.13)

\begin{equation} {M^a}_b \doteq \delta^a_b - {m^a}_b, \qquad {m^a}_b \doteq (\tau/2)(\partial_b V^{a}). \end{equation}

Let us express the term $\smash {\mathcal {O}(\epsilon ^2)}$ in (4.12) as $\smash {-{M^a}_b\mu ^b}$. Then,

(4.14)

\begin{align} \delta ({\boldsymbol{S}} - {\boldsymbol{\ell}}_{\tau}({\boldsymbol{X}} + {\boldsymbol{S}}/2)) & = \delta ({\boldsymbol{S}} - {\boldsymbol{M}}{\boldsymbol{V}}\tau - {\boldsymbol{m}}{\boldsymbol{S}} + {\boldsymbol{M}}{\boldsymbol{\mu}}) \nonumber\\ & = \delta ({\boldsymbol{M}}({\boldsymbol{S}} - {\boldsymbol{V}}\tau + {\boldsymbol{\mu}})) \nonumber\\ & = \delta ({\boldsymbol{S}} - {\boldsymbol{V}}\tau + {\boldsymbol{\mu}})/|\det {\boldsymbol{M}}|. \end{align}

Because $\smash {{\boldsymbol {m}} = \mathcal {O}(\epsilon )}$, the well-known formula yields $\smash {\det {\boldsymbol {M}} = 1 + \operatorname {tr}{\boldsymbol {m}} + \mathcal {O}(\epsilon ^2)}$. But $\operatorname {tr}{\boldsymbol {m}} = 0$ by (3.15), so

(4.15)

\begin{equation} \delta ({\boldsymbol{S}} - {\boldsymbol{\ell}}_{\tau}({\boldsymbol{X}} + {\boldsymbol{S}}/2)) = \delta ({\boldsymbol{S}} - {\boldsymbol{V}}\tau + {\boldsymbol{\mu}}) + \mathcal{O}(\epsilon^2). \end{equation}

The last term $\smash {\mathcal {O}(\epsilon ^2)}$ is insignificant and can be neglected right away, so (4.11) leads to

(4.16)

\begin{equation} T_{\tau}({\boldsymbol{X}}, {\boldsymbol{K}}) \approx \exp(\mathrm{i} \tau \varOmega({\boldsymbol{X}}, {\boldsymbol{K}}) + \mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{\mu}}), \end{equation}

where we have introduced the following notation:

(4.17)

\begin{equation} \varOmega({\boldsymbol{X}}, {\boldsymbol{K}}) \doteq{-} {\boldsymbol{K}} \cdot {\boldsymbol{V}}({\boldsymbol{X}}) = \omega - q_{\alpha} v^{\alpha} = \omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}} + \mathcal{O}(\epsilon). \end{equation}

By definition, $\smash {{\boldsymbol {\mu }}}$ is a polynomial of $\smash {\tau }$ with coefficients that are of order $\smash {\epsilon ^2}$ and therefore small. But because $\smash {\tau }$ can be large, and because $\smash {{\boldsymbol {\mu }}}$ is under the exponent, this makes $\smash {T_{\tau }}$ potentially sensitive to this term, so we retain it (for now).

4.2. Effective Green's operator

The effective Green's operator (3.25) can be understood as the right inverse of the operator (cf. § 3.2)

(4.18)

\begin{equation} \widehat{L}_\text{eff} \doteq \lim_{\nu \to 0+} (\partial_t - \lbrace \overline{H}, {\unicode{x25AA}} \rbrace + \nu), \end{equation}

so we denote it also as $\widehat {G} = \widehat {L}^{-1}_\text {eff}$ (which is admittedly abuse of notation). Because $\smash {\widehat {L}_\text {eff}}$ has real ${\boldsymbol {X}}$ representation by definition, the ${\boldsymbol {X}}$ representation of $\smash {\widehat {G}}$ is real too. In particular, $\smash {\left\langle {{\boldsymbol {X}} + {\boldsymbol {S}}/2 | \widehat {G} | {\boldsymbol {X}} - {\boldsymbol {S}}/2}\right\rangle}$ is real, hence

(4.19)

\begin{equation} G({\boldsymbol{X}}, -{\boldsymbol{K}}) = G^*({\boldsymbol{X}}, {\boldsymbol{K}}) \end{equation}

by definition of the Weyl symbol (2.70). As a corollary, the derivative of $G({\boldsymbol {X}}, {\boldsymbol {K}})$ with respect to the $a$th component of the whole second argument, denoted $G^{|a}$, satisfies

(4.20)

\begin{equation} (G^{|a}({\boldsymbol{X}}, {\boldsymbol{K}}))^* ={-}G^{|a}({\boldsymbol{X}}, -{\boldsymbol{K}}). \end{equation}

Also note that $\widehat {G}$ can be expressed as

(4.21)

\begin{equation} \widehat{G} = \lim_{\nu \to 0+} \mathrm{i} (\widehat{\omega} - \dot{x}^i \widehat{k}_i - \dot{p}_i \widehat{r\,}^i + \mathrm{i} \nu)^{{-}1} \end{equation}

(the notation ‘$\smash {\lim _{\nu \to 0+} A(\omega + \mathrm {i} \nu )}$’ will also be shortened as ‘$\smash {A(\omega + \mathrm {i} 0)}$’), whence

(4.22)

\begin{equation} \frac{\partial G}{\partial r^i} = \mathcal{O}(\dot{p}_i) = \mathcal{O}(\epsilon). \end{equation}

Due to (4.16), the leading-order approximation of the symbol of the operator (3.25) is $G({\boldsymbol {X}}, {\boldsymbol {K}}) = G_0(\varOmega ({\boldsymbol {X}}, {\boldsymbol {K}}))$, where

(4.23)

\begin{equation} G_0(\varOmega) \doteq \lim_{\nu \to 0+}\int_0^{\infty}\mathrm{d}\tau\, \mathrm{e}^{-\nu \tau + \mathrm{i}\varOmega \tau} = {\rm \pi}\,\delta(\varOmega) + \mathrm{i}\,\operatorname{pv}\frac{1}{\varOmega} \end{equation}

and the (standard) notation $\operatorname {pv} (1/\varOmega )$ is defined as follows:

(4.24)

\begin{equation} \operatorname{pv}\frac{1}{\varOmega} \doteq \lim_{\nu \to 0+} \frac{\varOmega}{\nu^2 + \varOmega^2}. \end{equation}

This means, in particular, that for any $A$, one has

(4.25)

\begin{align} \mathcal{J}[A, G_0] & \doteq \int \mathrm{d}{\boldsymbol{K}}\,A({\boldsymbol{X}}, {\boldsymbol{K}}) G_0(\varOmega({\boldsymbol{X}}, {\boldsymbol{K}})) \nonumber\\ & ={\rm \pi} \int \mathrm{d}{\boldsymbol{K}}\,A({\boldsymbol{X}}, {\boldsymbol{K}}) \delta(\varOmega ({\boldsymbol{X}}, {\boldsymbol{K}})) + \mathrm{i} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{A({\boldsymbol{X}}, {\boldsymbol{K}})}{\varOmega({\boldsymbol{X}}, {\boldsymbol{K}})}, \end{align}

where ${\unicode{x2A0F}}$ is a principal-value integral. Also usefully, $\smash {\overline {G}_0 = G_0}$ and

(4.26)

\begin{align} \partial_a \mathcal{J}[A, G_0] & = \int \mathrm{d}{\boldsymbol{K}}\,A({\boldsymbol{X}}, {\boldsymbol{K}}) G_0'(\varOmega({\boldsymbol{X}}, {\boldsymbol{K}}))\,\partial_a\varOmega({\boldsymbol{X}}, {\boldsymbol{K}}) \nonumber\\ & ={-}(\partial_a V^b({\boldsymbol{X}})) \int \mathrm{d}{\boldsymbol{K}}\,K_b A({\boldsymbol{X}}, {\boldsymbol{K}}) G_0'(\varOmega({\boldsymbol{X}}, {\boldsymbol{K}})) \nonumber\\ & ={-}(\partial_a V^b({\boldsymbol{X}}))\,\frac{\eth}{\partial \varOmega} \int \mathrm{d}{\boldsymbol{K}}\,K_b A({\boldsymbol{X}}, {\boldsymbol{K}}) G_0(\varOmega ({\boldsymbol{X}}, {\boldsymbol{K}})), \end{align}

where the notation $\eth /\partial \lambda \equiv \eth _\lambda$ is defined, for any $\smash {\lambda }$ and $Q$, as follows:

(4.27)

\begin{equation} \frac{\eth}{\partial \lambda} \int Q(\lambda) \doteq \left(\frac{\partial}{\partial \vartheta}\,\int Q(\lambda + \vartheta)\right)_{\vartheta = 0}. \end{equation}

Now let us reinstate the term ${\boldsymbol {\mu }}$ in (4.16). It is readily seen (Appendix B.2) that, although ${\boldsymbol {\mu }}$ may significantly affect $\smash {T_{\tau }}$ per se, its effect on $\mathcal {J}[A, G]$ is small, namely,

(4.28)

\begin{equation} \mathcal{J}[A, G] - \mathcal{J}[A, G_0] = \mathcal{O}(\epsilon^2). \end{equation}

Below, we apply this formulation to $\smash {A = \mathcal {O}(\varepsilon ^2)}$, in which case (4.28) becomes $\smash {\mathcal {J}[A, G] - \mathcal {J}[A, G_0] = \mathcal {O}(\epsilon ^2\varepsilon ^2)}$. Such corrections are negligible within our model, so from now on we adopt

(4.29)

\begin{equation} G({\boldsymbol{X}}, {\boldsymbol{K}}) \approx \overline{G}({\boldsymbol{X}}, {\boldsymbol{K}}) \approx G_0(\varOmega({\boldsymbol{X}}, {\boldsymbol{K}})). \end{equation}

4.3. Initial conditions

Consider the function $g$ from (3.24). Using (3.26), the latter can be written as follows:

(4.30)

\begin{equation} g = \lim_{\nu \to 0+} \lim_{\tau_0 \to -\infty} \left(\widehat{T}_{\tau - \tau_0} \widetilde{f}({\boldsymbol{X}}) + \int_0^{\tau-\tau_0} \mathrm{d}\tau'\,(1 - \mathrm{e}^{-\nu \tau'})\widehat{T}_{\tau'}\mathscr{F}({\boldsymbol{X}})\right). \end{equation}

Because $(1 - \mathrm {e}^{-\nu \tau })$ is smooth and $\smash {\widehat {T}_{\tau }\mathscr {F}}$ is rapidly oscillating, the second term in the external parenthesis is an oscillatory function of $\tau _0$ with the average negligible at $\smash {\nu \to 0}$. But the whole expression in these parenthesis is independent of $\tau _0$ at large $\tau _0$ (§ 3.2). Thus, it can be replaced with its own average over $\tau _0$, denoted $\smash {\langle \ldots \rangle _{\tau _0}}$. Because there is no $\smash {\nu }$-dependence left in this case, one can also omit $\smash {\lim _{\nu \to 0+}}$. That gives

(4.31)

\begin{equation} g = \lim_{\tau_0 \to -\infty} \langle \widehat{T}_{\tau - \tau_0} \widetilde{f}({\boldsymbol{X}}) \rangle_{\tau_0}. \end{equation}

Using

(4.32)

\begin{equation} f({\boldsymbol{X}}) = \sum_\sigma \delta({\boldsymbol{z}} - {\boldsymbol{z}}_\sigma(t)), \qquad \mathfrak{f}({\boldsymbol{X}}) \doteq \sum_\sigma \delta({\boldsymbol{z}} - \overline{{\boldsymbol{z}}}_\sigma(t)), \end{equation}

where the sum is taken over individual particles, one can writeFootnote ¹⁹

(4.33)

\begin{equation} \textstyle f({\boldsymbol{X}}) \approx \mathfrak{f}({\boldsymbol{X}}) - \sum_\sigma \widetilde{{\boldsymbol{z}}}_\sigma({\boldsymbol{X}}) \partial_{{\boldsymbol{z}}}\delta({\boldsymbol{z}} - \overline{{\boldsymbol{z}}}_\sigma({\boldsymbol{X}})), \end{equation}

where $\smash {{\boldsymbol {z}}_\sigma \doteq {\boldsymbol {z}}_\sigma - \overline {{\boldsymbol {z}}}_\sigma }$ are the $\smash {\widetilde {H}}$-driven small deviations from the particle unperturbed trajectories $\smash {\overline {{\boldsymbol {z}}}_\sigma }$. Then, $\smash {\overline {f} = \overline {\mathfrak {f}}({\boldsymbol {X}})}$, and the linearized perturbation $\smash {\widetilde {f} \doteq f - \overline {f}}$ is given by

(4.34)

\begin{equation} \textstyle \widetilde{f}({\boldsymbol{X}}) = \underbrace{\, \mathfrak{f}({\boldsymbol{X}}) -\overline{\mathfrak{f}}({\boldsymbol{X}}) }_{\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{f}}} \underbrace{\textstyle - \sum_\sigma \widetilde{{\boldsymbol{z}}}_\sigma({\boldsymbol{X}}) \partial_{{\boldsymbol{z}}}\delta({\boldsymbol{z}} - \overline{{\boldsymbol{z}}}_\sigma({\boldsymbol{X}})) }_{\underline{\widetilde{f}}}. \end{equation}

By definition, the unperturbed trajectories $\smash {\overline {{\boldsymbol {z}}}_\sigma }$ satisfy $\smash {\widehat {L} \delta ({\boldsymbol {z}} - \overline {{\boldsymbol {z}}}_\sigma ({\boldsymbol {X}})) = 0}$, where $\smash {\widehat {L}}$ as in (3.14); thus,

(4.35)

\begin{equation} \widehat{T}_{\tau - \tau_0} \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{f}} = \mathrm{e}^{\widehat{L} (\tau - \tau_0)} \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{f}} = \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{f}}. \end{equation}

Also, $\smash {\langle \widehat {T}_{\tau - \tau _0} \underline {\widetilde {f}} \rangle _{\tau _0} = 0}$, because $\smash {\widetilde {{\boldsymbol {z}}}_\sigma }$ are oscillatory functions of ${\boldsymbol {X}}$ that is slowly evolved by $\smash {\widehat {T}_{\tau - \tau _0}}$. Hence, $\smash {g}$ is the microscopic part of the unperturbed distribution function:

(4.36)

\begin{equation} g = \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{f}} = \mathfrak{f}({\boldsymbol{X}}) - \overline{\mathfrak{f}}({\boldsymbol{X}}). \end{equation}

This indicates that the term $\smash {\varGamma }$ defined in (3.34) is due to collisional effects. We postpone discussing these effects until § 6, so $\smash {\varGamma }$ will be ignored for now.

4.4. Summary of § 4

The main result of this section is that the Weyl symbol of the effective Green's operator $\smash {\widehat {G}}$ can be approximated within the assumed accuracy as follows:

(4.37)

\begin{equation} G({\boldsymbol{X}}, {\boldsymbol{K}}) \approx G_0(\varOmega({\boldsymbol{X}}, {\boldsymbol{K}})), \qquad \varOmega({\boldsymbol{X}}, {\boldsymbol{K}}) \doteq{-} {\boldsymbol{K}} \cdot {\boldsymbol{V}}({\boldsymbol{X}}). \end{equation}

Here, $\smash {{\boldsymbol {V}}}$ is the unperturbed velocity in the $\smash {{\boldsymbol {X}}}$ space, so $\smash {\varOmega ({\boldsymbol {X}}, {\boldsymbol {K}}) = \omega - {\boldsymbol {k}} \cdot {\boldsymbol {v}} + \mathcal {O}(\epsilon )}$, where $\smash {{\boldsymbol {v}}}$ is the unperturbed velocity in the $\smash {{\boldsymbol {x}}}$ space, and

(4.38)

\begin{equation} G_0(\varOmega) = {\rm \pi}\,\delta(\varOmega) + \mathrm{i}\,\operatorname{pv}\frac{1}{\varOmega}, \qquad \operatorname{pv}\frac{1}{\varOmega} \doteq \lim_{\nu \to 0+} \frac{\varOmega}{\nu^2 + \varOmega^2}. \end{equation}

We also show that the term $\smash {\varGamma }$ defined in (3.34) is due to collisional effects. We postpone discussing these effects until § 6, so $\smash {\varGamma }$ will be ignored for now.

5. Interaction with prescribed fields

In this section, we explore the effect of the diffusion operator $\smash {\widehat {D}^{\alpha \beta }}$. The oscillations will be described by $\smash {\overline {W}}$ as a prescribed function, so they are allowed (yet not required) to be ‘off-shell’, i.e. do not have to be constrained by a dispersion relation. Examples of off-shell fluctuations include driven near-field oscillations, evanescent waves and microscopic fluctuations (see also § 6). We will first derive the symbol of $\smash {\widehat {D}^{\alpha \beta }}$ and, using this symbol, approximate the diffusion operator with a differential operator (§ 5.1). Then, we will calculate the coefficients in the approximate expression for $\smash {\widehat {D}^{\alpha \beta }}$ (§§ 5.2 and 5.3). Finally, we will introduce the concept of the OC distribution (§ 5.4) and summarize and simplify the resulting equations (§ 5.6).

5.1. Expansion of the dispersion operator

The (effective) Green's operator can be represented through its symbol $G$ using (2.71):

(5.1)

\begin{equation} \widehat{G} = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{X}}\,\mathrm{d}{\boldsymbol{K}}\,\mathrm{d}{\boldsymbol{S}} {\left. {\boldsymbol {\mathsf {|X}}} \right\rangle} + {{\boldsymbol{S}}/2} G({\boldsymbol{X}}, {\boldsymbol{K}}) {\left\langle {{\boldsymbol {\mathsf {X|}}}} \right.} - {{\boldsymbol{S}}/2} \mathrm{e}^{\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}}. \end{equation}

The corresponding representation of $\widehat {u\,\,}^\alpha$ is even simpler, because the symbol of $\widehat {u\,\,}^\alpha$ is independent of ${\boldsymbol {K}}$:Footnote ²⁰

(5.2)

\begin{equation} \widehat{u\,}^\alpha = \int \mathrm{d}{\boldsymbol{X}} {\left. {\boldsymbol {\mathsf {|X}}} \right\rangle} u^\alpha({\boldsymbol{X}}) {\left\langle {{\boldsymbol {\mathsf {X|}}}} \right.}. \end{equation}

Let us also introduce the Wigner matrix of $u^\alpha$, denoted $\smash {W_{{\boldsymbol {u}}}^{\alpha \beta }}$, and its inverse Fourier transform $\smash {C_{{\boldsymbol {u}}}^{\alpha \beta }}$ as in § 2.2.3. Using these together with (2.67), one obtains

(5.3)

Then, by taking $\smash {\text {symb}_X }$ of (

5.3

), one finds that the symbol of $\smash {\widehat {D}^{\alpha \beta }}$ is a convolution of $\smash {\overline {W}_{{\boldsymbol {u}}}^{\alpha \beta }}$ and $\smash {G}$ (Appendix B.3):

(5.4)

\begin{equation} D^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}) = \int \mathrm{d}{\boldsymbol{K}}'\, \overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}') G({\boldsymbol{X}}, {\boldsymbol{K}} - {\boldsymbol{K}}'). \end{equation}

Let us Taylor-expand the symbol (5.4) in ${\boldsymbol {K}}$:

(5.5)

\begin{align} D^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}) &\approx \int \mathrm{d}{\boldsymbol{K}}'\, \overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}') G({\boldsymbol{X}}, -{\boldsymbol{K}}')\nonumber\\ &\quad + K_c \int \mathrm{d}{\boldsymbol{K}}'\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}') G^{|c}({\boldsymbol{X}}, -{\boldsymbol{K}}') +\mathcal{O}(K_a K_b G^{|ab}). \end{align}

As a reminder, $G^{|a}({\boldsymbol {X}}, -{\boldsymbol {K}}) = - \partial ^a G({\boldsymbol {X}}, -{\boldsymbol {K}})$ denotes the derivative of $G$ with respect to (the $a$th component of) the whole second argument, $-K_a$, and

(5.6)

\begin{equation} K_a\,\frac{\partial G}{\partial K_a} = \omega\,\frac{\partial G}{\partial \omega} + k_i\,\frac{\partial G}{\partial k_i} + r^i\,\frac{\partial G}{\partial r^i}. \end{equation}

Upon application of $\smash {\text {oper}_X }$, $\omega$ gets replaced (roughly) with $\mathrm {i} \partial _t = \mathcal {O}(\epsilon )$ and $k_i$ gets replaced (also roughly) with $-\mathrm {i}\partial _i = \mathcal {O}(\epsilon )$. By (4.22), the last term in (5.6) is of order $\epsilon$ too. This means that the contribution of the whole $\smash {K_a \partial ^a G}$ term to the equation for $\smash {\overline {f}}$ is of order $\epsilon$. The standard QLT neglects this contribution entirely, i.e. adopts $\smash {D^{\alpha \beta }({\boldsymbol {X}}, {\boldsymbol {K}}) \approx D^{\alpha \beta }({\boldsymbol {X}}, {\boldsymbol {0}})}$, in which case the diffusion operator becomes just a local function of phase-space variables, $\smash {\widehat {D}^{\alpha \beta } \approx D^{\alpha \beta }({\boldsymbol {X}}, {\boldsymbol {0}})}$. In this work, we retain corrections to the first order in ${\boldsymbol {X}}$, i.e. keep the second term in (5.5) as well, while neglecting the higher-order terms as usual.

Within this model, one can rewrite (5.5) as follows:

(5.7)

\begin{equation} D^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}) \approx D_0^{\alpha\beta}({\boldsymbol{X}}) + K_c \varTheta^{\alpha \beta c}({\boldsymbol{X}}). \end{equation}

Here, we used (4.19) and introduced

(5.8)

\begin{gather} \displaystyle D_0^{\alpha\beta}({\boldsymbol{X}}) \doteq \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}})G^*({\boldsymbol{X}}, {\boldsymbol{K}}), \end{gather}

(5.9)

\begin{gather}\displaystyle \varTheta^{\alpha \beta c}({\boldsymbol{X}}) \doteq{-}\int \mathrm{d}{\boldsymbol{K}}\, \overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}) (G^{|c}({\boldsymbol{X}}, {\boldsymbol{K}}))^*, \end{gather}

which satisfy (Appendix B.4)

(5.10)

\begin{equation} D_0^{\alpha\beta}({\boldsymbol{X}}) = (D_0^{\alpha\beta}({\boldsymbol{X}}))^*, \qquad \varTheta^{\alpha\beta c}({\boldsymbol{X}}) ={-} (\varTheta^{\alpha\beta c}({\boldsymbol{X}}))^*. \end{equation}

The first-order Weyl expansion of $\smash {\widehat {D}^{\alpha \beta }}$ is obtained by applying $\smash {\text {oper}_X }$ to (5.7). Namely, for any $\psi$, one has (cf. § 2.1.5)

(5.11)

\begin{equation} \widehat{D}^{\alpha\beta}\psi \approx D_0^{\alpha\beta} \psi - \mathrm{i} \varTheta^{\alpha \beta c} \partial_c\psi - \frac{\mathrm{i}}{2}\,(\partial_c\varTheta^{\alpha \beta c})\psi. \end{equation}

What remains now is to calculate the functions $\smash {D_0^{\alpha \beta }}$ and $\smash {\varTheta ^{\alpha \beta c}}$ explicitly.

5.2. Wigner matrix of the velocity oscillations

To express $\smash {\widehat {D}^{\alpha \beta }}$ through the Wigner function $W$ of the perturbation Hamiltonian (§ 3.1.1), we need to express $\smash {W_{{\boldsymbol {u}}}^{\alpha \beta }}$ through $W$. Recall that $\smash {W_{{\boldsymbol {u}}}^{\alpha \beta }}$ is the symbol of the density operator (§ 2.2.3). By definition (3.8), one has $\smash {u^{\alpha } = \mathrm {i} J^{\alpha \mu } \widehat {q}_{\mu } \widetilde {H}}$, where $\smash {\widehat {q}_\alpha \doteq - \mathrm {i}\partial _\alpha }$ (§ 2.2.1). Then,

(5.12)

\begin{equation} \widehat{W}_{{\boldsymbol{u}}}^{\alpha\beta} = (2{\rm \pi})^{{-}N} J^{\alpha \mu}\widehat{q}_{\mu}{\left. {\boldsymbol {\mathsf {|{\widetilde{H}}}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {{\widetilde{H}}|}}}} \right.}\widehat{q}_{\nu} J^{\beta \nu} = J^{\alpha\mu} J^{\beta \nu}\widehat{q}_{\mu} \widehat{W} \widehat{q}_{\nu}, \end{equation}

where $\widehat {W}$ is the density operator whose symbol is $W$. By applying $\smash {\text {symb}_X }$, one obtains

(5.13)

\begin{equation} W_{{\boldsymbol{u}}}^{\alpha\beta} = J^{\alpha \mu}J^{\beta \nu} (q_{\mu} \,\bigstar \,W \,\bigstar\, q_{\nu}), \end{equation}

where $\bigstar$ is the Moyal product (2.72). Using formulas analogous to (2.33) in the $\smash {({\boldsymbol {X}}, {\boldsymbol {K}})}$ space, one obtains

(5.14)

\begin{align} q_{\mu} \,\bigstar\, W \,\bigstar \,q_{\nu} & = \left(q_{\mu} W -\frac{\mathrm{i}}{2} \frac{\partial W}{\partial z^{\mu}} \right) \,\bigstar \,q_{\nu} \nonumber\\ & = q_{\mu}q_{\nu} W - \frac{\mathrm{i}}{2}\,q_{\nu}\,\frac{\partial W}{\partial z^{\mu}} + \frac{\mathrm{i}}{2} \frac{\partial}{\partial z^{\nu}} \left(q_{\mu} W_h - \frac{\mathrm{i}}{2}\frac{\partial W}{\partial z^{\mu}}\right) \nonumber\\ & = q_{\mu}q_{\nu} W_h + \frac{\mathrm{i}}{2} \left(q_{\mu}\frac{\partial W}{\partial z^{\nu}} - q_{\nu}\frac{\partial W}{\partial z^{\mu}}\right) + \frac{1}{4}\frac{\partial^2 W}{\partial z^{\mu}\partial z^{\nu}}. \end{align}

Hence, $W_{{\boldsymbol {u}}}^{\alpha \beta }$ and $W$ are connected via the following exact formula:

(5.15)

\begin{equation} W_{{\boldsymbol{u}}}^{\alpha\beta} = J^{\alpha \mu}J^{\beta \nu} \left( q_{\mu} q_{\nu} W - \frac{\mathrm{i}}{2} \left( q_{\nu}\,\frac{\partial W}{\partial z^{\mu}} -q_{\mu}\,\frac{\partial W}{\partial z^{\nu}} \right) + \frac{1}{4} \frac{\partial^2 W}{\partial z^{\mu} \partial z^{\nu}} \right). \end{equation}

5.3. Nonlinear potentials

Due to (5.10), one has $D_0^{\alpha \beta } = \operatorname {re} D_0^{\alpha \beta }$. Using this together with (5.8), (5.15), (4.29), and (4.23), one obtains

(5.16)

\begin{align} D_0^{\alpha\beta} = J^{\alpha \mu}J^{\beta \nu} \operatorname{re} &\int \mathrm{d}{\boldsymbol{K}} \left({\rm \pi}\,\delta(\varOmega) - \mathrm{i}\, \operatorname{pv}\frac{1}{\varOmega}\right). \nonumber\\ & \qquad \times\left( q_{\mu} q_{\nu} \overline{W} - \frac{\mathrm{i}}{2}\left( q_{\nu}\,\frac{\partial\overline{W}}{\partial z^{\mu}} - q_{\mu}\,\frac{\partial\overline{W}}{\partial z^{\nu}} \right) + \frac{1}{4} \frac{\partial^2\overline{W}}{\partial z^{\mu}\partial z^{\nu}} \right), \end{align}

with notation as in (2.10). This can be written as $\smash {D_0^{\alpha \beta } = {\mathsf {D}}^{\alpha \beta } + \varrho ^{\alpha \beta } + \varsigma ^{\alpha \beta }}$, where

(5.17)

\begin{equation} {\mathsf{D}}^{\alpha\beta} \doteq J^{\alpha\mu}J^{\beta\nu} \int \mathrm{d}{\boldsymbol{K}}\, {\rm \pi}\,\delta(\varOmega)\, q_{\mu}q_{\nu}\overline{W}, \end{equation}

and we also introduced

(5.18)

\begin{gather} \displaystyle \varrho^{\alpha\beta} \doteq{-} \frac{1}{2}\,J^{\alpha\mu}J^{\beta\nu} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}} \left( q_{\nu}\,\frac{\partial \overline{W}}{\partial z^{\mu}} - q_{\mu}\,\frac{\partial \overline{W}}{\partial z^{\nu}} \right) \frac{1}{\varOmega}, \end{gather}

(5.19)

\begin{gather}\displaystyle \varsigma^{\alpha\beta} \doteq \frac{1}{4}\, J^{\alpha\mu} J^{\beta\nu} \int \mathrm{d}{\boldsymbol{K}}\,{\rm \pi}\,\delta(\varOmega)\,\frac{\partial^2 \overline{W}}{\partial z^{\mu} \partial z^{\nu}}. \end{gather}

As shown in Appendix B.5, the contributions of these two functions to (3.33) are

(5.20)

\begin{equation} \frac{\partial}{\partial z^{\alpha}} \left(\varrho^{\alpha\beta}\, \frac{\partial\overline{f}}{\partial z^{\beta}} \right) = \mathcal{O}(\epsilon\varepsilon^2), \qquad \frac{\partial}{\partial z^{\alpha}} \left( \varsigma^{\alpha\beta}\,\frac{\partial\overline{f}}{\partial z^{\beta}} \right) = \mathcal{O}(\epsilon^2\varepsilon^2). \end{equation}

Thus, $\smash {\varrho ^{\alpha \beta }}$ must be retained and $\smash {\varsigma ^{\alpha \beta }}$ must be neglected, which leads to

(5.21)

\begin{equation} D_0^{\alpha\beta} \approx {\mathsf{D}}^{\alpha\beta} + \varrho^{\alpha\beta}. \end{equation}

The function $\smash {\varTheta ^{\alpha \beta c} = \mathrm {i}\, \operatorname {im} \varTheta ^{\alpha \beta c}}$ can be written as follows:

\[\varTheta^{\alpha \beta c}({\boldsymbol{X}}) = \mathrm{i} J^{\alpha \mu}J^{\beta \nu} \int \mathrm{d}{\boldsymbol{K}}\,q_{\mu}q_{\nu}\overline{W}({\boldsymbol{X}}, {\boldsymbol{K}})\,\frac{\partial}{\partial K_c}\,\operatorname{pv} \frac{1}{\varOmega ({\boldsymbol{X}}, {\boldsymbol{K}})} ={-}\mathrm{i} V^c({\boldsymbol{X}})\, \Theta^{\alpha\beta}({\boldsymbol{X}}), \]

where we introduced

(5.22)

\begin{equation} \Theta^{\alpha\beta} \doteq J^{\alpha \mu}J^{\beta \nu} \frac{\eth}{\partial \varOmega} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\mu}q_{\nu}\overline{W}}{\varOmega} \end{equation}

and $\smash {\eth }$ is defined as in (4.27). Then finally, one can rewrite (5.11) as follows:

(5.23)

\begin{align} \widehat{D}^{\alpha\beta}\psi & \approx ({\mathsf{D}}^{\alpha\beta} + \varrho^{\alpha\beta})\psi -\Theta^{\alpha\beta} V^c \partial_c\psi - \frac{1}{2}\,V^c(\partial_c\Theta^{\alpha\beta})\psi \nonumber\\ & = ({\mathsf{D}}^{\alpha\beta} + \varrho^{\alpha\beta})\psi - \Theta^{\alpha\beta} (\partial_t + v^{\lambda}\partial_{\lambda})\psi -\frac{1}{2}\,((\partial_t+v^{\lambda}\partial_{\lambda})\Theta^{\alpha\beta})\psi, \end{align}

where we used (3.15). With some algebra (Appendix B.6), and assuming the notation

(5.24)

\begin{equation} \varPhi ={-} J^{\mu \nu}\,\frac{\partial}{\partial z^{\mu}} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\nu}\overline{W}}{2\varOmega}, \end{equation}

one finds that (5.23) leads to

(5.25)

\begin{equation} \partial_{\alpha}(\widehat{D}^{\alpha\beta}\partial_{\beta}\overline{f}) = \partial_{\alpha}({\mathsf{D}}^{\alpha\beta}\partial_{\beta}\overline{f}) - \frac{1}{2}\,\mathrm{d}_t\partial_{\alpha}(\Theta^{\alpha\beta} \partial_{\beta}\overline{f}) + \lbrace \varPhi, \overline{f} \rbrace. \end{equation}

Hence, (3.33) becomes (to the extent that $\varGamma$ is negligible; see § 6.7)

(5.26)

\begin{equation} \mathrm{d}_t \overline{f} + \frac{1}{2}\,\mathrm{d}_t\partial_{\alpha}(\Theta^{\alpha\beta} \partial_{\beta}\overline{f}) - \lbrace \varPhi, \overline{f} \rbrace = \partial_{\alpha}({\mathsf{D}}^{\alpha \beta} \partial_{\beta} \overline{f}). \end{equation}

The functions $\smash {\Theta ^{\alpha \beta }}$, $\smash {\varPhi }$ and $\smash {{\mathsf {D}}^{\alpha \beta }}$ that determine the coefficients in this equation are fundamental and, for the lack of a better term, will be called nonlinear potentials.

5.4. Oscillation-centre distribution

Let us introduce

(5.27)

\begin{equation} F \doteq \overline{f} + \frac{1}{2}\,\partial_{\alpha}(\Theta^{\alpha\beta} \partial_{\beta}\overline{f}). \end{equation}

Then, using (5.25), one can rewrite (5.26) asFootnote ²¹

(5.28)

\begin{equation} \partial_t F - \lbrace \mathcal{H}, F \rbrace = \partial_{\alpha}({\mathsf{D}}^{\alpha \beta} \partial_{\beta} F), \end{equation}

where corrections $\mathcal {O}(\varepsilon ^4)$ have been neglected and we introduced $\smash {\mathcal {H} \doteq \overline {H} + \varPhi }$. As a reminder, the nonlinear potentials in (5.28) are as follows:

(5.29)

\begin{align} {\mathsf{D}}^{\alpha\beta} &= J^{\alpha\mu}J^{\beta\nu} \int \mathrm{d}{\boldsymbol{K}}\, {\rm \pi}\,\delta(\varOmega)\, q_{\mu}q_{\nu}\overline{W}, \end{align}

(5.30)

\begin{align} \Theta^{\alpha\beta} &= J^{\alpha \mu}J^{\beta \nu} \frac{\eth}{\partial \varOmega} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\mu}q_{\nu}\overline{W}}{\varOmega}, \end{align}

(5.31)

\begin{align} \varPhi & ={-} J^{\mu \nu}\,\frac{\partial}{\partial z^{\mu}} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\nu}\overline{W}}{2\varOmega}. \end{align}

Equations (5.27)–(5.31) form a closed model that describes the evolution of the average distribution $\smash {\overline {f}}$ in turbulence with prescribed $\overline {W}$. In particular, (5.28) can be interpreted as a Liouville-type equation for $F$ as an effective, or ‘dressed’, distribution. The latter can be understood as the distribution of ‘dressed’ particles called OCs. Then, $\mathcal {H}$ serves as the OC Hamiltonian, $\smash {{\mathsf {D}}^{\alpha \beta }}$ is the phase-space diffusion coefficient, $\smash {\varPhi }$ is the ponderomotive energy, $\smash {\varOmega = \omega - q_\alpha v^\alpha }$ and $\smash {v^\alpha \doteq J^{\alpha \beta }\partial _\beta \overline {H}}$. Within the assumed accuracy, one can redefine $\smash {v^\alpha }$ to be the OC velocity rather than the particle velocity; specifically,Footnote ²²

(5.32)

\begin{equation} v^\alpha \doteq J^{\alpha\beta}\partial_\beta\mathcal{H} = J^{\alpha\beta}\partial_\beta\overline{H} + \mathcal{O}(\epsilon^2). \end{equation}

Then, the presence of $\smash {\delta (\varOmega )}$ in (5.29) signifies that OCs diffuse in phase space in response to waves they are resonant with. Below, we use the terms ‘OCs’ and ‘particles’ interchangeably except where specified otherwise.

That said, the interpretation of OCs as particle-like objects is limited. Single-OC motion equations are not introduced in our approach. (They would have been singular for resonant interactions.) Accordingly, the transformation (5.27) of the distribution function $\smash {\overline {f} \mapsto F}$ is not derived from a coordinate transformation but rather is fundamental. As a result, particles and OCs live in the same phase space, but the ‘dynamics of OCs’ can be irreversible (§ 5.5). This qualitatively distinguishes our approach from the traditional OC theory (Dewar Reference Dewar1973) and from the conceptually similar gyrokinetic theory (Littlejohn Reference Littlejohn1981; Cary & Brizard Reference Cary and Brizard2009), where coordinate transformations are central.

5.5. ${H}$-theorem

Because $\smash {\overline {W}}$ is non-negative (§ 2.1.6), $\smash {{\mathsf {D}}^{\alpha \beta }}$ is positive-semidefinite; that is,

(5.33)

\begin{equation} {\mathsf{D}}^{\alpha\beta} \psi_\alpha \psi_\beta = \int \mathrm{d}{\boldsymbol{K}}\, {\rm \pi}\,\delta(\varOmega)\, a^2 \overline{W} \geqslant 0, \qquad a \doteq J^{\alpha\mu} \psi_\alpha q_{\mu} \end{equation}

for any real $\smash {\psi }$. This leads to the following theorem. Consider the OC entropy defined as

(5.34)

\begin{equation} \mathscr{S} \doteq{-} \int \mathrm{d}{\boldsymbol{z}}\,F(t, {\boldsymbol{z}})\ln F(t, {\boldsymbol{z}}). \end{equation}

According to (5.28), $\smash {\mathscr {S}}$ satisfies

(5.35)

\begin{align} \frac{\mathrm{d}\mathscr{S}}{\mathrm{d} t} & ={-} \int \mathrm{d}{\boldsymbol{z}}\,\frac{\mathrm{d}(F \ln F)}{\mathrm{d} F}\left( \lbrace \mathcal{H}, F \rbrace + \partial_{\alpha}({\mathsf{D}}^{\alpha \beta} \partial_{\beta} F) \right)\nonumber\\ & ={-} \int \mathrm{d}{\boldsymbol{z}}\,\frac{\mathrm{d}(F \ln F)}{\mathrm{d} F}\,J^{\alpha\beta}(\partial_\alpha \mathcal{H})(\partial_\beta F) - \int \mathrm{d}{\boldsymbol{z}}\,(1 + \ln F)\,\partial_{\alpha}({\mathsf{D}}^{\alpha \beta} \partial_{\beta} F) \nonumber\\ & ={-} \int \mathrm{d}{\boldsymbol{z}}\,J^{\alpha\beta}(\partial_\alpha \mathcal{H}) \partial_\beta(F \ln F) - \int \mathrm{d}{\boldsymbol{z}}\,\ln F\,\partial_{\alpha}({\mathsf{D}}^{\alpha \beta} \partial_{\beta} F) \nonumber\\ & = \int \mathrm{d}{\boldsymbol{z}}\,(J^{\alpha\beta} \partial^2_{\alpha\beta} \mathcal{H})\,F \ln F + \int \mathrm{d}{\boldsymbol{z}}\,{\mathsf{D}}^{\alpha \beta} (\partial_\alpha\ln F) (\partial_{\beta} \ln F) F. \end{align}

The first integral vanishes due to $\smash {J^{\alpha \beta } \partial ^2_{\alpha \beta } = 0}$. The second integral is non-negative due to (5.33). Thus,

(5.36)

\begin{equation} \frac{\mathrm{d}\mathscr{S}}{\mathrm{d} t} \geqslant 0, \end{equation}

which is recognized as the $\smash {H}$-theorem (Lifshitz & Pitaevskii Reference Lifshitz and Pitaevskii1981, § 4) for QL OC dynamics.

5.6. Summary of § 5

From now on, we assume that the right-hand side of (5.28) scales not as $\smash {\mathcal {O}(\varepsilon ^2)}$ but as $\smash {\mathcal {O}(\epsilon \varepsilon ^2)}$, either due to the scarcity of resonant particles or, for QL diffusion driven by microscopic fluctuations (§ 6), due to the plasma parameter's being large. Also, the spatial derivatives can be neglected within the assumed accuracy in the definition of $F$ (5.27) and on the right-hand side of (5.28). Using this together with (2.69), and with (2.56) for the Poisson bracket, our results can be summarized as follows.

QL evolution of a particle distribution in a prescribed wave field is governed byFootnote ²³

(5.37)

\begin{equation} \frac{\partial F}{\partial t} - \frac{\partial \mathcal{H}}{\partial {\boldsymbol{x}}} \cdot \frac{\partial F}{\partial {\boldsymbol{p}}} + \frac{\partial \mathcal{H}}{\partial {\boldsymbol{p}}} \cdot \frac{\partial F}{\partial {\boldsymbol{x}}} = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol {\mathsf{D}}}\,\frac{\partial F}{\partial {\boldsymbol{p}}}\right). \end{equation}

The OC distribution $\smash {F}$ is defined as

(5.38)

\begin{equation} F = \overline{f} + \frac{1}{2}\,\frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol{\Theta}}\, \frac{\partial \overline{f}}{\partial {\boldsymbol{p}}}\right), \end{equation}

so the density of OCs is the same as the locally averaged density of the true particles:

(5.39)

\begin{equation} \mathcal{N} \doteq \int \mathrm{d}{\boldsymbol{p}}\,F = \int \mathrm{d}{\boldsymbol{p}}\,\overline{f}. \end{equation}

The function $\smash {\mathcal {H}}$ is understood as the OC Hamiltonian. It is given by

(5.40)

\begin{equation} \mathcal{H} \doteq \overline{H} + \varPhi, \end{equation}

where $\smash {\overline {H}}$ is the average Hamiltonian, which may include interaction with background fields, and $\smash {\varPhi }$ is the ponderomotive potential. The nonlinear potentials that enter (5.37) can be calculated to the zeroth order in $\epsilon$ and are given byFootnote ²⁴

(5.41)

\begin{align} {{\boldsymbol {\mathsf{D}}}} &= \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, {\rm \pi}\,{\boldsymbol{k}}{\boldsymbol{k}} \overline{{\mathsf{W}}}(t, {\boldsymbol{k}} \cdot {\boldsymbol{v}}, {\boldsymbol{k}}; {\boldsymbol{p}}) , \end{align}

(5.42)

\begin{align} {\boldsymbol{\Theta}} &=\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \left. \frac{{\boldsymbol{k}} {\boldsymbol{k}} \overline{{\mathsf{W}}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}} + \vartheta} \right|_{\vartheta=0}, \end{align}

(5.43)

\begin{align} \varPhi &= \frac{1}{2}\frac{\partial}{\partial {\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{{\boldsymbol{k}} \overline{{\mathsf{W}}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}}, \end{align}

where $\smash {{\boldsymbol {k}} {\boldsymbol {k}}}$ is a dyadic matrix with two lower indices, and the same conventions apply as in § 2.1.2. Also, $\smash {{\boldsymbol {v}}}$ is hereby redefined as the OC spatial velocity, namely,

(5.44)

\begin{equation} {\boldsymbol{v}} \doteq \partial_{\boldsymbol{p}}\mathcal{H} = \partial_{\boldsymbol{p}}\overline{H} + \mathcal{O}(\epsilon^2). \end{equation}

The function $\smash {\overline {{\mathsf {W}}}}$ is defined as

(5.45)

\begin{equation} \overline{{\mathsf{W}}}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}) \doteq \int \mathrm{d}{\boldsymbol{r}}\,\overline{W}(t, {\boldsymbol{x}}, {\boldsymbol{p}}, \omega, {\boldsymbol{k}}, {\boldsymbol{r}}), \end{equation}

where $\overline {W}$ is the average Wigner function (3.4) of the perturbation Hamiltonian, i.e. the spectrum of its symmetrized autocorrelation function (3.5). Due to (2.78), it can be understood as the average of $\smash {{\mathsf {W}} \doteq \text {symb}_{{\mathsf {x}}} \widehat {{\mathsf {W}}}}$ (where $\smash {\widehat {{\mathsf {W}}}}$ is defined in (3.3)), i.e. as the Wigner function of the perturbation Hamiltonian with $\smash {{\boldsymbol {p}}}$ treated as a parameter. As such, $\smash {\overline {{\mathsf {W}}}}$ is non-negative, so $\smash {{{\boldsymbol {\mathsf {D}}}}}$ is positive-semidefinite. This leads to an $\smash {H}$-theorem (proven similarly to (5.36)) for the entropy density $\smash {\sigma \doteq - \int \mathrm {d}{\boldsymbol {p}}\,F\ln F}$:

(5.46)

\begin{equation} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{{\mathsf{D}}} \geqslant 0, \qquad \left(\frac{\partial \psi}{\partial t}\right)_{{\mathsf{D}}} \doteq \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol {\mathsf{D}}}\,\frac{\partial \psi}{\partial {\boldsymbol{p}}}\right). \end{equation}

Also note that for homogeneous turbulence in particular, where $\smash {\overline {{\mathsf {W}}}}$ is independent of $\smash {{\boldsymbol {x}}}$, (2.46) yields that

(5.47)

\begin{equation} \int \mathrm{d}\omega \,\overline{{\mathsf{W}}}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}) = \frac{1}{\mathscr{V}_n} \int \mathrm{d}\omega \,\mathrm{d}{\boldsymbol{x}}\,\overline{{\mathsf{W}}}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}) = \frac{1}{\mathscr{V}_n}\,\overline{|\mathring{\widetilde{H}}(t, {\boldsymbol{k}}, {\boldsymbol{p}})|^2}, \end{equation}

where $\smash {\mathscr {V}_n}$ is the plasma volume (the index $\smash {n}$ denotes the number of spatial dimensions) and $\smash {\mathring {\widetilde {H}}}$ is the spatial spectrum of $\smash {\widetilde {H}}$ as defined in (2.23).

Equation (5.37) can be used to calculate the ponderomotive force $\smash {\partial _t\int \mathrm {d}{\boldsymbol {p}}\,\overline {f}}$ that a given wave field imparts on a plasma. This potentially resolves the controversies mentioned in Kentwell & Jones (Reference Kentwell and Jones1987). We will revisit this subject for on-shell waves in § 7.5.

6. Interaction with self-consistent fields

Here, we explain how to calculate the function $\overline {{\mathsf {W}}}$ in the presence of microscopic fluctuations (non-zero $g$). In particular, we reinstate the term $\smash {\varGamma }$ that was omitted in § 5. We also show that a collision operator of the Balescu–Lenard type emerges from our theory within a general interaction model. This calculation can be considered as a generalization of that in Rogister & Oberman (Reference Rogister and Oberman1968) for homogeneous plasmas. Another related calculation was proposed in Chavanis (Reference Chavanis2012) in application to potential interactions in inhomogeneous systems using action–angle variables, with global averaging over the angles. (See also Mynick (Reference Mynick1988) for a related calculation in action–angle variables based on the Fokker–Planck approach.) In contrast, our model holds for any Hamiltonian interactions via any vector fields and allows for weak inhomogeneities in canonical coordinates.

6.1. Interaction model

Let us assume that particles interact via an $M$-component real field ${\boldsymbol {\varPsi }} \equiv (\varPsi ^1, \varPsi ^2, \ldots, \varPsi ^M)^\intercal$. It is treated below as a column vector; hence the index $\smash {^\intercal }$. (A complex field can be accommodated by considering its real and imaginary parts as separate components.) We split this field into the average part $\smash {\overline {{\boldsymbol {\varPsi }}}}$ and the oscillating part $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$. The former is considered given. For the latter, we assume the action integral of this field without plasma in the form

(6.1)

\begin{equation} S_0 = \int \mathrm{d}{\boldsymbol {\mathsf{x}}}\,\mathfrak{L}_0, \qquad \mathfrak{L}_0 = \frac{1}{2}\,\smash{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}}\widehat{\boldsymbol{\varXi}}_0\widetilde{{\boldsymbol{\varPsi}}} \end{equation}

(see § 9 for examples), where $\smash {\widehat {\boldsymbol {\varXi }}_0}$ is a Hermitian operatorFootnote ²⁵ and $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}^{{\dagger}} = \smash {\widetilde {{\boldsymbol {\varPsi }}}}^\intercal }$ is a row vector dual to $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$. Plasma is allowed to consist of multiple species, henceforth denoted with index $\smash {s}$. Because $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ is assumed small, the generic Hamiltonian for each species $\smash {s}$ can be Taylor-expanded in $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ and represented in a generic form

(6.2)

\begin{equation} H_s(t, {\boldsymbol{x}}, {\boldsymbol{p}}) \approx H_{0s} + \widehat{\boldsymbol{\alpha}}_s^{{\dagger}} \widetilde{{\boldsymbol{\varPsi}}} + \frac{1}{2}\,(\widehat{\boldsymbol{L}}_s\widetilde{{\boldsymbol{\varPsi}}})^{{\dagger}} (\widehat{\boldsymbol{R}}_s\widetilde{{\boldsymbol{\varPsi}}}) \end{equation}

(see § 9 for examples), which can be considered as a second-order Taylor expansion of the full Hamiltonian in $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$. Here, $\smash {H_{0s} \equiv H_{0s}(t, {\boldsymbol {x}}, {\boldsymbol {p}})}$ is independent of $\smash {\widetilde {{\boldsymbol {\varPsi }}} \equiv \widetilde {{\boldsymbol {\varPsi }}}(t, {\boldsymbol {x}})}$, $\smash {\widehat {\boldsymbol {\alpha }}_s \equiv (\widehat {\alpha }_{s,1}, \widehat {\alpha }_{s,2}, \ldots, \widehat {\alpha }_{s,M})^\intercal }$ is a column vector whose elements $\smash {\widehat {\alpha }_{s,i}}$ are linear operators on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$. The dagger is added so that $\smash {\widehat {\boldsymbol {\alpha }}_s^{{\dagger}} }$ could be understood as a row vector whose elements $\smash {\widehat {\alpha }_{s,i}^{{\dagger}} }$ act on the individual components of the field; i.e. $\smash {\widehat {\boldsymbol {\alpha }}_s^{{\dagger}} \widetilde {{\boldsymbol {\varPsi }}} \equiv \widehat {\alpha }^{{\dagger}} _{s,i}\widetilde {\varPsi }^i}$. We let $\smash {\widehat {\boldsymbol {\alpha }}_s}$ be non-local in $t$ and ${\boldsymbol {x}}$ (for example, $\smash {\widehat {\boldsymbol {\alpha }}_s}$ can be a spacetime derivative or a spacetime integral), and we also let $\smash {\widehat {\boldsymbol {\alpha }}_s}$ depend on $\smash {{\boldsymbol {p}}}$ parametrically so

(6.3)

\begin{equation} \text{symb}\, \widehat{\boldsymbol{\alpha}}_s = {\boldsymbol{\alpha}}_s(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}). \end{equation}

The matrix operators $\smash {\widehat {\boldsymbol {L}}_s}$ and $\smash {\widehat {\boldsymbol {R}}_s}$ and their symbols $\smash {{\boldsymbol {L}}_s}$ and $\smash {{\boldsymbol {R}}_s}$ are understood similarly.

The Lagrangian density of the oscillating-field–plasma system is

(6.4)

\begin{equation} \mathfrak{L}_p = \frac{1}{2}\,\smash{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}}\widehat{\boldsymbol{\varXi}}_0 \widetilde{{\boldsymbol{\varPsi}}} + \sum_s \sum_{\sigma_s} ({\boldsymbol{p}}_{\sigma_s} \cdot \dot{{\boldsymbol{x}}}_{\sigma_s} - H_s(t, {\boldsymbol{x}}_{\sigma_s}, {\boldsymbol{p}}_{\sigma_s})) \delta({\boldsymbol{x}} - {\boldsymbol{x}}_{\sigma_s}(t)), \end{equation}

where the sum is taken over individual particles. Note that

(6.5)

\begin{align} \sum_{\sigma_s} H_s(t, {\boldsymbol{x}}_{\sigma_s}, {\boldsymbol{p}}_{\sigma_s}) \delta({\boldsymbol{x}} - {\boldsymbol{x}}_{\sigma_s}(t)) & = \int \mathrm{d}{\boldsymbol{p}}\, \sum_{\sigma_s} \delta({\boldsymbol{z}} - {\boldsymbol{z}}_{\sigma_s}(t)) H_s(t, {\boldsymbol{x}}, {\boldsymbol{p}}) \nonumber\\ & = \int \mathrm{d}{\boldsymbol{p}}\, f_s(t, {\boldsymbol{x}}, {\boldsymbol{p}}) H_s(t, {\boldsymbol{x}}, {\boldsymbol{p}}), \end{align}

so the $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$-dependent part of the system action can be written as $\smash {S = \int \mathrm {d}{\boldsymbol {\mathsf {x}}}\,\mathfrak {L}}$ with

(6.6)

\begin{gather} \displaystyle \mathfrak{L} = \frac{1}{2}\,\smash{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}} \widehat{\boldsymbol{\varXi}}_p \widetilde{{\boldsymbol{\varPsi}}} - \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widetilde{f}_s\widehat{\boldsymbol{\alpha}}_s^{{{\dagger}}}\widetilde{{\boldsymbol{\varPsi}}}, \end{gather}

(6.7)

\begin{gather}\displaystyle \widehat{\boldsymbol{\varXi}}_p \doteq \widehat{\boldsymbol{\varXi}}_0 - \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\smash{\widehat{\boldsymbol{L}}}_s^{{\dagger}} f_s \widehat{\boldsymbol{R}}_s. \end{gather}

(The contribution of $\smash {\widetilde {f}_s}$ to the second term in (6.6) has been omitted because it averages to zero at integration over spacetime and thus does not contribute to $\smash {S}$.) This ‘abridged’ action is not sufficient to describe the particle motion, but it is sufficient to describe the dynamics of $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ at given $\smash {f_s}$, as discussed below. The operator $\smash {\widehat {\boldsymbol {\varXi }}_p}$ can be considered Hermitian without loss of generality, because its anti-Hermitian part does not contribute to $\smash {S}$. Also, we assume that unless either of $\smash {\widehat {\boldsymbol {L}}}$ and $\smash {\widehat {\boldsymbol {R}}}$ is zero, the high-frequency field has no three-wave resonances, so terms cubic in $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ can be neglected in $\smash {S}$;Footnote ²⁶ then,

(6.8)

\begin{equation} \widehat{\boldsymbol{\varXi}}_p \approx \widehat{\boldsymbol{\varXi}}_0 - \sum_s \int \mathrm{d}{\boldsymbol{p}}\,(\smash{\widehat{\boldsymbol{L}}}_s^{{\dagger}} F_s \widehat{\boldsymbol{R}}_s)_\text{H}. \end{equation}

Using the same assumption, one can also adopt

(6.9)

\begin{equation} \overline{H}_s = H_{0s} + \frac{1}{2}\,\overline{(\widehat{\boldsymbol{L}}_s\widetilde{{\boldsymbol{\varPsi}}})^{{\dagger}} (\widehat{\boldsymbol{R}}_s\widetilde{{\boldsymbol{\varPsi}}})}, \qquad \widetilde{H}_s \approx \widehat{\boldsymbol{\alpha}}_s^{{\dagger}} \widetilde{{\boldsymbol{\varPsi}}}, \end{equation}

because in the absence of three-wave resonances, the oscillating part of $\smash {(\widehat {\boldsymbol {L}}_s\widetilde {{\boldsymbol {\varPsi }}})^{{\dagger}} (\widehat {\boldsymbol {R}}_s\widetilde {{\boldsymbol {\varPsi }}})}$ contributes only $\smash {\mathcal {O}(\varepsilon ^4)}$ terms to the equation for $\smash {F_s}$.

6.2. Field equations

The Euler–Lagrange equation for $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ derived from (6.6) is

(6.10)

\begin{equation} \widehat{\boldsymbol{\varXi}}_p \widetilde{{\boldsymbol{\varPsi}}} = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widehat{\boldsymbol{\alpha}}_s \widetilde{f}_s. \end{equation}

Then, to the extent that the linear approximation for $\smash {\tilde {f}_s}$ is sufficient (see below), one finds that the oscillating part of the field satisfies

(6.11)

\begin{equation} \widehat{\boldsymbol{\varXi}}_p \widetilde{{\boldsymbol{\varPsi}}} - \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widehat{\boldsymbol{\alpha}}_s \widehat{G}_s \lbrace \widehat{\boldsymbol{\alpha}}_s^{{\dagger}} \widetilde{{\boldsymbol{\varPsi}}}, \overline{f}_s \rbrace = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widehat{\boldsymbol{\alpha}}_s g_s, \end{equation}

where we used (3.24). Note that the right-hand side of (6.11) is determined by microscopic fluctuations $g_s(t, {\boldsymbol {x}}, {\boldsymbol {p}})$ (§ 4.3). Equation (6.11) can also be expressed as

(6.12)

\begin{equation} \widehat{\boldsymbol{\varXi}}\widetilde{{\boldsymbol{\varPsi}}} = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widehat{\boldsymbol{\alpha}}_s g_s, \end{equation}

where $\smash {\widehat {\boldsymbol {\varXi }}}$ is understood as the plasma dispersion operator and is given by

(6.13)

\begin{equation} \widehat{\boldsymbol{\varXi}} \doteq \widehat{\boldsymbol{\varXi}}_p - \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widehat{\boldsymbol{\alpha}}_s \widehat{G}_s \lbrace \widehat{\boldsymbol{\alpha}}_s^{{\dagger}} \,{\unicode{x25AA}}\,, \overline{f}_s \rbrace, \end{equation}

where $\smash {{\unicode{x25AA}} }$ is a placeholder. The general solution of (6.12) can be written as

(6.14)

\begin{equation} \widetilde{{\boldsymbol{\varPsi}}} = \underline{\widetilde{{\boldsymbol{\varPsi}}}} + \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}}, \qquad \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}} = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\smash{\widehat{\boldsymbol{\varXi}}}^{{-}1}\widehat{\boldsymbol{\alpha}}_s g_s. \end{equation}

Here, $\smash {\smash {\widehat {\boldsymbol {\varXi }}}^{-1}}$ is the right inverse of $\smash {\widehat {\boldsymbol {\varXi }}}$ (specifically, $\smash {\widehat {\boldsymbol {\varXi }} \smash {\widehat {\boldsymbol {\varXi }}}^{-1} = \widehat {\boldsymbol {1}}}$ yet $\smash {\smash {\widehat {\boldsymbol {\varXi }}}^{-1}\widehat {\boldsymbol {\varXi }} \ne \widehat {\boldsymbol {1}}}$) such that $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}}$ vanishes at zero $g$.Footnote ²⁷ The rest of the solution, $\smash {\underline {\widetilde {{\boldsymbol {\varPsi }}}}}$, is the macroscopic field that satisfies

(6.15)

\begin{equation} \widehat{\boldsymbol{\varXi}}\underline{\widetilde{{\boldsymbol{\varPsi}}}} = {\boldsymbol{0}}. \end{equation}

In the special case when the dispersion operator is Hermitian ($\smash {\widehat {\boldsymbol {\varXi }} = \widehat {\boldsymbol {\varXi }}_\text {H}}$), (6.15) also flows from the ‘adiabatic’ macroscopic part of the action $\smash {S}$, namely,

(6.16)

\begin{equation} S_{\text{ad}} \doteq \frac{1}{2}\int \mathrm{d}{\boldsymbol {\mathsf{x}}}\,\smash{\underline{\widetilde{{\boldsymbol{\varPsi}}}}}^{{\dagger}}\widehat{\boldsymbol{\varXi}}_\text{H}\underline{\widetilde{{\boldsymbol{\varPsi}}}}. \end{equation}

Because we have assumed a linear model for $\smash {f_s}$ in (6.11), $\smash {\underline {\widetilde {{\boldsymbol {\varPsi }}}}}$ is decoupled from $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}}$, and hence the dynamics of $\smash {\underline {\widetilde {{\boldsymbol {\varPsi }}}}}$ turns out to be collisionless. This is justified, because collisional dissipation is assumed to be much slower that collisionless dissipation (§ 3.3). One can reinstate collisions in (6.15) by modifying $\smash {\widehat {G}_s}$ ad hoc, if necessary. Alternatively, one can avoid separating $\smash {\underline {\widetilde {{\boldsymbol {\varPsi }}}}}$ and $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}}$ and, instead, derive an equation for the average Wigner matrix of the whole $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ (McDonald Reference McDonald1991). However, this approach is beyond QLT, so it is not considered in this paper.

6.3. Dispersion matrix

As readily seen from the definition (6.13), the operator $\smash {\widehat {\boldsymbol {\varXi }}}$ can be expressed as

(6.17)

\begin{equation} \widehat{\boldsymbol{\varXi}} = \widehat{\boldsymbol{\varXi}}_p - \mathrm{i}\widehat{k}_j \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\widehat{\boldsymbol{\alpha}}_s \widehat{G}_s \widehat{\boldsymbol{\alpha}}_s^{{\dagger}} \,\frac{\partial F_s}{\partial p_j} + \mathcal{O}(\epsilon, \varepsilon^2). \end{equation}

The corrections caused by non-zero $\epsilon$ and $\varepsilon$ in this formula will be insignificant for our purposes, so they will be neglected. In particular, this means that $\smash {G_s \doteq \text {symb}_X \widehat {G}_s}$ can be adopted in the form independent of $\smash {{\boldsymbol {r}}}$ (§ 4.2):

(6.18)

\begin{equation} G_s \approx \frac{\mathrm{i}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \mathrm{i} 0} = {\rm \pi}\,\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) + \mathrm{i}\,\operatorname{pv}\frac{1}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}. \end{equation}

Then, $\smash {\widehat {G}_s}$ can be considered as an operator on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$ with ${\boldsymbol {p}}$ as a parameter, and $\smash {\text{symb}\, \widehat {G}_s = G_s(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}}; {\boldsymbol {p}})}$. Also,

(6.19)

\begin{equation} \text{symb}\,(\widehat{\boldsymbol{\alpha}}_s \widehat{G}_s \widehat{\boldsymbol{\alpha}}_s^{{\dagger}}) = {\boldsymbol{\alpha}}_s \star G_s \star {\boldsymbol{\alpha}}_s^{{\dagger}} \approx {\boldsymbol{\alpha}}_s G_s {\boldsymbol{\alpha}}_s^{{\dagger}}. \end{equation}

This readily yields the ‘dispersion matrix’ $\smash {{\boldsymbol {\varXi }} \doteq \text {symb}_{{\mathsf {x}}} \widehat {\boldsymbol {\varXi }}}$:

(6.20)

\begin{gather} \displaystyle {\boldsymbol{\varXi}}(\omega, {\boldsymbol{k}}) \approx {\boldsymbol{\varXi}}_p(\omega, {\boldsymbol{k}}) + \sum_s \int \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}){\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \mathrm{i} 0}\, {\boldsymbol{k}} \cdot \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}, \end{gather}

(6.21)

\begin{gather}\displaystyle {\boldsymbol{\varXi}}_p(\omega, {\boldsymbol{k}}) \approx {\boldsymbol{\varXi}}_0(\omega, {\boldsymbol{k}}) - \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{\wp}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})F_s({\boldsymbol{p}}) \end{gather}

(see § 9 for examples). Here, $\smash {{\boldsymbol {\alpha }}_s {\boldsymbol {\alpha }}_s^{{\dagger}} }$ is a dyadic matrix, and the arguments $t$ and ${\boldsymbol {x}}$ are henceforth omitted for brevity. Also, we introduced the operators $\smash {\widehat {\boldsymbol {\wp }}_s = \widehat {\boldsymbol {\wp }}_s^{{\dagger}} }$ and their symbols $\smash {{\boldsymbol {\wp }}_s = {\boldsymbol {\wp }}_s^{{\dagger}} }$ as

(6.22)

\begin{equation} \widehat{\boldsymbol{\wp}}_s \doteq (\smash{\widehat{\boldsymbol{L}}}_s^{{\dagger}}\widehat{\boldsymbol{R}})_\text{H}, \qquad {\boldsymbol{\wp}}_s \doteq \text{symb}\, \widehat{\boldsymbol{\wp}}_s \approx ({\boldsymbol{L}}_s^{{\dagger}} {\boldsymbol{R}}_s)_\text{H}. \end{equation}

The appearance of $+ \mathrm {i} 0$ in the denominator in (6.20) is related to the Landau rule. (Remember that as arguments of Weyl symbols, $\omega$ and ${\boldsymbol {k}}$ are real by definition.) The Hermitian and anti-Hermitian parts of the dispersion matrix are

(6.23)

\begin{align} {\boldsymbol{\varXi}}_\text{H}(\omega, {\boldsymbol{k}}) & \approx {\boldsymbol{\varXi}}_p(\omega, {\boldsymbol{k}}) + \sum_s {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}){\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\, {\boldsymbol{k}} \cdot \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}, \end{align}

(6.24)

\begin{align} {\boldsymbol{\varXi}}_\text{A}(\omega, {\boldsymbol{k}}) & \approx{-}{\rm \pi} \sum_s \int \mathrm{d}{\boldsymbol{p}}\, {\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}){\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) \delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)\, {\boldsymbol{k}} \cdot \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}. \end{align}

Assuming the notation $\smash {{\boldsymbol {\varXi }}^{-{{\dagger}} } \doteq ({\boldsymbol {\varXi }}^{{\dagger}} )^{-1}}$, the inverse dispersion matrix can be expressed as

(6.25)

\begin{equation} {\boldsymbol{\varXi}}^{{-}1} = {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\varXi}}^{{{\dagger}}} {\boldsymbol{\varXi}}^{-{{\dagger}}} = {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\varXi}}_\text{H} {\boldsymbol{\varXi}}^{-{{\dagger}}} - \mathrm{i} {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\varXi}}^{-{{\dagger}}}. \end{equation}

Because $\smash {{\boldsymbol {\varXi }}^{-{{\dagger}} } = ({\boldsymbol {\varXi }}^{-1})^{{\dagger}} }$, this leads to the following formulas, which we will need later:

(6.26)

\begin{equation} ({\boldsymbol{\varXi}}^{{-}1})_\text{H} = {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\varXi}}_\text{H} {\boldsymbol{\varXi}}^{-{{\dagger}}}, \qquad ({\boldsymbol{\varXi}}^{{-}1})_\text{A} ={-}{\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\varXi}}^{-{{\dagger}}}. \end{equation}

6.4. Spectrum of microscopic fluctuations

Other objects to be used below are the density operators of the oscillating fields:

(6.27)

and the corresponding average Wigner matrices on $\smash {({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$. The former, $\smash {{{\boldsymbol {\mathsf {U}}}} \doteq \overline {{{\boldsymbol {\mathsf {W}}}}}_{\underline {\widetilde {{\boldsymbol {\varPsi }}}}}}$, is readily found by definition (2.51), and the latter, $\smash {{\boldsymbol {\mathfrak {W}}} \doteq \overline {{{\boldsymbol {\mathsf {W}}}}}_{\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}}}$, is calculated as follows. Let us consider $\smash {g_s(t, {\boldsymbol {x}}, {\boldsymbol {p}})}$ as a ket in $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$, with $\smash {{\boldsymbol {p}}}$ as a parameter. Then, (6.14) readily yields

(6.28)

\begin{equation} \widehat{\boldsymbol{{\boldsymbol {\mathsf{W}}}}}_{\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}}} = \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}}\sum_{s,s'} \int \mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\, \smash{\widehat{\boldsymbol{\varXi}}}^{{-}1}\widehat{\boldsymbol{\alpha}}_s({\boldsymbol{p}}) {\left. {\boldsymbol {\mathsf {|{g_s({\boldsymbol{p}})}}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {{g_{s'}({\boldsymbol{p}}')}|}}}} \right.} \widehat{\boldsymbol{\alpha}}^{{\dagger}}_{s'}({\boldsymbol{p}}')\smash{\widehat{\boldsymbol{\varXi}}}^{-{{\dagger}}}. \end{equation}

By applying $\smash {\text {symb}_{{\mathsf {x}}} }$ to this, one obtains

(6.29)

\begin{equation} {\boldsymbol{\mathfrak{W}}}= \sum_{s,s'} \int \mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\, {\boldsymbol{\varXi}}^{{-}1} \star {\boldsymbol{\alpha}}_s({\boldsymbol{p}}) \star {\boldsymbol{\mathfrak{G}}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \star {\boldsymbol{\alpha}}^{{\dagger}}_{s'}({\boldsymbol{p}}') \star {\boldsymbol{\varXi}}^{-{{\dagger}}}, \end{equation}

where most arguments are omitted for brevity and (Appendix B.7)

(6.30)

\begin{align} \mathfrak{G}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') & \doteq \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{s}}}\,\mathrm{e}^{- \mathrm{i}{\boldsymbol {\mathsf{k}}}\cdot{\boldsymbol {\mathsf{s}}}}\, \overline{g_{s}({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2, {\boldsymbol{p}})\, g_{s'}({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2, {\boldsymbol{p}}')} \nonumber\\ & \approx \frac{1}{(2{\rm \pi})^n}\,\delta_{ss'}\delta({\boldsymbol{p}} - {\boldsymbol{p}}') \delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)F_s({\boldsymbol{p}}), \end{align}

assuming corrections due to inter-particle correlations are negligible. Then, (6.29) gives

(6.31)

\begin{align} {\boldsymbol{\mathfrak{W}}}(\omega, {\boldsymbol{k}}) = \frac{1}{(2{\rm \pi})^{n}} \sum_{s'}\int \mathrm{d}{\boldsymbol{p}}'\,&\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})F_{s'}({\boldsymbol{p}}') \nonumber\\ & \times {\boldsymbol{\varXi}}^{{-}1}(\omega, {\boldsymbol{k}}) ({\boldsymbol{\alpha}}_{s'}{\boldsymbol{\alpha}}_{s'}^{{\dagger}})(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}') {\boldsymbol{\varXi}}^{-{{\dagger}}}(\omega, {\boldsymbol{k}}), \end{align}

where $\smash {{\boldsymbol {v}}'_{s'} \doteq {\boldsymbol {v}}_{s'}(t, {\boldsymbol {x}}, {\boldsymbol {p}}')}$. It is readily seen from (6.31) that $\smash {{\boldsymbol {\mathfrak {W}}}}$ is positive-semidefinite. One can also recognize (6.31) as a manifestation of the dressed-particle superposition principle (Rostoker Reference Rostoker1964). Specifically, (6.31) shows that the contributions of individual OCs to $\smash {{\boldsymbol {\mathfrak {W}}}}$ are additive and affected by the plasma collective response, i.e. by the difference between $\smash {{\boldsymbol {\varXi }}}$ and the vacuum dispersion matrix $\smash {{\boldsymbol {\varXi }}_0}$.

Using (6.31), one can also find other averages quadratic in the field via (cf. (2.53a))

(6.32)

\begin{equation} \overline{(\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}})(\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}})^{{\dagger}}} \approx \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, ({{\boldsymbol {\mathsf{L}}}} {\boldsymbol{\mathfrak{W}}} {{\boldsymbol {\mathsf{R}}}}^{{\dagger}})(\omega, {\boldsymbol{k}}), \end{equation}

where $\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {L}}}}}}$ and $\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {R}}}}}}$ are any linear operators and $\smash {{{\boldsymbol {\mathsf {L}}}}}$ and $\smash {{{\boldsymbol {\mathsf {R}}}}}$ are their symbols; for example,

(6.33)

\begin{equation} \overline{\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}}(t, {\boldsymbol{x}})\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}}(t, {\boldsymbol{x}})} \approx \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{\mathfrak{W}}}(\omega, {\boldsymbol{k}}). \end{equation}

Because of this, we loosely attribute $\smash {{\boldsymbol {\mathfrak {W}}}}$ as the spectrum of microscopic oscillations, but see also § 8.2, where an alternative notation is introduced and a fluctuation–dissipation theorem is derived from (6.31) for plasma in thermal equilibrium. See also § 9 for specific examples.

6.5. Nonlinear potentials

From (6.14), the oscillating part of the Hamiltonian (6.9) can be split into the macroscopic part and the microscopic part as $\smash {\widetilde {H}_s = \underline {\widetilde {H}}_s + \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {H}}_s}$, $\smash {\underline {\widetilde {H}}_s = \widehat {\boldsymbol {\alpha }}_s^{{\dagger}} \underline {\widetilde {{\boldsymbol {\varPsi }}}}}$, and

(6.34)

\begin{equation} \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{H}}_s({\boldsymbol{p}}) = \sum_{s'} \int \mathrm{d}{\boldsymbol{p}}'\,\widehat{\mathcal{X}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}')g_{s'}({\boldsymbol{p}}'). \end{equation}

Here, $\smash {\widehat {\mathcal {X}}_{ss'}}$ is an operator on $\smash {\mathscr {H}_{{{\mathsf {x}}}}}$ given by

(6.35)

\begin{equation} \widehat{\mathcal{X}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \doteq \widehat{\boldsymbol{\alpha}}_s^{{\dagger}}({\boldsymbol{p}})\smash{\widehat{\boldsymbol{\varXi}}}^{{-}1} \widehat{\boldsymbol{\alpha}}_{s'}({\boldsymbol{p}}'), \end{equation}

with the symbol

(6.36)

\begin{equation} \mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \approx {\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) {\boldsymbol{\varXi}}^{{-}1}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_{s'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}') \end{equation}

(see § 9 for examples). The corresponding average Wigner functions on $\smash {({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$ are $\smash {\overline {{\mathsf {W}}}_s = \overline {{\mathsf {W}}}_s^{\text {(m)}} + \overline {{\mathsf {W}}}_s^{(\mu )}}$, where the index ‘m’ stands for ‘macroscopic’ and the index ‘$\smash {\mu }$’ stands for ‘microscopic’. Because the dependence on $\smash {t}$ and $\smash {{\boldsymbol {x}}}$ is slow, one can approximate them as follows:

(6.37a)

\begin{gather} \overline{{\mathsf{W}}}_s^{\text{(m)}} \approx {\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) {{\boldsymbol {\mathsf{U}}}}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}), \end{gather}

(6.37b)

\begin{gather}\overline{{\mathsf{W}}}_s^{(\mu)} \approx {\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) {\boldsymbol{\mathfrak{W}}}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}). \end{gather}

The matrix $\smash {{{\boldsymbol {\mathsf {U}}}}}$ is positive-semidefinite as an average Wigner tensor (§ 2.1.7), and so is $\smash {{\boldsymbol {\mathfrak {W}}}}$ (§ 6.4). Hence, both $\smash {\overline {{\mathsf {W}}}_s^{\text {(m)}}}$ and $\smash {\overline {{\mathsf {W}}}_s^{(\mu )}}$ are non-negative. Using (6.31), one can also rewrite the Wigner function of $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {H}}_s}$ more compactly as

(6.38)

\begin{equation} \overline{{\mathsf{W}}}_s^{(\mu)} (\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) = (2{\rm \pi})^{{-}n} \sum_{s'}\int \mathrm{d}{\boldsymbol{p}}'\, \delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}) |\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2\,F_{s'}({\boldsymbol{p}}'). \end{equation}

Now we can represent the nonlinear potentials (5.41)–(5.43) as

(6.39)

\begin{equation} {{\boldsymbol {\mathsf{D}}}}_s = {{\boldsymbol {\mathsf{D}}}}_s^{\text{(m)}} + {{\boldsymbol {\mathsf{D}}}}_s^{(\mu)}, \qquad {\boldsymbol{\Theta}}_s = {\boldsymbol{\Theta}}_s^{\text{(m)}} + {\boldsymbol{\Theta}}_s^{(\mu)}, \qquad \varPhi_s = \varPhi_s^{\text{(m)}} + \varPhi_s^{(\mu)}. \end{equation}

Here, the index $\smash {^{\text {(m)}}}$ denotes contributions from $\smash {\overline {{\mathsf {W}}}_s^{\text {(m)}}}$ and the index $\smash {^{(\mu )}}$ denotes contributions from $\smash {\overline {{\mathsf {W}}}_s^{(\mu )}}$. Specifically,

(6.40)

\begin{align} {{\boldsymbol {\mathsf{D}}}}_s^{\text{(m)}} & = \int \mathrm{d}{\boldsymbol{k}}\, {\rm \pi}\, {\boldsymbol{k}} {\boldsymbol{k}} \overline{{\mathsf{W}}}_s^{\text{(m)}}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}), \end{align}

(6.41)

\begin{align} {\boldsymbol{\Theta}}_s^{\text{(m)}} & =\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \left. \frac{{\boldsymbol{k}} {\boldsymbol{k}} \overline{{\mathsf{W}}}_s^{\text{(m)}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}, \end{align}

(6.42)

\begin{align} \varPhi_s^{\text{(m)}} & = \frac{1}{2}\frac{\partial}{\partial {\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{{\boldsymbol{k}} \overline{{\mathsf{W}}}_s^{\text{(m)}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}. \end{align}

Here, $\smash {\overline {{\mathsf {W}}}_s^{\text {(m)}}}$ is a non-negative function (6.37a), so $\smash {{{\boldsymbol {\mathsf {D}}}}_s^{\text {(m)}}}$ is positive-semidefinite and leads to an $\smash {H}$-theorem similar to (5.46). One also has

(6.43)

\begin{align} {{\boldsymbol {\mathsf{D}}}}_s^{(\mu)} & = \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,{\rm \pi}\, {\boldsymbol{k}} {\boldsymbol{k}} \, \delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, |\mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_{s}, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2\,F_{s'}({\boldsymbol{p}}'), \end{align}

(6.44)

\begin{align} {\boldsymbol{\Theta}}_s^{(\mu)} &= \sum_{s'} \frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, \left. \frac{{\boldsymbol{k}} {\boldsymbol{k}} F_{s'}({\boldsymbol{p}}')}{{\boldsymbol{k}} \cdot ({\boldsymbol{v}}'_{s'} - {\boldsymbol{v}}_s) + \vartheta}\, |\mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') |^2 \right|_{\vartheta=0}, \end{align}

(6.45)

\begin{align} \varPhi_s^{(\mu)} & = \sum_{s'}\frac{\partial}{\partial {\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, \frac{{\boldsymbol{k}} F_{s'}({\boldsymbol{p}}')}{2{\boldsymbol{k}} \cdot ({\boldsymbol{v}}'_{s'} - {\boldsymbol{v}}_s)} \, |\mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') |^2. \end{align}

The functions $\smash {{\boldsymbol {\Theta }}_s^{(\mu )}}$ and $\smash {\varPhi _s^{(\mu )}}$ scale as $\smash {\overline {{\mathsf {W}}}_s^{(\mu )}}$, i.e. as $\smash {\epsilon \varepsilon ^2}$ (§ 3.3). Their contribution to (5.37) is of order $\smash {\epsilon {\boldsymbol {\Theta }}_s^{(\mu )}}$ and $\smash {\epsilon \varPhi _s^{(\mu )}}$, respectively, so it scales as $\smash {\epsilon ^2\varepsilon ^2}$ and therefore is negligible within our model. In contrast, $\smash {{{\boldsymbol {\mathsf {D}}}}_s^{(\mu )}}$ must be retained alongside with $\smash {{{\boldsymbol {\mathsf {D}}}}_s^{\text {(m)}}}$. This is because although weak, macroscopic fluctuations can resonate with particles from the bulk distribution, while the stronger macroscopic fluctuations are assumed to resonate only with particles from the tail distribution, which are few.

6.6. Oscillation-centre Hamiltonian

Within the assumed accuracy, the OC Hamiltonian is $\smash {\mathcal {H}_s = \overline {H}_s + \varPhi _s^{\text {(m)}}}$, and $\smash {\overline {H}_s}$ is given by (6.9). Combined with the general theorem (2.53c), the latter readily yields $\smash {\overline {H}_s = H_{0s} + \phi _s}$, where

(6.46)

\begin{equation} \phi_s \approx \frac{1}{2}\int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \operatorname{tr}\big({{\boldsymbol {\mathsf{U}}}}{\boldsymbol{\wp}}_s\big) \end{equation}

and the contribution of $\smash {{\boldsymbol {\mathfrak {W}}}}$ has been neglected. Because both $\smash {\varPhi _s^{\text {(m)}}}$ and $\smash {\phi _s}$ are quadratic in $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ and enter $\smash {\mathcal {H}_s}$ only in the combination $\smash {\varDelta _s \doteq \varPhi _s^{\text {(m)}} + \phi _s}$, it is convenient to attribute the latter as the ‘total’ ponderomotive energy. Using (6.42) in combination with (6.37a), one can express it as follows:

(6.47)

\begin{equation} \varDelta_s = \frac{1}{2}\frac{\partial}{\partial {\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}}\, \frac{{\boldsymbol{\alpha}}_s^{{\dagger}} {{\boldsymbol {\mathsf{U}}}} {\boldsymbol{\alpha}}_s}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} + \frac{1}{2}\int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\operatorname{tr}\big({{\boldsymbol {\mathsf{U}}}}{\boldsymbol{\wp}}_s\big) \end{equation}

(see § 9 for examples). Notably,

(6.48)

\begin{equation} \varDelta_s ={-}\frac{1}{2} \frac{\delta}{\delta F_s}{\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \operatorname{tr}({\boldsymbol{\varXi}}_\text{H} {{\boldsymbol {\mathsf{U}}}}) ={-}\frac{\delta S_{\text{ad}}}{\delta F_s}, \end{equation}

where $\smash {\delta /\delta F_s}$ denotes a functional derivative and $\smash {S_{\text {ad}}}$ is the adiabatic action defined in (6.16). Equation (6.48) is a generalization of the well-known ‘$\smash {K}$–$\smash {\chi }$ theorem’ (Kaufman & Holm Reference Kaufman and Holm1984; Kaufman Reference Kaufman1987). Loosely speaking, it says that the coefficient connecting $\smash {\varDelta _s}$ with $\smash {{{\boldsymbol {\mathsf {U}}}}}$ is proportional to the linear polarizability of an individual particle of type $\smash {s}$ (Dodin et al. Reference Dodin, Zhmoginov and Ruiz2017; Dodin & Fisch Reference Dodin and Fisch2010a) (‘$\smash {K}$’ in the name of this theorem is the same as our $\smash {\varDelta _s}$, and ‘$\smash {\chi }$’ is the linear susceptibility). Also, the OC Hamiltonian and the OC velocity can be expressed as

(6.49)

\begin{equation} \mathcal{H}_s = H_{0s} + \varDelta_s, \qquad {\boldsymbol{v}} = \partial_{\boldsymbol{p}} H_{0s} + \partial_{\boldsymbol{p}}\varDelta_s. \end{equation}

6.7. Polarization drag

Within the assumed accuracy, the OC distribution can be expressed as

(6.50)

\begin{equation} F_s = \overline{f}_s + \frac{1}{2}\,\frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol{\Theta}}_s^{\text{(m)}}\, \frac{\partial \overline{f}_s}{\partial {\boldsymbol{p}}}\right), \end{equation}

and (5.37) becomes

(6.51)

\begin{gather} \displaystyle \frac{\partial F_s}{\partial t} - \frac{\partial \mathcal{H}_s}{\partial {\boldsymbol{x}}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}} + \frac{\partial \mathcal{H}_s}{\partial {\boldsymbol{p}}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{x}}} = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol {\mathsf{D}}}_s^{\text{(m)}}\,\frac{\partial F_s}{\partial {\boldsymbol{p}}}\right) + \mathcal{C}_s, \end{gather}

(6.52)

\begin{gather}\displaystyle \mathcal{C}_s \doteq \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol {\mathsf{D}}}_s^{(\mu)}\,\frac{\partial F_s}{\partial {\boldsymbol{p}}}\right) + \varGamma_s, \end{gather}

where we have reinstated the term $\smash {\varGamma _s}$ introduced in § 3.3. As a collisional term, $\smash {\varGamma _s}$ is needed only to the zeroth order in $\smash {\epsilon }$, so

(6.53)

\begin{equation} \displaystyle \varGamma_s = \overline{\lbrace \widetilde{H}_s, g_s \rbrace} \approx \partial_{{\boldsymbol{p}}} \cdot (\overline{g_s \partial_{{\boldsymbol{x}}}\widetilde{H}_s}) \equiv \partial_{{\boldsymbol{p}}} \cdot {\boldsymbol{\zeta}}_s, \qquad {\boldsymbol{\zeta}}_s = \mathrm{i} \overline{\left\langle{{\boldsymbol {\mathsf{x}}}|\widehat{\boldsymbol{k}} \widetilde{H}_s}\right\rangle \left\langle{{\boldsymbol {\mathsf{x}}}|g_s}\right\rangle}. \end{equation}

Correlating with $\smash {g_s}$ is only the microscopic part of $\smash {\widetilde {H}_s}$, so using (6.34) one obtains

(6.54)

\begin{equation} {\boldsymbol{\zeta}}_s = \mathrm{i} \sum_{s'} \int \mathrm{d}{\boldsymbol{p}}'\,\langle{\boldsymbol {\mathsf{x}}}|\widehat{\boldsymbol{k}}\widehat{\mathcal{X}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \overline{{\left. {\boldsymbol {\mathsf {|{g_{s'}({\boldsymbol{p}}')}}}} \right\rangle}{\left\langle {{\boldsymbol {\mathsf {{g_s({\boldsymbol{p}})}|}}}} \right.}} {\boldsymbol {\mathsf{x}}}\rangle. \end{equation}

Next, let us use (2.28) and $\smash {{\boldsymbol {\zeta }} = \operatorname {re} {\boldsymbol {\zeta }}}$ to express this result as follows:

(6.55)

\begin{align} {\boldsymbol{\zeta}}_s & = \mathrm{i} \sum_{s'} \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\mathrm{d}{\boldsymbol{p}}'\,{\boldsymbol{k}} \star \mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}, {\boldsymbol{p}}, {\boldsymbol{p}}') \star {\boldsymbol{\mathfrak{G}}}_{s's}(\omega, {\boldsymbol{k}}, {\boldsymbol{p}}', {\boldsymbol{p}})\nonumber\\ & \approx \mathrm{i} \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\mathrm{d}{\boldsymbol{p}}'\,{\boldsymbol{k}} \mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}, {\boldsymbol{p}}, {\boldsymbol{p}}') \,(2{\rm \pi})^{{-}n}\delta({\boldsymbol{p}} - {\boldsymbol{p}}')\,\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)\,F_s({\boldsymbol{p}}) \nonumber\\ & ={-} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,{\boldsymbol{k}} \operatorname{im} \mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}, {\boldsymbol{p}}, {\boldsymbol{p}})\,F_s({\boldsymbol{p}}), \end{align}

where we have approximated $\smash {\star }$ with the usual product and substituted (6.30). Hence,

(6.56)

\begin{equation} \varGamma_s \approx{-}\partial_{\boldsymbol{p}} \cdot ({\boldsymbol{\mathfrak{F}}}_s F_s), \end{equation}

where $\smash {{\boldsymbol {\mathfrak {F}}}_s}$ can be interpreted as the polarization drag (i.e. the average force that is imposed on an OC by its dress) and is given by

(6.57)

\begin{equation} {\boldsymbol{\mathfrak{F}}}_s = \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,{\boldsymbol{k}} \operatorname{im} \mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}, {\boldsymbol{p}}, {\boldsymbol{p}}). \end{equation}

Using (6.36), one also rewrite this as follows:

(6.58a)

\begin{align} {\boldsymbol{\mathfrak{F}}}_s & \approx \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,{\boldsymbol{k}}\, ({\boldsymbol{\alpha}}_s^{{\dagger}} ({\boldsymbol{\varXi}}^{{-}1})_\text{A} {\boldsymbol{\alpha}}_s)({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}) \end{align}

(6.58b)

\begin{align} & \approx{-}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,{\boldsymbol{k}}\, ({\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\varXi}}^{-{{\dagger}}}{\boldsymbol{\alpha}}_s)({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}), \end{align}

where we have substituted (6.26) for $\smash {({\boldsymbol {\varXi }}^{-1})_\text {A}}$. With (6.24) for $\smash {{\boldsymbol {\varXi }}_\text {A}}$, this yields

(6.59)

\begin{align} {\boldsymbol{\mathfrak{F}}}_s \approx \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, &{\rm \pi}\,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\,{\boldsymbol{k}}{\boldsymbol{k}}\cdot\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \nonumber\\ & \times {\boldsymbol{\alpha}}_s^{{\dagger}}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}) {\boldsymbol{\varXi}}^{{-}1}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_{s'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}') \nonumber\\ & \times {\boldsymbol{\alpha}}_{s'}^{{\dagger}}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}') {\boldsymbol{\varXi}}^{-{{\dagger}}}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_s({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}). \end{align}

The product of the last two lines equals $\smash {|\mathcal {X}_{ss'}({\boldsymbol {k}} \cdot {\boldsymbol {v}}_{s}, {\boldsymbol {k}}; {\boldsymbol {p}}, {\boldsymbol {p}}')|^2}$. Hence,

(6.60)

\begin{equation} {\boldsymbol{\mathfrak{F}}}_s = \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, {\rm \pi}\,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, |\mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2\,{\boldsymbol{k}}{\boldsymbol{k}}\cdot \frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'}. \end{equation}

6.8. Collision operator

By combining (6.56) for $\smash {\varGamma _s}$ with (6.43) for $\smash {{{\boldsymbol {\mathsf {D}}}}_s^{(\mu )}}$, one can express $\smash {\mathcal {C}_s}$ as

(6.61)

\begin{align} \mathcal{C}_s = \frac{\partial}{\partial {\boldsymbol{p}}}\cdot \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, &{\rm \pi} \,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, |\mathcal{X}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_{s}, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 \nonumber\\ &\times {\boldsymbol{k}}{\boldsymbol{k}} \cdot \left( \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \right), \end{align}

where $\mathcal {X}_{ss'}$ is given by (6.36). One can recognize this as a generalization of the Balescu–Lenard collision operator (Krall & Trivelpiece Reference Krall and Trivelpiece1973, § 11.11) to interactions via a general multi-component field ${\boldsymbol {\varPsi }}$. Specific examples can be found in § 9.

It is readily seen that $\smash {\mathcal {C}_s}$ conserves particles, i.e.

(6.62)

\begin{equation} \int \mathrm{d}{\boldsymbol{p}}\,\mathcal{C}_s = 0, \end{equation}

and vanishes in thermal equilibrium (§ 8.1). Other properties of $\smash {\mathcal {C}_s}$ are determined by the properties of the coupling coefficient $\smash {\mathcal {X}_{ss'}}$, which are as follows. Note that

(6.63)

\begin{equation} |\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 = \mathcal{Q}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') + \mathcal{R}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')/2, \end{equation}

where we introduced

(6.64)

\begin{align} \mathcal{Q}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') & \doteq (|\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 + |\mathcal{X}_{s's}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}})|^2)/2, \end{align}

(6.65)

\begin{align} \mathcal{R}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') & \doteq |\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 - |\mathcal{X}_{s's}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}})|^2. \end{align}

To calculate $\smash {\mathcal {R}_{ss'}}$, note that (6.36) yields

(6.66)

\begin{align} |\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 & \approx |{\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) {\boldsymbol{\varXi}}^{{-}1}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_{s'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}')|^2,\nonumber\\ |\mathcal{X}_{s's}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}})|^2 & \approx |{\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) {\boldsymbol{\varXi}}^{-{{\dagger}}}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_{s'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}')|^2, \end{align}

whence one obtains

(6.67)

\begin{align} \mathcal{R}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \approx 4\operatorname{im}\big(&{\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) ({\boldsymbol{\varXi}}^{{-}1})_\text{H}(\omega, {\boldsymbol{k}}){\boldsymbol{\alpha}}_{s'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}')\nonumber\\ & {\boldsymbol{\alpha}}_{s'}^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}') ({\boldsymbol{\varXi}}^{{-}1})_\text{A}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_{s'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}') \big). \end{align}

The operators $\smash {({\boldsymbol {\varXi }}^{-1})_\text {H}}$, $\smash {({\boldsymbol {\varXi }}^{-1})_\text {A}}$ and $\smash {\widehat {\boldsymbol {\alpha }}_s}$ (for all $\smash {s}$) have been introduced for real fields, so their matrix elements in the coordinate representation are real. Then, the corresponding symbols satisfy $\smash {{\boldsymbol {A}}(-\omega, -{\boldsymbol {k}}) = {\boldsymbol {A}}^*(\omega, {\boldsymbol {k}})}$, where $\smash {{\boldsymbol {A}}}$ is any of the three symbols. This gives

(6.68)

\begin{equation} \mathcal{R}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \doteq \mathcal{R}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') ={-}\mathcal{R}_{ss'}(-{\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}'). \end{equation}

Because the rest of the integrand in (6.61) is even in $\smash {{\boldsymbol {k}}}$, (6.68) signifies that $\smash {\mathcal {R}_{ss'}}$ does not contribute to $\smash {\mathcal {C}_s}$. Thus, $\smash {\mathcal {X}_{ss'}}$ in (6.61) can also be replaced with $\smash {\mathcal {Q}_{ss'}}$:

(6.69)

\begin{align} \mathcal{C}_s = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, &{\rm \pi} \,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, \mathcal{Q}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_{s}, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \nonumber\\ & \times {\boldsymbol{k}}{\boldsymbol{k}} \cdot \left( \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \right). \end{align}

In this representation, the coupling coefficient in $\smash {\mathcal {C}_s}$ is manifestly symmetric,

(6.70)

\begin{equation} \mathcal{Q}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') = \mathcal{Q}_{s's}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}}), \end{equation}

which readily leads to momentum and energy conservation (Appendix C):Footnote ²⁸

(6.71)

\begin{equation} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}}\mathcal{C}_s = 0, \qquad \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\mathcal{C}_s = 0. \end{equation}

The collision operator $\smash {\mathcal {C}_s}$ also satisfies the $\smash {H}$-theorem (Appendix C.3):

(6.72)

\begin{equation} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{\text{coll}} \geqslant 0, \end{equation}

where the entropy density $\smash {\sigma }$ is defined as

(6.73)

\begin{equation} \sigma \doteq{-}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,F_s({\boldsymbol{p}})\ln F_s({\boldsymbol{p}}), \end{equation}

and $\smash {(\partial _t F_s)_{\text {coll}} \doteq \mathcal {C}_s}$. Note that these properties are not restricted to any particular $\smash {\mathcal {H}_s}$. Also note that if applied in proper variables (§ 3.1.2), our formula (6.69) can describe collisions in strong background fields. This topic, including comparison with the relevant literature, is left to future work.

6.9. Summary of § 6

Let us summarize the above general results (for examples, see § 9). We consider species $\smash {s}$ governed by a Hamiltonian of the form

(6.74)

\begin{equation} H_s = H_{0s} + \widehat{\boldsymbol{\alpha}}_s^{{\dagger}}({\boldsymbol{p}}) \widetilde{{\boldsymbol{\varPsi}}} + \frac{1}{2}\,(\widehat{\boldsymbol{L}}_s\widetilde{{\boldsymbol{\varPsi}}})^{{\dagger}} (\widehat{\boldsymbol{R}}_s\widetilde{{\boldsymbol{\varPsi}}}), \end{equation}

where $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ is a real oscillating field (of any dimension $\smash {M}$), which generally consists of a macroscopic part $\smash {\underline {\widetilde {{\boldsymbol {\varPsi }}}}}$ and a microscopic part $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}}$. The term $\smash {H_{0s}}$ is independent of $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$, and the operators $\smash {\smash {\widehat {\boldsymbol {\alpha }}}_s^{{\dagger}} }$, $\smash {\widehat {\boldsymbol {L}}_s}$ and $\smash {\widehat {\boldsymbol {R}}_s}$ may be non-local in $t$ and ${\boldsymbol {x}}$ and may depend on the momentum ${\boldsymbol {p}}$ parametrically. The dynamics of this system averaged over the fast oscillations can be described in terms of the OC distribution function

(6.75)

\begin{equation} F_s = \overline{f}_s + \frac{1}{2}\,\frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol{\Theta}}_s\, \frac{\partial \overline{f}_s}{\partial {\boldsymbol{p}}}\right) \end{equation}

(the index $\smash {^{\text {(m)}}}$ is henceforth omitted for brevity), which is governed by the following equation of the Fokker–Planck type:

(6.76)

\begin{equation} \frac{\partial F_s}{\partial t} - \frac{\partial \mathcal{H}_s}{\partial {\boldsymbol{x}}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}} + \frac{\partial \mathcal{H}_s}{\partial {\boldsymbol{p}}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{x}}} = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left({\boldsymbol {\mathsf{D}}}_s\,\frac{\partial F_s}{\partial {\boldsymbol{p}}}\right) + \mathcal{C}_s. \end{equation}

Here, $\smash {\mathcal {H}_s = H_{0s} + \varDelta _s}$ is the OC Hamiltonian, $\smash {{\boldsymbol {\Theta }}_s}$ is the dressing function and $\smash {\varDelta _s}$ is the total ponderomotive energy (i.e. the part of the OC Hamiltonian that is quadratic in $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$), so $\smash {{\boldsymbol {v}}_s(t, {\boldsymbol {x}}, {\boldsymbol {p}}) \doteq \partial _{\boldsymbol {p}}\mathcal {H}_s}$ is the OC velocity. Specifically,

(6.77a)

\begin{align} {{\boldsymbol {\mathsf{D}}}}_s & = \int \mathrm{d}{\boldsymbol{k}}\,{\rm \pi} \,{\boldsymbol{k}}{\boldsymbol{k}} \overline{{\mathsf{W}}}_s(t, {\boldsymbol{x}}, {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}), \end{align}

(6.77b)

\begin{align} {\boldsymbol{\Theta}}_s & =\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}}{\boldsymbol{k}} \left. \frac{\overline{{\mathsf{W}}}_s(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}, \end{align}

(6.77c)

\begin{align} \varDelta_s & = \frac{1}{2}\frac{\partial}{\partial {\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}}\,\frac{\overline{{\mathsf{W}}}_s(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} \nonumber\\ & \quad +\frac{1}{2}\int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\operatorname{tr}\big({{\boldsymbol {\mathsf{U}}}}{\boldsymbol{\wp}}_s\big)(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}). \end{align}

Here, $\smash {\overline {{\mathsf {W}}}_s = {\boldsymbol {\alpha }}_s^{{\dagger}} {{\boldsymbol {\mathsf {U}}}} {\boldsymbol {\alpha }}_s}$ is a scalar function, the average Wigner matrix $\smash {{{\boldsymbol {\mathsf {U}}}}}$ is understood as the Fourier spectrum of the symmetrized autocorrelation matrix of the macroscopic oscillations:

(6.78)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}) = \int \frac{\mathrm{d}\tau}{2{\rm \pi}}\,\frac{\mathrm{d}{\boldsymbol{s}}}{(2{\rm \pi})^{n}}\,\, \overline{ {\underline{\widetilde{{\boldsymbol{\varPsi}}}}}(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2)\, \smash{\underline{\widetilde{{\boldsymbol{\varPsi}}}}}^{{\dagger}}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2) } \,\mathrm{e}^{\mathrm{i}\omega \tau - \mathrm{i} {\boldsymbol{k}} \cdot {\boldsymbol{s}}}, \end{equation}

with $\smash {n \doteq \dim {\boldsymbol {x}}}$. Also, the vector $\smash {{\boldsymbol {\alpha }}_s(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}}; {\boldsymbol {p}})}$ is the Weyl symbol of $\smash {\widehat {\boldsymbol {\alpha }}_s}$ as defined in (2.26), $\smash {{\boldsymbol {\wp }}_s(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}}; {\boldsymbol {p}}) \approx ({\boldsymbol {L}}_s^{{\dagger}} {\boldsymbol {R}}_s)_\text {H}}$, $\smash {{\boldsymbol {L}}_s}$ and $\smash {{\boldsymbol {R}}_s}$ are the Weyl symbols of $\smash {\widehat {\boldsymbol {L}}_s}$ and $\smash {\widehat {\boldsymbol {R}}_s}$, respectively, and $\smash {_\text {H}}$ denotes the Hermitian part. The matrix $\smash {{{\boldsymbol {\mathsf {D}}}}_s}$ is positive-semidefinite and satisfies an $\smash {H}$-theorem of the form (5.46). Also, $\smash {\varDelta _s}$ satisfies the ‘$\smash {K}$–$\smash {\chi }$ theorem’

(6.79)

The matrix ${\boldsymbol {\varXi }}$ characterizes the collective plasma response to the field $\widetilde {{\boldsymbol {\varPsi }}}$ and is given by

(6.80)

\begin{equation} {\boldsymbol{\varXi}}\approx {\boldsymbol{\varXi}}_0 + \sum_s \int \mathrm{d}{\boldsymbol{p}}\, \Bigg( \frac{{\boldsymbol{\alpha}}_s({\boldsymbol{p}})\,{\boldsymbol{\alpha}}_s^{{\dagger}}({\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s({\boldsymbol{p}}) + \mathrm{i} 0}\, {\boldsymbol{k}}\cdot \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} - {\boldsymbol{\wp}}_s({\boldsymbol{p}})F_s({\boldsymbol{p}}) \Bigg). \end{equation}

Here, the arguments $\smash {(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}})}$ are omitted for brevity, $\smash {{\boldsymbol {\alpha }}_s{\boldsymbol {\alpha }}_s^{{\dagger}} }$ is a dyadic matrix and $\smash {{\boldsymbol {\varXi }}_0}$ is the symbol of the Hermitian dispersion operator $\smash {\widehat {\boldsymbol {\varXi }}_0}$ that governs the field $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ in the absence of plasma. Specifically, $\smash {\widehat {\boldsymbol {\varXi }}_0}$ is defined such that the field Lagrangian density without plasma is $\smash {\mathfrak {L}_0 = \smash {\widetilde {{\boldsymbol {\varPsi }}}}^{{\dagger}} \widehat {\boldsymbol {\varXi }}_0 \widetilde {{\boldsymbol {\varPsi }}}/2}$.

The spectrum of microscopic fluctuations (specifically, the spectrum of the symmetrized autocorrelation function of the microscopic field $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}}$) is a positive-semidefinite matrix function and given by

(6.81)

\begin{align} {\boldsymbol{\mathfrak{W}}}(\omega, {\boldsymbol{k}}) = \frac{1}{(2{\rm \pi})^{n}} \sum_{s'}\int \mathrm{d}{\boldsymbol{p}}'\,&\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})F_{s'}({\boldsymbol{p}}') \nonumber\\ &\times {\boldsymbol{\varXi}}^{{-}1}(\omega, {\boldsymbol{k}}) ({\boldsymbol{\alpha}}_{s'}{\boldsymbol{\alpha}}_{s'}^{{\dagger}})(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}') {\boldsymbol{\varXi}}^{-{{\dagger}}}(\omega, {\boldsymbol{k}}), \end{align}

where $\smash {{\boldsymbol {v}}'_{s'} \doteq {\boldsymbol {v}}_{s'}({\boldsymbol {p}}')}$. (The dependence on t and $\boldsymbol{x}$ is assumed too but not emphasized.) The microscopic fluctuations give rise to a collision operator of the Balescu–Lenard type:

(6.82)

\begin{align} \mathcal{C}_s = \frac{\partial}{\partial {\boldsymbol{p}}}\cdot \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\, &{\rm \pi} \,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, \mathcal{Q}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_{s}, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')\nonumber\\ &\times {\boldsymbol{k}}{\boldsymbol{k}} \cdot \left( \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \right), \end{align}

where the coupling coefficient $\smash {\mathcal {Q}_{ss'}(\omega, {\boldsymbol {k}}; {\boldsymbol {p}}, {\boldsymbol {p}}') = \mathcal {Q}_{s's}(\omega, {\boldsymbol {k}}; {\boldsymbol {p}}', {\boldsymbol {p}})}$ is given by

(6.83)

\begin{align} \mathcal{Q}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') &= (|\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 + |\mathcal{X}_{s's}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}})|^2)/2, \end{align}

(6.84)

\begin{align} \mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') &\approx {\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) {\boldsymbol{\varXi}}^{{-}1}(\omega, {\boldsymbol{k}}) {\boldsymbol{\alpha}}_{s'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}'). \end{align}

The operator $\smash {\mathcal {C}_s}$ satisfies the $\smash {H}$-theorem and conserves particles, momentum and energy

\begin{equation*} \int \mathrm{d}{\boldsymbol{p}}\,\mathcal{C}_s = 0, \qquad \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}}\mathcal{C}_s = 0, \qquad \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\mathcal{C}_s = 0. \end{equation*}

7. Interaction with on-shell waves

Here, we discuss QL interaction of plasma with ‘on-shell’ waves, i.e. waves constrained by dispersion relations. To motivate the assumptions that will be adopted, and also to systematically introduce our notation, we start with briefly overviewing theory of linear waves in dispersive media (Whitham Reference Whitham1974; Tracy et al. Reference Tracy, Brizard, Richardson and Kaufman2014), including monochromatic waves (§ 7.1), conservative eikonal waves (§ 7.2), general eikonal waves (§ 7.3) and general broadband waves described by the WKE (§ 7.4). After that, we derive conservation laws for the total momentum and energy, which are exact within our model (§ 7.5). All waves in this section are considered macroscopic, so we adopt a simplified notation $\smash {\underline {\widetilde {{\boldsymbol {\varPsi }}}} \equiv \widetilde {{\boldsymbol {\varPsi }}}}$ and the index $\smash {^{(\text {m})}}$ will be omitted.

7.1. Monochromatic waves

Conservative (non-dissipative) waves can be described using the least-action principle $\delta S = 0$. Assuming the notation as in § 6.2, the action integral can be expressed as $S = \int \mathrm {d}{\boldsymbol {\mathsf {x}}}\,\mathfrak {L}$ with the Lagrangian density given by

(7.1)

\begin{equation} \mathfrak{L} = \frac{1}{2}\,\smash{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}}\widehat{\boldsymbol{\varXi}}_\text{H}\widetilde{{\boldsymbol{\varPsi}}}. \end{equation}

First, let us assume a homogeneous stationary medium, so $\smash {{\boldsymbol {\varXi }}_\text {H}(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}}) = {\boldsymbol {\varXi }}_\text {H}(\omega, {\boldsymbol {k}})}$. Because we assume real fields,Footnote ²⁹ $\smash {\left\langle {t_1, {\boldsymbol {x}}_1 |\widehat {\boldsymbol {\varXi }}|t_2, {\boldsymbol {x}}_2}\right\rangle}$ is real for all $\smash {(t_1, {\boldsymbol {x}}_1, t_2, {\boldsymbol {x}}_2)}$, one also has

(7.2)

\begin{equation} {\boldsymbol{\varXi}}_\text{H}(-\omega, -{\boldsymbol{k}}) = {\boldsymbol{\varXi}}_\text{H}^*(\omega, {\boldsymbol{k}}) = {\boldsymbol{\varXi}}_\text{H}^\intercal(\omega, {\boldsymbol{k}}), \end{equation}

where the latter equality is due to $\smash {{\boldsymbol {\varXi }}_\text {H}^{{\dagger}} (\omega, {\boldsymbol {k}}) = {\boldsymbol {\varXi }}_\text {H}(\omega, {\boldsymbol {k}})}$.

Because ${\boldsymbol {\varXi }}_\text {H}(\omega, {\boldsymbol {k}})$ is Hermitian, it has $M \doteq \dim {\boldsymbol {\varXi }}_\text {H}$ orthonormal eigenvectors ${\boldsymbol {\eta }}_b$:

(7.3)

\begin{equation} {\boldsymbol{\varXi}}_\text{H}(\omega, {\boldsymbol{k}}){\boldsymbol{\eta}}_b(\omega, {\boldsymbol{k}}) = \varLambda_b(\omega, {\boldsymbol{k}}){\boldsymbol{\eta}}_b(\omega, {\boldsymbol{k}}), \qquad {\boldsymbol{\eta}}_b^{{\dagger}}(\omega, {\boldsymbol{k}}){\boldsymbol{\eta}}_{b'}(\omega, {\boldsymbol{k}}) = \delta_{b,b'}. \end{equation}

Here, $\smash {\varLambda _b}$ are the corresponding eigenvalues, which are real and satisfy

(7.4)

\begin{equation} \varLambda_b(\omega, {\boldsymbol{k}}) = {\boldsymbol{\eta}}_b^{{\dagger}}(\omega, {\boldsymbol{k}}){\boldsymbol{\varXi}}_\text{H}(\omega, {\boldsymbol{k}}){\boldsymbol{\eta}}_b(\omega, {\boldsymbol{k}}). \end{equation}

Due to (7.2), one has

(7.5)

\begin{equation} \varLambda_b(-\omega, -{\boldsymbol{k}}) = \text{eigv}_b\, ({\boldsymbol{\varXi}}_\text{H}(\omega, {\boldsymbol{k}}))^\intercal = \text{eigv}_b\, {\boldsymbol{\varXi}}_\text{H}(\omega, {\boldsymbol{k}}) = \varLambda_b(\omega, {\boldsymbol{k}}), \end{equation}

where $\text {eigv}_b$ stands for the $b$th eigenvalue. Using this together with (7.2), one obtains from (7.3) that

(7.6)

\begin{equation} {\boldsymbol{\varXi}}_\text{H}^*(\omega, {\boldsymbol{k}}){\boldsymbol{\eta}}_b(-\omega, -{\boldsymbol{k}}) = \varLambda_b(\omega, {\boldsymbol{k}}){\boldsymbol{\eta}}_b(-\omega, -{\boldsymbol{k}}), \end{equation}

whence

(7.7)

\begin{equation} {\boldsymbol{\eta}}_b(-\omega, -{\boldsymbol{k}}) = {\boldsymbol{\eta}}_b^*(\omega, {\boldsymbol{k}}). \end{equation}

Let us consider a monochromatic wave of the form

(7.8)

with real frequency $\overline {\omega }$, real wavevector $\overline {{\boldsymbol {k}}}$ and complex amplitude ${\breve {{\boldsymbol {\varPsi }}}}$. For such a wave, the action integral can be expressed as $\smash {S = \int \mathrm {d}{\boldsymbol {\mathsf {x}}}\,\overline {\mathfrak {L}}}$, where the average Lagrangian density $\overline {\mathfrak {L}}$ is given byFootnote ³⁰

(7.9)

\begin{equation} \overline{\mathfrak{L}} = \frac{1}{2}\,\overline{\smash{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}} \widehat{\boldsymbol{\varXi}}_\text{H} \widetilde{{\boldsymbol{\varPsi}}}} = \frac{1}{4}\,\operatorname{re}(\smash{{\breve{{\boldsymbol{\varPsi}}}}}^{{\dagger}} {\boldsymbol{\varXi}}_\text{H}(\overline{\omega}, \overline{{\boldsymbol{k}}}) {\breve{{\boldsymbol{\varPsi}}}}) = \frac{1}{4}\,\smash{{\breve{{\boldsymbol{\varPsi}}}}}^{{\dagger}} {\boldsymbol{\varXi}}_\text{H}(\overline{\omega}, \overline{{\boldsymbol{k}}}) {\breve{{\boldsymbol{\varPsi}}}}. \end{equation}

Let us decompose ${\breve {{\boldsymbol {\varPsi }}}}$ in the basis formed by the eigenvectors $\smash {{\boldsymbol {\eta }}_b}$, that is, as

(7.10)

\begin{equation} {\breve{{\boldsymbol{\varPsi}}}} = \sum_b {\boldsymbol{\eta}}_b {\breve{a}}^b. \end{equation}

Then, (7.9) becomes

(7.11)

\begin{equation} \overline{\mathfrak{L}} = \frac{1}{4}\sum_b \varLambda_b(\overline{\omega}, \overline{{\boldsymbol{k}}})\,|{\breve{a}}^b|^2. \end{equation}

The real and imaginary parts of the amplitudes ${\breve {a}}^b$ can be treated as independent variables. This is equivalent to treating ${\breve {a}}^{b*}$ and ${\breve {a}}^b$ as independent variables, so one arrives at the following Euler–Lagrange equations:

(7.12)

\begin{equation} 0 = \frac{\delta S[{\breve{{\boldsymbol{a}}}}^*, {\breve{{\boldsymbol{a}}}}]}{\delta {\breve{a}}^{b*}} = \frac{1}{4}\,\varLambda_b(\overline{\omega}, \overline{{\boldsymbol{k}}}){\breve{a}}^b, \qquad 0 = \frac{\delta S[{\breve{{\boldsymbol{a}}}}^*, {\breve{{\boldsymbol{a}}}}]}{\delta {\breve{a}}^b} = \frac{1}{4}\,{\breve{a}}^{b*}\varLambda_b(\overline{\omega}, \overline{{\boldsymbol{k}}}). \end{equation}

Hence the $b$th mode with a non-zero amplitude ${\breve {a}}^b$ satisfies the dispersion relation

(7.13)

\begin{equation} 0 = \varLambda_b(\overline{\omega}, \overline{{\boldsymbol{k}}}) = \varLambda_b(-\overline{\omega}, -\overline{{\boldsymbol{k}}}). \end{equation}

Equation (7.13) determines a dispersion surface in the ${\boldsymbol {\mathsf {k}}}$ space where the waves can have non-zero amplitude. This surface is sometimes called a shell, so waves constrained by a dispersion relation are called on-shell. Also note that combining (7.13) with (7.3) yields that on-shell waves satisfy

(7.14)

\begin{equation} {\boldsymbol{\varXi}}_\text{H}(\overline{\omega}, \overline{{\boldsymbol{k}}}){\boldsymbol{\eta}}_b(\overline{\omega}, \overline{{\boldsymbol{k}}}) = {\boldsymbol{0}}, \qquad {\boldsymbol{\eta}}_b^{{\dagger}}(\overline{\omega}, \overline{{\boldsymbol{k}}}){\boldsymbol{\varXi}}_\text{H}(\overline{\omega}, \overline{{\boldsymbol{k}}}) = {\boldsymbol{0}}, \end{equation}

which are two mutually adjoint representations of the same equation.

Below, we consider the case when (7.13) is satisfied only for one mode at a time, so summation over $b$ and the index $b$ itself can be omitted. (A more general case is discussed, for example, in Dodin et al. (Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019).) Then, $\smash {{\breve {{\boldsymbol {\varPsi }}}} = {\boldsymbol {\eta }}(\overline {\omega }, \overline {{\boldsymbol {k}}}){\breve {a}}}$,

(7.15)

\begin{equation} \overline{\mathfrak{L}} = \frac{1}{4}\,\varLambda(\overline{\omega}, \overline{{\boldsymbol{k}}})|{\breve{a}}|^2, \end{equation}

and $\overline {\omega }$ is connected with $\overline {{\boldsymbol {k}}}$ via $\smash {\overline {\omega } = w(\overline {{\boldsymbol {k}}})}$, where $w({\boldsymbol {k}}) = -w(-{\boldsymbol {k}})$ is the function that solves $\varLambda (w({\boldsymbol {k}}), {\boldsymbol {k}}) = 0$. Also importantly, (7.14) ensures that

(7.16)

\begin{align} \partial_{\unicode{x25AA}} \varLambda(\overline{\omega}, \overline{{\boldsymbol{k}}}) & = ((\partial_{\unicode{x25AA}} {\boldsymbol{\eta}}^{{\dagger}}) {\boldsymbol{\varXi}}_\text{H}{\boldsymbol{\eta}} + {\boldsymbol{\eta}}^{{\dagger}}(\partial_{\unicode{x25AA}} {\boldsymbol{\varXi}}_\text{H}){\boldsymbol{\eta}} + {\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\varXi}}_\text{H}(\partial_{\unicode{x25AA}} {\boldsymbol{\eta}}))\big|_{(\omega, {\boldsymbol{k}}) = (\overline{\omega}, \overline{{\boldsymbol{k}}})} \nonumber\\ & = ({\boldsymbol{\eta}}^{{\dagger}}(\partial_{\unicode{x25AA}} {\boldsymbol{\varXi}}_\text{H}){\boldsymbol{\eta}})\big|_{(\omega, {\boldsymbol{k}}) = (\overline{\omega}, \overline{{\boldsymbol{k}}})}, \end{align}

where $\smash {{\unicode{x25AA}} }$ can be replaced with any variable.

7.2. Conservative eikonal waves

7.2.1. Basic properties

In case of a quasimonochromatic eikonal wave and, possibly, inhomogeneous non-stationary plasma, one can apply the same arguments as in § 7.1 except the above equalities are now satisfied up to $\smash {\mathcal {O}(\epsilon )}$. For a single-mode wave, one has

(7.17)

\begin{equation} \widetilde{{\boldsymbol{\varPsi}}}(t, {\boldsymbol{x}}) = \operatorname{re}(\widetilde{{\boldsymbol{\varPsi}}}_{\text{c}}(t, {\boldsymbol{x}})) + \mathcal{O}(\epsilon), \qquad \widetilde{{\boldsymbol{\varPsi}}}_{\text{c}} = \mathrm{e}^{\mathrm{i} \theta(t, {\boldsymbol{x}})} {\boldsymbol{\eta}}(t, {\boldsymbol{x}}) {\breve{a}}(t, {\boldsymbol{x}}), \end{equation}

where the local frequency and the wavevector,

(7.18)

\begin{equation} \overline{\omega} \doteq{-} \partial_t\theta, \qquad \overline{{\boldsymbol{k}}} \doteq \partial_{{\boldsymbol{x}}} \theta \end{equation}

are slow functions of $\smash {(t, {\boldsymbol {x}})}$, and so is $\smash {{\boldsymbol {\eta }}(t, {\boldsymbol {x}}) \doteq {\boldsymbol {\eta }}(t, {\boldsymbol {x}}, \overline {\omega }(t, {\boldsymbol {x}}), \overline {{\boldsymbol {k}}}(t, {\boldsymbol {x}}))}$, which satisfies (7.3). Then,

(7.19)

\begin{equation} \overline{\mathfrak{L}} = \frac{1}{4}\,\varLambda(t, {\boldsymbol{x}}, \overline{\omega}, \overline{{\boldsymbol{k}}})\,|{\breve{a}}(t, {\boldsymbol{x}})|^2 + \mathcal{O}(\epsilon). \end{equation}

Within the leading-order theory, the term $\smash {\mathcal {O}(\epsilon )}$ is neglected.Footnote ³¹ Then, the least-action principle

(7.20)

\begin{equation} 0 = \frac{\delta S[\theta, {\breve{{\boldsymbol{a}}}}^*, {\breve{{\boldsymbol{a}}}}]}{\delta {\breve{a}}^{b*}} \approx \frac{1}{4}\,\varLambda_b(\overline{\omega}, \overline{{\boldsymbol{k}}}){\breve{a}}^b, \qquad 0 = \frac{\delta S[\theta, {\breve{{\boldsymbol{a}}}}^*, {\breve{{\boldsymbol{a}}}}]}{\delta {\breve{a}}^b} \approx \frac{1}{4}\,{\breve{a}}^{b*}\varLambda_b(\overline{\omega}, \overline{{\boldsymbol{k}}}) \end{equation}

leads to the same (but now local) dispersion relation as for monochromatic waves, $\smash {\varLambda (t, {\boldsymbol {x}}, \overline {\omega }, \overline {{\boldsymbol {k}}}) = 0}$. This shows that quasimonochromatic waves are also on-shell, and thus they satisfy (7.16) as well. Also notice that the dispersion relation can now be understood as a Hamilton–Jacobi equation for the eikonal phase $\smash {\theta }$:

(7.21)

\begin{equation} \varLambda(t, {\boldsymbol{x}}, -\partial_t \theta, \partial_{{\boldsymbol{x}}}\theta) = 0. \end{equation}

Like in the previous section, let us introduce the function $\smash {w}$ that solves

(7.22)

\begin{equation} \varLambda(t, {\boldsymbol{x}}, w(t, {\boldsymbol{x}}, {\boldsymbol{k}}), {\boldsymbol{k}}) = 0 \end{equation}

and therefore satisfies

(7.23)

\begin{equation} w(t, {\boldsymbol{x}}, {\boldsymbol{k}}) ={-}w(t, {\boldsymbol{x}}, -{\boldsymbol{k}}). \end{equation}

Differentiating (7.22) with respect to $t$, ${\boldsymbol {x}}$ and ${\boldsymbol {k}}$ leads to

(7.24a)

\begin{align} & \partial_{t} \varLambda + (\partial_\omega \varLambda)\partial_{t}w = 0, \end{align}

(7.24b)

\begin{align} & \partial_{{\boldsymbol{x}}} \varLambda + (\partial_\omega \varLambda)\partial_{{\boldsymbol{x}}}w = 0, \end{align}

(7.24c)

\begin{align} & \partial_{{\boldsymbol{k}}} \varLambda + (\partial_\omega \varLambda)\partial_{{\boldsymbol{k}}}w = 0, \end{align}

where the derivatives of $\varLambda$ are evaluated at $\smash {(t, {\boldsymbol {x}}, w(t, {\boldsymbol {x}}, {\boldsymbol {k}}), {\boldsymbol {k}})}$. In particular, (7.24c) gives

(7.25)

\begin{equation} {\boldsymbol{v}}_{\text{g}} \doteq \frac{\partial w}{\partial {\boldsymbol{k}}} ={-} \frac{\partial_{{\boldsymbol{k}}} \varLambda}{\partial_\omega \varLambda}, \end{equation}

for the group velocity $\smash {{\boldsymbol {v}}_{\text {g}}}$, whose physical meaning is to be specified shortly.

Because $\theta$ is now an additional dynamical variable, one also obtains an additional Euler–Lagrange equation

(7.26)

\begin{equation} 0 = \delta_{\theta} S[\theta, {\breve{a}}^*, {\breve{a}}] = \partial_t\mathcal{I} + \partial_{{\boldsymbol{x}}} \cdot {\boldsymbol{\mathcal{J}}}, \end{equation}

where $\mathcal {I}$ is called the action density and ${\boldsymbol {\mathcal {J}}}$ is the action flux density:

(7.27)

\begin{align} & \mathcal{I} \doteq \frac{\partial \mathfrak{L}}{\partial \omega} = \frac{|{\breve{a}}|^2}{4}\frac{\partial \varLambda}{\partial \omega} = \frac{|{\breve{a}}|^2}{4}\,{\boldsymbol{\eta}}^{{\dagger}}\,\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial \omega}\, {\boldsymbol{\eta}}, \end{align}

(7.28)

\begin{align} & \mathcal{J}^i \doteq{-}\frac{\partial \mathfrak{L}}{\partial k_i} ={-} \frac{|{\breve{a}}|^2}{4}\frac{\partial \varLambda}{\partial k_i} ={-} \frac{|{\breve{a}}|^2}{4}\,{\boldsymbol{\eta}}^{{\dagger}}\,\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial k_i}\,{\boldsymbol{\eta}}, \end{align}

where we used (7.16) and the derivatives are evaluated on $\smash {(t, {\boldsymbol {x}}, w(t, {\boldsymbol {x}}, \overline {{\boldsymbol {k}}}(t, {\boldsymbol {x}})), \overline {{\boldsymbol {k}}}(t, {\boldsymbol {x}}))}$. Using (7.25), one can also rewrite (7.28) as

(7.29)

\begin{equation} {\boldsymbol{\mathcal{J}}} = \overline{{\boldsymbol{v}}}_{\text{g}} \mathcal{I}, \qquad \overline{{\boldsymbol{v}}}_{\text{g}}(t, {\boldsymbol{x}}) \doteq {\boldsymbol{v}}_{\text{g}}(t, {\boldsymbol{x}}, \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})). \end{equation}

(The arguments $(t, {\boldsymbol {x}})$ will be omitted from now on for brevity. We will also use $({\boldsymbol {k}})$ as a shorthand for $(w({\boldsymbol {k}}), {\boldsymbol {k}})$ where applicable.) Then, (7.26) becomes

(7.30)

\begin{equation} \partial_t\mathcal{I} + \partial_{{\boldsymbol{x}}} \cdot (\overline{{\boldsymbol{v}}}_{\text{g}}\mathcal{I}) = 0,\end{equation}

which can be a understood as a continuity equation for quasiparticles (‘photons’ or, more generally, ‘wave quanta’) with density $\smash {\mathcal {I}}$ and fluid velocity $\smash {\overline {{\boldsymbol {v}}}_{\text {g}}}$ (see also § 7.2.2). Thus, if an eikonal wave satisfies the least-action principle, its total action $\smash {\int \mathrm {d}{\boldsymbol {x}}\,\mathcal {I}}$ (‘number of quanta’) is an invariant. This conservation law can be attributed to the fact that the wave Lagrangian density $\smash {\overline {\mathfrak {L}}}$ depends on derivatives of $\smash {\theta }$ but not on $\smash {\theta }$ per se.

Also notice the following. By expanding (7.19) in $\smash {\partial _t \theta }$ around $\smash {\partial _t \theta = -w(t, {\boldsymbol {x}}, \partial _{{\boldsymbol {x}}}\theta )}$, which is satisfied on any solution, one obtains

(7.31)

\begin{equation} \overline{\mathfrak{L}} \approx{-}\frac{1}{4}\,(\partial_t \theta + w(t, {\boldsymbol{x}}, \partial_{{\boldsymbol{x}}}\theta))\partial_\omega \varLambda\,|{\breve{a}}|^2 ={-}(\partial_t \theta + w(t, {\boldsymbol{x}}, \partial_{{\boldsymbol{x}}}\theta))\mathcal{I}, \end{equation}

where we used that $\smash {\overline {\mathfrak {L}}(t, {\boldsymbol {x}}, -\partial _t\theta, \partial _{{\boldsymbol {x}}}\theta ) = 0}$ due to (7.21). Then, one arrives at the canonical form of the action integral (Hayes Reference Hayes1973)

(7.32)

\begin{equation} S[\mathcal{I}, \theta] ={-} \int \mathrm{d} t\,\mathrm{d}{\boldsymbol{x}}\,(\partial_t \theta + w(t, {\boldsymbol{x}}, {\boldsymbol{k}}))\mathcal{I}. \end{equation}

From here, $\smash {\delta _{\mathcal {I}} S = 0}$ yields the dispersion relation in the Hamilton–Jacobi form $\smash {\partial _t \theta + w(t, {\boldsymbol {x}}, {\boldsymbol {k}}) = 0}$, and $\smash {\delta _{\theta} S = 0}$ yields the action conservation (7.30).

7.2.2. Ray equations

By (7.18), one has the so-called consistency relations

(7.33)

\begin{equation} \partial_t \overline{k}_i + \partial_i \overline{\omega} = 0, \qquad \partial_i \overline{k}_j = \partial_j \overline{k}_i. \end{equation}

These lead to

(7.34)

\begin{align} \bigg(\frac{\partial}{\partial t} & + \overline{{\boldsymbol{v}}}_{\text{g}} \cdot \frac{\partial}{\partial {\boldsymbol{x}}}\bigg) \overline{k}_i(t, {\boldsymbol{x}}) ={-} \frac{\partial w(t, {\boldsymbol{x}}, \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}}))}{\partial x^i} + \overline{{\boldsymbol{v}}}_{\text{g}} \cdot \frac{\partial \overline{k}_i(t, {\boldsymbol{x}})}{\partial {\boldsymbol{x}}} \nonumber\\ & ={-} \left(\frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial x^i} \right)_{{\boldsymbol{k}} = \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})} - \overline{v}_{\text{g}}^j\,\frac{\partial \overline{k}_j(t, {\boldsymbol{x}})}{\partial x^i} + \overline{v}_{\text{g}}^j\, \frac{\partial \overline{k}_i(t, {\boldsymbol{x}})}{\partial x^j} \nonumber\\ & ={-} \left(\frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial x^i}\right)_{{\boldsymbol{k}} = \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})}, \end{align}

and similarly,

(7.35)

\begin{align} \bigg(\frac{\partial}{\partial t} & + \overline{{\boldsymbol{v}}}_{\text{g}} \cdot \frac{\partial}{\partial {\boldsymbol{x}}}\bigg) \overline{\omega}(t, {\boldsymbol{x}}) = \bigg(\frac{\partial}{\partial t} + \overline{{\boldsymbol{v}}}_{\text{g}} \cdot \frac{\partial}{\partial {\boldsymbol{x}}}\bigg) w(t, {\boldsymbol{x}}, \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})) \nonumber\\ & = \left(\frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial t} + \overline{v}_{\text{g}}^i\, \frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial x^i}\right)_{{\boldsymbol{k}} = \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})} + \overline{v}_{\text{g}}^i \left(\frac{\partial}{\partial t}+ \overline{{\boldsymbol{v}}}_{\text{g}} \cdot \frac{\partial}{\partial {\boldsymbol{x}}}\right) \overline{k}_i(t, {\boldsymbol{x}})\nonumber\\ & = \left(\frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial t}\right)_{{\boldsymbol{k}} = \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})}, \end{align}

where we used (7.34). Using the convective derivative associated with the group velocity,

(7.36)

\begin{equation} \mathrm{d}/\mathrm{d} t \equiv \mathrm{d}_t \doteq \partial_t + (\overline{{\boldsymbol{v}}}_{\text{g}} \cdot \partial_{\boldsymbol{x}}), \end{equation}

one can rewrite these compactly as

(7.37)

\begin{equation} \frac{\mathrm{d}\overline{k}_i(t, {\boldsymbol{x}})}{\mathrm{d} t} ={-} \left(\frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial x^i}\right)_{{\boldsymbol{k}} = \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})}, \quad \frac{\mathrm{d}\overline{\omega}(t, {\boldsymbol{x}})}{\mathrm{d} t} = \left(\frac{\partial w(t, {\boldsymbol{x}}, {\boldsymbol{k}})}{\partial t}\right)_{{\boldsymbol{k}} = \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})}. \end{equation}

One can also represent (7.37) as ordinary differential equations for $\smash {\overline {{\boldsymbol {k}}}(t) \doteq \overline {{\boldsymbol {k}}}(t, \overline {{\boldsymbol {x}}}(t))}$ and $\smash {\overline {\omega }(t) \doteq \overline {\omega }(t, \overline {{\boldsymbol {x}}}(t))}$, where $\smash {\overline {{\boldsymbol {x}}}(t)}$ are the ‘ray trajectories’ governed by

(7.38)

\begin{equation} \frac{\mathrm{d}\overline{x}^i(t)}{\mathrm{d} t} = v_{\text{g}}^i(t, \overline{{\boldsymbol{x}}}(t), \overline{{\boldsymbol{k}}}(t)). \end{equation}

Specifically, together with (7.38), equations (7.37) become Hamilton's equations also known as the ray equations:

(7.39)

\begin{equation} \frac{\mathrm{d}\overline{x}^i}{\mathrm{d} t} = \frac{\partial w(t, \overline{{\boldsymbol{x}}}, \overline{{\boldsymbol{k}}})}{\partial \overline{k}_i}, \qquad \frac{\mathrm{d} \overline{k}_i}{\mathrm{d} t} ={-} \frac{\partial w(t, \overline{{\boldsymbol{x}}}, \overline{{\boldsymbol{k}}})}{\partial \overline{x}^i}, \qquad \frac{\mathrm{d} \overline{\omega}}{\mathrm{d} t} = \frac{\partial w(t, \overline{{\boldsymbol{x}}}, \overline{{\boldsymbol{k}}})}{\partial t}, \end{equation}

where $\smash {\overline {{\boldsymbol {x}}}}$ is the coordinate, $\smash {\hbar \overline {{\boldsymbol {k}}}}$ is the momentum, $\smash {\hbar \overline {\omega }}$ is the energy, $\smash {\hbar w}$ is the Hamiltonian and the constant factor $\smash {\hbar }$ can be anything. If $\smash {\hbar }$ is chosen to be the Planck constant, then (7.39) can be interpreted as the motion equations of individual wave quanta, for example, photons. Hamilton's equations for ‘true’ particles, such as electrons and ions, are also subsumed under (7.39) in that they can be understood as the ray equations of the particles considered as quantum-matter waves in the semiclassical limit.

Also notably, (7.39) can be obtained by considering the point-particle limit of (7.32) (Ruiz & Dodin Reference Ruiz and Dodin2015b). Specifically, adopting $\smash {\mathcal {I}(t, {\boldsymbol {x}}) \,\propto \, \delta ({\boldsymbol {x}} - \overline {{\boldsymbol {x}}}(t))}$ and taking the integral in (7.32) by parts leads to a canonical action $\smash {S\, \propto \, \int \mathrm {d} t\,(\overline {{\boldsymbol {k}}} \cdot \dot {\overline {{\boldsymbol {x}}}} - w(t, \overline {{\boldsymbol {x}}}, \overline {{\boldsymbol {k}}}))}$, whence Hamilton's equations follow as usual.

7.2.3. Wave momentum and energy

Using (7.30) and (7.36), one arrives at the following equality for any given field $\smash {{\mathsf {X}}}$:

(7.40)

\begin{align} \partial_t({\mathsf{X}} \mathcal{I})+\partial_{\boldsymbol{x}} \cdot ({\mathsf{X}} \mathcal{I} \overline{{\boldsymbol{v}}}_{\text{g}}) & =(\partial_t{\mathsf{X}})\mathcal{I}+{\mathsf{X}}(\partial_t\mathcal{I})+[\partial_{\boldsymbol{x}} \cdot (\mathcal{I} \overline{{\boldsymbol{v}}}_{\text{g}})]{\mathsf{X}}+\mathcal{I}(\overline{{\boldsymbol{v}}}_{\text{g}}\cdot \partial_{\boldsymbol{x}} ){\mathsf{X}} \nonumber\\ &=\mathcal{I}[\partial_t+(\overline{{\boldsymbol{v}}}_{\text{g}}\cdot \partial_{\boldsymbol{x}})]{\mathsf{X}}+{\mathsf{X}}[\partial_t\mathcal{I}+\partial_{\boldsymbol{x}} \cdot (\mathcal{I}\overline{{\boldsymbol{v}}}_{\text{g}})] \nonumber\\ &=\mathcal{I}\,\mathrm{d}_t{\mathsf{X}}. \end{align}

For $\smash {{\mathsf {X}} = \overline {k}_i}$ and $\smash {{\mathsf {X}} = \overline {\omega }}$, (7.40) yields, respectively,

(7.41a)

\begin{align} \partial_t P_{\text{w}, i} + \partial_{{\boldsymbol{x}}} \cdot (\overline{{\boldsymbol{v}}}_{\text{g}} P_{\text{w}, i}) & ={-} \mathcal{I}\partial_i w, \end{align}

(7.41b)

\begin{align} \partial_t\mathcal{E}_{\text{w}} + \partial_{{\boldsymbol{x}}} \cdot (\overline{{\boldsymbol{v}}}_{\text{g}}\mathcal{E}_{\text{w}}) & = \mathcal{I}\partial_t w, \end{align}

where we used (7.37) and introduced the following notation:

(7.42)

\begin{equation} {\boldsymbol{P}}_{\text{w}} \doteq \overline{{\boldsymbol{k}}}\mathcal{I}, \qquad \mathcal{E}_{\text{w}} \doteq \overline{\omega}\mathcal{I}. \end{equation}

When a medium is homogeneous along $\smash {x^i}$, (7.41a) yields $\smash {\int \mathrm {d}{\boldsymbol {x}}\,P_{\text {w},i} = \text {const}}$. Likewise, when a medium is stationary, (7.41b) yields $\smash {\int \mathrm {d}{\boldsymbol {x}}\,\mathcal {E}_{\text {w}} = \text {const}}$. Hence, by definition, $\smash {{\boldsymbol {P}}_{\text {w}}}$ and $\smash {\mathcal {E}_{\text {w}}}$ are the densities of the wave canonical momentum and energy, at least up to a constant factor $\smash {\kappa }$.Footnote ³² A proof that $\smash {\kappa = 1}$ can be found, for example, in Dodin & Fisch (Reference Dodin and Fisch2012). In § 7.5, we will show this using different arguments.

7.3. Non-conservative eikonal waves

In a medium with non-zero $\smash {{\boldsymbol {\varXi }}_\text {A}}$, where waves are non-conservative, the wave properties are defined as in the previous section but the wave action evolves differently. The variational principle is not easy to apply in this case (however, see Dodin et al. Reference Dodin, Zhmoginov and Ruiz2017), so a different approach will be used to derive the action equation. A more straightforward but less intuitive approach can be found in Dodin et al. (Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019) and McDonald (Reference McDonald1988).

7.3.1. Monochromatic waves

First, consider a homogeneous stationary medium and a ‘monochromatic’ (exponentially growing at a constant rate) wave field in the form

(7.43)

\begin{equation} \widetilde{{\boldsymbol{\varPsi}}}(t, {\boldsymbol{x}}) = \operatorname{re}(\mathrm{e}^{-\mathrm{i} \overline{\omega}t + \mathrm{i} \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{x}}}\,{\breve{{\boldsymbol{\varPsi}}}}_{\text{c}}), \qquad {\breve{{\boldsymbol{\varPsi}}}}_{\text{c}} = \mathrm{e}^{\overline{\gamma}t} \times \text{const}, \end{equation}

where the constants $\smash {\overline {\omega }}$ and $\smash {\overline {{\boldsymbol {k}}}}$ are, as usual, the real frequency and wavenumber, and $\smash {\overline {\gamma }}$ is the linear growth rate, which can have either sign. Then, (6.15) becomes

(7.44)

\begin{equation} {\boldsymbol{0}} = {\boldsymbol{\varXi}}(\overline{\omega} + \mathrm{i}\overline{\gamma}, \overline{{\boldsymbol{k}}}){\breve{{\boldsymbol{\varPsi}}}}_{\text{c}} = {\boldsymbol{\varXi}}_\text{H}(\overline{\omega}, \overline{{\boldsymbol{k}}}){\breve{{\boldsymbol{\varPsi}}}}_{\text{c}} + \mathrm{i}(\overline{\gamma} \partial_\omega {\boldsymbol{\varXi}}_\text{H}(\overline{\omega}, \overline{{\boldsymbol{k}}}) + {\boldsymbol{\varXi}}_\text{A}(\overline{\omega}, \overline{{\boldsymbol{k}}}) ){\breve{{\boldsymbol{\varPsi}}}}_{\text{c}} + \mathcal{O}(\epsilon^2), \end{equation}

where we assume that $\smash {{\boldsymbol {\varXi }}}$ is a smooth function of $\smash {\omega }$ and also that both $\smash {{\boldsymbol {\varXi }}_\text {A}}$ and $\smash {\overline {\gamma }}$ are $\smash {\mathcal {O}(\epsilon )}$. Like in § 7.2.1, we adopt $\smash {{\breve {{\boldsymbol {\varPsi }}}}_{\text {c}} = {\boldsymbol {\eta }}{\breve {a}} + \mathcal {O}(\epsilon )}$, where the polarization vector $\smash {{\boldsymbol {\eta }}}$ is the relevant eigenvector of $\smash {{\boldsymbol {\varXi }}_\text {H}}$. Then, by projecting (7.44) on $\smash {{\boldsymbol {\eta }}}$, one obtains

(7.45)

\begin{equation} 0 = \varLambda(\overline{\omega}, \overline{{\boldsymbol{k}}}){\breve{a}} + \mathrm{i}(\overline{\gamma} \partial_\omega \varLambda + {\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\eta}} )\big|_{(\omega, {\boldsymbol{k}}) = (\overline{\omega}, \overline{{\boldsymbol{k}}})}{\breve{a}} + \mathcal{O}(\epsilon^2), \end{equation}

where $\smash {\varLambda = {\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\varXi }}_\text {H}{\boldsymbol {\eta }}}$ is the corresponding eigenvalue of $\smash {{\boldsymbol {\varXi }}_\text {H}}$ and we used (7.16). Let us neglect $\smash {\mathcal {O}(\epsilon ^2)}$, divide (7.45) by $\smash {{\breve {a}}}$ and consider the real and imaginary parts of the resulting equation separately:

(7.46)

\begin{equation} \varLambda(\overline{\omega}, \overline{{\boldsymbol{k}}}) = 0, \qquad (\overline{\gamma} \partial_\omega \varLambda + {\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\eta}} )\big|_{(\omega, {\boldsymbol{k}})} = 0. \end{equation}

The former is the same dispersion relation for $\smash {\overline {\omega }}$ as for conservative waves, and the latter yields $\overline {\gamma } = \gamma ({\boldsymbol {\overline {k}}})$, where

(7.47)

\begin{equation} \gamma({\boldsymbol{k}}) \doteq{-} \frac{{\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\eta}}}{\partial_{\omega}\varLambda}. \end{equation}

Because $\smash {|{\breve {a}}|\, \propto \, \mathrm {e}^{\overline {\gamma }t}}$, one can write the amplitude equation as

(7.48)

\begin{equation} \partial_t |{\breve{a}}|^2 = 2 \overline{\gamma} |{\breve{a}}|^2. \end{equation}

One can also define the action density $\smash {\mathcal {I}}$ as in § 7.2.1 and rewrite (7.48) in terms of that. Because $\smash {\mathcal {I} = |{\breve {a}}|^2 \times \text {const}}$, one obtains

(7.49)

\begin{equation} \partial_t \mathcal{I} = 2 \overline{\gamma} \mathcal{I}. \end{equation}

7.3.2. Non-monochromatic waves

When weak inhomogeneity and weak dissipation coexist, their effect on the action density is additive, so (7.30) and (7.49) merge into a general equation

(7.50)

\begin{equation} \partial_t\mathcal{I} + \partial_{\boldsymbol{x}} (\overline{{\boldsymbol{v}}}_{\text{g}} \mathcal{I}) = 2 \overline{\gamma} \mathcal{I}. \end{equation}

(A formal derivation of (7.50), which uses the Weyl expansion (2.41) and projection of the field equation on the polarization vector, can be found in Dodin et al. Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019.) Then, (7.40) is modified as follows:

(7.51)

\begin{equation} \partial_t({\mathsf{X}} \mathcal{I})+\partial_{\boldsymbol{x}} \cdot ({\mathsf{X}} \mathcal{I} \overline{{\boldsymbol{v}}}_{\text{g}}) = \mathcal{I}\,\mathrm{d}_t{\mathsf{X}} + 2 \overline{\gamma} {\mathsf{X}} \mathcal{I}, \end{equation}

and the equations (7.41) for the wave momentum and energy (7.42) become

(7.52a)

\begin{align} \partial_t P_{\text{w}, i} + \partial_{{\boldsymbol{x}}} \cdot (\overline{{\boldsymbol{v}}}_{\text{g}} P_{\text{w}, i}) & = 2 \overline{\gamma}P_{\text{w}, i} - \mathcal{I}\partial_i w, \end{align}

(7.52b)

\begin{align} \partial_t\mathcal{E}_{\text{w}} + \partial_{{\boldsymbol{x}}} \cdot (\overline{{\boldsymbol{v}}}_{\text{g}}\mathcal{E}_{\text{w}}) & = 2 \overline{\gamma}\mathcal{E}_{\text{w}} + \mathcal{I}\partial_t w. \end{align}

A comment is due here regarding the relation between (7.50) and the amplitude equation (7.48) that is commonly used in the standard QLT for homogeneous plasma (for example, see (2.21) in Drummond & Pines Reference Drummond and Pines1962). In a nutshell, the latter is incorrect, even when $\smash {\partial _{\boldsymbol {x}} = 0}$. Because $\smash {\overline {f}}$ is time-dependent, waves do not grow or decay exponentially. Rather, they can be considered as geometrical-optics (WKB) waves, and unlike in § 7.3.1, the ratio $\smash {|{\breve {a}}|^2/\mathcal {I}}$ generally evolves at a rate comparable to $\smash {\overline {\gamma }}$. The standard QLT remains conservative only because it also incorrectly replaces (3.19) with its stationary-plasma limit ($\smash {\epsilon = 0}$) and the two errors cancel each other. These issues are less of a problem for waves in not-too-hot plasmas (e.g. Langmuir waves), because in such plasmas, changing the distribution functions does not significantly affect the dispersion relations and thus $\smash {|{\breve {a}}|^2/\mathcal {I}}$ does in fact approximately remain constant. See also the discussion in § 9.1.4.

7.4. General waves

Let us now discuss a more general case that includes broadband waves. The evolution of such waves can be described statistically in terms of their average Wigner matrix $\smash {{{\boldsymbol {\mathsf {U}}}}}$. This matrix also determines the function $\smash {\overline {{\mathsf {W}}}_s}$ that is given by (6.37a) and enters the nonlinear potentials (6.77). Below, we derive the general form of $\smash {{{\boldsymbol {\mathsf {U}}}}}$ in terms of the phase-space action density $\smash {J}$ and the governing equation for $\smash {J}$ (§§ 7.4.1–7.4.3). Then, we also express the function $\smash {\overline {{\mathsf {W}}}_s}$ through $\smash {J}$ (§ 7.4.4). Related calculations can also be found in McDonald & Kaufman (Reference McDonald and Kaufman1985) and Ruiz (Reference Ruiz2017).

7.4.1. Average Wigner matrix of an eikonal-wave field

Let us start with calculating the average Wigner matrix of an eikonal field $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ of the form (7.17) (see also Appendix A.2). Using $\smash {\widetilde {{\boldsymbol {\varPsi }}} = (\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}} + \smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}^*)/2}$, it can be readily expressed through the average Wigner functions of the complexified fieldFootnote ³³ $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}}$ and of its complex conjugate:

(7.53)

\begin{equation} {{\boldsymbol {\mathsf{U}}}} \approx (\overline{{{\boldsymbol {\mathsf{W}}}}}_{\smash{\widetilde{{\boldsymbol{\varPsi}}}}_{\text{c}}} + \overline{{{\boldsymbol {\mathsf{W}}}}}_{\smash{\widetilde{{\boldsymbol{\varPsi}}}}_{\text{c}}^*})/4 \equiv ({{\boldsymbol {\mathsf{U}}}}_{\text{c}+} + {{\boldsymbol {\mathsf{U}}}}_{\text{c}-})/4. \end{equation}

For $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}} = {\breve {a}}{\boldsymbol {\eta }}(\overline {\omega }, \overline {{\boldsymbol {k}}})\mathrm {e}^{\mathrm {i} \theta }}$, where the arguments $(t, {\boldsymbol {x}})$ are omitted for brevity, one has

(7.54)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}_{\text{c}} \equiv {{\boldsymbol {\mathsf{U}}}}_{\text{c}+} \approx ({\boldsymbol{\eta}}{\boldsymbol{\eta}}^{{\dagger}})(\overline{\omega}, \overline{{\boldsymbol{k}}}) |{\breve{a}}|^2\int \frac{\mathrm{d}\tau\,\mathrm{d}{\boldsymbol{s}}}{(2{\rm \pi})^{{\mathsf{n}}}}\, \mathrm{e}^{\mathrm{i} \theta (t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2)} \mathrm{e}^{- \mathrm{i}\theta(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2)} \mathrm{e}^{\mathrm{i} \omega \tau - \mathrm{i} {\boldsymbol{k}} \cdot {\boldsymbol{s}}}, \end{equation}

where we neglected the dependence of $\smash {{\breve {a}}}$ and $\smash {{\boldsymbol {\eta }}}$ on $\smash {(t, {\boldsymbol {x}})}$ because it is weak compared with that of $\smash {\mathrm {e}^{\pm \mathrm {i}\theta }}$. By Taylor-expanding $\smash {\theta }$, one obtains

\[ {{\boldsymbol {\mathsf{U}}}}_{\text{c}} \approx ({\boldsymbol{\eta}}{\boldsymbol{\eta}}^{{\dagger}})(\overline{\omega}, \overline{{\boldsymbol{k}}}) |{\breve{a}}|^2 \int \frac{\mathrm{d}\tau\,\mathrm{d}{\boldsymbol{s}}}{(2{\rm \pi})^{{\mathsf{n}}}}\, \mathrm{e}^{\mathrm{i} (\omega - \overline{\omega}) \tau - \mathrm{i} ({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}) \cdot {\boldsymbol{s}}} = ({\boldsymbol{\eta}}{\boldsymbol{\eta}}^{{\dagger}})(\overline{\omega}, \overline{{\boldsymbol{k}}}) |{\breve{a}}|^2 \delta(\omega - \overline{\omega}) \delta({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}). \]

For $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}^* = {\breve {a}}^*{\boldsymbol {\eta }}^*(\overline {\omega }, \overline {{\boldsymbol {k}}})\mathrm {e}^{-\mathrm {i} \theta }}$, which can also be written as $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}^* = {\breve {a}}^*{\boldsymbol {\eta }}(-\overline {\omega }, -\overline {{\boldsymbol {k}}})\mathrm {e}^{-\mathrm {i} \theta }}$ due to (7.7), the result is the same up to replacing $\smash {\overline {\omega } \to - \overline {\omega }}$ and $\smash {\overline {{\boldsymbol {k}}} \to -\overline {{\boldsymbol {k}}}}$. Also notice that

(7.55)

\begin{align} \delta(\omega \mp \overline{\omega}) \delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}) & = \delta(\omega \mp w(\overline{{\boldsymbol{k}}})) \delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}) \nonumber\\ & = \delta(\omega \mp w({\pm} {\boldsymbol{k}})) \delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}) \nonumber\\ & = \delta(\omega - w({\boldsymbol{k}})) \delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}), \end{align}

so one can rewrite $\smash {{{\boldsymbol {\mathsf {U}}}}_{\text {c}\pm }}$ as follows:

(7.56)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}_{\text{c}\pm} = {\boldsymbol{\eta}}({\boldsymbol{k}}) {\boldsymbol{\eta}}^{{\dagger}}({\boldsymbol{k}}) |{\breve{a}}|^2 \delta(\omega - w({\boldsymbol{k}})) \delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}), \end{equation}

where $\smash {({\boldsymbol {k}}) \equiv (w({\boldsymbol {k}}), {\boldsymbol {k}})}$. Thus finally,

(7.57)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}(\omega, {\boldsymbol{k}}) \approx {\boldsymbol{\eta}}({\boldsymbol{k}}){\boldsymbol{\eta}}^{{\dagger}}({\boldsymbol{k}})\, |{\breve{a}}|^2 (\delta({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}) + \delta({\boldsymbol{k}} + \overline{{\boldsymbol{k}}})) \delta(\omega - w({\boldsymbol{k}}))/4. \end{equation}

7.4.2. Average Wigner matrix of a general wave

Assuming the background medium is sufficiently smooth, a general wave field can be represented as a superposition of eikonal fields

(7.58)

\begin{equation} \textstyle \widetilde{{\boldsymbol{\varPsi}}} = \operatorname{re} \smash{\widetilde{{\boldsymbol{\varPsi}}}}_{\text{c}}, \qquad \smash{\widetilde{{\boldsymbol{\varPsi}}}}_{\text{c}} = \sum_\sigma \smash{\widetilde{{\boldsymbol{\varPsi}}}}_{\sigma,\text{c}}, \qquad \smash{\widetilde{{\boldsymbol{\varPsi}}}}_{\sigma,\text{c}} = {\breve{a}}_\sigma \mathrm{e}^{\mathrm{i}\theta_\sigma}. \end{equation}

As a quadratic functional, its average Wigner matrix $\smash {{{\boldsymbol {\mathsf {U}}}}}$ equals the sum of the average Wigner matrices $\smash {{{\boldsymbol {\mathsf {U}}}}_\sigma }$ of the individual eikonal waves:

(7.59)

\begin{equation} \textstyle {{\boldsymbol {\mathsf{U}}}} = \sum_\sigma {{\boldsymbol {\mathsf{U}}}}_\sigma = \sum_\sigma({{\boldsymbol {\mathsf{U}}}}_{\sigma,\text{c}+} + {{\boldsymbol {\mathsf{U}}}}_{\sigma,\text{c}-})/4, \end{equation}

where $\smash {{{\boldsymbol {\mathsf {U}}}}_{\sigma,\text {c}+} \equiv {{\boldsymbol {\mathsf {U}}}}_{\sigma,\text {c}}}$ and $\smash {{{\boldsymbol {\mathsf {U}}}}_{\sigma,\text {c}-}}$ are the average Wigner matrices of $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\sigma,\text {c}}}$ and $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\sigma,\text {c}}^*}$, respectively,

(7.60)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}_{\sigma,\text{c}\pm} = {\boldsymbol{\eta}}({\boldsymbol{k}}){\boldsymbol{\eta}}^{{\dagger}}({\boldsymbol{k}})|{\breve{a}}_\sigma|^2 \delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}_\sigma) \delta(\omega - w({\boldsymbol{k}})). \end{equation}

Equation (7.59) can also be expressed as

(7.61)

\begin{equation} \textstyle {{\boldsymbol {\mathsf{U}}}} = ({{\boldsymbol {\mathsf{U}}}}_{\text{c}+} + {{\boldsymbol {\mathsf{U}}}}_{\text{c}-})/4, \qquad {{\boldsymbol {\mathsf{U}}}}_{\text{c}\pm} = \sum_\sigma {{\boldsymbol {\mathsf{U}}}}_{\sigma,\text{c}\pm}, \end{equation}

where $\smash {{{\boldsymbol {\mathsf {U}}}}_{\text {c}\pm }}$ are the average Wigner matrices of $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}}$ and $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}^*}$, respectively,

(7.62)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}_{\text{c}\pm} = {\boldsymbol{\eta}}({\boldsymbol{k}}){\boldsymbol{\eta}}^{{\dagger}}({\boldsymbol{k}}) h_{\text{c}\pm}({\boldsymbol{k}}) \delta(\omega - w({\boldsymbol{k}})), \qquad h_{\text{c}\pm}({\boldsymbol{k}}) \doteq \textstyle \sum_{\sigma} |{\breve{a}}_\sigma|^2\delta({\boldsymbol{k}} \mp \overline{{\boldsymbol{k}}}_\sigma). \end{equation}

Because $\smash {h_{\text {c}-}({\boldsymbol {k}}) = h_{\text {c}+}(-{\boldsymbol {k}}) \equiv h_{\text {c}}(-{\boldsymbol {k}})}$, the matrix $\smash {{{\boldsymbol {\mathsf {U}}}}}$ can also be written as follows:

(7.63)

\begin{equation} \textstyle {{\boldsymbol {\mathsf{U}}}}(\omega, {\boldsymbol{k}}) = ({\boldsymbol{\eta}} {\boldsymbol{\eta}}^{{\dagger}})({\boldsymbol{k}})\,(h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\delta(\omega - w({\boldsymbol{k}})), \end{equation}

where $\smash {h({\boldsymbol {k}}) \doteq h_{\text {c}}({\boldsymbol {k}})/4}$ is given by

(7.64)

\begin{equation} \textstyle h({\boldsymbol{k}}) = \frac{1}{4} \sum_{\sigma} |{\breve{a}}_\sigma|^2 \delta({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}_\sigma) \geqslant 0. \end{equation}

This shows that, for broadband waves comprised of eikonal waves, $\smash {{{\boldsymbol {\mathsf {U}}}}}$ has the same form as for an eikonal wave except ${h({\boldsymbol {k}})}$ is not necessarily delta-shaped.

7.4.3. Phase-space action density and the WKE

The wave equation for the complexified field ${{\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}}$ can be written in the invariant form as ${\widehat {\boldsymbol {\varXi }}{\left. {\boldsymbol {\mathsf {{\widetilde {{\boldsymbol {|\varPsi }}}}_{\text {c}}}}} \right\rangle}= {\left. {\boldsymbol {\mathsf {\widehat {\boldsymbol {|0}}}}} \right\rangle}}$. Multiplying it by $ {\left. {\boldsymbol {\mathsf {{\widetilde {{\boldsymbol {\varPsi }}}}_{\text {c}}|}}} \right\rangle}$ from the right leads to

(7.65)

This readily yields an equation for the Wigner matrix: $\smash {{\boldsymbol {\varXi }} \star {{\boldsymbol {\mathsf {U}}}}_{\text {c}} = {\boldsymbol {0}}}$. Let us integrate this equation over $\smash {\omega }$ to make the left-hand side a smooth function of $\smash {(t, {\boldsymbol {x}}, {\boldsymbol {k}})}$. Let us also take the trace of the resulting equation to put it in a scalar form

(7.66)

\begin{equation} \textstyle \operatorname{tr} \int \mathrm{d}\omega\,{\boldsymbol{\varXi}} \star {{\boldsymbol {\mathsf{U}}}}_{\text{c}} = 0. \end{equation}

As usual, we assume ${\boldsymbol {\varXi }} = {\boldsymbol {\varXi }}_\text {H} + \mathrm {i} {\boldsymbol {\varXi }}_\text {A}$ with ${\boldsymbol {\varXi }}_\text {A} = \mathcal {O}(\epsilon ) \ll {\boldsymbol {\varXi }}_\text {H} = \mathcal {O}(1)$ for generic $({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})$. The integrand in (7.66) can be written as $\smash {{\boldsymbol {\varXi }} \star {{\boldsymbol {\mathsf {U}}}}_{\text {c}} = {\boldsymbol {\varXi }}\mathrm {e}^{\mathrm {i}\widehat {\mathcal {L}}_{{\mathsf {x}}}/2}{{\boldsymbol {\mathsf {U}}}}_{\text {c}}}$, and its expansion in the differential operator $\smash {\widehat {\mathcal {L}}_{{\mathsf {x}}}}$ (2.32) contains derivatives of all orders. High-order derivatives on $\smash {{{\boldsymbol {\mathsf {U}}}}_{\text {c}}}$ are not negligible per se, because for on-shell waves this function is delta-shaped. However, using integration by parts, one can reapply all derivatives with respect to $\smash {\omega }$ to $\smash {{\boldsymbol {\varXi }}}$ and take the remaining derivatives (with respect to $\smash {t}$, $\boldsymbol{x}$ and $\boldsymbol{k}$) outside the integral. Then it is seen that each power $\smash {m}$ of $\smash {\widehat {\mathcal {L}}_{{\mathsf {x}}}}$ in the expansion of $\smash {{\boldsymbol {\varXi }}\mathrm {e}^{\mathrm {i}\widehat {\mathcal {L}}_{{\mathsf {x}}}/2}{{\boldsymbol {\mathsf {U}}}}_{\text {c}}}$ contributes $\smash {\mathcal {O}(\epsilon ^m)}$ to the integral. Let us neglect terms with $\smash {m \geqslant 2}$ and use (7.62). Hence, one obtainsFootnote ³⁴

(7.67)

\begin{align} 0 & \approx \operatorname{tr} \int \mathrm{d}\omega\,\left({\boldsymbol{\varXi}}_\text{H} {{\boldsymbol {\mathsf{U}}}}_{\text{c}} + \mathrm{i} {\boldsymbol{\varXi}}_\text{A} {{\boldsymbol {\mathsf{U}}}}_{\text{c}} + \frac{\mathrm{i}}{2}\,\lbrace {\boldsymbol{\varXi}}_\text{H}, {{\boldsymbol {\mathsf{U}}}}_{\text{c}} \rbrace_{{{\mathsf{x}}}}\right) \nonumber\\ & \approx ({\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}_\text{H} {\boldsymbol{\eta}} + \mathrm{i} {\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\eta}})h_{\text{c}} + \frac{\mathrm{i}}{2}\,\operatorname{tr} \int \mathrm{d}\omega\left( \frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial {\mathsf{x}}^i}\frac{\partial {{\boldsymbol {\mathsf{U}}}}_{\text{c}}}{\partial {\mathsf{k}}_i} - \frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial {\mathsf{k}}_i}\frac{\partial {{\boldsymbol {\mathsf{U}}}}_{\text{c}}}{\partial {\mathsf{x}}^i} \right). \end{align}

Let us also re-express this as follows, using (7.4) and (7.47):

(7.68)

\begin{align} 0 & \approx \left(\varLambda - \mathrm{i}\gamma\, \frac{\partial\varLambda}{\partial\omega}\right)h_{\text{c}} - \frac{\mathrm{i}}{2} \operatorname{tr} \int \mathrm{d}\omega\,\frac{\partial}{\partial \omega}\left(\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial t}\,{{\boldsymbol {\mathsf{U}}}}_{\text{c}}\right) + \frac{\mathrm{i}}{2}\frac{\partial}{\partial t}\operatorname{tr}\int \mathrm{d}\omega\,\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial \omega}\,{{\boldsymbol {\mathsf{U}}}}_{\text{c}} \nonumber\\ & \hspace{1.25in} + \frac{\mathrm{i}}{2}\frac{\partial}{\partial k_i} \operatorname{tr} \int \mathrm{d}\omega\,\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial x^i}\,{{\boldsymbol {\mathsf{U}}}}_{\text{c}} - \frac{\mathrm{i}}{2}\frac{\partial}{\partial x^i}\operatorname{tr}\int \mathrm{d}\omega\,\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial k_i}\,{{\boldsymbol {\mathsf{U}}}}_{\text{c}}. \end{align}

Clearly,

(7.69)

\begin{equation} \int \mathrm{d}\omega\,\frac{\partial}{\partial \omega}\left(\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial t}\,{{\boldsymbol {\mathsf{U}}}}_{\text{c}}\right) = 0. \end{equation}

To simplify the remaining terms, we proceed as follows. As a Hermitian matrix, $\smash {{\boldsymbol {\varXi }}_\text {H}}$ can be represented in terms of its eigenvalues $\smash {\varLambda _b}$ and eigenvectors $\smash {{\boldsymbol {\eta }}_b }$ as $\smash {{\boldsymbol {\varXi }}_\text {H} = \varLambda _b {\boldsymbol {\eta }}_b {\boldsymbol {\eta }}_b^{{\dagger}} }$. For $\smash {{{\boldsymbol {\mathsf {U}}}}_{\text {c}}}$, let us use (7.62) again, where $\smash {{\boldsymbol {\eta }}}$ is one of the vectors $\smash {{\boldsymbol {\eta }}_b}$, say, $\smash {{\boldsymbol {\eta }} \equiv {\boldsymbol {\eta }}_0}$. (Accordingly, $\smash {\varLambda \equiv \varLambda _0}$.) Then, for any $\smash {{\unicode{x25AA}} \in \lbrace \omega, x^i, k_i \rbrace }$, one has

(7.70)

\begin{align} \operatorname{tr}\int \mathrm{d}\omega \left(\frac{\partial {\boldsymbol{\varXi}}_\text{H}}{\partial {\unicode{x25AA}}}\,{{\boldsymbol {\mathsf{U}}}}_{\text{c}}\right) & = \frac{\partial \varLambda_b}{\partial {\unicode{x25AA}}}\, |{\boldsymbol{\eta}}_b^{{\dagger}} {\boldsymbol{\eta}}|^2 h_{\text{c}} + \varLambda_b\,\Big({\boldsymbol{\eta}}^{{\dagger}}\frac{\partial {\boldsymbol{\eta}}_b}{\partial{\unicode{x25AA}}}\Big)({\boldsymbol{\eta}}_b^{{\dagger}}{\boldsymbol{\eta}})h_{\text{c}} + \varLambda_b\,({\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\eta}}_b)\Big(\frac{\partial {\boldsymbol{\eta}}_b^{{\dagger}}}{\partial{\unicode{x25AA}}}\,{\boldsymbol{\eta}}\Big)h_{\text{c}} \nonumber\\ & = \frac{\partial \varLambda_b}{\partial {\unicode{x25AA}}}\,(\delta_{b,0})^2h_{\text{c}} + \varLambda_b\,\Big({\boldsymbol{\eta}}^{{\dagger}}\frac{\partial {\boldsymbol{\eta}}_b}{\partial{\unicode{x25AA}}}\Big)\delta_{b,0} h_{\text{c}} + \varLambda_b\,\delta_{b,0}\Big(\frac{\partial {\boldsymbol{\eta}}_b^{{\dagger}}}{\partial{\unicode{x25AA}}}\,{\boldsymbol{\eta}}\Big)h_{\text{c}} \nonumber\\ & = \frac{\partial \varLambda}{\partial {\unicode{x25AA}}}\,h_{\text{c}} + \Big({\boldsymbol{\eta}}^{{\dagger}}\frac{\partial{\boldsymbol{\eta}}}{\partial{\unicode{x25AA}}} + \frac{\partial {\boldsymbol{\eta}}^{{\dagger}}}{\partial{\unicode{x25AA}}}\,{\boldsymbol{\eta}}\Big)\,\varLambda h_{\text{c}} \nonumber\\ & = \frac{\partial \varLambda}{\partial {\unicode{x25AA}}}\,h_{\text{c}} + \frac{\partial({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\eta}})}{\partial{\unicode{x25AA}}}\,\varLambda h_{\text{c}} \nonumber\\ & = \frac{\partial \varLambda}{\partial {\unicode{x25AA}}}\,h_{\text{c}}, \end{align}

where we used $\smash {{\boldsymbol {\eta }}_b^{{\dagger}} {\boldsymbol {\eta }} = \delta _{b,0}}$ and, in particular, $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\eta }} = 1}$. Then, (7.68) can be written as

(7.71)

\begin{equation} \varLambda h_{\text{c}} - 2\mathrm{i} \mathscr{E} = 0, \end{equation}

where

(7.72)

\begin{equation} \mathscr{E} = 2\gamma\left(\frac{\partial\varLambda}{\partial\omega}\,h\right) -\frac{\partial}{\partial t}\left(\frac{\partial\varLambda}{\partial\omega}\,h\right) -\frac{\partial}{\partial k_i}\left(\frac{\partial\varLambda}{\partial x^i}\,h\right) +\frac{\partial}{\partial x^i}\left(\frac{\partial\varLambda}{\partial k_i}\,h\right). \end{equation}

The real part of (7.71) gives $\smash {\varLambda = 0}$, which is the dispersion relation. The imaginary part of (7.71) gives $\smash {\mathscr {E} = 0}$. To understand this equation, let us rewrite $\smash {\mathscr {E}}$ as

(7.73)

\begin{equation} \mathscr{E} = 2\gamma J -\frac{\partial J}{\partial t} +\frac{\partial}{\partial k_i}\left(\frac{\partial w}{\partial x^i}\,J\right) -\frac{\partial}{\partial x^i}\left(\frac{\partial w}{\partial k_i}\,J\right). \end{equation}

Here, we introduced

(7.74)

\begin{equation} J({\boldsymbol{k}}) \doteq h({\boldsymbol{k}})\,\partial_{\omega}\varLambda({\boldsymbol{k}}), \qquad \varLambda({\boldsymbol{k}}) \doteq \varLambda(w({\boldsymbol{k}}), {\boldsymbol{k}}), \end{equation}

which, according to (7.24), satisfy

(7.75a)

\begin{align} J \partial_t w({\boldsymbol{k}}) & ={-}h (\partial_t \varLambda)({\boldsymbol{k}}), \end{align}

(7.75b)

\begin{align} J \partial_{{\boldsymbol{x}}} w({\boldsymbol{k}}) & ={-}h (\partial_{{\boldsymbol{x}}} \varLambda)({\boldsymbol{k}}), \end{align}

(7.75c)

\begin{align} J \partial_{{\boldsymbol{k}}} w({\boldsymbol{k}}) & ={-}h (\partial_{{\boldsymbol{k}}} \varLambda)({\boldsymbol{k}}). \end{align}

Note that using (7.64), one can also express $\smash {J}$ as

(7.76)

\begin{equation} \textstyle J = \sum_{\sigma} \big(\frac{1}{4}\,|{\breve{a}}_\sigma|^2 \partial_{\omega}\varLambda({\boldsymbol{k}}_\sigma)\big)\delta({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}_\sigma) = \sum_{\sigma} \mathcal{I}_\sigma \delta({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}_\sigma), \end{equation}

where $\smash {\mathcal {I}_\sigma }$ are the action densities (7.27) of the individual eikonal waves that comprise the total wave field (§ 7.4.2). In particular, $\smash {\int \mathrm {d}{\boldsymbol {k}}\,J = \sum _\sigma \mathcal {I}_\sigma }$, which is the total action density. Therefore, the function $\smash {J}$ can be interpreted as the phase-space action density. In terms of $\smash {J}$, the equation $\smash {\mathscr {E} = 0}$ can be written as

(7.77)

\begin{equation} \frac{\partial J}{\partial t} + \frac{\partial w}{\partial k_i}\,\frac{\partial J}{\partial x^i} - \frac{\partial w}{\partial x^i}\,\frac{\partial J}{\partial k_i} = 2 \gamma J. \end{equation}

This equation, called the WKE, serves the same role in QL wave-kinetic theory as the Vlasov equation serves in plasma kinetic theory.Footnote ³⁵ Unlike the field equation used in the standard QLT (Drummond & Pines Reference Drummond and Pines1962), (7.77) exactly conserves the action of non-resonant waves, i.e. those with $\smash {\gamma = 0}$. Also note that (7.50) for eikonal waves can be deduced from (7.77) as a particular case by assuming the ansatz

(7.78)

\begin{equation} J(t, {\boldsymbol{x}}, {\boldsymbol{k}}) = \mathcal{I}(t, {\boldsymbol{x}})\delta({\boldsymbol{k}} - \overline{{\boldsymbol{k}}}(t, {\boldsymbol{x}})) \end{equation}

and integrating over $\smash {{\boldsymbol {k}}}$. In other words, eikonal-wave theory can be understood as the ‘cold-fluid’ limit of wave-kinetic theory.

7.4.4. Function ${\overline {\mathsf{{W}}}}_s$ in terms of ${J}$

Here we explicitly calculate the function (6.37a) that determines the nonlinear potentials (6.77). Using (7.63), one obtains

(7.79)

\begin{equation} \overline{{\mathsf{W}}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) = |{\boldsymbol{\alpha}}_s^{{\dagger}}{\boldsymbol{\eta}}|^2({\boldsymbol{k}}; {\boldsymbol{p}})\,(h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\,\delta(\omega - w({\boldsymbol{k}})) \geqslant 0, \end{equation}

where $\smash {({\boldsymbol {k}}; {\boldsymbol {p}}) \equiv (w({\boldsymbol {k}}), {\boldsymbol {k}}; {\boldsymbol {p}})}$. By definition of $\smash {\widehat {\boldsymbol {\alpha }}_s}$, the function $\smash {\left\langle {t_1, {\boldsymbol {x}}_1 | \widehat {\boldsymbol {\alpha }}_s |t_2, {\boldsymbol {x}}_2}\right\rangle}$ is real for all $\smash {(t_1, {\boldsymbol {x}}_1)}$ and $\smash {(t_2, {\boldsymbol {x}}_2)}$, so $\smash {{\boldsymbol {\alpha }}_s(-\omega, -{\boldsymbol {k}}) = {\boldsymbol {\alpha }}_s^*(\omega, {\boldsymbol {k}})}$ by definition of the Weyl symbol (2.26). Together with (7.7), this gives $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2(\omega, {\boldsymbol {k}}; {\boldsymbol {p}})} = \smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2(-\omega, -{\boldsymbol {k}}; {\boldsymbol {p}})}$, so

(7.80a)

\begin{equation} |{\boldsymbol{\alpha}}_s^{{\dagger}}{\boldsymbol{\eta}}|^2 \equiv |{\boldsymbol{\alpha}}_s^{{\dagger}}{\boldsymbol{\eta}}|^2({\boldsymbol{k}}; {\boldsymbol{p}}) = |{\boldsymbol{\alpha}}_s^{{\dagger}}{\boldsymbol{\eta}}|^2(-{\boldsymbol{k}}; {\boldsymbol{p}}), \end{equation}

and similarly,

(7.80b)

\begin{equation} |{\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\wp}}_s {\boldsymbol{\eta}}|^2 \equiv |{\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\wp}}_s {\boldsymbol{\eta}}|^2({\boldsymbol{k}}; {\boldsymbol{p}}) = |{\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\wp}}_s {\boldsymbol{\eta}}|^2(-{\boldsymbol{k}}; {\boldsymbol{p}}). \end{equation}

This also means that $\smash {\overline {{\mathsf {W}}}_s(\omega, {\boldsymbol {k}}; {\boldsymbol {p}})} = \smash {\overline {{\mathsf {W}}}_s(-\omega, -{\boldsymbol {k}}; {\boldsymbol {p}})}$. Then finally, using (7.74), one can express this function through the phase-space action density:

(7.81)

$$\begin{gather} \overline{{\mathsf{W}}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) = |{\boldsymbol{\alpha}}_s^{{\dagger}}{\boldsymbol{\eta}}|^2 (\varsigma_{{\boldsymbol{k}}} J({\boldsymbol{k}}) + \varsigma_{-{\boldsymbol{k}}}J(-{\boldsymbol{k}})) \,\delta(\varLambda(\omega, {\boldsymbol{k}})), \end{gather}$$

(7.82)

$$\begin{gather}\varsigma_{{\boldsymbol{k}}} \doteq \operatorname{sgn} \partial_{\omega}\varLambda({\boldsymbol{k}}) = \operatorname{sgn}(J({\boldsymbol{k}})/h({\boldsymbol{k}})) = \operatorname{sgn} J({\boldsymbol{k}}). \end{gather}$$

7.5. Conservation laws

Let us rewrite (7.77) together with (6.76) in the ‘divergence’ form

(7.83)

\begin{align} \frac{\partial J}{\partial t} + \frac{\partial (v_{\text{g}}^i J)}{\partial x^i} - \frac{\partial}{\partial k_i}\left(\frac{\partial w}{\partial x^i}\,J\right) & = 2\gamma J, \end{align}

(7.84)

\begin{align} \frac{\partial F_s}{\partial t} + \frac{\partial (v_s^i\,F_s)}{\partial x^i} - \frac{\partial}{\partial p_i} \left(\frac{\partial \mathcal{H}_s}{\partial x^i} \,F_s\right) & = \frac{\partial}{\partial p_i} \left({\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j}\right) + \mathcal{C}_s. \end{align}

Using (7.62), the diffusion matrix $\smash {{\mathsf {D}}_{s,ij}}$ can be represented as follows:

(7.85)

\begin{equation} {\mathsf{D}}_{s,ij} = 2 {\rm \pi}\int \mathrm{d}{\boldsymbol{k}}\,k_i k_j\, |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2\, \frac{J({\boldsymbol{k}})}{\partial_\omega \varLambda({\boldsymbol{k}})}\,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - w({\boldsymbol{k}})). \end{equation}

Also, by substituting (6.24) into (7.47), one finds

(7.86)

\begin{equation} \gamma = {\rm \pi}\sum_s \int\mathrm{d}{\boldsymbol{p}}\,\frac{|{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\partial_{\omega}\varLambda({\boldsymbol{k}})}\, \delta(w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s({\boldsymbol{p}}))\,{\boldsymbol{k}} \cdot \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}. \end{equation}

Together with (7.75), these yield the following notable corollaries. First of all, if $\smash {{\boldsymbol {\varXi }}_0}$, $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2}$ and $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\wp }}_s{\boldsymbol {\eta }}}$ are independent of $\smash {{\boldsymbol {x}}}$,Footnote ³⁶ one has for each $\smash {l}$ that (Appendix D.1)

(7.87)

\begin{align} \frac{\partial}{\partial t} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\,p_l F_s + \int \mathrm{d}{\boldsymbol{k}}\,k_l J \right) &+ \frac{\partial}{\partial x^i} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\, p_l v_s^i F_s + \int \mathrm{d}{\boldsymbol{k}}\,k_l v_{\text{g}}^i J \right)\nonumber\\ &+ \frac{\partial}{\partial x^l}\sum_s \int\mathrm{d}{\boldsymbol{p}}\, \varDelta_s F_s ={-}\sum_s\int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial x^l}\,F_s. \end{align}

This can be viewed as a momentum-conservation theorem, because at $\smash {\partial _l H_{0s} = 0}$, one has

(7.88)

\begin{equation} \sum_s \int\mathrm{d}{\boldsymbol{x}}\,\mathrm{d}{\boldsymbol{p}}\,p_l F_s + \int \mathrm{d}{\boldsymbol{x}}\,\mathrm{d}{\boldsymbol{k}}\,k_l J = \text{const}. \end{equation}

Also, the ponderomotive force on a plasma is readily found from (7.87) as the sum of the terms quadratic in the wave amplitude (after $\smash {F_s}$ has been expressed through $\smash {\overline {f}_s}$). Similarly, if $\smash {{\boldsymbol {\varXi }}_0}$, $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2}$ and $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\wp }}_s{\boldsymbol {\eta }}}$ are independent of $\smash {t}$, one has (Appendix D.2)

(7.89)

\begin{align} \frac{\partial}{\partial t} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \int \mathrm{d}{\boldsymbol{k}}\,w J \right)& + \frac{\partial}{\partial x^i} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\, H_{0s} v_s^i F_s + \int \mathrm{d}{\boldsymbol{k}}\,w v_{\text{g}}^i J \right)\nonumber\\ & + \frac{\partial}{\partial x^i}\sum_s \int\mathrm{d}{\boldsymbol{p}}\, \varDelta_s v_s^i F_s = \sum_s\int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial t}\,F_s. \end{align}

This can be viewed as an energy-conservation theorem, because at $\smash {\partial _t H_{0s} = 0}$, one has

(7.90)

\begin{equation} \sum_s \int\mathrm{d}{\boldsymbol{x}}\,\mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \int \mathrm{d}{\boldsymbol{x}}\,\mathrm{d}{\boldsymbol{k}}\,w J = \text{const}. \end{equation}

Related equations are also discussed in Dodin & Fisch (Reference Dodin and Fisch2012) and Dewar (Reference Dewar1977).

The individual terms in (7.87) and (7.89) can be interpreted as described in table 1. The results of § 7.2.3 are reproduced as a particular case for the eikonal-wave ansatz (7.78).Footnote ³⁷ In particular, note that electrostatic waves carry non-zero momentum density $\smash {\int \mathrm {d}{\boldsymbol {k}}\,{\boldsymbol {k}} J}$ just like any other waves, even though an electrostatic field of these waves carries no momentum. The momentum is stored in the particle motion in this case (§ 9.1.3), and it is pumped there via either temporal dependence (Liu & Dodin Reference Liu and Dodin2015, § II.2) or spatial dependence (Ochs & Fisch Reference Ochs and Fisch2021b, Reference Ochs and Fisch2022) of the wave amplitude. This shows that homogeneous-plasma models that ignore ponderomotive effects cannot adequately describe the energy–momentum transfer between waves and plasma even when resonant absorption per se occurs in a homogeneous-plasma region. The OC formalism presented here provides means to describe such processes rigorously, generally, and without cumbersome calculations.

Table 1. Interpretation of the individual terms in (7.87) and (7.89). The wave energy–momentum is understood as the canonical (‘Minkowski’) energy–momentum, which must not be confused with the kinetic (‘Abraham’) energy–momentum (Dewar Reference Dewar1977; Dodin & Fisch Reference Dodin and Fisch2012). Whether the terms with $\smash {\varDelta _s F_s}$ should be attributed to OCs or to the wave is a matter of convention, because $\smash {\varDelta _s F_s}$ scales linearly both with $\smash {F_s}$ and with $\smash {J}$. In contrast, the wave energy density is defined unambiguously as $\smash {\mathcal {E}_s \doteq \int \mathrm {d}{\boldsymbol {p}}\,H_{0s} F_s}$ and does not contain $\smash {\varDelta _s}$. This is because $\smash {\int \mathrm {d}{\boldsymbol {p}}\,\varDelta _s F_s}$ is a part of the wave energy density $\smash {\mathcal {E}_{\text {w}}}$ (Dodin & Fisch Reference Dodin and Fisch2010a). Similarly, $\smash {\int \mathrm {d}{\boldsymbol {p}}\,(\partial _{{\boldsymbol {v}}_s}\varDelta _s) F_s}$ is a part of the wave momentum density (Dodin & Fisch Reference Dodin and Fisch2012).

7.6. Summary of § 7

In summary, we have considered plasma interaction with general broadband single-mode on-shell waves (for examples, see § 9). Assuming a general response matrix $\smash {{\boldsymbol {\varXi }}}$, these waves have a dispersion function $\smash {\varLambda (t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}})}$ and polarization ${\boldsymbol {\eta }}(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}})$ determined by

(7.91)

\begin{equation} {\boldsymbol{\varXi}}_\text{H}{\boldsymbol{\eta}} = \varLambda {\boldsymbol{\eta}}, \qquad \varLambda = {\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\varXi}}_\text{H}{\boldsymbol{\eta}}, \end{equation}

where the normalization $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\eta }} = 1}$ is assumed. Specifically for $\smash {{\boldsymbol {\varXi }}}$ given by (6.80), one has

(7.92)

\begin{align} \varLambda(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}) = {\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}_0 {\boldsymbol{\eta}} & - \sum_s \int \mathrm{d}{\boldsymbol{p}}\, {\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\wp}}_s({\boldsymbol{p}}){\boldsymbol{\eta}}\,F_s({\boldsymbol{p}}) \nonumber\\ &+ \sum_s {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{|{\boldsymbol{\alpha}}_s^{{{\dagger}}}{\boldsymbol{\eta}}|^2 ({\boldsymbol{p}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s({\boldsymbol{p}})}\, {\boldsymbol{k}} \cdot \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}, \end{align}

where the arguments $\smash {(t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}})}$ are omitted for brevity. (Some notation is summarized in § 6.9.) The wave frequency $\smash {\omega = w(t, {\boldsymbol {x}}, {\boldsymbol {k}})}$ satisfies

(7.93)

\begin{equation} \varLambda(t, {\boldsymbol{x}}, w(t, {\boldsymbol{x}}, {\boldsymbol{k}}), {\boldsymbol{k}}) = 0 \end{equation}

and $\smash {w(t, {\boldsymbol {x}}, -{\boldsymbol {k}}) = -w(t, {\boldsymbol {x}}, {\boldsymbol {k}})}$, where $w$ is a real function at real arguments. The wave local linear growth rate $\smash {\gamma }$, which is assumed to be small in this section, is

(7.94)

\begin{equation} \gamma (t, {\boldsymbol{x}}, {\boldsymbol{k}}) ={-}\left( \frac{{\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}_\text{A} {\boldsymbol{\eta}}}{\partial_{\omega}\varLambda} \right)_{(t, {\boldsymbol{x}}, w(t, {\boldsymbol{x}}, {\boldsymbol{k}}), {\boldsymbol{k}})}, \end{equation}

or explicitly,

\[ \gamma (t, {\boldsymbol{x}}, {\boldsymbol{k}}) = {\rm \pi}\sum_s \int\mathrm{d}{\boldsymbol{p}}\, \frac{|{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\partial_{\omega}\varLambda(t, {\boldsymbol{x}}, w, {\boldsymbol{k}})}\, \delta(w - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s(t, {\boldsymbol{x}}, {\boldsymbol{p}}))\,{\boldsymbol{k}} \cdot \frac{\partial F_s(t, {\boldsymbol{x}}, {\boldsymbol{p}})}{\partial {\boldsymbol{p}}}, \]

where $w \equiv w(t, {\boldsymbol {x}}, {\boldsymbol {k}})$ and $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2 \equiv |{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2(t, {\boldsymbol {x}}, w, {\boldsymbol {k}}; {\boldsymbol {p}})}$. The nonlinear potentials (6.77) are expressed through the scalar function

(7.95)

\begin{equation} \overline{{\mathsf{W}}}_s(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}) = |{\boldsymbol{\alpha}}_s^{{\dagger}}{\boldsymbol{\eta}}|^2 (\varsigma_{{\boldsymbol{k}}} J(t, {\boldsymbol{x}}, {\boldsymbol{k}}) + \varsigma_{-{\boldsymbol{k}}}J(t, {\boldsymbol{x}}, -{\boldsymbol{k}})) \,\delta(\varLambda(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}})), \end{equation}

where $\smash {\varsigma _{{\boldsymbol {k}}} \doteq \operatorname {sgn}(\partial _{\omega }\varLambda (t, {\boldsymbol {x}}, \omega, {\boldsymbol {k}}))}$ is evaluated at $\omega = w(t, {\boldsymbol {x}}, {\boldsymbol {k}})$; see also (7.82). The function $J$ is the phase-space action density governed by the WKE

(7.96)

\begin{equation} \frac{\partial J}{\partial t} - \frac{\partial w}{\partial {\boldsymbol{x}}} \cdot \frac{\partial J}{\partial {\boldsymbol{k}}} + \frac{\partial w}{\partial {\boldsymbol{k}}} \cdot \frac{\partial J}{\partial {\boldsymbol{x}}} = 2 \gamma J, \end{equation}

where $\smash {\partial _{\boldsymbol {k}} w = {\boldsymbol {v}}_{\text {g}}}$ is the group velocity. Collisional dissipation is assumed small compared with collisionless dissipation, so it is neglected in (7.96) but can be reintroduced by an ad hoc modification of $\smash {\gamma }$ (§ 6.2). Unlike the field equation used in the standard QLT, (7.96) exactly conserves the action of non-resonant waves, i.e. those with $\smash {\gamma = 0}$. The WKE must be solved together with the QL equation for the OC distribution $F_s$,

(7.97)

because $F_s$ determines the coefficients in (7.96) and $\smash {J}$ determines the coefficients in (7.97). When $\smash {{\boldsymbol {\varXi }}_0}$ and $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2}$ are independent of $\smash {t}$ and $\smash {{\boldsymbol {x}}}$, (7.96) and (7.97) conserve the total momentum and energy of the system; specifically,

(7.98)

\begin{align} \textstyle \partial_t (\sum_s P_{s,i} + P_{\text{w},i}) + \partial_j (\sum_s {\varPi_{s,i}}^j + {\varPi_{\text{w},i}}^j) & ={-} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,F_s \partial_i H_{0s}, \end{align}

(7.99)

\begin{align} \textstyle \partial_t (\sum_s \mathcal{E}_{s} + \mathcal{E}_{\text{w}}) + \partial_j (\sum_s Q^j_s + Q^j_{\text{w}}) & = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,F_s \partial_t H_{0s}. \end{align}

Here, the notation is as in table 1, or see (7.87) and (7.89) instead.

8. Thermal equilibrium

In this section, we discuss, for completeness, the properties of plasmas in thermal equilibrium.

8.1. Boltzmann–Gibbs distribution

As discussed in § 6.8, collisions conserve the density of each species, the total momentum density and the total energy density, while the plasma total entropy density $\smash {\sigma }$ either grows or remains constant. Let us search for equilibrium states in particular. At least one of the states in which $\smash {\sigma }$ remains constant is the one that maximizes the entropy density at fixed $\smash {\int \mathrm {d}{\boldsymbol {p}}\,F_s}$, $\smash {\sum _s \int \mathrm {d}{\boldsymbol {p}}\,{\boldsymbol {p}}F_s}$ and $\smash {\sum _s\int \mathrm {d}{\boldsymbol {p}}\,\mathcal {H}_s F_s}$. This ‘state of thermal equilibrium’ can be found as an extremizer of

(8.1)

\begin{equation} \sigma' \doteq \sigma - \sum_s \lambda_s^{(\mathcal{N})}\int \mathrm{d}{\boldsymbol{p}}\,F_s - {\boldsymbol{\lambda}}^{({\boldsymbol{P}})} \cdot \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}}F_s - \lambda^{(\mathcal{H})} \sum_s\int \mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s F_s \end{equation}

considered as a functional of all $\smash {F_s}$, where $\smash {\lambda _s^{(\mathcal {N})}}$, $\smash {{\boldsymbol {\lambda }}^{({\boldsymbol {P}})}}$ and $\smash {\lambda ^{(\mathcal {H})}}$ are Lagrange multipliers. Using (6.73), one finds that extremizers of $\smash {\sigma '}$ satisfy

(8.2)

\begin{equation} 0 = \frac{\delta \sigma'}{\delta F_s} ={-} \ln F_s - 1 - \lambda_s^{(\mathcal{N})} - {\boldsymbol{\lambda}}^{({\boldsymbol{P}})} \cdot {\boldsymbol{p}} - \lambda^{(\mathcal{H})} \mathcal{H}_s, \end{equation}

whence

(8.3)

\begin{equation} F_s = \text{const}_s \times \exp(- {\boldsymbol{\lambda}}^{({\boldsymbol{P}})} \cdot {\boldsymbol{p}} - \lambda^{(\mathcal{H})}\mathcal{H}_s). \end{equation}

The pre-exponential constant is determined by the given density of species $\smash {s}$, while $\smash {{\boldsymbol {\lambda }}^{({\boldsymbol {P}})}}$ and $\smash {\lambda ^{(\mathcal {H})}}$ can be expressed through the densities of the plasma momentum and energy stored in the whole distribution. Because

(8.4)

\begin{equation} \frac{\delta^2 \sigma'}{\delta F_s \delta F_s} ={-} \frac{1}{F_s} < 0, \qquad \frac{\delta^2 \sigma'}{\delta F_s \delta F_{s' \ne s}} = 0, \end{equation}

the matrix $\smash {\delta ^2 \sigma '/\delta F_s \delta F_{s'}}$ is positive-definite, so the entropy is maximal (as opposed to minimal) at the extremizer (8.3).

The distribution (8.3) is known as the Boltzmann–Gibbs distribution, with $\smash {T \doteq ~1/\lambda ^{(\mathcal {H})}}$ being the temperature (common for all species). Also, let us introduce a new, rescaled Lagrange multiplier $\smash {{\boldsymbol {\mathfrak {u}}}}$ via $\smash {{\boldsymbol {\lambda }}^{({\boldsymbol {P}})} = -{\boldsymbol {\mathfrak {u}}}/T}$. Then,

(8.5)

\begin{equation} F_s({\boldsymbol{p}}) = F_s^{(0)}\exp\left(-\frac{\mathcal{H}_s({\boldsymbol{p}}) - {\boldsymbol{\mathfrak{u}}} \cdot {\boldsymbol{p}}}{T}\right), \end{equation}

where $\smash {F_s^{(0)}}$ is independent of $\smash {{\boldsymbol {p}}}$. Correspondingly,

(8.6)

\begin{equation} \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} ={-} F_s({\boldsymbol{p}})\,\frac{{\boldsymbol{v}}_s - {\boldsymbol{\mathfrak{u}}}}{T}, \end{equation}

where we used (5.44). From (8.6), one obtains

(8.7)

\begin{align} &\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, {\boldsymbol{k}}\cdot \left(\frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'}\right)\nonumber\\ &\quad ={-} \frac{1}{T}\,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, ({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})\, F_s({\boldsymbol{p}})F_{s'}({\boldsymbol{p}}')\nonumber\\ &\quad = 0, \end{align}

where $\smash {\mathcal {H}'_{s'} \doteq \mathcal {H}_{s'}({\boldsymbol {p}}')}$. Then, (6.61) yields that the collision operator vanishes on the Boltzmann–Gibbs distribution, and thus, expectedly, $\smash {(\mathrm {d}\sigma /\mathrm {d} t)_{\text {coll}} = 0}$. One can also show that the Boltzmann–Gibbs distribution is the only distribution (strictly speaking, a class of distributions parameterized by $\smash {T}$ and $\smash {{\boldsymbol {\mathfrak {u}}}}$) for which the entropy density is conserved (Appendix E).

The property (8.6) of the thermal-equilibrium state also leads to other notable results that we derive below. In doing so, we will assume the reference frame where $\smash {{\boldsymbol {\mathfrak {u}}} = 0}$, so the Boltzmann–Gibbs distribution has a better known form

(8.8)

\begin{equation} F_s({\boldsymbol{p}}) = F_s^{(0)}\exp\left(-\frac{\mathcal{H}_s({\boldsymbol{p}})}{T}\right), \qquad \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} ={-} F_s({\boldsymbol{p}})\,\frac{{\boldsymbol{v}}_s}{T}. \end{equation}

(For $\smash {\mathcal {H}_s}$ isotropic in $\smash {{\boldsymbol {p}}}$, this is the frame where the plasma total momentum density $\smash {\sum _s \int \mathrm {d}{\boldsymbol {p}}\,{\boldsymbol {p}}F_s}$ is zero.) The generalizations to arbitrary $\smash {{\boldsymbol {\mathfrak {u}}}}$ are straightforward.

8.2. Fluctuation–dissipation theorem

Let us describe microscopic fluctuations in equilibrium plasmas in terms of $\smash {{{\boldsymbol {\mathsf {S}}}}(\omega, {\boldsymbol {k}}) \doteq (2{\rm \pi} )^{{{\mathsf {n}}}}\,{\boldsymbol {\mathfrak {W}}}(\omega, {\boldsymbol {k}})}$, i.e.

(8.9)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) \doteq \int \mathrm{d}\tau \mathrm{d}{\boldsymbol{s}}\, \overline{\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}}(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2) \underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}}{\widetilde{{\boldsymbol{\varPsi}}}}^{{\dagger}}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2)}\,\mathrm{e}^{\mathrm{i} \omega \tau - \mathrm{i} {\boldsymbol{k}} \cdot {\boldsymbol{s}}}, \end{equation}

which can also be represented in terms of the Fourier image $\smash {\mathring {{\boldsymbol {\varPsi }}}(\omega, {\boldsymbol {k}})}$ of the microscopic field $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {{\boldsymbol {\varPsi }}}}(t, {\boldsymbol {x}})}$:

(8.10)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) = \frac{\overline{ \mathring{{\boldsymbol{\varPsi}}}(\omega, {\boldsymbol{k}}) \smash{\mathring{{\boldsymbol{\varPsi}}}}^{{\dagger}}(\omega, {\boldsymbol{k}}) }}{\mathscr{T}\mathscr{V}_n}. \end{equation}

For statistically homogeneous fields that persist on time $\smash {\mathscr {T} \to \infty }$ within volume $\smash {\mathscr {V}_n \to \infty }$, the Fourier transform is formally divergent; hence the appearance of the factors $\smash {\mathscr {T}}$ and $\smash {\mathscr {V}_n}$ in (8.10).Footnote ³⁸ Also, as seen from (6.32), any quadratic function of the microscopic field can be expressed through $\smash {{{\boldsymbol {\mathsf {S}}}}}$ via

(8.11)

\begin{equation} \overline{(\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}})(\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}})^{{\dagger}}} \approx \int \frac{\mathrm{d}\omega}{2{\rm \pi}}\,\frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\, ({{\boldsymbol {\mathsf{L}}}}{{\boldsymbol {\mathsf{S}}}}{{\boldsymbol {\mathsf{R}}}}^{{\dagger}})(\omega, {\boldsymbol{k}}), \end{equation}

From (6.31), one finds that, in general,

(8.12)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) = 2{\rm \pi}\, \sum_{s'}\int \mathrm{d}{\boldsymbol{p}}'\,\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'})F_{s'}({\boldsymbol{p}}') {\boldsymbol{\varXi}}^{{-}1}(\omega, {\boldsymbol{k}}) ({\boldsymbol{\alpha}}_{s'}{\boldsymbol{\alpha}}_{s'}^{{\dagger}})(\omega, {\boldsymbol{k}};{\boldsymbol{p}}') {\boldsymbol{\varXi}}^{-{{\dagger}}}(\omega, {\boldsymbol{k}}). \end{equation}

For a thermal distribution in particular, which satisfies (8.8), one can rewrite (6.24) as follows:

(8.13)

\begin{align} {\boldsymbol{\varXi}}_\text{A}(\omega, {\boldsymbol{k}}) & \approx \frac{\rm \pi}{T} \sum_s \int \mathrm{d}{\boldsymbol{p}}\, {\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}){\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) \delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)\,({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) F_s({\boldsymbol{p}}) \nonumber\\ & = \frac{{\rm \pi} \omega}{T} \sum_s \int \mathrm{d}{\boldsymbol{p}}\, {\boldsymbol{\alpha}}_s(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}){\boldsymbol{\alpha}}_s^{{\dagger}}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}) \delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) F_s({\boldsymbol{p}}). \end{align}

By comparing this with (8.12), one also finds that

(8.14)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) = \frac{2T}{\omega}\,({\boldsymbol{\varXi}}^{{-}1}{\boldsymbol{\varXi}}_\text{A}{\boldsymbol{\varXi}}^{-{{\dagger}}})(\omega, {\boldsymbol{k}}). \end{equation}

Due to (6.26), this leads to the fluctuation–dissipation theorem in the following form:

(8.15)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) ={-}\frac{2T}{\omega}\,({\boldsymbol{\varXi}}^{{-}1})_\text{A}(\omega, {\boldsymbol{k}}). \end{equation}

For examples of $\smash {{\boldsymbol {\varXi }}}$ for specific systems, see § 9.

8.3. Kirchhoff's law

Consider the power deposition via polarization drag

(8.16)

\begin{equation} \mathfrak{P} = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,({\boldsymbol{v}}_s \cdot {\boldsymbol{\mathfrak{F}}}_s) F_s({\boldsymbol{p}}). \end{equation}

Using (6.58a) for $\smash {{\boldsymbol {\mathfrak {F}}}_s}$, (8.13) for $\smash {{\boldsymbol {\varXi }}_\text {A}}$ and (8.15) for $\smash {{{\boldsymbol {\mathsf {S}}}}}$, this can also be expressed as follows:

(8.17)

\begin{align} \mathfrak{P} & \approx \sum_s \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}\,({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)\, ({\boldsymbol{\alpha}}_s^{{\dagger}} ({\boldsymbol{\varXi}}^{{-}1})_\text{A} {\boldsymbol{\alpha}}_s)({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}) F_s({\boldsymbol{p}}) \nonumber\\ & = \sum_s\int \frac{\mathrm{d}\omega}{2{\rm \pi}} \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}\, 2{\rm \pi}\omega\,\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) ({\boldsymbol{\alpha}}_s^{{\dagger}} ({\boldsymbol{\varXi}}^{{-}1})_\text{A} {\boldsymbol{\alpha}}_s)(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})F_s({\boldsymbol{p}}) \nonumber\\ & = 2T \int \frac{\mathrm{d}\omega}{2{\rm \pi}} \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n} \operatorname{tr}\bigg(({\boldsymbol{\varXi}}^{{-}1})_\text{A} \frac{{\rm \pi} \omega}{T}\, \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) ({\boldsymbol{\alpha}}_s{\boldsymbol{\alpha}}_s^{{\dagger}})(\omega, {\boldsymbol{k}}; {\boldsymbol{p}})F_s({\boldsymbol{p}})\bigg) \nonumber\\ & ={-}\int \frac{\mathrm{d}\omega}{2{\rm \pi}}\,\frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\omega \operatorname{tr}({{\boldsymbol {\mathsf{S}}}}{\boldsymbol{\varXi}}_\text{A})(\omega, {\boldsymbol{k}}). \end{align}

Thus, the spectral density of the power deposition via polarization drag is given by

(8.18)

\begin{equation} \mathfrak{P}_{\omega, {\boldsymbol{k}}} ={-}\omega \operatorname{tr}({{\boldsymbol {\mathsf{S}}}}{\boldsymbol{\varXi}}_\text{A}), \end{equation}

which is a restatement of Kirchhoff's law (Krall & Trivelpiece Reference Krall and Trivelpiece1973, § 11.4). For examples of $\smash {{\boldsymbol {\varXi }}}$ for specific systems, see § 9.

8.4. Equipartition theorem

As flows from § 7.5, the energy of on-shell waves of a field $\smash {{\boldsymbol {\widetilde {\varPsi }}}}$ in a homogeneous $\smash {n}$-dimensional plasma of a given volume $\smash {\mathscr {V}_n}$ can be written as

(8.19)

\begin{align} \mathscr{V}_n\mathcal{E}_{\text{w}} & = \int \mathrm{d}{\boldsymbol{k}}\,\mathscr{V}_n w({\boldsymbol{k}})J({\boldsymbol{k}}) \nonumber\\ & = (2{\rm \pi})^n\int \frac{\mathscr{V}_n \mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\int_0^\infty \mathrm{d}\omega\, \omega\,\partial_\omega\varLambda(\omega, {\boldsymbol{k}})\,h({\boldsymbol{k}})\,\delta(\omega - w({\boldsymbol{k}})) \nonumber\\ & = (2{\rm \pi})^n \sum_{{\boldsymbol{k}}} \int^{\infty}_0 \mathrm{d}\omega\,\omega\, \partial_\omega\varLambda(\omega, {\boldsymbol{k}})\, ({\boldsymbol{\eta}}^{{\dagger}}{{\boldsymbol {\mathsf{U}}}}{\boldsymbol{\eta}})(\omega, {\boldsymbol{k}}). \end{align}

To apply this to microscopic fluctuations, one can replace $\smash {{{\boldsymbol {\mathsf {U}}}}}$ with $\smash {{\boldsymbol {\mathfrak {W}}}}$ and substitute $\smash {{\boldsymbol {\mathfrak {W}}} = (2{\rm \pi} )^{-(n+1)}{{\boldsymbol {\mathsf {S}}}}}$. Then, the total energy of a mode with given wavevector $\smash {{\boldsymbol {k}}}$ and polarization $\smash {{\boldsymbol {\eta }}}$ can be expressed as

(8.20)

\begin{equation} \mathcal{E}_{{\boldsymbol{k}}, {\boldsymbol{\eta}}} = \frac{1}{2{\rm \pi}} \int^{\infty}_0 \mathrm{d}\omega\, \omega\,(\partial_\omega\varLambda)\,{\boldsymbol{\eta}}^{{\dagger}} {{\boldsymbol {\mathsf{S}}}} {\boldsymbol{\eta}}, \end{equation}

where the arguments $\smash {(\omega, {\boldsymbol {k}})}$ are omitted for brevity. For thermal equilibrium, one can substitute (8.15) for $\smash {{{\boldsymbol {\mathsf {S}}}}}$; then,

(8.21)

\begin{equation} \mathcal{E}_{{\boldsymbol{k}}, {\boldsymbol{\eta}}} ={-}\frac{T}{\rm \pi}\, \operatorname{im} \int^{\infty}_0 \mathrm{d}\omega\,(\partial_\omega\varLambda)\, {\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\eta}}. \end{equation}

The integrand peaks at $\smash {\omega = w({\boldsymbol {k}})}$, where the mode eigenvalue $\smash {\varLambda }$ is small. Due to damping, the actual zero of $\smash {\varLambda }$ is slightly below the real axis in the complex-frequency space. Then, at infinitesimally small damping, $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\varXi }}^{-1} {\boldsymbol {\eta }}}$ can be approximated near $\smash {\omega = w({\boldsymbol {k}})}$ as

(8.22)

\begin{equation} {\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{\varXi}}^{{-}1} {\boldsymbol{\eta}} \approx \frac{1}{\varLambda} \approx \frac{1}{\partial_\omega\varLambda(\omega, {\boldsymbol{k}})} \left(\operatorname{pv}\frac{1}{\omega - w({\boldsymbol{k}})} -\mathrm{i} {\rm \pi}\delta(\omega - w({\boldsymbol{k}})) \right). \end{equation}

This leads to the well-known equipartition theorem

(8.23)

\begin{equation} \mathcal{E}_{{\boldsymbol{k}}} = T. \end{equation}

Note that, according to (8.23), the sum $\smash {\mathscr {V}_n\mathcal {E}_{\text {w}} = \sum _{{\boldsymbol {k}}, {\boldsymbol {\eta }}}\mathcal {E}_{{\boldsymbol {k}}, {\boldsymbol {\eta }}}}$ is divergent. This indicates that not all modes can be classical and on-shell (weakly damped) simultaneously.

8.5. Summary of § 8

In thermal equilibrium, when all species have Boltzmann–Gibbs distributions with common temperature $\smash {T}$, the collision operator vanishes, the entropy is conserved and the spectrum of microscopic fluctuations (8.9) satisfies the fluctuation–dissipation theorem

(8.24)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) ={-}\frac{2T}{\omega}\,({\boldsymbol{\varXi}}^{{-}1})_\text{A}(\omega, {\boldsymbol{k}}), \end{equation}

where $\smash {{\boldsymbol {\varXi }}}$ is the dispersion matrix (6.80) and $\smash {_\text {A}}$ denotes the anti-Hermitian part (or the imaginary part in case of scalar fields). From here, it is shown that the spectral density of the power deposition via polarization drag is given by $\smash {\mathfrak {P}_{\omega, {\boldsymbol {k}}} = - \omega \operatorname {tr}({{\boldsymbol {\mathsf {S}}}}{\boldsymbol {\varXi }}_\text {A})}$, which is a restatement of Kirchhoff's law. For on-shell waves, (8.24) reduces to the equipartition theorem, which says that the energy per mode equals $\smash {T}$. Applications to specific systems are discussed in § 9.

9. Examples

In this section, we show how to apply our general formulation to non-relativistic electrostatic interactions (§ 9.1), relativistic electromagnetic interactions (§ 9.2), Newtonian gravity (§ 9.3) and relativistic gravity, including gravitational waves (§ 9.4).

9.1. Non-relativistic electrostatic interactions

9.1.1. Main equations

Let us show how our general formulation reproduces (and generalizes) the well-known results for electrostatic turbulence in non-magnetized non-relativistic plasma. In this case,

(9.1)

\begin{equation} H_s = \frac{p^2}{2m_s} + e_s \overline{\varphi} + e_s\widetilde{\varphi}, \end{equation}

where $e_s$ is the electric charge, $\varphi$ is the electrostatic potential and $\smash {\overline {\varphi }}$ and $\smash {\widetilde {\varphi }}$ are its average and oscillating parts, respectively. Then, $\smash {H_{0s} = \overline {H}_s = p^2/(2m_s) + e_s \overline {\varphi }}$, $\smash {\widetilde {H}_s = e_s \widetilde {\varphi }}$, $\smash {\widehat {\boldsymbol {\alpha }}_s = e_s}$ and $\smash {\widehat {\boldsymbol {L}}_s = \widehat {\boldsymbol {R}}_s = \widehat {\boldsymbol {0}}}$, so $\smash {{\boldsymbol {\wp }}_s = {\boldsymbol {0}}}$. The matrix (6.78) is a scalar (Wigner function) given by

(9.2)

\begin{equation} {\mathsf{U}}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}) = \int \frac{\mathrm{d}\tau}{2{\rm \pi}}\,\frac{\mathrm{d}{\boldsymbol{s}}}{(2{\rm \pi})^{n}}\,\, \overline{ {\underline{\widetilde{\varphi}}}(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2)\, {\underline{\widetilde{\varphi}}}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2) } \,\mathrm{e}^{\mathrm{i}\omega \tau - \mathrm{i} {\boldsymbol{k}} \cdot {\boldsymbol{s}}}. \end{equation}

(Underlining denotes the macroscopic part, $\smash {n \doteq \dim {\boldsymbol {x}}}$, and the arguments $\smash {(t, {\boldsymbol {x}})}$ will be omitted from now on.) Correspondingly,

(9.3)

\begin{align} {{\boldsymbol {\mathsf{D}}}}_s & \approx e_s^2 \int \mathrm{d}{\boldsymbol{k}}\, {\rm \pi}\,{\boldsymbol{k}}{\boldsymbol{k}} {\mathsf{U}}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}), \end{align}

(9.4)

\begin{align} {\boldsymbol{\Theta}}_s & = e_s^2\,\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \left. \frac{{\boldsymbol{k}}{\boldsymbol{k}} {\mathsf{U}}(\omega, {\boldsymbol{k}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}, \end{align}

(9.5)

\begin{align} \varDelta_s = \varPhi_s & = \frac{e_s^2}{2}\frac{\partial}{\partial {\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{{\boldsymbol{k}} {\mathsf{U}}(\omega, {\boldsymbol{k}})}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}, \end{align}

and also

(9.6)

\begin{equation} \mathcal{H}_s = \frac{p^2}{2m_s} + e_s\overline{\varphi} + \varDelta_s, \qquad {\boldsymbol{v}}_s = \frac{{\boldsymbol{p}}}{m_s} + \frac{\partial \varDelta_s}{\partial {\boldsymbol{p}}}. \end{equation}

The Lagrangian density of a free electrostatic field is

(9.7)

\begin{equation} \mathfrak{L}_0 = \frac{1}{8{\rm \pi}}\,\delta^{ij}(\partial_i\widetilde{\varphi})(\partial_j\widetilde{\varphi}) = \frac{\partial}{\partial x^i}\left(\frac{1}{8{\rm \pi}}\,\delta^{ij} \widetilde{\varphi}(\partial_j\widetilde{\varphi})\right) + \frac{1}{2}\,\widetilde{\varphi}\left(-\frac{\delta^{ij}\partial_i \partial_j}{4{\rm \pi}}\right)\,\widetilde{\varphi}. \end{equation}

The first term on the right-hand side does not contribute to the field action $\smash {S_0}$ and thus can be ignored. The second term is of the form (6.1) with $\smash {M = 1}$, $\smash {{\boldsymbol {\mathsf {g}}}} = 1$ (§ 2.1.2) and $\smash {\widehat {\boldsymbol {\varXi }}_0 = \widehat {k}^2/(4{\rm \pi} )}$, so $\smash {{\boldsymbol {\varXi }}_0 (\omega, {\boldsymbol {k}}) = k^2/(4{\rm \pi} )}$, where $\smash {k^2 \equiv {\boldsymbol {k}}^2 \equiv \delta ^{ij}k_i k_j}$. Then, (6.20) gives

(9.8)

\begin{equation} {\boldsymbol{\varXi}}(\omega, {\boldsymbol{k}}) = \varXi(\omega, {\boldsymbol{k}}) = \frac{k^2\epsilon_\parallel (\omega, {\boldsymbol{k}})}{4{\rm \pi}}, \end{equation}

where the arguments $\smash {t}$ and $\smash {{\boldsymbol {x}}}$ are omitted for brevity and $\smash {\epsilon _\parallel }$ is the parallel permittivity,

(9.9)

\begin{equation} \epsilon_\parallel(\omega, {\boldsymbol{k}}) = 1 + \sum_s \frac{4{\rm \pi} e_s^2}{k^2} \int \mathrm{d}{\boldsymbol{p}}\,\frac{{\boldsymbol{k}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \mathrm{i} 0} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}}. \end{equation}

9.1.2. Collisions and fluctuations

By (8.12), the spectrum of microscopic oscillations of $\smash {\widetilde {\varphi }}$ is a scalar given by

(9.10)

\begin{equation} {\mathsf{S}}(\omega, {\boldsymbol{k}}) = 2{\rm \pi}\sum_{s} \left(\frac{4{\rm \pi} e_s}{k^2 |\epsilon_\parallel (\omega, {\boldsymbol{k}})|}\right)^2 \int \mathrm{d}{\boldsymbol{p}}\, \delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_{s})F_{s}({\boldsymbol{p}}), \end{equation}

where we substituted $n = 3$ for three-dimensional plasma. For thermal equilibrium, (8.15) leads to the well-known formula (Lifshitz & Pitaevskii Reference Lifshitz and Pitaevskii1981, § 51)

(9.11)

\begin{equation} {\mathsf{S}}(\omega, {\boldsymbol{k}}) ={-} \frac{2 T}{\omega}\operatorname{im}\left(\frac{1}{\varXi(\omega, {\boldsymbol{k}})}\right) ={-}\frac{8{\rm \pi} T}{\omega k^2}\operatorname{im}\left(\frac{1}{\epsilon_\parallel(\omega, {\boldsymbol{k}})}\right) = \frac{8{\rm \pi} T}{\omega k^2} \frac{\operatorname{im} \epsilon_\parallel(\omega, {\boldsymbol{k}})}{|\epsilon_\parallel(\omega, {\boldsymbol{k}})|^2}. \end{equation}

The spectrum $\smash {{\mathsf {S}}_\rho }$ of charge-density fluctuations is found using Poisson's equation $\smash {\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {\rho }} = \widehat {k}^2\underset{\raise0.3em\hbox{$\smash{\scriptscriptstyle\thicksim}$}} {\widetilde {\varphi }}/4{\rm \pi} }$, whence $\smash {{\mathsf {S}}_\rho \approx (k^2/4{\rm \pi} )^2{\mathsf {S}}}$. Fluctuations of other fields are found similarly. Also, (6.36) leads to

(9.12)

\begin{equation} |\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 = \left(\frac{4{\rm \pi} e_s e_{s'}}{k^2 |\epsilon_\parallel(\omega, {\boldsymbol{k}})|}\right)^2. \end{equation}

Then, (6.61) yields the standard Balescu–Lenard collision operator

(9.13)

\begin{align} \mathcal{C}_s = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \sum_{s'}\int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^3}\,\mathrm{d}{\boldsymbol{p}}'\, &\frac{{\rm \pi} {\boldsymbol{k}}{\boldsymbol{k}}}{|\epsilon_\parallel({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}})|^2}\, \left(\frac{4{\rm \pi} e_s e_{s'}}{k^2}\right)^2 \delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}) \nonumber\\ & \cdot \left( \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \right). \end{align}

(As a reminder, the distribution functions are normalized such that $\int \mathrm {d}{\boldsymbol {p}}\,F_s$ is the local average density of species $\smash {s}$ (5.39).)

9.1.3. On-shell waves

For on-shell waves, (7.63) gives $\smash {{\mathsf {U}}(\omega, {\boldsymbol {k}}) = (h({\boldsymbol {k}}) + h(-{\boldsymbol {k}}))\delta (\omega - w({\boldsymbol {k}}))}$, where $\smash {w({\boldsymbol {k}})}$ is determined by the dispersion relation

(9.14)

\begin{equation} \epsilon_{{\parallel}\text{H}}(w({\boldsymbol{k}}), {\boldsymbol{k}}) = 0, \end{equation}

and $\smash {\epsilon _{\parallel \text {H}} \equiv \operatorname {re} \epsilon _\parallel }$ is given by

(9.15)

\begin{equation} \epsilon_{{\parallel}\text{H}}(\omega, {\boldsymbol{k}}) = 1 + \sum_s \frac{4{\rm \pi} e_s^2}{k^2} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\,\frac{{\boldsymbol{k}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}}. \end{equation}

The phase-space density of the wave action, defined in (7.74), is

(9.16)

\begin{equation} J({\boldsymbol{k}}) = h({\boldsymbol{k}})\,\frac{\partial\varXi_\text{H}({\boldsymbol{k}})}{\partial \omega} = h({\boldsymbol{k}}) \,\frac{k^2}{4{\rm \pi}}\frac{\partial\epsilon_{{\parallel}\text{H}}(w({\boldsymbol{k}}), {\boldsymbol{k}})}{\partial \omega}, \end{equation}

and the dressing function (9.4) is given by

(9.17)

\begin{align} {\boldsymbol{\Theta}}_{s} & = e_s^2\,\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{k}}\,(h({\boldsymbol{k}}) + h(-{\boldsymbol{k}})) \left. \frac{{\boldsymbol{k}} {\boldsymbol{k}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}\nonumber\\ & = 2e_s^2\,\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}}) \left. \frac{{\boldsymbol{k}} {\boldsymbol{k}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}. \end{align}

Using these, one obtains (Appendix F.1.1)

(9.18)

\begin{equation} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}} F_s + \int \mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}} J = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}} \overline{f}_s, \end{equation}

so the conserved quantity (7.88) is the average momentum of the plasma (while the electrostatic field carries no momentum, naturally). Also (Appendix F.1.2),

(9.19)

\begin{equation} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \int \mathrm{d}{\boldsymbol{k}}\,w J = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s}\overline{f}_s + \frac{1}{8{\rm \pi}}\,\overline{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}}}, \end{equation}

so, expectedly, the conserved quantity (7.90) is the average particle energy plus the energy of the electrostatic field. In combination with our equations for $\smash {F_s}$ and $\smash {J}$ (§ 7.6), these results can be considered as a generalization and concise restatement of the OC QLT by Dewar (Reference Dewar1973), which is rigorously reproduced from our general formulation as a particular case.

9.1.4. Eikonal waves

As a particular case, let us consider an eikonal wave

(9.20)

\begin{equation} \underline{\widetilde{\varphi}} \approx \operatorname{re}(\mathrm{e}^{\mathrm{i} \theta}{\breve{\varphi}}), \qquad \overline{\omega}\doteq{-}\partial_t\theta, \qquad \overline{{\boldsymbol{k}}}\doteq \partial_{{\boldsymbol{x}}} \theta, \end{equation}

which may or may not be on-shell. As seen from § 7.4.1,

(9.21)

\begin{equation} {\mathsf{U}} \approx \frac{|{\breve{\varphi}}|^2}{4}\,\sum_{{\pm}} \delta(\omega \pm \overline{\omega})\,\delta({\boldsymbol{k}} \pm \overline{{\boldsymbol{k}}}). \end{equation}

For non-resonant particles, the dressing function is well defined is found as follows:

(9.22)

\begin{align} {\boldsymbol{\Theta}}_s & \approx{-} \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \frac{e_s^2 {\boldsymbol{k}}{\boldsymbol{k}}|{\breve{\varphi}}|^2}{4(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)^2}\,\sum_{{\pm}} \delta(\omega \pm \overline{\omega})\,\delta({\boldsymbol{k}} \pm \overline{{\boldsymbol{k}}})\nonumber\\ & ={-} \frac{e_s^2 \overline{{\boldsymbol{k}}}\,\overline{{\boldsymbol{k}}} |{\breve{\varphi}}|^2}{2(\overline{\omega} - \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{v}}_s)^2}. \end{align}

Similarly, the ponderomotive energy for non-resonant particles is

(9.23)

\begin{align} \varDelta_s & \approx \frac{e_s^2 |{\breve{\varphi}}|^2}{8m_s} \frac{\partial}{\partial {\boldsymbol{v}}_s} \cdot \int \mathrm{d}{\omega}\,\mathrm{d}{\boldsymbol{k}}\, \frac{{\boldsymbol{k}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} \sum_{{\pm}} \delta(\omega \pm \overline{\omega})\,\delta({\boldsymbol{k}} \pm \overline{{\boldsymbol{k}}}) \nonumber\\ & = \frac{e_s^2 |{\breve{\varphi}}|^2}{8m_s} \int \mathrm{d}{\omega}\,\mathrm{d}{\boldsymbol{k}}\, \frac{k^2}{(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)^2} \sum_{{\pm}} \delta(\omega \pm \overline{\omega})\,\delta({\boldsymbol{k}} \pm \overline{{\boldsymbol{k}}}) \nonumber\\ & = \frac{e_s^2 \overline{k}^2 |{\breve{\varphi}}|^2}{4m_s(\overline{\omega} - \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{v}}_s)^2}, \end{align}

in agreement with Dewar (Reference Dewar1972) and Cary & Kaufman (Reference Cary and Kaufman1977). One can also express these functions in terms of the electric-field envelope $\smash {{\breve {{\boldsymbol {E}}}} \approx -\mathrm {i} \overline {{\boldsymbol {k}}} {\breve {\varphi }}}$:

(9.24)

\begin{equation} {\boldsymbol{\Theta}}_{s} \approx{-}\frac{e_s^2{\breve{{\boldsymbol{E}}}}\smash{{\breve{{\boldsymbol{E}}}}}^{{\dagger}}}{2(\overline{\omega} - \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{v}}_s)^2}, \qquad \varDelta_s \approx \frac{e_s^2 |\smash{{\breve{{\boldsymbol{E}}}}}|^2}{4m_s (\overline{\omega} - \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{v}}_s)^2}. \end{equation}

For on-shell in particular, one can use (9.16) together with $\smash {h({\boldsymbol {k}}) = \frac {1}{4} |{\breve {\varphi }}|^2 \delta ({\boldsymbol {k}} - \overline {{\boldsymbol {k}}})}$ (cf. (7.64)) to obtain the well-known expression for the wave action density $\smash {\mathcal {I} \doteq \int \mathrm {d}{\boldsymbol {k}}\,J}$:

(9.25)

\begin{equation} \mathcal{I} = \frac{|\smash{{\breve{{\boldsymbol{E}}}}}|^2}{16{\rm \pi}}\frac{\partial\epsilon_{{\parallel}\text{H}}(\omega, \overline{{\boldsymbol{k}}})}{\partial \omega}\,\Big|_{\omega = w(\overline{{\boldsymbol{k}}})}. \end{equation}

For not-too-hot plasma, one has $\smash {\epsilon _{\parallel \text {H}}(\omega, {\boldsymbol {k}}) \approx 1 - \omega _p^2/\omega ^2}$, where $\smash {\omega _p \doteq \sum _s 4{\rm \pi} \mathcal {N}_s e_s^2/m_s}$ is the plasma frequency. The corresponding waves are Langmuir waves. Their dispersion relation is $\smash {w(\overline {{\boldsymbol {k}}}) \approx \pm \omega _p}$, so $\smash {\mathcal {I} \approx \pm |{\breve {{\boldsymbol {E}}}}|^2/(8{\rm \pi} \omega _p)}$ (and accordingly, the wave energy density is $\smash {\mathcal {E}_{\text {w}} = w\mathcal {I} \geqslant 0}$ for either sign). Remember, though, that this expression is only approximate. Using it instead of (9.25) can result in violation of the exact conservation laws of QLT. Conservation of the Langmuir-wave action in non-stationary plasmas beyond the cold-plasma approximation is also discussed in Dodin, Geyko & Fisch (Reference Dodin, Geyko and Fisch2009), Dodin & Fisch (Reference Dodin and Fisch2010b) and Schmit, Dodin & Fisch (Reference Schmit, Dodin and Fisch2010).

9.1.5. Homogeneous plasma

In homogeneous $n$-dimensional plasma of a given volume $\mathscr {V}_n$, the Wigner function (9.2) has the form $\smash {{\mathsf {U}} = \mathcal {U}(t, {\boldsymbol {k}}) \delta (\omega - w(t, {\boldsymbol {k}}))}$. The function $\mathcal {U}$ is readily found using (5.47):

(9.26)

\begin{equation} \mathcal{U}(t, {\boldsymbol{k}}) = \frac{1}{\mathscr{V}_n} \int \mathrm{d}{\boldsymbol{x}}\,\mathrm{d}\omega\, U = \frac{1}{\mathscr{V}_n}\,|\overline{\mathring{\underline{\widetilde{\varphi}}}(t, {\boldsymbol{k}})|^2} = \frac{1}{\mathscr{V}_n}\,|\mathring{\underline{\widetilde{\varphi}}}(t, {\boldsymbol{k}})|^2. \end{equation}

Then,

(9.27)

\begin{equation} {{\boldsymbol {\mathsf{D}}}}_{s} \approx \frac{{\rm \pi} e_s^2}{\mathscr{V}_n} \int \mathrm{d}{\boldsymbol{k}}\, {\boldsymbol{k}}{\boldsymbol{k}}\,|\mathring{\underline{\widetilde{\varphi}}}(t,\, \boldsymbol{k})|^2\, \delta(w(t, {\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s). \end{equation}

This coincides with the well-known formula for the QL-diffusion coefficient in homogeneous electrostatic plasma.Footnote ³⁹ The functions $\smash {{\boldsymbol {\Theta }}_{s}}$ and $\smash {\varDelta _s}$ are also important in homogeneous turbulence in that they ensure the proper energy–momentum conservation; for example, see Stix (Reference Stix1992, § 16.3) and Liu & Dodin (Reference Liu and Dodin2015, § II.2). These functions can be expressed through $\smash {\mathring {\underline {\widetilde {\varphi }}}}$ too. However, they have a simpler representation in terms of the Wigner function $\smash {{\mathsf {U}}}$, as in (9.4) and (9.5), respectively. This is because $\smash {{\mathsf {U}}}$ is a local property of the field, which makes it more fundamental than the amplitudes of global Fourier harmonics commonly used in the literature.

9.2. Relativistic electromagnetic interactions

9.2.1. Main equations

Let us extend the above results to relativistic electromagnetic interactions. In this case,

(9.28)

\begin{equation} H_s = \sqrt{m_s^2 c^4 + ({\boldsymbol{p}}c - e_s {\boldsymbol{A}})^2} + e_s \varphi, \end{equation}

where $\smash {c}$ is the speed of light and $\smash {{\boldsymbol {A}}}$ is the vector potential. Let us adopt the Weyl gauge for the oscillating part of the electromagnetic field ($\smash {\widetilde {\varphi } = 0}$) and Taylor-expand $\smash {H_s}$ to the second order in $\smash {\widetilde {{\boldsymbol {A}}}}$. This leads to

(9.29)

\begin{align} & \displaystyle H_s \approx H_{0s} - e_s{\boldsymbol{\beta}}_s^{{\dagger}} \widetilde{{\boldsymbol{A}}} + \frac{e_s^2}{2 c^2}\,\smash{\widetilde{{\boldsymbol{A}}}}^{{\dagger}} {\boldsymbol{\mu}}_s^{{-}1} \widetilde{{\boldsymbol{A}}}, \end{align}

(9.30)

\begin{align} & \displaystyle H_{0s} = \sqrt{m_s^2 c^4 + ({\boldsymbol{p}}c - e_s \overline{{\boldsymbol{A}}})^2} + e_s \overline{\varphi} \end{align}

(although plasma is assumed non-magnetized, a weak average magnetic field $\smash {\overline {{\boldsymbol {B}}} = \nabla \times \overline {{\boldsymbol {A}}}}$ is allowed, so $\smash {\overline {{\boldsymbol {A}}}}$ can be order-one and thus generally must be retained), where

(9.31)

\begin{equation} {\boldsymbol{\beta}}_s = \frac{1}{m_s c \gamma_s}\left({\boldsymbol{p}} - \frac{e_s}{c}\,\overline{{\boldsymbol{A}}}\right), \qquad {\boldsymbol{\mu}}_s^{{-}1} = \frac{{\boldsymbol{1}} - {\boldsymbol{\beta}}_s{\boldsymbol{\beta}}_s^{{\dagger}}}{m_s \gamma_s}, \end{equation}

and $\smash {\gamma _s \doteq (1-\beta _s^2)^{-1/2}}$. In the equations presented below, $\smash {{\boldsymbol {\beta }}_s = {\boldsymbol {v}}_s/c}$ (where $\smash {{\boldsymbol {v}}_s}$ is the OC velocity) is a sufficiently accurate approximation. Also, $\smash {{\boldsymbol {\mu }}_s \equiv (\partial ^2_{{\boldsymbol {p}}{\boldsymbol {p}}} H_{0s})^{-1}}$ can be interpreted as the relativistic-mass tensor.

Let us choose the field $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$ of our general theory to be the oscillating electric field $\smash {\widetilde {{\boldsymbol {E}}} = \mathrm {i} \widehat {\omega }\widetilde {{\boldsymbol {A}}}/c}$; then (cf. (6.2)),

(9.32)

\begin{equation} \widehat{\boldsymbol{\alpha}}_s = \mathrm{i} e_s{\boldsymbol{v}}_s \widehat{\omega}^{{-}1}, \qquad \widehat{\boldsymbol{L}}_s = e_s^2\widehat{\omega}^{{-}1}, \qquad \widehat{\boldsymbol{R}}_s = {\boldsymbol{\mu}}_s^{{-}1}\widehat{\omega}^{{-}1}. \end{equation}

(Other ways to identify $\smash {\widehat {\boldsymbol {L}}_s}$ and $\smash {\widehat {\boldsymbol {R}}_s}$ are also possible and lead to the same results.) Then,

(9.33)

\begin{equation} {\boldsymbol{\alpha}}_s = \frac{\mathrm{i} e_s{\boldsymbol{v}}_s}{\omega}, \qquad {\boldsymbol{\wp}}_s = \frac{e_s^2}{\omega^2}\,{\boldsymbol{\mu}}_s^{{-}1}. \end{equation}

The average Wigner matrix of $\smash {\widetilde {{\boldsymbol {E}}}}$ is

(9.34)

\begin{equation} {{\boldsymbol {\mathsf{U}}}}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}) = \int \frac{\mathrm{d}\tau}{2{\rm \pi}}\,\frac{\mathrm{d}{\boldsymbol{s}}}{(2{\rm \pi})^{3}}\,\, \overline{ \underline{\widetilde{{\boldsymbol{E}}}}(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2)\, \smash{\underline{\widetilde{{\boldsymbol{E}}}}}^{{\dagger}}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2) } \,\mathrm{e}^{\mathrm{i}\omega \tau - \mathrm{i} {\boldsymbol{k}} \cdot {\boldsymbol{s}}} \end{equation}

(the arguments $\smash {t}$ and $\smash {{\boldsymbol {x}}}$ are henceforth omitted), and the nonlinear potentials are

(9.35)

\begin{align} {{\boldsymbol {\mathsf{D}}}}_s & = {\rm \pi}e_s^2 \int \mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}}{\boldsymbol{k}}\,\frac{{\boldsymbol{v}}_s^{{\dagger}} {{\boldsymbol {\mathsf{U}}}}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}){\boldsymbol{v}}_s}{({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)^2}, \end{align}

(9.36)

\begin{align} {\boldsymbol{\Theta}}_s & = e_s^2\,\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, \left. \frac{{\boldsymbol{k}}{\boldsymbol{k}}}{\omega^2} \frac{({\boldsymbol{v}}_s^{{\dagger}} {{\boldsymbol {\mathsf{U}}}} {\boldsymbol{v}}_s)}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}, \end{align}

(9.37)

\begin{align} \varDelta_s & = \frac{e_s^2}{2}\frac{\partial}{\partial{\boldsymbol{p}}} \cdot {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{{\boldsymbol{k}}}{\omega^2}\frac{({\boldsymbol{v}}_s^{{\dagger}} {{\boldsymbol {\mathsf{U}}}} {\boldsymbol{v}}_s)}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} + \frac{e_s^2}{2} \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{\operatorname{tr}({{\boldsymbol {\mathsf{U}}}} {\boldsymbol{\mu}}_s^{{-}1})}{\omega^2}. \end{align}

When plasma is non-relativistic and the field is electrostatic (so $\smash {{{\boldsymbol {\mathsf {U}}}} = {\boldsymbol {k}}{\boldsymbol {k}}^{{\dagger}} {\mathsf {U}}_\varphi }$, where $\smash {{\mathsf {U}}_\varphi }$ is scalar), (9.35) gives the same $\smash {{{\boldsymbol {\mathsf {D}}}}_s}$ as (9.3) and (9.37) gives the same $\smash {\varDelta _s}$ as (9.5). For $\smash {{\boldsymbol {\Theta }}_s}$, the equivalence between (9.36) and (9.4) should not be expected because $\smash {{\boldsymbol {\Theta }}_s}$ is a part of a distribution function, which is not gauge-invariant. (Canonical momenta in the Weyl gauge are different from those in the electrostatic gauge.) But it is precisely the dressing function (9.36) that leads to the correct expressions for the momentum and energy stored in the OC distribution (§ 9.2.3).

The Lagrangian density of a free electromagnetic field is

(9.38)

\begin{equation} \mathfrak{L}_0 = \frac{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}}\widetilde{{\boldsymbol{E}}} - \smash{\widetilde{{\boldsymbol{B}}}}^{{\dagger}}\widetilde{{\boldsymbol{B}}}}{8{\rm \pi}}. \end{equation}

From Faraday's law, one has $\smash {\widetilde {{\boldsymbol {B}}} = \widehat {\omega }^{-1}c(\widehat {\boldsymbol {k}} \times {\boldsymbol {E}})}$.Footnote ⁴⁰ Then, $\smash {- \smash {\widetilde {{\boldsymbol {B}}}}^{{\dagger}} \widetilde {{\boldsymbol {B}}}/c^2}$ can be represented as follows (up to a divergence, which is insignificant):

(9.39)

\begin{align} (\widetilde{{\boldsymbol{E}}} \times \widehat{\omega}^{{-}1}\widehat{\boldsymbol{k}}) \cdot (\widehat{\omega}^{{-}1}\widehat{\boldsymbol{k}} \times \widetilde{{\boldsymbol{E}}}) & = \widetilde{{\boldsymbol{E}}} \cdot \widehat{\omega}^{{-}2}(\widehat{\boldsymbol{k}} \times (\widehat{\boldsymbol{k}} \times \widetilde{{\boldsymbol{E}}})) \nonumber\\ & = \smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widehat{\omega}^{{-}2}(\widehat{\boldsymbol{k}}(\widehat{\boldsymbol{k}} \cdot \widetilde{{\boldsymbol{E}}}) - \widetilde{{\boldsymbol{E}}} \widehat{k}^2) \nonumber\\ & = \smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widehat{\omega}^{{-}2}(\widehat{\boldsymbol{k}\,}\smash{\widehat{\boldsymbol{k}}}^{{\dagger}} - {\boldsymbol{1}}\widehat{k\,}^2)\widetilde{{\boldsymbol{E}}}. \end{align}

Then, the vacuum dispersion operator can be written as (cf. (6.1))

(9.40)

\begin{equation} \widehat{\boldsymbol{\varXi}}_0(\omega, {\boldsymbol{k}}) = \frac{1}{4{\rm \pi}}\,({\boldsymbol{1}} + c^2 \widehat{\omega}^{{-}2}(\widehat{\boldsymbol{k}\,}\smash{\widehat{\boldsymbol{k}}}^{{\dagger}} - {\boldsymbol{1}}\widehat{k\,}^2)). \end{equation}

The total dispersion matrix is readily found to be

(9.41)

\begin{equation} {\boldsymbol{\varXi}}(\omega, {\boldsymbol{k}}) = \frac{1}{4{\rm \pi}}\left({\boldsymbol{\epsilon}}(\omega, {\boldsymbol{k}}) + \frac{c^2}{\omega^2}\,({\boldsymbol{k}}\smash{{\boldsymbol{k}}}^{{\dagger}} - {\boldsymbol{1}}k^2)\right), \end{equation}

where $\smash {{\boldsymbol {\epsilon }}}$ (not to be confused with the small parameter $\smash {\epsilon }$ that we introduced earlier) is the dielectric tensor

(9.42)

\begin{equation} {\boldsymbol{\epsilon}}(\omega, {\boldsymbol{k}}) = {\boldsymbol{1}} - \frac{{\boldsymbol{\mathfrak{w}}}_p}{\omega^2} + \sum_s \frac{4{\rm \pi} e_s^2}{\omega^2}\int \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \mathrm{i} 0}\,{\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial{\boldsymbol{p}}}. \end{equation}

Here, $\smash {{\boldsymbol {\mathfrak {w}}}_p}$ is the squared relativistic plasma frequency, which is a matrix, because the ‘masses’ $\smash {{\boldsymbol {\mu }}_s}$ are matrices:

(9.43)

\begin{equation} {\boldsymbol{\mathfrak{w}}}_p \doteq \sum_s 4 {\rm \pi}e_s^2 \int \mathrm{d}{\boldsymbol{p}}\,F_s {\boldsymbol{\mu}}_s^{{-}1}. \end{equation}

9.2.2. Collisions and fluctuations

By (8.12), the spectrum of microscopic oscillations of $\smash {\widetilde {{\boldsymbol {E}}}}$ is a matrix given by

(9.44)

\begin{equation} {{\boldsymbol {\mathsf{S}}}}(\omega, {\boldsymbol{k}}) = 2{\rm \pi}\, \sum_s \left(\frac{4\pi e_s}{\omega}\right)^2 \int \mathrm{d}{\boldsymbol{p}}\,\delta(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_{s})F_{s}({\boldsymbol{p}}) {\boldsymbol{\epsilon}}^{{-}1}(\omega, {\boldsymbol{k}})\, {\boldsymbol{v}}_{s}{\boldsymbol{v}}_{s}^{{\dagger}}\, {\boldsymbol{\epsilon}}^{-{{\dagger}}}(\omega, {\boldsymbol{k}}). \end{equation}

In the electrostatic limit, one can replace $\smash {{\boldsymbol {\epsilon }}^{-1}}$ with $\smash {\epsilon _\parallel ^{-1}{\boldsymbol {k}}{\boldsymbol {k}}^{{\dagger}} /k^2}$, where $\smash {\epsilon _\parallel }$ is the relativistic generalization of (9.9); then (9.44) leads to (9.10) as a particular case. For thermal equilibrium, one can also use (8.15) and the following form of $\smash {{\boldsymbol {\epsilon }}^{-1}}$ for isotropic plasma:

(9.45)

\begin{equation} {\boldsymbol{\epsilon}}^{{-}1} = \frac{1}{\epsilon_\perp}\bigg({\boldsymbol{1}} - \frac{{\boldsymbol{k}}{\boldsymbol{k}}^{{\dagger}}}{k^2}\bigg) + \frac{1}{\epsilon_\parallel}\frac{{\boldsymbol{k}}{\boldsymbol{k}}^{{\dagger}}}{k^2}, \end{equation}

where $\smash {\epsilon _\perp }$ is the (scalar) transverse permittivity. Also, (6.36) leads to

(9.46)

\begin{equation} |\mathcal{X}_{ss'}(\omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}')|^2 \approx \left(\frac{4{\rm \pi} e_s e_{s'}}{\omega^2}\right)^2 |{\boldsymbol{v}}_s^{{\dagger}} {\boldsymbol{\epsilon}}^{{-}1}(\omega, {\boldsymbol{k}}) {\boldsymbol{v}'}_{s'}|^2. \end{equation}

Then the collision operator (6.61) is obtained in the form

(9.47)

\begin{align} \mathcal{C}_s = \frac{\partial}{\partial {\boldsymbol{p}}}\cdot \sum_{s'} 2 e_s^2 e_{s'}^2 \int & \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^3}\,\mathrm{d}{\boldsymbol{p}}'\, \frac{|{\boldsymbol{v}}_s^{{\dagger}} {\boldsymbol{\epsilon}}^{{-}1}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}) {\boldsymbol{v}'}_{s'}|^2}{({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)^4}\, \delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}) \nonumber\\ & \times {\boldsymbol{k}}{\boldsymbol{k}} \cdot \left( \frac{\partial F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \right), \end{align}

which is in agreement with (Silin Reference Silin1961; Hizanidis, Molvig & Swartz Reference Hizanidis, Molvig and Swartz1983). Replacing $\smash {{\boldsymbol {\epsilon }}^{-1}}$ with $\smash {\epsilon _\parallel ^{-1}{\boldsymbol {k}}{\boldsymbol {k}}^{{\dagger}} /k^2}$ leads to the standard Balescu–Lenard operator (9.13) as a particular case.

9.2.3. On-shell waves

Electromagnetic on-shell waves satisfy

(9.48)

\begin{equation} \left({\boldsymbol{\epsilon}}_\text{H}(w({\boldsymbol{k}}), {\boldsymbol{k}}) + \frac{c^2}{w({\boldsymbol{k}})^2}\,({\boldsymbol{k}}\smash{{\boldsymbol{k}}}^{{\dagger}} - {\boldsymbol{1}}k^2)\right) {\breve{{\boldsymbol{E}}}} = 0, \end{equation}

where $\smash {{\breve {{\boldsymbol {E}}}}}$ is the complex envelope vector parallel to the polarization vector $\smash {{\boldsymbol {\eta }}}$; also,

(9.49)

\begin{equation} {\boldsymbol{\epsilon}}_\text{H}(\omega, {\boldsymbol{k}}) = {\boldsymbol{1}} - \frac{{\boldsymbol{\mathfrak{w}}}_p}{\omega^2} + \sum_s \frac{4{\rm \pi} e_s^2}{\omega^2}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\,{\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial{\boldsymbol{p}}}. \end{equation}

This yields (see (9.39))

(9.50)

\begin{equation} \smash{{\breve{{\boldsymbol{E}}}}}^{{\dagger}}{\boldsymbol{\epsilon}}_\text{H}(w({\boldsymbol{k}}), {\boldsymbol{k}}){\breve{{\boldsymbol{E}}}} ={-} \frac{c^2}{w({\boldsymbol{k}})^2}\,\smash{{\breve{{\boldsymbol{E}}}}}^{{\dagger}}({\boldsymbol{k}}\smash{{\boldsymbol{k}}}^{{\dagger}} - {\boldsymbol{1}}k^2){\breve{{\boldsymbol{E}}}} = \smash{{\breve{{\boldsymbol{B}}}}}^{{\dagger}} {\breve{{\boldsymbol{B}}}}. \end{equation}

Then, the phase-space density of the wave action (7.74) can be cast in the form

(9.51)

\begin{equation} J({\boldsymbol{k}}) = h({\boldsymbol{k}}){\boldsymbol{\eta}}^{{\dagger}}\,\frac{\partial {\boldsymbol{\varXi}}_\text{H}(\omega, {\boldsymbol{k}})}{\partial \omega}\,{\boldsymbol{\eta}}\,\Big|_{\omega = w(\overline{{\boldsymbol{k}}})} = \frac{h({\boldsymbol{k}})}{4{\rm \pi}\omega^2}\,{\boldsymbol{\eta}}^{{\dagger}}\,\frac{\partial(\omega^2{\boldsymbol{\epsilon}}_{\text{H}}(\omega, {\boldsymbol{k}}))}{\partial \omega}\,{\boldsymbol{\eta}}\,\Big|_{\omega = w(\overline{{\boldsymbol{k}}})} \end{equation}

(cf. Dodin et al. Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019), and the dressing function (9.36) is given by

(9.52)

\begin{align} {\boldsymbol{\Theta}}_s & = e_s^2\,\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{k}}\,(h({\boldsymbol{k}}) + h(-{\boldsymbol{k}})) \left. \frac{{\boldsymbol{k}}{\boldsymbol{k}}}{w^2({\boldsymbol{k}})} \frac{({\boldsymbol{v}}_s^{{\dagger}} {\boldsymbol{\eta}} {\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{v}}_s)}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0} \nonumber\\ & = 2e_s^2\,\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}}) \left. \frac{{\boldsymbol{k}}{\boldsymbol{k}}}{w^2({\boldsymbol{k}})} \frac{({\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}} {\boldsymbol{\eta}})}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} \right|_{\vartheta=0}. \end{align}

Using these, one obtains (Appendix F.2.1)

(9.53)

\begin{equation} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}} F_s + \int \mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}} J = {\boldsymbol{\mathcal{P}}}^{(\text{kin})} + \frac{\overline{\widetilde{{\boldsymbol{E}}} \times \widetilde{{\boldsymbol{B}}}}}{4{\rm \pi} c}, \end{equation}

where $\smash {{\boldsymbol {\mathcal {P}}}^{(\text {kin})}}$ is the average density of the plasma kinetic (up to $\smash {\overline {{\boldsymbol {A}}}}$) momentum,

(9.54)

\begin{equation} {\boldsymbol{\mathcal{P}}}^{(\text{kin})} \doteq \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\overline{({\boldsymbol{p}} - e_s \widetilde{{\boldsymbol{A}}}/c)f_s} = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}}\,\overline{f_s^{\text{(kin)}}}, \end{equation}

the functions $\smash {f_s^{\text {(kin)}}({\boldsymbol {p}}) \doteq f_s({\boldsymbol {p}} + e_s \widetilde {{\boldsymbol {A}}}/c)}$ are the distributions of kinetic (up to $\smash {\overline {{\boldsymbol {A}}}}$) momenta, and the second term in (9.53) is the well-known average momentum of electromagnetic field. Similarly (Appendix F.2.2),

(9.55)

\begin{equation} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \int \mathrm{d}{\boldsymbol{k}}\,w J = \mathcal{K}^{(\text{kin})} + \frac{1}{8{\rm \pi}}\,\overline{\big( \smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}} + \smash{\widetilde{{\boldsymbol{B}}}}^{{\dagger}} \widetilde{{\boldsymbol{B}}} \big)}, \end{equation}

where $\smash {\mathcal {K}^{(\text {kin})}}$ is given by

(9.56)

\begin{equation} \mathcal{K}^{(\text{kin})} \doteq \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s}\,\overline{f_s^{(\text{kin})}}. \end{equation}

In other words, the total momentum and energy of the system in the OC–wave representation are the same as those in the original particle–field variables.

9.2.4. Eikonal waves

As a particular case, let us consider an eikonal wave

(9.57)

\begin{equation} \underline{\widetilde{{\boldsymbol{E}}}} \approx \operatorname{re}(\mathrm{e}^{\mathrm{i} \theta}{\breve{{\boldsymbol{E}}}}), \qquad \overline{\omega}\doteq{-}\partial_t\theta, \qquad \overline{{\boldsymbol{k}}}\doteq \partial_{{\boldsymbol{x}}} \theta, \end{equation}

which may or may not be on-shell. Then, (9.36) and (9.37) lead to (cf. § 9.1.4)

(9.58)

$$\begin{gather} \displaystyle {\boldsymbol{\Theta}}_s ={-} \frac{\overline{{\boldsymbol{k}}}\,\overline{{\boldsymbol{k}}}}{\overline{\omega}^2} \frac{e_s^2 |{\boldsymbol{v}}_s^{{\dagger}} {\breve{{\boldsymbol{E}}}}|^2}{2(\overline{\omega} - \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{v}}_s)^2}, \end{gather}$$

(9.59)

$$\begin{gather}\displaystyle \varDelta_s = \frac{e_s^2\smash{{\breve{{\boldsymbol{E}}}}}^{{\dagger}}{\boldsymbol{\mu}}_s^{{-}1}{\breve{{\boldsymbol{E}}}}}{4\overline{\omega}^2} + \frac{e_s^2\overline{{\boldsymbol{k}}}}{4\overline{\omega}^2} \cdot\frac{\partial}{\partial{\boldsymbol{p}}} \bigg(\frac{|{\boldsymbol{v}}_s^{{\dagger}} {\breve{{\boldsymbol{E}}}}|^2}{\overline{\omega} - \overline{{\boldsymbol{k}}} \cdot {\boldsymbol{v}}_s}\bigg). \end{gather}$$

For on-shell waves in particular, one also obtains the action density in the form

(9.60)

\begin{align} \mathcal{I} & = \frac{1}{16{\rm \pi}\omega^2}\, \smash{{\breve{{\boldsymbol{E}}}}}^{{\dagger}}\, \frac{\partial(\omega^2{\boldsymbol{\epsilon}}_{\text{H}}(\omega, {\boldsymbol{k}}))}{\partial \omega}\, {\breve{{\boldsymbol{E}}}}\,\Big|_{\omega = w(\overline{{\boldsymbol{k}}})} \nonumber\\ & = \frac{1}{16{\rm \pi}\omega}\left( \smash{{\breve{{\boldsymbol{E}}}}}^{{\dagger}}\, \frac{\partial(\omega {\boldsymbol{\epsilon}}_{\text{H}}(\omega, {\boldsymbol{k}}))}{\partial \omega}\, {\breve{{\boldsymbol{E}}}} + \smash{{\breve{{\boldsymbol{B}}}}}^{{\dagger}} {\breve{{\boldsymbol{B}}}} \right)\Big|_{\omega = w(\overline{{\boldsymbol{k}}})}, \end{align}

where we used (9.50).

9.3. Newtonian gravity

For Newtonian interactions governed by a gravitostatic potential $\smash {\varphi _g}$, one has

(9.61)

\begin{equation} H_s = \frac{p^2}{2m_s} + m_s \overline{\varphi}_g + m_s\widetilde{\varphi}_g, \qquad \mathfrak{L}_0 ={-} \frac{(\nabla \widetilde{\varphi}_g)^2}{8{\rm \pi} {\mathsf{G}}}, \end{equation}

where $\smash {{\mathsf {G}}}$ is the gravitational constant. This system is identical to that considered in § 9.1 for non-relativistic electrostatic interactions up to coefficients. Specifically, $\smash {e_s}$ are replaced with $\smash {m_s}$, a factor $\smash {-{\mathsf {G}}^{-1}}$ appears in $\smash {{\boldsymbol {\varXi }}_0}$, and the dispersion matrix becomes

(9.62)

\begin{equation} {\boldsymbol{\varXi}}(\omega, {\boldsymbol{k}}) = \varXi(\omega, {\boldsymbol{k}}) ={-}\frac{k^2\epsilon_g(\omega, {\boldsymbol{k}})}{4{\rm \pi} {\mathsf{G}}}. \end{equation}

Thus, $\smash {\epsilon _\parallel }$ is replaced with $\smash {-\epsilon _g/{\mathsf {G}}}$, where $\smash {\epsilon _g}$ is the gravitostatic permittivity given by

(9.63)

\begin{equation} \epsilon_g(\omega, {\boldsymbol{k}}) = 1 - \sum_s \frac{4{\rm \pi} {\mathsf{G}} m_s^2}{k^2} \int \mathrm{d}{\boldsymbol{p}}\,\frac{{\boldsymbol{k}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \mathrm{i} 0} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}}. \end{equation}

This readily yields, for example, the kinetic theory of the Jeans instability (Trigger et al. Reference Trigger, Ershkovich, van Heijst and Schram2004), whose dispersion relation is given by $\smash {\epsilon _g(\omega, {\boldsymbol {k}}) = 0}$ (modulo the usual analytic continuation of the permittivity to modes with $\smash {\operatorname {im}\omega < 0}$).

9.4. Relativistic gravity

9.4.1. Main equations

The dynamics of a relativistic neutral particle with mass $\smash {m}$ in a spacetime metric $\smash {g_{\alpha \beta }}$ with signature $\smash {(-+++)}$ is governed by a covariant Hamiltonian (see, for example, Garg & Dodin Reference Garg and Dodin2020)

(9.64)

\begin{equation} {\mathsf{H}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{p}}}) = \frac{1}{2m}\left(m^2 + g^{\alpha\beta}({\boldsymbol {\mathsf{x}}}) p_\alpha p_\beta\right) \equiv {\mathsf{H}}({\boldsymbol{g}}, {\boldsymbol {\mathsf{p}}}). \end{equation}

Here, $\smash {{\boldsymbol {\mathsf {x}}} \equiv (x^0, {\boldsymbol {x}})}$, and $\smash {x^0 = t}$, as usual. Also, $\smash {{\boldsymbol {\mathsf {p}}} \equiv (p_0, {\boldsymbol {p}})}$ is the index-free notation for the four-momentum $\smash {p_\alpha }$, $\smash {g^{\alpha \beta }}$ is the inverse metric, $\smash {{\boldsymbol {g}}}$ is the index-free notation for $\smash {g^{\alpha \beta }}$, the units are such that $\smash {c = 1}$ and the species index is omitted.Footnote ⁴¹ The corresponding Hamilton's equations, with $\smash {\tau }$ the proper time, are

(9.65)

\begin{equation} \frac{\mathrm{d} x^\alpha}{\mathrm{d} \tau} = \frac{\partial {\mathsf{H}}}{\partial p_\alpha}, \qquad \frac{\mathrm{d} p_\alpha}{\mathrm{d} \tau} ={-}\frac{\partial {\mathsf{H}}}{\partial x^\alpha}. \end{equation}

This dynamics is constrained to the shell $\smash {p_0 = P_0(t, {\boldsymbol {x}}, {\boldsymbol {p}})}$, where $\smash {P_0}$ is the (negative) solution of

(9.66)

\begin{equation} {\mathsf{H}}({\boldsymbol{g}}, P_0(t, {\boldsymbol{x}}, {\boldsymbol{p}}), {\boldsymbol{p}}) = 0. \end{equation}

This means that the particle distribution in the $\smash {({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {p}}})}$ space is delta-shaped and thus does not satisfy (3.35). Hence, we will consider particles in the six-dimensional space $\smash {({\boldsymbol {x}}, {\boldsymbol {p}})}$ instead. The corresponding dynamics is governed by the Hamiltonian

(9.67)

\begin{equation} H ={-} P_0(t, {\boldsymbol{x}}, {\boldsymbol{p}}). \end{equation}

This is seen from the fact that

(9.68)

\begin{equation} \frac{\partial H}{\partial {\unicode{x25AA}}} ={-} \frac{\partial P_0}{\partial {\unicode{x25AA}}} = \frac{\partial {\mathsf{H}}/\partial {\unicode{x25AA}}}{\partial {\mathsf{H}}/\partial p_0}, \end{equation}

where $\smash {{\unicode{x25AA}} \in \lbrace t, {\boldsymbol {x}}, {\boldsymbol {p}} \rbrace }$, so Hamilton's equations obtained from (9.67) are equivalent to (9.65):

(9.69)

\begin{equation} \frac{\mathrm{d} x^\alpha}{\mathrm{d} t} = \frac{\partial H}{\partial p_\alpha} = \frac{\partial {\mathsf{H}}/\partial p_\alpha}{\partial {\mathsf{H}}/\partial p_0}, \qquad \frac{\mathrm{d} p_\alpha}{\mathrm{d} t} ={-} \frac{\partial H}{\partial x^\alpha} = \frac{\partial {\mathsf{H}}/\partial x^\alpha}{\partial {\mathsf{H}}/\partial p_0}. \end{equation}

Let us decompose the metric into the average part and oscillations, $\smash {g_{\alpha \beta } = \overline {g}_{\alpha \beta } + \widetilde {g}_{\alpha \beta }}$, and approximate the inverse metric to the second order in $\smash {\widetilde {{\boldsymbol {g}}}}$:

(9.70)

\begin{equation} g^{\alpha\beta} \approx \overline{g}^{\alpha\beta} - \widetilde{g}^{\alpha\beta} + \widetilde{g}^{\alpha\gamma}\overline{g}_{\gamma\delta}\widetilde{g}^{\delta\beta}, \end{equation}

where the indices of $\smash {\widetilde {{\boldsymbol {g}}}}$ are manipulated using the background metric $\smash {\overline {g}_{\alpha \beta }}$. This gives

(9.71)

\begin{equation} {\mathsf{H}} = \frac{1}{2m}\left(m^2 + \overline{g}^{\alpha\beta}p_\alpha p_\beta - \widetilde{g}^{\alpha\beta}p_\alpha p_\beta + \widetilde{g}^{\alpha\beta}\overline{g}_{\beta\gamma}\widetilde{g}^{\gamma\delta}p_\alpha p_\delta\right). \end{equation}

The Hamiltonian (9.67) is expanded in $\smash {\widetilde {{\boldsymbol {g}}}}$ as follows:

(9.72)

\begin{equation} H({\boldsymbol{g}}, {\boldsymbol {\mathsf{p}}}) \approx{-} P_0 - \frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}}\,\widetilde{g}^{\alpha\beta} - \frac{1}{2}\,\frac{\partial^2P_0}{\partial \widetilde{g}^{\alpha\beta} \partial \widetilde{g}^{\gamma\delta}}\,\widetilde{g}^{\alpha\beta}\widetilde{g}^{\gamma\delta}, \end{equation}

where $\smash {P_0}$ and the derivatives on the right-hand side are evaluated on $\smash {(\overline {{\boldsymbol {g}}}, {\boldsymbol {\mathsf {p}}})}$. To calculate these derivatives, let us differentiate (9.66) and use (9.71) for $\smash {{\mathsf {H}}}$. This gives

(9.73)

\begin{equation} 0 = \frac{\partial{\mathsf{H}}}{\partial \widetilde{g}^{\alpha\beta}} + \frac{\partial{\mathsf{H}}}{\partial p_0} \frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}} = \frac{1}{2m}\left({-}p_\alpha p_\beta + 2P^0 \frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}}\right), \end{equation}

where the derivatives with respect to the oscillating metric are taken at fixed $\smash {p_\alpha }$ and at $\smash {\widetilde {{\boldsymbol {g}}} \to 0}$, and $\smash {P^0 \equiv P^0({\boldsymbol {g}}, {\boldsymbol {\mathsf {p}}}) = \overline {g}^{0\alpha }p_\alpha }$; thus,

(9.74)

\begin{equation} \frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}} = \frac{p_\alpha p_\beta}{2 P^0}. \end{equation}

Similarly, differentiating (9.66) twice gives

\begin{align*} 0 & = \frac{\partial^2{\mathsf{H}}}{\partial \widetilde{g}^{\alpha\beta}\partial \widetilde{g}^{\gamma\delta}} + \frac{\partial{\mathsf{H}}}{\partial p_0}\frac{\partial^2P_0}{\partial \widetilde{g}^{\alpha\beta}\partial \widetilde{g}^{\gamma\delta}} + \frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}}\frac{\partial}{\partial p_0} \frac{\partial {\mathsf{H}}}{\partial \widetilde{g}^{\gamma\delta}} + \frac{\partial}{\partial p_0}\left( \frac{\partial{\mathsf{H}}}{\partial \widetilde{g}^{\alpha\beta}} + \frac{\partial{\mathsf{H}}}{\partial p_0}\frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}} \right)\frac{\partial P_0}{\partial g^{\gamma\delta}} \nonumber\\ & = \frac{\partial^2{\mathsf{H}}}{\partial \widetilde{g}^{\alpha\beta}\partial \widetilde{g}^{\gamma\delta}} + \frac{\partial{\mathsf{H}}}{\partial p_0} \frac{\partial^2P_0}{\partial \widetilde{g}^{\alpha\beta}\partial \widetilde{g}^{\gamma\delta}} + \frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}}\frac{\partial}{\partial p_0}\frac{\partial {\mathsf{H}}}{\partial \widetilde{g}^{\gamma\delta}} + \frac{\partial P_0}{\partial \widetilde{g}^{\gamma\delta}}\frac{\partial}{\partial p_0} \frac{\partial{\mathsf{H}}}{\partial \widetilde{g}^{\alpha\beta}} + \frac{\partial^2{\mathsf{H}}}{\partial p_0\partial p_0}\frac{\partial P_0}{\partial \widetilde{g}^{\alpha\beta}}\frac{\partial P_0}{\partial \widetilde{g}^{\gamma\delta}} \nonumber\\ & = \frac{1}{2m}\left(\overline{g}_{\beta\gamma}p_\alpha p_\delta + \overline{g}_{\delta\alpha} p_\gamma p_\beta + 2P^0 \frac{\partial^2P_0}{\partial \widetilde{g}^{\alpha\beta}\partial \widetilde{g}^{\gamma\delta}} - \frac{1}{2P^0}\frac{\partial(p_\alpha p_\beta p_\gamma p_\delta)}{\partial p_0} + \overline{g}^{00}\,\frac{p_\alpha p_\beta p_\gamma p_\delta}{2(P^0)^2}\right),\nonumber \end{align*}

whence

\begin{equation} \frac{\partial^2P_0}{\partial \widetilde{g}^{\alpha\beta}\partial \widetilde{g}^{\gamma\delta}} ={-} \frac{1}{2P^0}\,(\overline{g}_{\beta\gamma}p_\alpha p_\delta + \overline{g}_{\delta\alpha} p_\gamma p_\beta) + \frac{1}{4(P^0)^2}\frac{\partial(p_\alpha p_\beta p_\gamma p_\delta)}{\partial p_0} - \overline{g}^{00}\,\frac{p_\alpha p_\beta p_\gamma p_\delta}{4(P^0)^3}. \nonumber \end{equation}

Then, (9.72) yields

(9.75)

\begin{equation} H \approx H_0 + \alpha_{\alpha\beta}\,\widetilde{g}^{\alpha\beta} + \frac{1}{2}\,\widetilde{g}_{\alpha\beta} \wp^{\alpha\beta}{}_{\gamma\delta} \widetilde{g}^{\gamma\delta}, \end{equation}

where we introduced $\smash {H_0 = - P_0}$, $\smash {\alpha ^{\alpha \beta } = p^\alpha p^\beta /(2P^0)}$ and

(9.76)

\begin{equation} \wp^{\alpha\beta}{}_{\gamma\delta} = \frac{\delta^{\beta}_{\gamma} p^\alpha p_\delta + \delta_{\delta}^{\alpha} p^\beta p_\gamma}{2P^0} - \frac{1}{4(P^0)^2}\frac{\partial(p^\alpha p^\beta p_\gamma p_\delta)}{\partial p_0} + \overline{g}^{00}\,\frac{p^\alpha p^\beta p_\gamma p_\delta}{4(P^0)^3}. \end{equation}

9.4.2. Nonlinear potentials

Let us treat $\smash {\widetilde {{\boldsymbol {g}}}}$ as a 16-dimensional vector (Garg & Dodin Reference Garg and Dodin2021b), so $\smash {\alpha _{\alpha \beta }}$ serves as $\smash {{\boldsymbol {\alpha }}^{{\dagger}} }$ and $\smash {\wp ^{\alpha \beta }{}_{\gamma \delta }}$ serves as $\smash {{\boldsymbol {\wp }}}$. (Because these operators happen to be local in the $\smash {{\boldsymbol {\mathsf {x}}}}$ representation, here we do not distinguish them from their symbols.) Let us also introduce

(9.77)

\begin{equation} \mathfrak{E} \doteq p_\alpha p_\beta p_\gamma p_\delta {\mathsf{U}}^{\alpha\beta\gamma\delta} \end{equation}

and notice that $\smash {v^i \approx \dot {x}^i = p^i/p^0}$ (see (9.69)), so $\smash {\omega - {\boldsymbol {k}} \cdot {\boldsymbol {v}} = - k_\rho p^\rho /P^0}$ and $\smash {\delta (\omega - {\boldsymbol {k}} \cdot {\boldsymbol {v}})} = \smash {P^0\delta (k^\rho p_\rho )}$. Then, one finds from (6.77) that (Appendix B.8)

(9.78)

\begin{align} {{\boldsymbol {\mathsf{D}}}} & = \frac{\rm \pi}{4P^0} \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\boldsymbol{k}}{\boldsymbol{k}} \mathfrak{E}\,\delta(k^\rho p_\rho), \end{align}

(9.79)

\begin{align} {\boldsymbol{\Theta}} & = \frac{1}{4P^0}\frac{\partial}{\partial \vartheta} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol {\mathsf{k}}} \left.\frac{{\boldsymbol{k}}{\boldsymbol{k}}\mathfrak{E}}{\vartheta P^0 - k^\rho p_\rho} \right|_{\vartheta=0}, \end{align}

(9.80)

\begin{align} \varDelta & =\frac{p_\alpha p_\beta}{2P^0}\int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\mathsf{U}}^\alpha{}_\gamma{}^{\gamma\beta} - \frac{1}{8P^0} \frac{\partial}{\partial p_\lambda} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol {\mathsf{k}}}\, \frac{k_\lambda\mathfrak{E}}{k^\rho p_\rho}. \end{align}

Equation (9.80) (where one takes $\smash {p_0 = P_0}$ after the differentiation) is in agreement with the result that was obtained for quasimonochromatic waves in Garg & Dodin (Reference Garg and Dodin2020). The derivation of the dispersion matrix $\smash {{\boldsymbol {\varXi }}}$ for relativistic gravitational interactions in matter is cumbersome, so it is not presented here, but see Garg & Dodin (Reference Garg and Dodin2022). The collision integral and fluctuations for relativistic gravitational interactions are straightforward to obtain from the general formulas presented in §§ 6.9 and 8. This can be used to describe QL interactions of gravitational waves, including not only the usual vacuum modesFootnote ⁴² but also waves coupled with matter, for example, the relativistic Jeans mode.

Also notice that the OC Hamiltonian $\smash {\mathcal {H} = H_0 + \varDelta }$ can be put in a covariant form as follows. Like in the original system (§ 9.4.1), $\smash {\mathcal {H}}$ determines the ponderomotively modified shell $\smash {p_0 = \mathcal {P}_0(t, {\boldsymbol {x}}, {\boldsymbol {p}})}$ via $\smash {\mathcal {H} = -\mathcal {P}_0}$. On one hand, the covariant OC Hamiltonian $\smash {\mathcal {H}'}$ vanishes on this shell,Footnote ⁴³ so it can be Taylor-expanded as follows:

(9.81)

\begin{equation} \mathcal{H}' \approx (p_0 - \mathcal{P}_0)\,\frac{\partial \mathcal{H}}{\partial p_0}\bigg|_{p = \mathcal{P}_0} \approx (p_0 - P_0 + \varDelta)\lambda, \qquad \lambda \doteq \frac{\partial {\mathsf{H}}}{\partial p_0}\bigg|_{p = P_0}. \end{equation}

On the other hand, it can also be represented as $\smash {\mathcal {H}' = {\mathsf {H}}(\overline {{\boldsymbol {g}}}, {\boldsymbol {\mathsf {p}}}) + \varDelta '}$ (here $\smash {\varDelta '}$ is the ponderomotive term yet to be found) and Taylor-expanded around the unperturbed shell $\smash {p_0 = P_0(t, {\boldsymbol {x}}, {\boldsymbol {p}})}$ as

(9.82)

\begin{equation} \mathcal{H}' \approx \varDelta' + (p_0 - P_0)\lambda = (p_0 - P_0 + \varDelta'/\lambda)\lambda. \end{equation}

By comparing (9.81) with (9.82), one finds that $\smash {\varDelta ' = \lambda \varDelta }$. Because $\smash {\lambda = P^0/m}$, this leads to the following covariant Hamiltonian for OCs:

(9.83)

$$\begin{gather} \displaystyle \mathcal{H}' = \frac{1}{2m}\left(m^2 + {g}_{\rm eff}^{\alpha\beta} p_\alpha p_\beta - \frac{1}{4} \frac{\partial}{\partial p_\lambda} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol {\mathsf{k}}}\, \frac{k_\lambda\mathfrak{E}}{k^\rho p_\rho} \right), \end{gather}$$

(9.84)

$$\begin{gather}\displaystyle g_{\rm eff}^{\alpha\beta} \doteq \overline{g}^{\alpha\beta} + \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\mathsf{U}}^\alpha{}_\gamma{}^{\gamma\beta}. \end{gather}$$

9.4.3. Gauge invariance

As shown in Garg & Dodin (Reference Garg and Dodin2021a, Reference Garg and Dodinb) adiabatic QL interactions via gravitational waves (i.e. those determined by $\smash {{\boldsymbol {\Theta }}}$ and $\smash {\varDelta }$) can be formulated in a form invariant with respect to gauge transformations

(9.85)

\begin{equation} \widetilde{g}^{\alpha\beta} \to \widetilde{g}'^{\alpha\beta} = \widetilde{g}^{\alpha\beta} + \nabla^{(\alpha} \widetilde{\xi}^{\beta)}, \end{equation}

where $\smash {\nabla }$ is the covariant derivative associated with the background metric $\smash {\overline {{\boldsymbol {g}}}}$, $\smash {\widetilde {\xi }^\mu }$ is an arbitrary vector field and $\smash {\psi ^{(\alpha } \eta ^{\beta )}} \equiv \smash {(\psi ^{\alpha } \eta ^{\beta } + \psi ^{\beta } \eta ^{\alpha })/2}$. Let us show that this also extends to resonant interactions. Recall that within the assumed accuracy the nonlinear potentials are supposed to be calculated only to the zeroth order in the geometrical-optics parameter. Then, the modification of the average Wigner matrix of the metric oscillations under the transformation (9.85) can be written as

(9.86)

\begin{align} {\mathsf{U}}'^{\alpha\beta\gamma\delta} - {\mathsf{U}}^{\alpha\beta\gamma\delta} & = \text{symb}_{{\mathsf{x}}} \Big( \mathrm{i} \widehat{k}^{(\alpha}\, \overline{{\left. {\boldsymbol {\mathsf {|\widetilde{{\boldsymbol{\xi}^{\beta}}} }}} \right\rangle} {\left\langle {{\boldsymbol {\mathsf {{\widetilde{g}^{\gamma\delta}}|}}}} \right.}} -\mathrm{i} \overline{{\left. {\boldsymbol {\mathsf {|\widetilde{g}^{\alpha\beta} }}} \right\rangle} {\left\langle {{\boldsymbol {\mathsf {{\widetilde{\xi}^{(\gamma}}|}}}} \right.}}\, \widehat{k}^{\delta)} + \widehat{k}^{(\alpha} \overline{{\left. {\boldsymbol {\mathsf {|{\widetilde{\xi}^{\beta)}} }}} \right\rangle} {\left\langle {{\boldsymbol {\mathsf {{\widetilde{\xi}^{(\gamma}}|}}}} \right.}} \,\widehat{k}^{\delta)} \Big) \nonumber\\ & = \mathrm{i} k^{(\alpha} \mathcal{W}^{\beta)\gamma\delta} - \mathrm{i} k^{(\delta} \mathcal{W}^{\gamma)\alpha\beta *} + k^{(\alpha} {\mathsf{W}}_{\widetilde{\xi}}^{\beta)(\gamma} k^{\delta)}, \end{align}

where $\smash {\mathcal {W}^{\beta \gamma \delta } \doteq \overline {\text {symb}_{{\mathsf {x}}} {\left. {\boldsymbol {\mathsf {|{\widetilde {\xi }^{\beta }} }}} \right\rangle} {\left\langle {{\boldsymbol {\mathsf {{\widetilde {g}^{\gamma \delta }}|}}}} \right.}}}$ and $\smash {{\mathsf {W}}_{\widetilde {\xi }}^{\beta \gamma }}$ is the average Wigner matrix of $\smash {\widetilde {\xi }^\alpha }$. The corresponding change of $\smash {\mathfrak {E}}$ is

\[ \mathfrak{E}' - \mathfrak{E} = (k^\rho p_\rho) \Big( \mathrm{i} p_\beta p_\gamma p_\delta\mathcal{W}^{\beta\gamma\delta} - \mathrm{i} p_\alpha p_\beta p_\gamma \mathcal{W}^{\gamma\alpha\beta *} + k^\lambda p_\lambda p_\beta p_\gamma {\mathsf{W}}_{\widetilde{\xi}}^{\beta\gamma} \Big) \equiv (k^\rho p_\rho)A. \]

Then, the difference in the diffusion coefficients (9.78) is

(9.87)

\begin{equation} {{\boldsymbol {\mathsf{D}}}}' - {{\boldsymbol {\mathsf{D}}}} = \frac{\rm \pi}{4P^0} \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{\boldsymbol{k}}{\boldsymbol{k}}\,\delta(k^\rho p_\rho)\,(k^\rho p_\rho)\,A = 0, \end{equation}

because $\smash {\delta (k^\rho p_\rho )\,(k^\rho p_\rho ) = 0}$. In particular, this rules out QL diffusion via ‘coordinate waves’.

9.4.4. Lorenz gauge and effective metric

Let us consider gravitational waves in the Lorenz gauge, $\smash {\nabla _\alpha \widetilde {g}^{\alpha \beta } = 0}$. In this case,

(9.88)

\begin{equation} k_\alpha {\mathsf{U}}^{\alpha\beta\gamma\delta} = k_\beta {\mathsf{U}}^{\alpha\beta\gamma\delta} = k_\gamma {\mathsf{U}}^{\alpha\beta\gamma\delta} = k_\delta {\mathsf{U}}^{\alpha\beta\gamma\delta} = 0, \end{equation}

and thus $\smash {k_\lambda \partial \mathfrak {E}/\partial p^\lambda = 0}$. Then,

(9.89)

\begin{equation} \frac{\partial}{\partial p_\lambda} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol {\mathsf{k}}}\, \frac{k_\lambda\mathfrak{E}}{k^\rho p_\rho} = \frac{\partial}{\partial \vartheta}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol {\mathsf{k}}}\left.\frac{(k_\lambda k^\lambda)\mathfrak{E}}{k^\rho p_\rho + \vartheta} \right|_{\vartheta = 0}. \end{equation}

This simplifies the expression (9.80) for $\smash {\varDelta }$ and (9.83) for $\smash {\mathcal {H}'}$. Furthermore, if the waves are not significantly affected by matter, so the vacuum dispersion $\smash {k_\lambda k^\lambda = 0}$ can be assumed, the term (9.89) vanishes completely. Then, (9.83) becomes

(9.90)

\begin{equation} \mathcal{H}' = \frac{1}{2m}\left(m^2 + g_{\rm eff}^{\alpha\beta} p_\alpha p_\beta\right) \end{equation}

and QL diffusion disappears, because particles cannot resonate with waves. This shows that the only average QL effect of vacuum gravitational waves on particles is the modification of the spacetime metric by $\smash {\int \mathrm {d}{\boldsymbol {\mathsf {k}}}\,{\mathsf {U}}_{\alpha \gamma }{}^{\gamma }{}_{\beta } = \mathcal {O}(\varepsilon ^2)}$. For quasimonochromatic waves, this effect is discussed in further detail in Garg & Dodin (Reference Garg and Dodin2020).

10. Summary

In summary, we have presented QLT for classical plasma interacting with inhomogeneous turbulence in the presence of background fields. Because we use the Weyl symbol calculus, global-mode decomposition is not invoked, so our formulation is local and avoids the usual issues with complex-frequency modes. Also, the particle Hamiltonian is kept general, so the results are equally applicable to relativistic, electromagnetic, and even non-electromagnetic (for example, gravitational) interactions. Because our approach is not bounded by the limitations of variational analysis either, effects caused by collisional and collisionless dissipation are also included naturally.

Our main results are summarized in §§ 5.6, 6.9, 7.6 and 8.5 and are as follows. Starting from the Klimontovich equation, we derive a Fokker–Planck model for the dressed OC distribution. This model captures QL diffusion, interaction with the background fields and ponderomotive effects simultaneously. The local diffusion coefficient is manifestly positive-semidefinite. Waves are allowed to be off-shell (not constrained by a dispersion relation), and a collision integral of the Balescu–Lenard type emerges in a form that is not restricted to any particular Hamiltonian. This operator conserves particles, momentum and energy, and it also satisfies the $\smash {H}$-theorem, as usual. As a spin-off, a general expression for the spectrum (average Wigner matrix) of microscopic fluctuations of the interaction field is derived. For on-shell waves, which satisfy a QL WKE, our theory conserves the momentum and energy of the wave–plasma system. Dewar's OC QLT of electrostatic turbulence (Dewar Reference Dewar1973) is proven formally as a particular case and given a concise formulation. Also discussed as examples are relativistic electromagnetic and gravitational interactions, and QLT for gravitational waves is proposed.

Aside from having the aesthetic appeal of a rigorous local theory, our formulation can help, for example, better understand and model QL plasma heating and current drive. First of all, it systematically accounts for the wave-driven evolution of the non-resonant particle distribution and for the ponderomotive effects caused by plasma inhomogeneity in both time and space. As discussed above (§ 7.5), this is generally important for adequately calculating the energy–momentum transfer between waves and plasma even when resonant absorption per se occurs in a homogeneous-plasma region. Second, our formulation provides general formulas that equally hold in any canonical variables and for any Hamiltonians that satisfy our basic assumptions (§ 3.1). Therefore, our results can be applied to various plasma models immediately. This eliminates the need for ad hoc calculations, which can be especially cumbersome beyond the homogeneous-plasma approximation. Discussing specific models of applied interest, however exciting, is beyond the scope of this paper and is left to future work.

Acknowledgements

Editor Alex Schekochihin and the author thank the referees for their advice in evaluating this article.

Funding

This work was supported by the U.S. DOE through Contract DE-AC02-09CH11466. It is also based upon the work supported by National Science Foundation under the grant No. PHY 1903130.

Declaration of interests

The author reports no conflict of interest.

Appendix A. Average Wigner matrices

A.1 Positive-semidefinitness

As known from Cartwright (Reference Cartwright1976), the average Wigner function of any scalar field on the real axis is non-negative if the averaging is done over a sufficiently large phase-space volume. Here, we extend this theorem to average Wigner matrices of vector fields in a multi-dimensional space, $\smash {{\boldsymbol {\psi }}({\boldsymbol {\mathsf {x}}})}$, and show that such matrices are positive-semidefinite.

For any given function $\smash {h({\boldsymbol {z}}) \equiv h({\boldsymbol {\mathsf {x}}}, {\boldsymbol {\mathsf {k}}})}$, we define its local phase-space average as the following convolution integral:Footnote ⁴⁴

(A1)

\begin{equation} \overline{h}({\boldsymbol{z}}) \doteq \int \mathrm{d}{\boldsymbol{z}}'\, \mathcal{G}({\boldsymbol{z}} - {\boldsymbol{z}}')\,h({\boldsymbol{z}}') \equiv \int \mathrm{d}{\boldsymbol {\mathsf{x}}}'\,\mathrm{d}{\boldsymbol {\mathsf{k}}}'\, \mathcal{G}({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{x}}}', {\boldsymbol {\mathsf{k}}} - {\boldsymbol {\mathsf{k}}}')\,h({\boldsymbol {\mathsf{x}}}', {\boldsymbol {\mathsf{k}}}') \end{equation}

with a Gaussian window function

(A2)

\begin{equation} \mathcal{G}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \doteq \frac{1}{(2{\rm \pi} \sigma_{{\mathsf{x}}} \sigma_{{\mathsf{k}}})^{{\mathsf{n}}}}\, \exp\left(-\frac{|{\boldsymbol {\mathsf{x}}}|^2}{2\sigma_{{\mathsf{x}}}^2} - \frac{|{\boldsymbol {\mathsf{k}}}|^2}{2\sigma_{{\mathsf{k}}}^2}\right) \end{equation}

and positive constants $\smash {\sigma _{{\mathsf {x}}}}$ and $\smash {\sigma _{{\mathsf {k}}}}$ yet to be specified. Unlike in § 2.1.1, the following notation will be assumed for the ‘scalar product’ for variables with upper, lower and mixed indices:

(A3)

\begin{equation} {\boldsymbol {\mathsf{x}}}' \cdot {\boldsymbol {\mathsf{x}}}'' \doteq \delta_{ij} {\mathsf{x}}'^i {\mathsf{x}}''^j, \qquad {\boldsymbol {\mathsf{k}}}' \cdot {\boldsymbol {\mathsf{k}}}'' \doteq \delta^{ij} {\mathsf{k}}'_i {\mathsf{k}}''_j, \qquad {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{x}}} \doteq {\mathsf{k}}_i {\mathsf{x}}^i. \end{equation}

(The Latin indices in this appendix range from 0 to $\smash {n}$, $\smash {\delta _{ij}}$ and $\smash {\delta ^{ij}}$ are unit matrices, and summation over repeating indices is assumed.) In particular, note that $\smash {|{\boldsymbol {\mathsf {x}}}|^2 \doteq {\boldsymbol {\mathsf {x}}} \cdot {\boldsymbol {\mathsf {x}}} \geqslant 0}$ and must not be confused with the squared spacetime interval, which can have either sign. Likewise, $\smash {|{\boldsymbol {\mathsf {k}}}|^2 \doteq {\boldsymbol {\mathsf {k}}} \cdot {\boldsymbol {\mathsf {k}}} \geqslant 0}$ must not be confused with $\smash {{\mathsf {k}}_i{\mathsf {k}}^i = -\omega ^2 + {\boldsymbol {k}}^2}$.

The average Wigner matrix of any given vector field $\smash {{\boldsymbol {\psi }}}$ is given by

(A4)

\begin{align} \overline{{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = \frac{1}{(2{\rm \pi})^{{\mathsf{n}}}}\frac{1}{(2{\rm \pi} \sigma_{{\mathsf{x}}} \sigma_{{\mathsf{k}}})^{{\mathsf{n}}}} \int &\mathrm{d}{\boldsymbol {\mathsf{s}}}\,\mathrm{d}{\boldsymbol {\mathsf{x}}}'\,\mathrm{d}{\boldsymbol {\mathsf{k}}}'\, {\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}' + {\boldsymbol {\mathsf{s}}}/2) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}}' - {\boldsymbol {\mathsf{s}}}/2) \nonumber\\ &\times \exp\left(-\frac{|{\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{x}}}'|^2}{2\sigma_{{\mathsf{x}}}^2} - \frac{|{\boldsymbol {\mathsf{k}}} - {\boldsymbol {\mathsf{k}}}'|^2}{2\sigma_{{\mathsf{k}}}^2}-\mathrm{i} {\boldsymbol {\mathsf{k}}}' \cdot {\boldsymbol {\mathsf{s}}}\right). \end{align}

The integral over $\smash {{\boldsymbol {\mathsf {k}}}'}$ can be readily taken:

(A5)

\begin{equation} \int \mathrm{d}{\boldsymbol {\mathsf{k}}}'\,\exp\left( - \frac{|{\boldsymbol {\mathsf{k}}} - {\boldsymbol {\mathsf{k}}}'|^2}{2\sigma_{{\mathsf{k}}}^2}-\mathrm{i} {\boldsymbol {\mathsf{k}}}' \cdot {\boldsymbol {\mathsf{s}}}\right) = (2{\rm \pi})^{{{\mathsf{n}}}/2} \sigma_{{\mathsf{k}}}^{{\mathsf{n}}} \exp\left(- \frac{\sigma_{{\mathsf{k}}}^2|{\boldsymbol {\mathsf{s}}}|^2}{2}-\mathrm{i} {\boldsymbol {\mathsf{k}}}\cdot{\boldsymbol {\mathsf{s}}}\right). \end{equation}

Then, using the variables $\smash {{\boldsymbol {\mathsf {x}}}_{1,2} \doteq {\boldsymbol {\mathsf {x}}}' \pm {\boldsymbol {\mathsf {s}}}/2}$, one can rewrite (A4) as follows:

(A6)

$$\begin{gather} \overline{{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = \frac{1}{(2{\rm \pi})^{3{{\mathsf{n}}}/2}\sigma_{{\mathsf{x}}}^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{x}}}_1\,\mathrm{d}{\boldsymbol {\mathsf{x}}}_2\,{\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}_1) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}}_2)\,\mathrm{e}^{-\phi}, \end{gather}$$

(A7)

$$\begin{gather}\phi = \frac{|{\boldsymbol {\mathsf{x}}} - ({\boldsymbol {\mathsf{x}}}_1 + {\boldsymbol {\mathsf{x}}}_2)/2|^2}{2\sigma_{{\mathsf{x}}}^2} + \frac{\sigma_{{\mathsf{k}}}^2|{\boldsymbol {\mathsf{x}}}_1 - {\boldsymbol {\mathsf{x}}}_2|^2}{2} + \mathrm{i} {\boldsymbol {\mathsf{k}}}\cdot ({\boldsymbol {\mathsf{x}}}_1 - {\boldsymbol {\mathsf{x}}}_2). \end{gather}$$

The function $\smash {\phi }$ can also be expressed as $\smash {\phi = |{\boldsymbol {\mathsf {x}}}|^2/(2\sigma _{{\mathsf {x}}}^2) + \phi ({\boldsymbol {\mathsf {x}}}_1) + \phi ^*({\boldsymbol {\mathsf {x}}}_2) - \lambda {\boldsymbol {\mathsf {x}}}_1 \cdot {\boldsymbol {\mathsf {x}}}_2}$, where

(A8)

\begin{equation} \phi({\boldsymbol {\mathsf{y}}}) \doteq \frac{|{\boldsymbol {\mathsf{y}}}|^2}{2}\left(\frac{1}{4\sigma_{{\mathsf{x}}}^2} + \sigma_{{\mathsf{k}}}^2\right) - \frac{{\boldsymbol {\mathsf{x}}} \cdot {\boldsymbol {\mathsf{y}}}}{2\sigma_{{\mathsf{x}}}^2} + \mathrm{i} {\boldsymbol {\mathsf{k}}} \cdot {\boldsymbol {\mathsf{y}}} \end{equation}

and $\smash {\lambda \doteq \sigma _{{\mathsf {k}}}^2 - (4\sigma _{{\mathsf {x}}}^2)^{-1}}$. Then, using $\smash {{\boldsymbol {\xi }}({\boldsymbol {\mathsf {y}}}) \doteq {\boldsymbol {\psi }}({\boldsymbol {\mathsf {y}}})\mathrm {e}^{-\phi ({\boldsymbol {\mathsf {y}}})}}$, one obtains from (A6) that

(A9)

\begin{equation} \overline{{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) = \frac{\mathrm{e}^{-|{\boldsymbol {\mathsf{x}}}|^2/(2\sigma_{{\mathsf{x}}}^2)}}{(2{\rm \pi})^{3{{\mathsf{n}}}/2}\sigma_{{\mathsf{x}}}^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{x}}}_1\,\mathrm{d}{\boldsymbol {\mathsf{x}}}_2\,{\boldsymbol{\xi}}({\boldsymbol {\mathsf{x}}}_1) {\boldsymbol{\xi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}}_2)\,\mathrm{e}^{\lambda {\boldsymbol {\mathsf{x}}}_1 \cdot {\boldsymbol {\mathsf{x}}}_2}. \end{equation}

By Taylor-expanding $\smash {\mathrm {e}^{\lambda {\boldsymbol {\mathsf {x}}}_1 \cdot {\boldsymbol {\mathsf {x}}}_2}}$, one obtains

(A10)

where $\smash {{{\boldsymbol {\mathsf {J}}}}_m \doteq \int \mathrm {d}{\boldsymbol {\mathsf {x}}}_1\,\mathrm {d}{\boldsymbol {\mathsf {x}}}_2\,({\boldsymbol {\mathsf {x}}}_1 \cdot {\boldsymbol {\mathsf {x}}}_2)^m {\boldsymbol {\xi }}({\boldsymbol {\mathsf {x}}}_1) {\boldsymbol {\xi }}^{{\dagger}} ({\boldsymbol {\mathsf {x}}}_2)}$. Note that

(A11)

\begin{equation} ({\boldsymbol {\mathsf{x}}}_1 \cdot {\boldsymbol {\mathsf{x}}}_2)^m = \sum_{{\boldsymbol{\mu}}(m)} \prod_{i=1}^{{{\mathsf{n}}}} ({\mathsf{x}}_1^i{\mathsf{x}}^i_2)^{m_i}, \end{equation}

where the summation is performed over all combinations ${\boldsymbol {\mu }}(m) \equiv \{m_1, m_2, \ldots, m_{{\mathsf {n}}}\}$ of integers $\smash {m_i \geqslant 0}$ such that $\smash {\sum _i m_i = m}$. Thus,

(A12)

\begin{equation} {{\boldsymbol {\mathsf{J}}}}_m = \sum_{{\boldsymbol{\mu}}(m)} {\boldsymbol{\mathcal{J}}}_{{\boldsymbol{\mu}}}{\boldsymbol{\mathcal{J}}}^{{\dagger}}_{{\boldsymbol{\mu}}}, \qquad {\boldsymbol{\mathcal{J}}}_{{\boldsymbol{\mu}}} = \int \mathrm{d}{\boldsymbol {\mathsf{y}}}\,{\boldsymbol{\xi}}({\boldsymbol {\mathsf{y}}})\prod_{i=1}^{{{\mathsf{n}}}}({\mathsf{y}}^i)^{m_i}. \end{equation}

Because each $\smash {{{\boldsymbol {\mathsf {J}}}}_m}$ is positive-semidefinite, the Wigner matrix $\smash {\overline {{{\boldsymbol {\mathsf {W}}}}}_{{\boldsymbol {\psi }}}}$ is positive-semidefinite when $\smash {\lambda \geqslant 0}$, or equivalently, when $\smash {\sigma _{{\mathsf {x}}}\sigma _{{\mathsf {k}}} > 1/2}$. This condition is assumed to be satisfied for the phase-space averaging of $\smash {{{\boldsymbol {\mathsf {W}}}}_{{\boldsymbol {\psi }}}}$ used in the main text. Loosely, this means that the averaging is done over the phase-space volume $\smash {\varDelta {\boldsymbol {\mathsf {x}}}\,\varDelta {\boldsymbol {\mathsf {k}}} \sim (\sigma _{{\mathsf {x}}}\sigma _{{\mathsf {k}}})^{{\mathsf {n}}} \gtrsim 1}$.

A.2 Invariant limit for eikonal fields

For eikonal fields (7.17), one has

(A13)

\begin{equation} {\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}} + {\boldsymbol {\mathsf{s}}}/2) {\boldsymbol{\psi}}^{{\dagger}}({\boldsymbol {\mathsf{x}}} - {\boldsymbol {\mathsf{s}}}/2) \approx \big({\boldsymbol{A}}({\boldsymbol {\mathsf{x}}})\,\mathrm{e}^{\mathrm{i}\overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}}) \cdot {\boldsymbol {\mathsf{s}}}} + \text{c.c.}\big) + \big({\boldsymbol{B}}({\boldsymbol {\mathsf{x}}})\,\mathrm{e}^{2\mathrm{i}\theta({\boldsymbol {\mathsf{x}}})} + \text{c.c.}\big). \end{equation}

Here, $\smash {{\boldsymbol {A}} \doteq {\boldsymbol {\eta }}{\boldsymbol {\eta }}^{{\dagger}} |{\breve {a}}|^2/4}$, $\smash {{\boldsymbol {B}} \doteq {\boldsymbol {\eta }}{\boldsymbol {\eta }}^\intercal {\breve {a}}^2/4}$, ‘c.c.’ stands for complex conjugate, we used the linear approximation $\smash {\theta ({\boldsymbol {\mathsf {x}}} \pm {\boldsymbol {\mathsf{s}}}/2) \approx \theta ({\boldsymbol {\mathsf {x}}}) \pm \overline {{\boldsymbol {\mathsf {k}}}}({\boldsymbol {\mathsf {x}}})\cdot {\boldsymbol {\mathsf {s}}}/2}$, with $\smash {\overline {{\boldsymbol {\mathsf {k}}}} \equiv (-\overline {\omega }, \overline {{\boldsymbol {k}}})}$. Then,

(A14)

\begin{equation} {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \approx {\boldsymbol{A}}({\boldsymbol {\mathsf{x}}})\,\delta({\boldsymbol {\mathsf{k}}} - \overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}})) + {\boldsymbol{A}}^*({\boldsymbol {\mathsf{x}}})\,\delta({\boldsymbol {\mathsf{k}}} + \overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}})) + 2\operatorname{re}\big({\boldsymbol{B}}({\boldsymbol {\mathsf{x}}})\mathrm{e}^{2\mathrm{i}\theta({\boldsymbol {\mathsf{x}}})}\delta({\boldsymbol {\mathsf{k}}})\big). \end{equation}

Let us adopt $\smash {\sigma _{{\mathsf {x}}} \ll l_c}$, where $\smash {l_c}$ is the least characteristic scale of $\smash {{\breve {a}}}$, $\smash {{\boldsymbol {\eta }}}$ and $\smash {\overline {{\boldsymbol {\mathsf {k}}}}}$. Then,

(A15)

\begin{equation} \overline{{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \approx {\boldsymbol{A}}({\boldsymbol {\mathsf{x}}})\,\mathcal{G}_{{\mathsf{k}}}({\boldsymbol {\mathsf{k}}} - \overline{{\boldsymbol {\mathsf{k}}}}) + {\boldsymbol{A}}^*({\boldsymbol {\mathsf{x}}})\,\mathcal{G}_{{\mathsf{k}}}({\boldsymbol {\mathsf{k}}} + \overline{{\boldsymbol {\mathsf{k}}}}) + 2\operatorname{re}\big({\boldsymbol{B}}({\boldsymbol {\mathsf{x}}})\mathcal{G}_{{\mathsf{k}}}({\boldsymbol {\mathsf{k}}})\zeta \mathrm{e}^{2\mathrm{i}\theta({\boldsymbol {\mathsf{x}}})}\big). \end{equation}

Here, $\smash {\mathcal {G}_{{\mathsf {k}}}({\boldsymbol {\mathsf {k}}})}$ are normalized Gaussians that can be replaced with delta functions if $\smash {\sigma _{{\mathsf {k}}}}$ is small compared with any scale of interest in the $\smash {{\boldsymbol {\mathsf {k}}}}$ space:

(A16)

\begin{equation} \mathcal{G}_{{\mathsf{k}}}({\boldsymbol {\mathsf{k}}}) \doteq \frac{1}{(\sqrt{2{\rm \pi}}\sigma_{{\mathsf{k}}})^{{\mathsf{n}}}}\, \exp\left( - \frac{|{\boldsymbol {\mathsf{k}}}|^2}{2\sigma_{{\mathsf{k}}}^2}\right) \to \delta({\boldsymbol {\mathsf{k}}}). \end{equation}

Also, the function

(A17)

\begin{equation} \zeta \approx \frac{1}{(\sqrt{2{\rm \pi}}\sigma_{{\mathsf{x}}})^{{\mathsf{n}}}} \int \mathrm{d}{\boldsymbol {\mathsf{x}}}'\, \exp\left(-\frac{|{\boldsymbol {\mathsf{x}}}' - {\boldsymbol {\mathsf{x}}}|^2}{2\sigma_{{\mathsf{x}}}^2} + 2\mathrm{i}\overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}}) \cdot ({\boldsymbol {\mathsf{x}}}' - {\boldsymbol {\mathsf{x}}})\right) = \mathrm{e}^{- 2|\overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}})|^2 \sigma_{{\mathsf{x}}}^2} \end{equation}

can be made exponentially small by adopting $\smash {\sigma _{{\mathsf {x}}} \gg |\overline {{\boldsymbol {\mathsf {k}}}}|^{-1}}$.Footnote ⁴⁵ In this limit, the average Wigner matrix of an eikonal field is independent of $\smash {\sigma _{{\mathsf {x}}}}$ and $\smash {\sigma _{{\mathsf {k}}}}$:

(A18)

\begin{equation} \overline{{{\boldsymbol {\mathsf{W}}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \approx {\boldsymbol{A}}({\boldsymbol {\mathsf{x}}})\,\delta({\boldsymbol {\mathsf{k}}} - \overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}})) + {\boldsymbol{A}}^*({\boldsymbol {\mathsf{x}}})\,\delta({\boldsymbol {\mathsf{k}}} + \overline{{\boldsymbol {\mathsf{k}}}}({\boldsymbol {\mathsf{x}}})). \end{equation}

This $\smash {\overline {{{\boldsymbol {\mathsf {W}}}}}_{{\boldsymbol {\psi }}}}$ is also Hermitian and positive-semidefinite (in agreement with the general theory from § A.1), because so are $\smash {{\boldsymbol {A}}}$ and $\smash {{\boldsymbol {A}}^*}$. The same properties pertain to the Wigner matrix of an ensemble of randomly phased eikonal fields, because it equals the sum of the Wigner matrices of the individual components (see also § 7.4).

Appendix B. Auxiliary proofs

B.1 Proof of (2.53)

Like in the case of (2.45), one finds that

(B1)

\begin{align} (\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}))^i (\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}}({\boldsymbol {\mathsf{x}}}))^{*j} & = \left\langle{{\boldsymbol {\mathsf{x}}}|\widehat{{\mathsf{L}}\,}^i{}_{i'}|\psi^{i'}}\right\rangle \left\langle{\psi^{j'}|(\widehat{{\mathsf{R}}}^j{}_{j'})^{{\dagger}}|{\boldsymbol {\mathsf{x}}}}\right\rangle \nonumber\\ & = (2{\rm \pi})^{{\mathsf{n}}} \left\langle{{\boldsymbol {\mathsf{x}}}|\widehat{{\mathsf{L}\,}}^i{}_{i'}\widehat{{\mathsf{W}}}_{{\boldsymbol{\psi}}}^{i'j'}(\widehat{{\mathsf{R}}}^{{\dagger}})_{j'}{}^j|{\boldsymbol {\mathsf{x}}}}\right\rangle \nonumber\\ & = (2{\rm \pi})^{{\mathsf{n}}} \left\langle{{\boldsymbol {\mathsf{x}}}|(\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}\,}}}}\widehat{\boldsymbol{{\boldsymbol {\mathsf{W}}}}}_\psi\smash{\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}}^{{\dagger}})^{ij}|{\boldsymbol {\mathsf{x}}}}\right\rangle \nonumber\\ & = \textstyle \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\, \big({{\boldsymbol {\mathsf{L}}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}}) \star {{\boldsymbol {\mathsf{R}}}}^{{\dagger}}({\boldsymbol {\mathsf{x}}}, {\boldsymbol {\mathsf{k}}})\big)^{ij}. \end{align}

This proves (2.53a). At $\smash {\epsilon \to 0}$, when $\smash {\star }$ becomes the usual product, (B1) gives

(B2)

\begin{equation} (\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}})(\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}})^{{\dagger}} = \textstyle \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,{{\boldsymbol {\mathsf{L}}}} {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}} {{\boldsymbol {\mathsf{R}}}}^{{\dagger}}, \end{equation}

and in particular, taking the trace of (B2) yields

(B3)

\begin{equation} \textstyle (\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}})^{{\dagger}} (\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}}) = \operatorname{tr} \big((\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}})(\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}})^{{\dagger}}\big) = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\operatorname{tr}({{\boldsymbol {\mathsf{L}}}} {{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}} {{\boldsymbol {\mathsf{R}}}}^{{\dagger}}) = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\operatorname{tr}({{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}} {{\boldsymbol {\mathsf{Q}}}}). \end{equation}

Here, $\smash {{{\boldsymbol {\mathsf {Q}}}} \doteq {{\boldsymbol {\mathsf {R}}}}^{{\dagger}} {{\boldsymbol {\mathsf {L}}}}}$, and we used that $\smash {\operatorname {tr}({{\boldsymbol {\mathsf {A}}}}{{\boldsymbol {\mathsf {B}}}}) = \operatorname {tr}({{\boldsymbol {\mathsf {B}}}}{{\boldsymbol {\mathsf {A}}}})}$ for any matrices $\smash {{{\boldsymbol {\mathsf {A}}}}}$ and $\smash {{{\boldsymbol {\mathsf {B}}}}}$.

For real fields, one can also replace the integrand with

(B4)

\begin{equation} \operatorname{tr}\big({{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}^* {{\boldsymbol {\mathsf{Q}}}}^*\big) = \operatorname{tr}\big({{\boldsymbol {\mathsf{Q}}}}^{{\dagger}}{{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}^{{\dagger}} \big) = \operatorname{tr}\big({{\boldsymbol {\mathsf{Q}}}}^{{\dagger}}{{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}\big) = \operatorname{tr}\big({{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}{{\boldsymbol {\mathsf{Q}}}}^{{\dagger}}\big), \end{equation}

where we used $\smash {\operatorname {tr}{{\boldsymbol {\mathsf {A}}}}^\intercal = \operatorname {tr}{{\boldsymbol {\mathsf {A}}}}}$, $\smash {({{\boldsymbol {\mathsf {A}}}}{{\boldsymbol {\mathsf {B}}}})^\intercal = {{\boldsymbol {\mathsf {B}}}}^\intercal {{\boldsymbol {\mathsf {A}}}}^\intercal }$, $\smash {{{\boldsymbol {\mathsf {W}}}}_{{\boldsymbol {\psi }}}^{{\dagger}} = {{\boldsymbol {\mathsf {W}}}}_{{\boldsymbol {\psi }}}}$ and, again, $\smash {\operatorname {tr}({{\boldsymbol {\mathsf {A}}}}{{\boldsymbol {\mathsf {B}}}}) = \operatorname {tr}({{\boldsymbol {\mathsf {B}}}}{{\boldsymbol {\mathsf {A}}}})}$, respectively. In summary then,

(B5)

\begin{equation} \textstyle (\widehat{\boldsymbol{{\boldsymbol {\mathsf{R}}}}}{\boldsymbol{\psi}})^{{\dagger}} (\widehat{\boldsymbol{{\boldsymbol {\mathsf{L}}}}}{\boldsymbol{\psi}}) = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\operatorname{tr}({{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}} {{\boldsymbol {\mathsf{Q}}}}) = \int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\operatorname{tr}\big({{\boldsymbol {\mathsf{W}}}}_{{\boldsymbol{\psi}}}{{\boldsymbol {\mathsf{Q}}}}^{{\dagger}}\big), \end{equation}

so the anti-Hermitian part of $\smash {{{\boldsymbol {\mathsf {Q}}}}}$ does not contribute to the integrals. Thus,

(B6)

Because $\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {L}}}}}}$ and $\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {R}}}}}}$ are arbitrary, they can as well be swapped; then one obtains (2.53c).

B.2 Proof of (4.28)

Suppose the dominant term in ${\boldsymbol {\mu }}$ in (4.16) has the form ${\boldsymbol {\mu }}_h \tau ^h$, where $h$ is a natural number and $\smash {\mu _h = \mathcal {O}(\epsilon ^2)}$. (Here, $h$ is a power index, so no summation over $h$ is assumed.) Let us Taylor-expand $\mathcal {J}[A, G]$ in $\smash {{\boldsymbol {\mu }}_h}$:

(B7)

\begin{align} \mathcal{J}[A, G] & - \mathcal{J}[A, G_0] - \mathcal{O}(\epsilon^2) \nonumber\\ &\quad \approx {\boldsymbol{\mu}}_h \cdot \frac{\partial}{\partial {\boldsymbol{\mu}}_h}\left( \int \mathrm{d}{\boldsymbol{K}}\,A({\boldsymbol{X}}, {\boldsymbol{K}}) \lim_{\nu\to 0+} \int_0^{\infty}\mathrm{d}\tau\,\mathrm{e}^{-\nu \tau + \mathrm{i} \varOmega \tau + \mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{\mu}}_h \tau^h} \right)_{{\boldsymbol{\mu}}_h = {\boldsymbol{0}}} \nonumber\\ &\quad \approx {\boldsymbol{\mu}}_h \cdot \int \mathrm{d}{\boldsymbol{K}}\,A({\boldsymbol{X}}, {\boldsymbol{K}}) \lim_{\nu \to 0+} \frac{\partial}{\partial {\boldsymbol{\mu}}_h} \left( \int_0^{\infty} \mathrm{d}\tau\,\mathrm{e}^{-\nu \tau + \mathrm{i}\varOmega \tau + \mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{\mu}}_h \tau^h} \right)_{{\boldsymbol{\mu}}_h = {\boldsymbol{0}}} \nonumber\\ &\quad \approx \mathrm{i} {\boldsymbol{\mu}}_h \cdot \int \mathrm{d}{\boldsymbol{K}}\,{\boldsymbol{K}} A({\boldsymbol{X}}, {\boldsymbol{K}}) \lim_{\nu \to 0+}\int_0^{\infty} \mathrm{d}\tau\,\tau^h \mathrm{e}^{-\nu \tau + \mathrm{i} \varOmega \tau} \nonumber\\ &\quad \approx \mathrm{i}^{1-h} {\boldsymbol{\mu}}_h \cdot \int \mathrm{d}{\boldsymbol{K}}\,{\boldsymbol{K}} A({\boldsymbol{X}}, {\boldsymbol{K}}) \,\frac{\mathrm{d}^h G_0(\varOmega ({\boldsymbol{X}}, {\boldsymbol{K}}))}{\mathrm{d}\varOmega^h} \nonumber\\ &\quad \approx \mathrm{i}^{1-h} {\boldsymbol{\mu}}_h \cdot \frac{\eth^h}{\partial \varOmega^h} \int \mathrm{d}{\boldsymbol{K}}\,{\boldsymbol{K}} A({\boldsymbol{X}}, {\boldsymbol{K}}) G_0(\varOmega({\boldsymbol{X}}, {\boldsymbol{K}})). \end{align}

Provided that $A$ is sufficiently smooth and well behaved, the overall coefficient here is $\mathcal {O}(1)$, so $\smash {\mathcal {J}[A, G] - \mathcal {J}[A, G_0] = \mathcal {O}(\mu _h) + \mathcal {O}(\epsilon ^2)}$. Because $\mu _h = \mathcal {O}(\epsilon ^2)$, this proves (4.28).

B.3 Proof of (5.4)

Here, we show that

(B8)

\begin{align} \text {symb}_{{\mathsf {x}}}(&\widehat{u\,}^{\alpha}\widehat{G\,}\widehat{u\,}^{\beta})\nonumber\\ & = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{S}}\, \mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}}} \left\langle{{\boldsymbol{X}} + {\boldsymbol{S}}/2 | \widehat{u\,}^{\alpha}\widehat{G\,}\widehat{u\,}^{\beta} | {\boldsymbol{X}} - {\boldsymbol{S}}/2}\right\rangle \nonumber\\ & = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{X}}'\,\mathrm{d}{\boldsymbol{K}}''\,\mathrm{d}{\boldsymbol{K}}'\,\mathrm{d}{\boldsymbol{S}}\,\mathrm{d}{\boldsymbol{S}}'\, {W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}', {\boldsymbol{K}}') G({\boldsymbol{X}}', {\boldsymbol{K}}'')\, \mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}} + \mathrm{i} ({\boldsymbol{K}}' + {\boldsymbol{K}}'') \cdot {\boldsymbol{S}}'} \nonumber\\ & \hspace{2cm}\times \delta({\boldsymbol{X}} + {\boldsymbol{S}}/2 - {\boldsymbol{X}}' - {\boldsymbol{S}}'/2) \delta({\boldsymbol{X}} - {\boldsymbol{S}}/2 - {\boldsymbol{X}}' + {\boldsymbol{S}}'/2) \nonumber\\ & = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{X}}'\,\mathrm{d}{\boldsymbol{K}}'\,\mathrm{d}{\boldsymbol{K}}''\,\mathrm{d}{\boldsymbol{S}}\,\mathrm{d}{\boldsymbol{S}}'\, {W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}', {\boldsymbol{K}}') G({\boldsymbol{X}}', {\boldsymbol{K}}'')\, \mathrm{e}^{-\mathrm{i} {\boldsymbol{K}} \cdot {\boldsymbol{S}} + \mathrm{i} ({\boldsymbol{K}}' + {\boldsymbol{K}}'') \cdot {\boldsymbol{S}}'} \nonumber\\ &\hspace{2cm}\times \delta ({\boldsymbol{S}} - {\boldsymbol{S}}') \delta ({\boldsymbol{X}} - {\boldsymbol{S}}/2 - {\boldsymbol{X}}' + {\boldsymbol{S}}'/2) \nonumber\\ & = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{X}}'\,\mathrm{d}{\boldsymbol{K}}'\,\mathrm{d}{\boldsymbol{K}}''\,\mathrm{d}{\boldsymbol{S}}\, {W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}', {\boldsymbol{K}}') G({\boldsymbol{X}}', {\boldsymbol{K}}'')\, \mathrm{e}^{\mathrm{i} ({\boldsymbol{K}}' + {\boldsymbol{K}}'' - {\boldsymbol{K}}) \cdot {\boldsymbol{S}}} \delta({\boldsymbol{X}} - {\boldsymbol{X}}') \nonumber\\ & = \frac{1}{(2{\rm \pi})^N} \int \mathrm{d}{\boldsymbol{K}}'\,\mathrm{d}{\boldsymbol{K}}''\,\mathrm{d}{\boldsymbol{S}}\, {W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}') G({\boldsymbol{X}}, {\boldsymbol{K}}'')\, \mathrm{e}^{\mathrm{i} ({\boldsymbol{K}}' + {\boldsymbol{K}}'' - {\boldsymbol{K}}) \cdot {\boldsymbol{S}}} \nonumber\\ & = \int \mathrm{d}{\boldsymbol{K}}'\, {W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}') G({\boldsymbol{X}}, {\boldsymbol{K}} - {\boldsymbol{K}}'). \end{align}

B.4 Proof of (5.10)

Using (4.19), (4.20) and (2.84) in application to $W_{{\boldsymbol {u}}}^{\alpha \beta }$, one finds that

(B9)

\begin{align} D_0^{\alpha\beta}({\boldsymbol{X}}) & \doteq \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}) G^*({\boldsymbol{X}}, {\boldsymbol{K}}) \nonumber\\ & = \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, - {\boldsymbol{K}}) G^*({\boldsymbol{X}},-{\boldsymbol{K}}) \nonumber\\ & = \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, - {\boldsymbol{K}}) G({\boldsymbol{X}}, {\boldsymbol{K}}) \nonumber\\ & = \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta *}({\boldsymbol{X}}, {\boldsymbol{K}}) G({\boldsymbol{X}}, {\boldsymbol{K}}) \nonumber\\ & = (D_0^{\alpha\beta}({\boldsymbol{X}}))^*, \end{align}

and also

(B10)

\begin{align} \Theta^{\alpha \beta c}({\boldsymbol{X}}) & \doteq{-}\int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}})(G^{|c}({\boldsymbol{X}}, {\boldsymbol{K}}))^* \nonumber\\ & =\int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}) G^{|c}({\boldsymbol{X}}, - {\boldsymbol{K}}) \nonumber\\ & =\int \mathrm{d}{\boldsymbol{K}}\,\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, - {\boldsymbol{K}}) G^{|c}({\boldsymbol{X}}, {\boldsymbol{K}}) \nonumber\\ & =\int \mathrm{d}{\boldsymbol{K}}\,(\overline{W}_{{\boldsymbol{u}}}^{\alpha\beta}({\boldsymbol{X}}, {\boldsymbol{K}}))^* G^{|c}({\boldsymbol{X}}, {\boldsymbol{K}}) \nonumber\\ & ={-} (\Theta^{\alpha\beta c}({\boldsymbol{X}}))^*. \end{align}

B.5 Proof of (5.20)

Let us estimate

(B11)

\begin{equation} \mathcal{L}^{(1)} \overline{f} \doteq \frac{\partial}{\partial z^{\alpha}} \left( J^{\alpha \mu} J^{\beta \nu} \mathcal{P}_{\mu \nu}^{(1)}\, \frac{\partial\overline{f}}{\partial z^{\beta}} \right), \end{equation}

where $\mathcal {P}_{\mu \nu }^{(1)}$ has the form

(B12)

\begin{equation} \mathcal{P}_{\mu \nu}^{(1)} \doteq \int \mathrm{d}{\boldsymbol{K}}\,q_{\nu}\, \frac{\partial \overline{W}({\boldsymbol{X}}, {\boldsymbol{K}})}{\partial z^{\mu}}\, G(\varOmega ({\boldsymbol{X}}, {\boldsymbol{K}})). \end{equation}

First, notice that

(B13)

\begin{align} \mathcal{P}_{\mu \nu}^{(1)} & = \frac{\partial}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\nu}\overline{W}G - \int \mathrm{d}{\boldsymbol{K}}\,q_{\nu}\overline{W}\,\frac{\partial G}{\partial z^{\mu}} \nonumber\\ & = \frac{\partial}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\nu}\overline{W}G + \frac{\partial v^{\lambda}}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\nu}q_{\lambda}\overline{W}G' \nonumber\\ & = \frac{\partial}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\nu}\overline{W}G + \frac{\partial v^{\lambda}}{\partial z^{\mu}} \frac{\eth}{\partial\varOmega} \int \mathrm{d}{\boldsymbol{K}}\, q_{\nu}q_{\lambda}\overline{W}G \nonumber\\ & \equiv \frac{\partial \mathcal{Q}_{\nu}^{(1)}}{\partial z^{\mu}} + \frac{\partial v^{\lambda}}{\partial z^{\mu}}\,\mathcal{R}_{\lambda \nu}^{(1)}. \end{align}

Because $\mathcal {Q}_{\nu }^{(1)}$ and $\mathcal {R}_{\lambda \nu }^{(1)}$ are $\mathcal {O}(\varepsilon ^2)$, one has $\smash {\mathcal {P}_{\mu \nu }^{(1)} \sim \kappa _{\mu } \varepsilon ^2}$, where $\kappa _{\mu }$ is the characteristic inverse scale along the $\mu$th phase-space axis. Thus,

(B14)

\begin{equation} \mathcal{L}^{(1)} \overline{f} \sim (J^{\alpha \mu} \kappa_{\alpha} \kappa_{\mu}) J^{\beta \nu} \kappa_{\beta}\varepsilon^2\overline{f} = \mathcal{O}(\epsilon\varepsilon^2), \end{equation}

where we used (see (2.69) and (3.2))

(B15)

\begin{equation} J^{\alpha \mu}\kappa_{\alpha}\kappa_{\mu} \sim \kappa_x\kappa_p = \mathcal{O}(\epsilon). \end{equation}

The first part of (5.20) is obtained by considering $\operatorname {im}\mathcal {L}^{(1)} \overline {f}$ and using (B14).

Let us also estimate

(B16)

\begin{equation} \mathcal{L}^{(2)} \overline{f} \doteq \frac{\partial}{\partial z^{\alpha}} \left( J^{\alpha \mu} J^{\beta \nu} \mathcal{P}_{\mu \nu}^{(2)}\,\frac{\partial\overline{f}}{\partial z^{\beta}} \right), \end{equation}

where $\mathcal {P}_{\mu \nu }^{(2)}$ has the form

(B17)

\begin{equation} \mathcal{P}_{\mu \nu}^{(2)} \doteq \int \mathrm{d}{\boldsymbol{K}}\, \frac{\partial^2\overline{W}}{\partial z^{\mu}\partial z^{\nu}}\,G(\varOmega ({\boldsymbol{X}}, {\boldsymbol{K}})). \end{equation}

First, note that

(B18)

\begin{align} \mathcal{P}_{\mu \nu}^{(2)} &= \int \mathrm{d}{\boldsymbol{K}}\,\frac{\partial^2\overline{W}}{\partial z^{\mu}\partial z^{\nu}}\,G \nonumber\\ &= \frac{\partial}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,\frac{\partial \overline{W}}{\partial z^{\nu}}\,G - \int \mathrm{d}{\boldsymbol{K}}\,\frac{\partial \overline{W}}{\partial z^{\nu}}\frac{\partial G}{\partial z^{\mu}} \nonumber\\ &= \frac{\partial^2}{\partial z^{\mu}\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\, \overline{W} G - \frac{\partial}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\, \frac{\partial G}{\partial z^{\nu}} - \frac{\partial}{\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,\frac{\partial G}{\partial z^{\mu}} + \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,\frac{\partial^2G}{\partial z^{\mu}\partial z^{\nu}} \nonumber\\ &= \frac{\partial^2}{\partial z^{\mu}\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,G - \frac{\partial}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,G' \left(- q_{\lambda}\,\frac{\partial v^{\lambda}}{\partial z^{\nu}}\right) \nonumber\\ &\quad - \frac{\partial}{\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,G' \left(- q_{\lambda}\,\frac{\partial v^{\lambda}}{\partial z^{\mu}}\right) + \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,\frac{\partial^2G}{\partial z^{\mu}\partial z^{\nu}} \nonumber\\ &= \frac{\partial^2}{\partial z^{\mu}\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}G + \frac{\partial}{\partial z^{\mu}} \left(\frac{\partial v^{\lambda}}{\partial z^{\nu}} \frac{\eth}{\partial\varOmega} \int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}\overline{W}G\right) \nonumber\\ &\quad + \frac{\partial}{\partial z^{\nu}} \left(\frac{\partial v^{\lambda}}{\partial z^{\mu}} \frac{\eth}{\partial\varOmega} \int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}\overline{W}G\right) + \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\frac{\partial^2G}{\partial z^{\mu}\partial z^{\nu}}. \end{align}

Next, note that

(B19)

\begin{align} \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,\frac{\partial^2G}{\partial z^{\mu}\partial z^{\nu}} & = \int \mathrm{d}{\boldsymbol{K}}\,\overline{W}\,\frac{\partial}{\partial z^{\mu}} \left(- q_{\lambda}G'\frac{\partial v^{\lambda}}{\partial z^{\nu}}\right) \nonumber\\ & ={-} \frac{\partial v^{\lambda}}{\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\, q_{\lambda}\overline{W}\,\frac{\partial G'}{\partial z^{\mu}} - \frac{\partial^2 v^{\lambda}}{\partial z^{\mu}\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}\overline{W}G'\nonumber\\ & = \frac{\partial v^{\lambda}}{\partial z^{\nu}} \frac{\partial v^{\delta}}{\partial z^{\mu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}q_{\delta} \overline{W} G'' - \frac{\partial^2 v^{\lambda}}{\partial z^{\mu}\partial z^{\nu}} \int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}\overline{W} G' \nonumber\\ & = \frac{\partial v^{\lambda}}{\partial z^{\nu}} \frac{\partial v^{\delta}}{\partial z^{\mu}} \frac{\eth^2}{\partial\varOmega^2} \int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}q_{\delta}\overline{W}G - \frac{\partial^2v^{\lambda}}{\partial z^{\mu}\partial z^{\nu}} \frac{\eth}{\partial\varOmega}\int \mathrm{d}{\boldsymbol{K}}\,q_{\lambda}\overline{W} G. \end{align}

Assuming the notation

(B20)

\begin{align} \mathcal{S}^{(2)} & \doteq \int \mathrm{d}{\boldsymbol{K}}\, \overline{W} G = \mathcal{O}\left(\varepsilon^2\right), \nonumber\\ \mathcal{Q}_{\lambda}^{(2)} & \doteq \frac{\eth}{\partial\varOmega}\int \mathrm{d}{\boldsymbol{K}}\, q_{\lambda}\overline{W} G = \mathcal{O}\left(\varepsilon^2\right), \nonumber\\ \mathcal{R}_{\lambda \delta}^{(2)} & \doteq \frac{\eth^2}{\partial\varOmega^2} \int \mathrm{d}{\boldsymbol{K}}\, q_{\lambda}q_{\delta}\overline{W}G = \mathcal{O}\left(\varepsilon^2\right), \end{align}

one can then rewrite $\mathcal {P}_{\mu \nu }^{(2)}$ as follows:

\begin{align*} \mathcal{P}_{\mu \nu}^{(2)} & = \frac{\partial^2\mathcal{S}^{(2)}}{\partial z^{\mu}\partial z^{\nu}} + \frac{\partial}{\partial z^{\mu}}\left(\frac{\partial v^{\lambda}}{\partial z^{\nu}}\,\mathcal{Q}_{\lambda}^{(2)}\right) + \frac{\partial}{\partial z^{\nu}}\left(\frac{\partial v^{\lambda}}{\partial z^{\mu}}\,\mathcal{Q}_{\lambda}^{(2)}\right) - \frac{\partial^2v^{\lambda}}{\partial z^{\mu}\partial z^{\nu}}\,\mathcal{Q}_{\lambda}^{(2)} + \frac{\partial v^{\lambda}}{\partial z^{\nu}}\frac{\partial v^{\delta}}{\partial z^{\mu}}\,\mathcal{R}_{\lambda \delta}^{(2)} \\ & = \frac{\partial^2\mathcal{S}^{(2)}}{\partial z^{\mu}\partial z^{\nu}} + \frac{\partial^2 v^{\lambda}}{\partial z^{\mu}\partial z^{\nu}}\,\mathcal{Q}_{\lambda}^{(2)} + \frac{\partial v^{\lambda}}{\partial z^{\nu}}\frac{\partial \mathcal{Q}_{\lambda}^{(2)}}{\partial z^{\mu}} + \frac{\partial v^{\lambda}}{\partial z^{\mu}}\frac{\partial \mathcal{Q}_{\lambda}^{(2)}}{\partial z^{\nu}} + \frac{\partial^2v^{\lambda}}{\partial z^{\mu}\partial z^{\nu}}\,\mathcal{Q}_{\lambda}^{(2)} + \frac{\partial v^{\lambda}}{\partial z^{\nu}}\frac{\partial v^{\delta}}{\partial z^{\mu}}\,\mathcal{R}_{\lambda\delta}^{(2)}. \end{align*}

Each term on the right-hand side of this equation scales as $\varepsilon ^2\kappa _{\mu }\kappa _{\nu }$, so

(B21)

\begin{equation} \mathcal{L}^{(2)} \overline{f} \sim (J^{\alpha \mu}\kappa_{\alpha}\kappa_{\mu}) (J^{\beta \nu}\kappa_{\beta}\kappa_{\nu}) \varepsilon^2 \overline{f} \sim \epsilon^2\varepsilon^2 \overline{f}, \end{equation}

where we again used (B15). The second part of (5.20) is obtained by considering $\operatorname {re}\mathcal {L}^{(2)} \overline {f}$ and using (B21).

B.6 Proof of (5.25)

Using (5.23) and assuming the notation $\mathrm {d}_t \doteq \partial _t + v^{\gamma }\partial _{\gamma }$, one finds that

(B22)

\begin{align} \partial_{\alpha}& (\widehat{D}^{\alpha\beta}\partial_{\beta}\overline{f}) - \partial_{\alpha}(({\mathsf{D}}^{\alpha\beta}+\varrho^{\alpha\beta})\partial_{\beta}\overline{f}) \nonumber\\ =\,& {-} \partial_{\alpha}\left(\Theta^{\alpha\beta}\mathrm{d}_t\partial_{\beta}\overline{f} + \frac{1}{2}\,(\mathrm{d}_t\Theta^{\alpha\beta}) \partial_{\beta}\overline{f}\right) \nonumber\\ =\,& {-} \partial_{\alpha} \left( \frac{1}{2}\,\Theta^{\alpha\beta}\mathrm{d}_t \partial_{\beta}\overline{f} + \frac{1}{2}\,\mathrm{d}_t(\Theta^{\alpha\beta}\partial_{\beta}\overline{f}) \right)\nonumber\\ =\, & {-}\partial_{\alpha} \left( \frac{1}{2}\,\Theta^{\alpha\beta}\partial_{\beta}\mathrm{d}_t\overline{f} - \frac{1}{2}\,\Theta^{\alpha\beta}(\partial_{\beta}v^{\gamma})\partial_{\gamma}\overline{f} \right) - \partial_{\alpha}\left( \frac{1}{2}\,\mathrm{d}_t(\Theta^{\alpha\beta}\partial_{\beta}\overline{f}) \right)\nonumber\\ = \,& {-}\partial_{\alpha}\left( \frac{1}{2}\,\Theta^{\alpha\beta}\partial_{\beta}\mathrm{d}_t\overline{f} \right) + \partial_{\alpha}\left(\frac{1}{2}\,\Theta^{\alpha\beta}(\partial_{\beta}v^{\gamma})\partial_{\gamma}\overline{f}\right) \nonumber\\ &{-} \mathrm{d}_t \left( \frac{1}{2}\,\partial_{\alpha}(\Theta^{\alpha\beta}\partial_{\beta}\overline{f}) \right) - (\partial_{\alpha}v^{\gamma}) \partial_{\gamma}\left( \frac{1}{2}\,\Theta^{\alpha\beta}\partial_{\beta}\overline{f} \right). \end{align}

Because $\Theta ^{\alpha \beta } = \mathcal {O}(\varepsilon ^2)$ and $\mathrm {d}_t \overline {f} = \mathcal {O}(\varepsilon ^2)$, the first term on the right-hand side of (B22) is negligible. Also note that due to (3.15), the factor $\smash {\partial _{\alpha }v^{\gamma }}$ in the last term on the right-hand side of (B22) commutes with $\smash {\partial _\gamma }$. Hence, one obtains

(B23)

\begin{align} \partial_{\alpha} (\widehat{D}^{\alpha\beta}\partial_{\beta}\overline{f}) & - \partial_{\alpha}({\mathsf{D}}^{\alpha\beta}\partial_{\beta}\overline{f}) + \mathrm{d}_t \left(\frac{1}{2}\,\partial_{\alpha}(\Theta^{\alpha \beta}\partial_{\beta}\overline{f})\right) \nonumber\\ & = \partial_{\alpha} \left( \varrho^{\alpha\beta} \partial_{\beta}\overline{f} + \frac{1}{2}\,\Theta^{\alpha\beta} \left(\partial_{\beta}v^{\gamma}\right) \partial_{\gamma}\overline{f} \right) - \partial_{\gamma} \left( \frac{1}{2}\,\Theta^{\alpha\beta} (\partial_{\alpha}v^{\gamma}) \partial_{\beta}\overline{f} \right)\nonumber\\ & = \partial_{\alpha}\left( \varrho^{\alpha\beta} \partial_{\beta} \overline{f} + \frac{1}{2} \left( \Theta^{\alpha \gamma}(\partial_{\gamma}v^{\beta}) -\Theta^{\gamma \beta}(\partial_{\gamma}v^{\alpha}) \right)\partial_{\beta}\overline{f} \right)\nonumber\\ & \equiv \partial_{\alpha} (U^{\alpha\beta} \partial_{\beta}\overline{f}). \end{align}

Next, notice that

(B24)

\begin{align} \varrho^{\alpha\beta} & ={-} \frac{1}{2}\, J^{\alpha \mu}J^{\beta \nu}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\left( q_{\nu}\,\frac{\partial\overline{W}}{\partial z^\mu} - q_{\mu}\,\frac{\partial\overline{W}}{\partial z^\nu} \right) \frac{1}{\varOmega} \nonumber\\ & ={-}\frac{1}{2}\,J^{\alpha \mu}J^{\beta \nu} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\, \frac{q_{\nu}}{\varOmega}\frac{\partial\overline{W}}{\partial z^\mu} + \frac{1}{2}\,J^{\alpha \mu}J^{\nu \beta}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\, \frac{q_{\mu}}{\varOmega}\frac{\partial\overline{W}}{\partial z^\nu} \nonumber\\ & ={-}\frac{1}{2}\,J^{\alpha \mu}J^{\beta\nu}\left( \frac{\partial}{\partial z^\mu}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\, \frac{q_{\nu}\overline{W}}{\varOmega} -{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,q_{\nu}\overline{W}\,\frac{\partial}{\partial z^\mu}\frac{1}{\varOmega} \right. \nonumber\\ & \hspace{2.5cm}\left. -\frac{\partial}{\partial z^\nu}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\mu}\overline{W}}{\varOmega} +{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,q_{\mu}\overline{W}\frac{\partial}{\partial z^\nu}\frac{1}{\varOmega}\right) \nonumber\\ & ={-}\frac{1}{2}\,J^{\alpha\mu}J^{\beta \nu} \left( \frac{\partial}{\partial z^\mu} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\nu}\overline{W}}{\varOmega} + \frac{\partial v^{\lambda}}{\partial z^\mu} \frac{\eth}{\partial \varOmega} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\lambda}q_{\nu}\overline{W}}{\varOmega} \right.\nonumber\\ & \hspace{2.5cm}\left. - \frac{\partial}{\partial z^\nu} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\mu}\overline{W}}{\varOmega} - \frac{\partial v^{\lambda}}{\partial z^\nu} \frac{\eth}{\partial \varOmega} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\, \frac{q_{\lambda}q_{\mu}\overline{W}}{\varOmega} \right). \end{align}

Assuming the notation

(B25)

\begin{equation} Q_{\mu} \doteq \frac{1}{2} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\,\frac{q_{\mu}\overline{W}}{\varOmega}, \qquad R_{\mu \nu}\doteq \frac{1}{2}\frac{\eth}{\partial \varOmega}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{K}}\, \frac{q_{\mu}q_{\nu}\overline{W}}{\varOmega}, \end{equation}

one can rewrite (B24) compactly as follows:

(B26)

\begin{equation} \varrho^{\alpha\beta} = J^{\alpha \mu}J^{\beta \nu} (\partial_{\nu}Q_{\mu} - \partial_{\mu}Q_{\nu}) + J^{\alpha \mu}J^{\beta \nu} ((\partial_{\nu}v^{\lambda})R_{\lambda \mu} - (\partial_{\mu}v^{\lambda})R_{\lambda \nu}). \end{equation}

Notice also that $\Theta ^{\alpha \beta } = 2J^{\alpha \mu }J^{\beta \nu }R_{\mu \nu }$. Hence, for $\smash {U^{\alpha \beta }}$ introduced in (B23), one has

(B27)

\begin{align} &U ^{\alpha\beta} - J^{\alpha \mu} J^{\beta \nu} (\partial_{\nu} Q_{\mu} - \partial_{\mu} Q_{\nu}) \nonumber\\ &\quad = \varrho^{\alpha\beta} - J^{\alpha \mu} J^{\beta \nu} (\partial_{\nu}Q_{\mu} - \partial_{\mu}Q_{\nu}) - ( \Theta^{\gamma \beta} (\partial_{\gamma}v^{\alpha}) - \Theta^{\alpha \gamma}(\partial_{\gamma}v^{\beta}) )/2 \nonumber\\ &\quad = J^{\alpha \mu} J^{\beta \nu} ((\partial_{\nu} v^{\lambda}) R_{\lambda \mu} - (\partial_{\mu}v^{\lambda}) R_{\lambda \nu}) + J^{\alpha \mu} J^{\gamma \nu} (\partial_{\gamma} v^{\beta}) R_{\mu \nu} - J^{\gamma \mu}J^{\beta \nu} (\partial_{\gamma}v^{\alpha}) R_{\mu \nu} \nonumber\\ &\quad = J^{\alpha \mu} J^{\beta \nu} (\partial_{\nu}v^{\lambda}) R_{\lambda \mu} - J^{\alpha \mu} J^{\beta \nu} (\partial_{\mu}v^{\lambda}) R_{\lambda \nu} + J^{\alpha \mu} J^{\gamma \nu} (\partial_{\gamma}v^{\beta}) R_{\mu \nu} - J^{\gamma \mu} J^{\beta \nu} (\partial_{\gamma}v^{\alpha})R_{\mu \nu} \nonumber\\ &\quad = J^{\alpha \mu} J^{\beta \lambda} (\partial_{\lambda}v^{\nu}) R_{\mu \nu} - J^{\alpha \lambda} J^{\beta \nu}(\partial_{\lambda} v^{\mu}) R_{\mu \nu} + J^{\alpha \mu} J^{\gamma \nu} (\partial_{\gamma}v^{\beta}) R_{\mu \nu} - J^{\gamma \mu}J^{\beta \nu}(\partial_{\gamma}v^{\alpha}) R_{\mu \nu} \nonumber\\ &\quad = ( J^{\alpha \mu} J^{\beta \lambda} J^{\nu \gamma} - J^{\alpha \lambda} J^{\beta \nu} J^{\mu \gamma} + J^{\alpha \mu}J^{\gamma \nu}J^{\beta \lambda} - J^{\gamma \mu}J^{\beta \nu}J^{\alpha \lambda} ) (\partial_{\gamma \lambda}^2\overline{H}) R_{\mu \nu} \nonumber\\ &\quad = ( J^{\alpha \mu} J^{\beta \lambda} J^{\nu \gamma} - J^{\alpha \lambda} J^{\beta \nu} J^{\mu \gamma} - J^{\alpha \mu} J^{\gamma \nu} J^{\lambda \beta} + J^{\mu \gamma} J^{\beta \nu} J^{\alpha \lambda} ) (\partial_{\gamma \lambda}^2\overline{H}) R_{\mu \nu} \nonumber\\ &\quad = 0, \end{align}

where we used (3.8) for $v^\alpha$ and the anti-symmetry of $J^{\alpha \beta }$. Therefore,

(B28)

\begin{equation} U^{\alpha\beta} = J^{\alpha \mu} J^{\beta \nu} (\partial_{\nu}Q_{\mu} - \partial_{\mu}Q_{\nu}) = (J^{\alpha \mu} J^{\beta \nu} - J^{\alpha \nu} J^{\beta \mu}) \partial_{\nu}Q_{\mu} ={-} U^{\beta \alpha}, \end{equation}

and accordingly,

(B29)

\begin{align} \partial_{\alpha}U^{\alpha\beta} & = (J^{\alpha \mu}J^{\beta \nu} - J^{\alpha \nu}J^{\beta \mu})\partial_{\nu \alpha}^2 Q_{\mu} = J^{\alpha \mu} J^{\beta \nu} \partial_{\nu \alpha}^2 Q_{\mu} \nonumber\\ & = J^{\nu \mu} J^{\beta \alpha} \partial_{\nu \alpha}^2 Q_{\mu} = J^{\beta \alpha} \partial_{\alpha} (J^{\nu \mu} \partial_{\nu} Q_{\mu}) ={-} J^{\alpha\beta} \partial_{\alpha} (J^{\mu \nu} \partial_{\mu}Q_{\nu}) \equiv J^{\alpha\beta} \partial_{\alpha}\varPhi. \end{align}

Here, $\smash {\varPhi \doteq -J^{\mu \nu }\partial _{\mu }Q_{\nu }}$, which is equivalent to (5.24). From (B29) and the fact that $\smash {U^{\alpha \beta }\partial _{\alpha \beta } = 0}$ due to the anti-symmetry of $\smash {U^{\alpha \beta }}$, one has

(B30)

\begin{equation} \partial_{\alpha} (U^{\alpha\beta}\partial_{\beta}\overline{f}) = J^{\alpha\beta} (\partial_{\alpha}\varPhi) (\partial_{\beta}\overline{f}) = \lbrace \varPhi, \overline{f} \rbrace. \end{equation}

Hence, (B23) leads to (5.25).

B.7 Proof of (6.30)

The correlation function

(B31)

\begin{equation} \mathfrak{C}_{ss'}(t, {\boldsymbol{x}}, \tau, {\boldsymbol{s}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \doteq \overline{ g_{s}(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2, {\boldsymbol{p}}) g_{s'}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2, {\boldsymbol{p}}') } \end{equation}

can be readily expressed as

\begin{align*} \mathfrak{C}_{ss'} = \bigg(\delta_{ss'}\!\sum_{\sigma_s = \sigma'_{s'}} + \sum_{\sigma_s \ne \sigma'_{s'}}\bigg)\, \langle &\delta({\boldsymbol{x}} + {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma_s}(t + \tau/2))\delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma_s}(t + \tau/2))\\ & \times \delta({\boldsymbol{x}} - {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma'_{s'}}(t - \tau/2))\delta({\boldsymbol{p}}' - \overline{{\boldsymbol{p}}}_{\sigma'_{s'}}(t - \tau/2)) \rangle - \mathfrak{C}_{\overline{\mathfrak{f}}}. \end{align*}

Here, $\smash {\left\langle {\ldots }\right\rangle}$ is another (in addition to overbar) notation for averaging used in this appendix, the dependence of $\smash {\overline {\mathfrak {f}}_s}$ on $\smash {(t, {\boldsymbol {x}})}$ is neglected and ‘$\smash {\sigma _s \ne \sigma '_{s'}}$’ denotes that excluded are the terms that have $\smash {s' = s}$ and $\smash {\sigma _s = \sigma '_{s'}}$ simultaneously. Aside from this, the summations over $\smash {\sigma _s}$ are taken over all $\smash {N_s \gg 1}$ particles of type $\smash {s}$, and the summations over $\smash {\sigma _{s'}}$ are taken over all $\smash {N_{s'} \gg 1}$ particles of type $\smash {s'}$. Also,

(B32)

\begin{equation} \mathfrak{C}_{\overline{\mathfrak{f}}} \doteq \left\langle{ \overline{\mathfrak{f}}_{s}(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2, {\boldsymbol{p}}) \overline{\mathfrak{f}}_{s'}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2, {\boldsymbol{p}}') }\right\rangle. \end{equation}

To the leading order, pair correlations can be neglected. Then,

(B33)

\begin{align} \sum_{\sigma_s \ne \sigma'_{s'}} \left\langle{\ldots}\right\rangle & = \sum_{\sigma_s \ne \sigma'_{s'}} \underbrace{\left\langle{\delta({\boldsymbol{x}} + {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma_s}(t + \tau/2))\delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma_s}(t + \tau/2))}\right\rangle}_{\overline{\mathfrak{f}}_s(t + \tau/2, {\boldsymbol{x}} + {\boldsymbol{s}}/2, {\boldsymbol{p}})/N_s} \nonumber\\ &\hphantom{\sum_{\sigma_s \ne \sigma'_{s'}} }\times \underbrace{\left\langle{\delta({\boldsymbol{x}} - {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma'_{s'}}(t - \tau/2))\delta({\boldsymbol{p}}' - \overline{{\boldsymbol{p}}}_{\sigma'_{s'}}(t - \tau/2))}\right\rangle}_{\overline{\mathfrak{f}}_{s'}(t - \tau/2, {\boldsymbol{x}} - {\boldsymbol{s}}/2, {\boldsymbol{p}}')/N_{s'}} \nonumber\\ &= \frac{\mathfrak{C}_{\overline{\mathfrak{f}}}}{N_s N_{s'}}\sum_{\sigma_s, \sigma'_{s'}} (1 - \delta_{ss'}\delta_{\sigma_s\sigma'_{s'}}) = (1 - N_s^{{-}1} \delta_{ss'}) \mathfrak{C}_{\overline{\mathfrak{f}}} \approx \mathfrak{C}_{\overline{\mathfrak{f}}}. \end{align}

Let us also use $\smash {\overline {{\boldsymbol {p}}}_{\sigma _s}(t + \tau /2) \approx \overline {{\boldsymbol {p}}}_{\sigma _s}(t)}$. Then,

(B34)

\begin{align} \mathfrak{C}_{ss'} \approx \delta_{ss'}\delta({\boldsymbol{p}} - {\boldsymbol{p}}') \sum_{\sigma=1}^{N_s} \langle & \delta({\boldsymbol{x}} + {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma}(t + \tau/2))\nonumber\\ & \times \delta({\boldsymbol{x}} - {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma}(t - \tau/2)) \delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma}(t)) \rangle. \end{align}

Next, notice that

(B35)

\begin{align} &\left\langle{ \delta({\boldsymbol{x}} + {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma}(t + \tau/2)) \delta({\boldsymbol{x}} - {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma}(t - \tau/2)) \delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma}(t)) }\right\rangle \nonumber\\ & = \left\langle{ \delta({\boldsymbol{s}} + \overline{{\boldsymbol{x}}}_{\sigma}(t - \tau/2) - \overline{{\boldsymbol{x}}}_{\sigma}(t + \tau/2)) \delta({\boldsymbol{x}} - {\boldsymbol{s}}/2 - \overline{{\boldsymbol{x}}}_{\sigma}(t - \tau/2)) \delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma}(t)) }\right\rangle \nonumber\\ & = \left\langle{ \delta({\boldsymbol{s}} + \overline{{\boldsymbol{x}}}_{\sigma}(t - \tau/2) - \overline{{\boldsymbol{x}}}_{\sigma}(t + \tau/2)) \delta({\boldsymbol{x}} - (\overline{{\boldsymbol{x}}}_{\sigma}(t + \tau/2) + \overline{{\boldsymbol{x}}}_{\sigma}(t - \tau/2))/2) \delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma}(t)) }\right\rangle \nonumber\\ &\approx \left\langle{ \delta({\boldsymbol{s}} - {\boldsymbol{v}}_s(t, \overline{{\boldsymbol{x}}}_\sigma, \overline{{\boldsymbol{p}}}_\sigma) \tau) \delta({\boldsymbol{x}} - \overline{{\boldsymbol{x}}}_{\sigma}(t)) \delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma}(t)) }\right\rangle \nonumber\\ &\approx \delta({\boldsymbol{s}} - {\boldsymbol{v}}_s(t, {\boldsymbol{x}}, {\boldsymbol{p}})\tau) \left\langle{ \delta({\boldsymbol{x}} - \overline{{\boldsymbol{x}}}_{\sigma}(t)) \delta({\boldsymbol{p}} - \overline{{\boldsymbol{p}}}_{\sigma}(t)) }\right\rangle \nonumber\\ & = \delta({\boldsymbol{s}} - {\boldsymbol{v}}_s(t, {\boldsymbol{x}}, {\boldsymbol{p}})\tau) \overline{\mathfrak{f}}_{s}(t, {\boldsymbol{x}}, {\boldsymbol{p}})/N_s. \end{align}

Hence,

(B36)

\begin{equation} \mathfrak{C}_{ss'} = \delta_{ss'}\delta({\boldsymbol{p}} - {\boldsymbol{p}}') \delta({\boldsymbol{s}} - {\boldsymbol{v}}_s(t, {\boldsymbol{x}}, {\boldsymbol{p}})\tau)F_{s}(t, {\boldsymbol{x}}, {\boldsymbol{p}}), \end{equation}

where we used $\smash {\overline {\mathfrak {f}}_{s} \approx F_s}$. Therefore,

(B37)

\begin{align} \mathfrak{G}_{ss'}(t, {\boldsymbol{x}}, \omega, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') & =\int \frac{\mathrm{d}\tau}{2{\rm \pi}}\frac{\mathrm{d}{\boldsymbol{s}}}{(2{\rm \pi})^n}\,\mathrm{e}^{\mathrm{i}\omega\tau - \mathrm{i}{\boldsymbol{k}}\cdot{\boldsymbol{s}}}\, \mathfrak{C}_{ss'}(t, {\boldsymbol{x}}, \tau, {\boldsymbol{s}}; {\boldsymbol{p}}, {\boldsymbol{p}}')\nonumber\\ & \approx \frac{1}{(2{\rm \pi})^n}\,\delta_{ss'}\delta({\boldsymbol{p}} - {\boldsymbol{p}}') F_{s}(t, {\boldsymbol{x}}, {\boldsymbol{p}}). \end{align}

B.8 Proof of (9.80)

Using the symmetry $\smash {{\mathsf {U}}^{\alpha \beta \gamma \delta } = {\mathsf {U}}^{\beta \alpha \gamma \delta } = {\mathsf {U}}^{\beta \alpha \delta \gamma }}$, one readily obtains from (6.47) that

(B38)

$$\begin{gather} \displaystyle \varDelta = \frac{1}{2P^0}\int \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\overline{g}_{\beta\gamma} p_\alpha p_\delta {\mathsf{U}}^{\alpha\beta\gamma\delta} + \frac{1}{8} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol {\mathsf{k}}}\,\mathcal{J}, \end{gather}$$

(B39)

$$\begin{gather}\displaystyle \mathcal{J} ={-} \frac{\partial'}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{P^0 \varpi}\right) - \frac{1}{(P^0)^2}\frac{\partial \mathfrak{E}}{\partial p_0} + \frac{\overline{g}^{00} \mathfrak{E}}{(P^0)^3}, \end{gather}$$

where $\smash {\varpi \doteq k_\rho p^\rho = P^0({\boldsymbol {k}} \cdot {\boldsymbol {v}} - \omega )}$ and the prime in $\smash {\partial '}$ denotes that $\smash {p_0}$ is considered as a function of $\smash {{\boldsymbol {p}}}$ at differentiation. One can also write this as follows:

(B40)

\begin{equation} \mathcal{J} = \frac{{\boldsymbol{k}}}{\varpi} \cdot \left(\frac{\partial' P_0}{\partial {\boldsymbol{p}}}\right) \frac{\mathfrak{E}}{(P^0)^2} - \frac{1}{P^0}\frac{\partial'}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) - \frac{1}{(P^0)^2}\frac{\partial \mathfrak{E}}{\partial p_0} + \frac{\overline{g}^{00} \mathfrak{E}}{(P^0)^3}. \end{equation}

As shown in Garg & Dodin (Reference Garg and Dodin2020, Appendix B), the following equality is satisfied:

(B41)

\begin{equation} \frac{{\boldsymbol{k}}}{\varpi} \cdot \left(\frac{\partial' P_0}{\partial {\boldsymbol{p}}}\right) = \frac{1}{\varpi}\frac{\partial\varpi}{\partial p_0} - \frac{\overline{g}^{00}}{P^0}. \end{equation}

Also notice that

(B42)

\begin{align} \frac{\partial'}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) & = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) + \frac{\partial P_0}{\partial {\boldsymbol{p}}} \cdot\frac{\partial}{\partial p_0} \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) \nonumber\\ & = \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}\, \frac{\partial}{\partial p_0} \left(\frac{\mathfrak{E}}{\varpi}\right), \end{align}

where we used Hamilton's equation $\smash {\partial _{{\boldsymbol {p}}} P_0 = -\partial _{{\boldsymbol {p}}} H = -{\boldsymbol {v}}}$. Therefore,

(B43)

\begin{equation} \mathcal{J} = \frac{1}{\varpi}\frac{\partial\varpi}{\partial p_0} \frac{\mathfrak{E}}{(P^0)^2} - \frac{1}{P^0} \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) + \frac{{\boldsymbol{k}} \cdot {\boldsymbol{v}}}{P^0}\frac{\partial}{\partial p_0} \left(\frac{\mathfrak{E}}{\varpi}\right) - \frac{1}{(P^0)^2}\frac{\partial \mathfrak{E}}{\partial p_0}. \end{equation}

The first and the last terms can be merged; then, one obtains

(B44)

\begin{align} \mathcal{J} & ={-} \frac{1}{P^0} \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) + \frac{{\boldsymbol{k}} \cdot {\boldsymbol{v}}}{P^0}\frac{\partial}{\partial p_0} \left(\frac{\mathfrak{E}}{\varpi}\right) - \frac{\varpi}{(P^0)^2}\frac{\partial}{\partial p_0}\left(\frac{\mathfrak{E}}{\varpi}\right) \nonumber\\ & ={-} \frac{1}{P^0} \frac{\partial}{\partial {\boldsymbol{p}}} \cdot \left(\frac{{\boldsymbol{k}}\mathfrak{E}}{\varpi}\right) + \frac{\omega}{P^0}\frac{\partial}{\partial p_0}\left(\frac{\mathfrak{E}}{\varpi}\right) \nonumber\\ & ={-} \frac{1}{P^0} \frac{\partial}{\partial p_\lambda} \left(\frac{k_\lambda\mathfrak{E}}{\varpi}\right). \end{align}

In combination with (B38), this leads to (9.80).

Appendix C. Properties of the collision operator

Here, we prove the properties of the collision operator discussed in § 6.8. To shorten the calculations, we introduce two auxiliary functions,

(C1)

\begin{align} \displaystyle \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \doteq &\,{\rm \pi}\,\delta({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s - {\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}) \mathcal{Q}_{ss'}({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s, {\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}'),\nonumber\\ \displaystyle \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') &\doteq \frac{\partial F_s({\boldsymbol{p}})}{\partial p_j}\,F_{s'}({\boldsymbol{p}}') - F_s({\boldsymbol{p}})\,\frac{\partial F_{s'}({\boldsymbol{p}}')}{\partial p_j'}, \end{align}

which have the following properties:

(C2)

\begin{equation} \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') = \mathcal{Z}_{s's}({\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}}), \qquad \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') ={-} \mathcal{F}_{s's}({\boldsymbol{p}}', {\boldsymbol{p}}). \end{equation}

C.1 Momentum conservation

Momentum conservation is proven as follows. Using integration by parts, one obtains

(C3)

\begin{align} & \sum_s \int \mathrm{d}{\boldsymbol{p}}\,p_l\mathcal{C}_s\nonumber\\ & = \sum_{s,s'} \int \mathrm{d}{\boldsymbol{p}}\,p_l\, \frac{\partial}{\partial p_i} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,k_i k_j\, \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \nonumber\\ & ={-}\sum_{s,s'} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\,k_l k_j\, \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}'). \end{align}

Now we swap the dummy variables $\smash {s \leftrightarrow s'}$ and $\smash {{\boldsymbol {p}} \leftrightarrow {\boldsymbol {p}}'}$ and then apply (C2):

(C4)

\begin{align} & \sum_s \int \mathrm{d}{\boldsymbol{p}}\,p_l\mathcal{C}_s\nonumber\\ & ={-}\sum_{s',s} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,\mathrm{d}{\boldsymbol{p}}\,k_l k_j\, \mathcal{Z}_{s's}({\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}}) \mathcal{F}_{s's}({\boldsymbol{p}}', {\boldsymbol{p}}) \nonumber\\ & = \sum_{s',s} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,\mathrm{d}{\boldsymbol{p}}\,k_l k_j\, \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}'). \end{align}

The expression on the right-hand side of (C4) is minus that in (C3). Hence, both are zero, which proves that $\smash {\sum _s \int \mathrm {d}{\boldsymbol {p}}\,p_l\mathcal {C}_s = 0}$.

C.2 Energy conservation

Energy conservation is proven similarly, using that $\smash {v_s^i = \partial \mathcal {H}_s/\partial p_i}$ and the fact that $\smash {{\boldsymbol {k}} \cdot {\boldsymbol {v}}_s}$ and $\smash {{\boldsymbol {k}} \cdot {\boldsymbol {v}}'_{s'}}$ are interchangeable due to the presence of $\smash {\delta ({\boldsymbol {k}} \cdot {\boldsymbol {v}}_s - {\boldsymbol {k}} \cdot {\boldsymbol {v}}'_{s'})}$ in $\smash {\mathcal {Z}_{ss'}}$:

(C5)

\begin{align} & \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\mathcal{C}_s\nonumber\\ & = \sum_{s,s'} \int \mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\, \frac{\partial}{\partial p_i} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,k_i k_j\, \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \nonumber\\ & ={-}\sum_{s,s'} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\,({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) k_j\, \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \nonumber\\ & ={-}\sum_{s',s} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,\mathrm{d}{\boldsymbol{p}}\,({\boldsymbol{k}} \cdot {\boldsymbol{v}}'_{s'}) k_j\, \mathcal{Z}_{s's}({\boldsymbol{k}}; {\boldsymbol{p}}', {\boldsymbol{p}}) \mathcal{F}_{s's}({\boldsymbol{p}}', {\boldsymbol{p}}) \nonumber\\ & = \sum_{s',s} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,\mathrm{d}{\boldsymbol{p}}\,({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s) k_j\, \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}'). \end{align}

Like in the previous case, the third and the fifth lines are minus each other, whence $\smash {\sum _s \int \mathrm {d}{\boldsymbol {p}}\,\mathcal {H}_s\mathcal {C}_s = 0}$.

C.3 ${H}$-theorem

From (6.72) and (6.73), one has

(C6)

\begin{equation} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{\text{coll}} ={-} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,(1 + \ln F_s({\boldsymbol{p}}))\mathcal{C}_s ={-} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\ln F_s({\boldsymbol{p}})\mathcal{C}_s, \end{equation}

where we used particle conservation, $\smash {\int \mathrm {d}{\boldsymbol {p}}\,\mathcal {C}_s = 0}$. Then,

(C7)

\begin{align} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{\text{coll}} & ={-} \sum_{ss'} \int \mathrm{d}{\boldsymbol{p}}\,\ln F_s({\boldsymbol{p}})\,\frac{\partial}{\partial p_i} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}'\,k_i k_j \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \nonumber\\ & = \sum_{ss'} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\, \mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\, k_i k_j\, \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial p_i}\,\mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}'). \end{align}

Let us swap the dummy variables $\smash {s \leftrightarrow s'}$ and $\smash {{\boldsymbol {p}} \leftrightarrow {\boldsymbol {p}}'}$ and then apply (C2) to obtain

(C8)

\begin{equation} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{\text{coll}} ={-}\sum_{ss'} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\int\mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\, k_i k_j\, \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial p_i'}\,\mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}'). \end{equation}

Upon comparing (C8) with (C7), one can put the result in a symmetrized form

(C9)

\begin{align} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{\text{coll}} = \frac{1}{2}\sum_{ss'} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\,\mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\, k_i k_j & \mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \nonumber\\ &\times \left(\frac{\partial \ln F_s({\boldsymbol{p}})}{\partial p_i} - \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial p_i'}\right). \end{align}

But notice that

(C10)

\begin{equation} \mathcal{F}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') = \left( \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial p_j} - \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial p_j'} \right) F_s({\boldsymbol{p}})F_{s'}({\boldsymbol{p}}'). \end{equation}

Thus,

(C11)

\begin{align} \left(\frac{\mathrm{d}\sigma}{\mathrm{d} t}\right)_{\text{coll}} = \frac{1}{2}\sum_{ss'} \int \frac{\mathrm{d}{\boldsymbol{k}}}{(2{\rm \pi})^n}\, \mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{p}}'\, &\mathcal{Z}_{ss'}({\boldsymbol{k}}; {\boldsymbol{p}}, {\boldsymbol{p}}') F_s({\boldsymbol{p}})F_{s'}({\boldsymbol{p}}') \nonumber\\ &\times \left({\boldsymbol{k}}\cdot \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} - {\boldsymbol{k}}\cdot\frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} \right)^2 \geqslant 0. \end{align}

Appendix D. Conservation laws for on-shell waves

Here, we prove the momentum-conservation theorem (7.87) and the energy-conservation theorem (7.89) for QL interactions of plasmas with on-shell waves.

D.1 Momentum conservation

Let us multiply (7.83) by $k_l$ and integrate over ${\boldsymbol {k}}$. Then, one obtains

(D1)

\begin{align} 0&= \int \mathrm{d}{\boldsymbol{k}}\,k_l\,\frac{\partial J}{\partial t} + \int \mathrm{d}{\boldsymbol{k}}\,k_l\,\frac{\partial(v_{\text{g}}^i J)}{\partial x^i} - \int \mathrm{d}{\boldsymbol{k}}\,k_l\frac{\partial}{\partial k_i}\left(\frac{\partial w}{\partial x^i}\,J\right) - 2\int \mathrm{d}{\boldsymbol{k}}\,k_l\gamma J \nonumber\\ &= \frac{\partial}{\partial t} \int \mathrm{d}{\boldsymbol{k}}\,k_l J + \frac{\partial}{\partial x^i} \int\mathrm{d}{\boldsymbol{k}}\,k_l v_{\text{g}}^i J + \int \mathrm{d}{\boldsymbol{k}}\,\frac{\partial w}{\partial x^l}\,J - 2\int \mathrm{d}{\boldsymbol{k}}\,w \gamma J. \end{align}

Similarly, multiplying (7.84) by $\smash {\mathcal {H}_s}$ and integrating over $\smash {{\boldsymbol {p}}}$ yields

(D2)

\begin{align} 0&= \int\mathrm{d}{\boldsymbol{p}}\,p_l\,\frac{\partial F_s}{\partial t} + \int\mathrm{d}{\boldsymbol{p}}\,p_l\,\frac{\partial (v_s^i F_s)}{\partial x^i} - \int\mathrm{d}{\boldsymbol{p}}\,p_l\,\frac{\partial}{\partial p_i} \left(\frac{\partial \mathcal{H}_s}{\partial x^i} \,F_s\right) \nonumber\\ &\quad - \int\mathrm{d}{\boldsymbol{p}}\,p_l\,\frac{\partial}{\partial p_i} \left({\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j}\right) - \int\mathrm{d}{\boldsymbol{p}}\,p_l\,\mathcal{C}_s \nonumber\\ &= \frac{\partial}{\partial t} \int\mathrm{d}{\boldsymbol{p}}\,p_l F_s + \frac{\partial}{\partial x^i}\int\mathrm{d}{\boldsymbol{p}}\, p_l v_s^i F_s + \frac{\partial}{\partial x^l}\int\mathrm{d}{\boldsymbol{p}}\,\varDelta_s F_s + \int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial x^l} \,F_s \nonumber\\ &\quad - \int\mathrm{d}{\boldsymbol{p}}\,\varDelta_s\,\frac{\partial F_s}{\partial x^l} + \int\mathrm{d}{\boldsymbol{p}}\,{\mathsf{D}}_{s,lj}\,\frac{\partial F_s}{\partial p_j} - \int\mathrm{d}{\boldsymbol{p}}\,p_l\,\mathcal{C}_s. \end{align}

Let us sum up (D2) over species and also add it with (D1). The contribution of the collision integral disappears due to (6.71), so one obtains

(D3)

\begin{align} 0 &= \frac{\partial}{\partial t} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\,p_l F_s + \int \mathrm{d}{\boldsymbol{k}}\,k_l J \right) + \frac{\partial}{\partial x^i} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\, p_l v_s^i F_s + \int \mathrm{d}{\boldsymbol{k}}\,k_l v_{\text{g}}^i J \right)\nonumber\\ &\quad + \sum_s\frac{\partial}{\partial x^l}\int\mathrm{d}{\boldsymbol{p}}\,\varDelta_s F_s + \sum_s\int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial x^l}\,F_s\nonumber\\ &\quad + \sum_s\int\mathrm{d}{\boldsymbol{p}}\,{\mathsf{D}}_{s,lj}\,\frac{\partial F_s}{\partial p_j} - 2\int \mathrm{d}{\boldsymbol{k}}\,k_l \gamma J\nonumber\\ &\quad - \sum_s \int\mathrm{d}{\boldsymbol{p}}\,\varDelta_s\,\frac{\partial F_s}{\partial x^l} + \int \mathrm{d}{\boldsymbol{k}}\,J\,\frac{\partial w}{\partial x^l}. \end{align}

Next, notice that

(D4)

\begin{align} 2 \int\mathrm{d}{\boldsymbol{k}}\,k_l \gamma J & = 2 {\rm \pi}\sum_s \int\mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{k}}\, k_l k_j\,\frac{|{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\partial_{\omega}\varLambda}\,J \delta(w - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)\,\frac{\partial F_s}{\partial p_j}\nonumber\\ &= \sum_s \int\mathrm{d}{\boldsymbol{p}}\,{\mathsf{D}}_{s,l j}\,\frac{\partial F_s}{\partial p_j}. \end{align}

Also, assuming that $\smash {{\boldsymbol {\varXi }}_0}$, $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2}$ and $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\wp }}_s{\boldsymbol {\eta }}}$ are independent of $\smash {{\boldsymbol {x}}}$ and using (7.80), one gets

(D5)

\begin{align} & \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\varDelta_s\frac{\partial F_s}{\partial x^l} \nonumber\\ & = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\frac{\partial F_s}{\partial x^l} \bigg( \frac{\partial}{\partial p_i} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{k_i}{2(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)}\, |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2 (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\, \delta(\omega - w({\boldsymbol{k}}))\nonumber\\ & \hphantom{= \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\frac{\partial F_s}{\partial x^l} \bigg(} + \frac{1}{2} \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\wp}}_s{\boldsymbol{\eta}}) (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\,\delta(\omega - w({\boldsymbol{k}}))\bigg)\nonumber\\ & ={-} \frac{1}{2}\sum_s {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\mathrm{d}{\boldsymbol{p}}\, (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\, \delta(\omega - w({\boldsymbol{k}}))\, \frac{k_i |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\, \frac{\partial^2 F_s}{\partial x^l\partial p_i} \nonumber\\ &\hphantom{\,=\,} + \frac{1}{2}\sum_s {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\mathrm{d}{\boldsymbol{p}}\, (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\,\delta(\omega - w({\boldsymbol{k}}))\, \frac{\partial}{\partial x^l}\,({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\wp}}_s{\boldsymbol{\eta}} F_s)\nonumber\\ & ={-} \sum_s \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, h({\boldsymbol{k}})\, \delta(\omega - w({\boldsymbol{k}}))\,\frac{\partial}{\partial x^l} {\unicode{x2A0F}}\mathrm{d}{\boldsymbol{p}}\,\left( \frac{k_i |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} \frac{\partial F_s}{\partial p_i} - {\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\wp}}_s{\boldsymbol{\eta}}F_s \right) \nonumber\\ & ={-} \sum_s \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, h({\boldsymbol{k}})\, \delta(\omega - w({\boldsymbol{k}}))\,\frac{\partial ({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\varXi}}{\boldsymbol{\eta}})}{\partial x^l} \nonumber\\ & ={-} \sum_s \int \mathrm{d}{\boldsymbol{k}}\, h({\boldsymbol{k}})\,\frac{\partial \varLambda(w({\boldsymbol{k}}), {\boldsymbol{k}})}{\partial x^l} \nonumber\\ & = \sum_s \int \mathrm{d}{\boldsymbol{k}}\, J\,\frac{\partial w}{\partial x^l}, \end{align}

where we also used (7.75b). Substituting (D4) and (D5) into (D3) leads to (7.87).

D.2 Energy conservation

Let us multiply (7.83) by $w$ and integrate over ${\boldsymbol {k}}$. Then, one obtains

(D6)

\begin{align} 0&= \int \mathrm{d}{\boldsymbol{k}}\,w\,\frac{\partial J}{\partial t} + \int \mathrm{d}{\boldsymbol{k}}\,w\,\frac{\partial(v_{\text{g}}^i J)}{\partial x^i} - \int \mathrm{d}{\boldsymbol{k}}\,w\,\frac{\partial}{\partial k_i}\left(\frac{\partial w}{\partial x^i}J\right) - 2\int \mathrm{d}{\boldsymbol{k}}\,w \gamma J\nonumber\\ &= \frac{\partial}{\partial t} \int \mathrm{d}{\boldsymbol{k}}\,w J - \int \mathrm{d}{\boldsymbol{k}}\,\frac{\partial w}{\partial t}\,J + \frac{\partial}{\partial x^i} \int\mathrm{d}{\boldsymbol{k}}\,w v_{\text{g}}^i J - \int \mathrm{d}{\boldsymbol{k}}\,\frac{\partial w}{\partial x^i}\, v_{\text{g}}^i J \nonumber\\ &\quad + \int \mathrm{d}{\boldsymbol{k}}\,v_{\text{g}}^i\,\frac{\partial w}{\partial x^i}\,J - 2\int \mathrm{d}{\boldsymbol{k}}\,w \gamma J\nonumber\\ &= \frac{\partial}{\partial t} \int \mathrm{d}{\boldsymbol{k}}\,w J + \frac{\partial}{\partial x^i}\int \mathrm{d}{\boldsymbol{k}}\,w v_{\text{g}}^i J - \int \mathrm{d}{\boldsymbol{k}}\,\frac{\partial w}{\partial t}\,J - 2\int \mathrm{d}{\boldsymbol{k}}\,w \gamma J. \end{align}

Similarly, multiplying (7.84) by $\smash {\mathcal {H}_s}$ and integrating over $\smash {{\boldsymbol {p}}}$ yields

(D7)

\begin{align} 0&= \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\frac{\partial F_s}{\partial t} + \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\frac{\partial (v_s^i F_s)}{\partial x^i} - \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\frac{\partial}{\partial p_i} \left(\frac{\partial \mathcal{H}_s}{\partial x^i} \,F_s\right) \nonumber\\ &\quad - \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\frac{\partial}{\partial p_i} \left({\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j}\right) - \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\mathcal{C}_s \nonumber\\ &= \frac{\partial}{\partial t} \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s F_s - \int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial \mathcal{H}_s}{\partial t}\,F_s + \frac{\partial}{\partial x^i}\int\mathrm{d}{\boldsymbol{p}}\, \mathcal{H}_s v_s^i F_s - \int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial \mathcal{H}_s }{\partial x^i}\,v_s^i F_s \nonumber\\ &\quad + \int\mathrm{d}{\boldsymbol{p}}\,v_s^i\,\frac{\partial \mathcal{H}_s}{\partial x^i} \,F_s + \int\mathrm{d}{\boldsymbol{p}}\,v_s^i {\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j} - \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\mathcal{C}_s \nonumber\\ &= \frac{\partial}{\partial t} \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s F_s + \frac{\partial}{\partial x^i}\int\mathrm{d}{\boldsymbol{p}}\, \mathcal{H}_s v_s^i F_s - \int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial t}\,F_s - \int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial \varDelta_s}{\partial t}\,F_s \nonumber\\ &\quad + \int\mathrm{d}{\boldsymbol{p}}\,v_s^i {\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j} - \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\mathcal{C}_s \nonumber\\ &= \frac{\partial}{\partial t} \int\mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \frac{\partial}{\partial x^i}\int\mathrm{d}{\boldsymbol{p}}\, H_{0s} v_s^i F_s + \frac{\partial}{\partial x^i}\int\mathrm{d}{\boldsymbol{p}}\, \varDelta_s v_s^i F_s - \int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial t}\,F_s \nonumber\\ &\quad + \int\mathrm{d}{\boldsymbol{p}}\,\varDelta_s\,\frac{\partial F_s}{\partial t} + \int\mathrm{d}{\boldsymbol{p}}\,v_s^i {\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j} - \int\mathrm{d}{\boldsymbol{p}}\,\mathcal{H}_s\,\mathcal{C}_s. \end{align}

Let us sum up (D7) over species and also add it with (D6). The contribution of the collision integral disappears due to (6.71), so one obtains

(D8)

\begin{align} 0 &= \frac{\partial}{\partial t} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \int \mathrm{d}{\boldsymbol{k}}\,w J \right) + \frac{\partial}{\partial x^i} \left( \sum_s \int\mathrm{d}{\boldsymbol{p}}\, H_{0s} v_s^i F_s + \int \mathrm{d}{\boldsymbol{k}}\,w v_{\text{g}}^i J \right)\nonumber\\ &\quad + \frac{\partial}{\partial x^i}\sum_s \int\mathrm{d}{\boldsymbol{p}}\, \varDelta_s v_s^i F_s - \sum_s\int\mathrm{d}{\boldsymbol{p}}\,\frac{\partial H_{0s}}{\partial t}\,F_s \nonumber\\ &\quad + \sum_s \int\mathrm{d}{\boldsymbol{p}}\,v_s^i {\mathsf{D}}_{s,ij}\,\frac{\partial F_s}{\partial p_j} - 2\int \mathrm{d}{\boldsymbol{k}}\,w \gamma J \nonumber\\ &\quad + \sum_s \int\mathrm{d}{\boldsymbol{p}}\,\varDelta_s\,\frac{\partial F_s}{\partial t} - \int \mathrm{d}{\boldsymbol{k}}\,J\,\frac{\partial w}{\partial t}. \end{align}

Next, notice that

(D9)

\begin{align} 2 \int\mathrm{d}{\boldsymbol{k}}\,w \gamma J & = 2 {\rm \pi}\sum_s \int\mathrm{d}{\boldsymbol{p}}\,\mathrm{d}{\boldsymbol{k}}\, w k_j\,\frac{|{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\partial_{\omega}\varLambda}\,J \delta(w - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)\,\frac{\partial F_s}{\partial p_j}\nonumber\\ &= \sum_s \int\mathrm{d}{\boldsymbol{p}}\,v_s^i {\mathsf{D}}_{s,i j}\,\frac{\partial F_s}{\partial p_j}. \end{align}

(D10)

\begin{align} & \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\varDelta_s\frac{\partial F_s}{\partial t} \nonumber\\ & = \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\frac{\partial F_s}{\partial t} \,\bigg( \frac{\partial}{\partial p_i} {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\frac{k_i}{2(\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)}\, |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2 (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\, \delta(\omega - w({\boldsymbol{k}})) \nonumber\\ & \hphantom{= \sum_s \int \mathrm{d}{\boldsymbol{p}}\,\frac{\partial F_s}{\partial t} \bigg(} + \frac{1}{2}\int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\wp}}_s{\boldsymbol{\eta}}) (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\,\delta(\omega - w({\boldsymbol{k}})) \bigg) \nonumber\\ & ={-} \frac{1}{2}\sum_s {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\mathrm{d}{\boldsymbol{p}}\, (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\,\delta(\omega - w({\boldsymbol{k}}))\, \frac{k_i |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\frac{\partial^2 F_s}{\partial t\partial p_i} \nonumber\\ & \hphantom{\,=\,} + \frac{1}{2}\sum_s {\unicode{x2A0F}} \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\mathrm{d}{\boldsymbol{p}}\, (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}}))\,\delta(\omega - w({\boldsymbol{k}}))\, \frac{\partial}{\partial t}\,({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\wp}}_s{\boldsymbol{\eta}} F_s) \nonumber\\ & ={-} \sum_s \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, h({\boldsymbol{k}})\, \delta(\omega - w({\boldsymbol{k}}))\,\frac{\partial}{\partial t} {\unicode{x2A0F}}\mathrm{d}{\boldsymbol{p}}\left( \frac{k_i |{\boldsymbol{\alpha}}_s^{{\dagger}} {\boldsymbol{\eta}}|^2}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s} \frac{\partial F_s}{\partial p_i} - {\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\wp}}_s{\boldsymbol{\eta}} F_s \right)\nonumber\\ & ={-} \sum_s \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\, h({\boldsymbol{k}})\, \delta(\omega - w({\boldsymbol{k}}))\,\frac{\partial ({\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\varXi}}{\boldsymbol{\eta}})}{\partial t} \nonumber\\ &={-} \sum_s \int \mathrm{d}{\boldsymbol{k}}\, h({\boldsymbol{k}})\,\frac{\partial \varLambda(w({\boldsymbol{k}}), {\boldsymbol{k}})}{\partial t} \nonumber\\ &= \sum_s \int \mathrm{d}{\boldsymbol{k}}\, J\,\frac{\partial w}{\partial t}, \end{align}

where we also used (7.75a). Substituting (D9) and (D10) into (D8) leads to (7.89).

Appendix E. Uniqueness of the entropy-preserving distribution

Here, we prove that the Boltzmann–Gibbs distribution is the only distribution for which the entropy density $\smash {\sigma }$ is conserved. According to (C11), $\smash {\sigma }$ is conserved when

(E1)

\begin{equation} \delta({\boldsymbol{k}} \cdot ({\boldsymbol{v}}_s - {\boldsymbol{v}}'_{s'}))\, ({\boldsymbol{k}}\cdot{\boldsymbol{G}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}'))^2 = 0 \end{equation}

(for all $\smash {{\boldsymbol {p}}}$, $\smash {{\boldsymbol {p}}'}$ and $\smash {{\boldsymbol {k}}}$, as well as all $\smash {s}$ and $\smash {s'}$), where

(E2)

\begin{equation} {\boldsymbol{G}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') \doteq \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} - \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'}. \end{equation}

Let us decompose the vector $\smash {{\boldsymbol {G}}_{ss'}({\boldsymbol {p}}, {\boldsymbol {p}}')}$ into components parallel and perpendicular to the vector $\smash {{\boldsymbol {v}}_s - {\boldsymbol {v}}'_{s'}}$:

(E3)

\begin{equation} {\boldsymbol{G}}_{ss'}({\boldsymbol{p}}, {\boldsymbol{p}}') = \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})\,({\boldsymbol{v}}_s - {\boldsymbol{v}}'_{s'}) + {\boldsymbol{G}}_{ss'}^\perp({\boldsymbol{p}}, {\boldsymbol{p}}'), \end{equation}

where $\smash {\lambda _{ss'}({\boldsymbol {v}}_s, {\boldsymbol {v}}'_{s'})}$ is a scalar function. (Because the velocities are functions of the momenta, one can as well consider $\smash {\lambda _{ss'}}$ as a function of $\smash {{\boldsymbol {p}}}$ and $\smash {{\boldsymbol {p}}'}$.) Due to the presence of the delta function in (E1), the contribution of the first term to (E1) is zero, so (E1) can be written as

(E4)

\begin{equation} \delta({\boldsymbol{k}} \cdot ({\boldsymbol{v}}_s - {\boldsymbol{v}}'_{s'}))\, ({\boldsymbol{k}}\cdot{\boldsymbol{G}}_{ss'}^\perp({\boldsymbol{p}}, {\boldsymbol{p}}'))^2 = 0. \end{equation}

By considering this formula for $\smash {{\boldsymbol {k}}}$ parallel to $\smash {{\boldsymbol {G}}_{ss'}^\perp ({\boldsymbol {p}}, {\boldsymbol {p}}')}$ (and thus perpendicular to $\smash {{\boldsymbol {v}}_s - {\boldsymbol {v}}'_{s'}}$), one finds that $\smash {{\boldsymbol {G}}_{ss'}^\perp ({\boldsymbol {p}}, {\boldsymbol {p}}') = 0}$. Combined with (E2) and (E3), this yields

(E5)

\begin{equation} \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} - \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} = \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})\,({\boldsymbol{v}}_s - {\boldsymbol{v}}'_{s'}). \end{equation}

Also, by swapping $\smash {{\boldsymbol {p}} \leftrightarrow {\boldsymbol {p}}'}$ and $\smash {s \leftrightarrow s'}$, one finds that

(E6)

\begin{equation} \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'}) = \lambda_{s's}({\boldsymbol{v}}'_{s'}, {\boldsymbol{v}}_s). \end{equation}

Equation (E5) yields, in particular, thatFootnote ⁴⁶

(E7a)

\begin{align} \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial p_2} - \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial p_2'} = \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})\,(v_{s,2} - v'_{s',2}), \end{align}

(E7b)

\begin{align} \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial p_3} - \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial p_3'} = \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})\,(v_{s,3} - v'_{s',3}), \end{align}

where we have assumed some coordinate axes in the momentum and velocity space labelled $\smash {(1, 2, 3, \ldots )}$. Then,

(E8a)

\begin{align} \frac{\partial^2\ln F_s({\boldsymbol{p}})}{\partial p_2 \partial v_{s,1}} = \frac{\partial \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})}{\partial v_{s,1}}\,(v_{s,2} - v'_{s',2}), \end{align}

(E8b)

\begin{align} \frac{\partial^2 \ln F_s({\boldsymbol{p}})}{\partial p_3 \partial v_{s,1}} = \frac{\partial \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})}{\partial v_{s,1}}\,(v_{s,3} - v'_{s',3}), \end{align}

where the derivative with respect to $\smash {v_{s,1}}$ is taken at fixed $\smash {v_{s, i \ne 1}}$ and at fixed $\smash {{\boldsymbol {v}}'_{s'}}$. Due to (E5), $\smash {\lambda _{ss'}({\boldsymbol {v}}_s, {\boldsymbol {v}}'_{s'})}$ is continuous for all $\smash {F_s}$ and $\smash {F_{s'}}$. (Here, we consider only physical distributions, which are always differentiable.) Then, (E8) leads to

(E9)

\begin{equation} \frac{1}{v_{s,2} - v'_{s',2}}\frac{\partial^2\ln F_s({\boldsymbol{p}})}{\partial p_2 \partial v_{s,1}} = \frac{1}{v_{s,3} - v'_{s',3}}\frac{\partial^2 \ln F_s({\boldsymbol{p}})}{\partial p_3 \partial v_{s,1}}. \end{equation}

By differentiating this with respect to $\smash {v'_{s',2}}$, one obtains

(E10)

\begin{equation} \frac{\partial^2\ln F_s({\boldsymbol{p}})}{\partial p_2 \partial v_{s,1}} = 0, \end{equation}

whence (E8a) yields

(E11)

\begin{equation} \frac{\partial \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})}{\partial v_{s,1}} = 0. \end{equation}

By repeating this argument for other axes and for $\smash {{\boldsymbol {v}}'}$ instead of $\smash {{\boldsymbol {v}}}$, one can also extend (E11) to

(E12)

\begin{equation} \frac{\partial \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})}{\partial {\boldsymbol{v}}} = 0, \qquad \frac{\partial \lambda_{ss'}({\boldsymbol{v}}_s, {\boldsymbol{v}}'_{s'})}{\partial {\boldsymbol{v}}'} = 0. \end{equation}

Hence, $\smash {\lambda _{ss'}({\boldsymbol {v}}_s, {\boldsymbol {v}}'_{s'})}$ is actually independent of the velocities; i.e. $\smash {\lambda _{ss'}({\boldsymbol {v}}_s, {\boldsymbol {v}}'_{s'}) = \lambda _{ss'}}$. Using this along with (E6), one also finds that

(E13)

\begin{equation} \lambda_{ss'} = \lambda_{s's}. \end{equation}

Let us rewrite (E5) as follows:

(E14)

\begin{equation} \frac{\partial \ln F_s({\boldsymbol{p}})}{\partial {\boldsymbol{p}}} - \lambda_{ss'} {\boldsymbol{v}}_s = \frac{\partial \ln F_{s'}({\boldsymbol{p}}')}{\partial {\boldsymbol{p}}'} - \lambda_{s's} {\boldsymbol{v}}'_{s'}. \end{equation}

Here, the left-hand side is independent of $\smash {{\boldsymbol {p}}'}$ and the right-hand side is independent of $\smash {{\boldsymbol {p}}}$, so both must be equal to some vector

(E15)

\begin{equation} {\boldsymbol{\mu}}_{ss'} = {\boldsymbol{\mu}}_{s's} \end{equation}

that is independent of both $\smash {{\boldsymbol {p}}}$ and $\smash {{\boldsymbol {p}}'}$. Because $\smash {{\boldsymbol {v}}_s = \partial _{{\boldsymbol {p}}}\mathcal {H}_s}$, this is equivalent to

(E16a)

\begin{equation} \ln F_s({\boldsymbol{p}}) - \lambda_{ss'} \mathcal{H}_s({\boldsymbol{p}}) = {\boldsymbol{\mu}}_{ss'} \cdot {\boldsymbol{p}} + \eta_{ss'} \end{equation}

(and similarly for $\smash {{\boldsymbol {p}}'}$), where the integration constant $\smash {\eta _{ss'}}$ is independent of both $\smash {{\boldsymbol {p}}}$ and $\smash {{\boldsymbol {p}}'}$. This is supposed to hold for any $\smash {s'}$, so one can also write

(E16b)

\begin{equation} \ln F_s({\boldsymbol{p}}) - \lambda_{ss''} \mathcal{H}_s({\boldsymbol{p}}) = {\boldsymbol{\mu}}_{ss''} \cdot {\boldsymbol{p}} + \eta_{ss''}, \end{equation}

where $\smash {s''}$ is any other species index. Subtracting equations (E16) from each other gives

(E17)

\begin{equation} (\lambda_{ss'} - \lambda_{ss''}) \mathcal{H}_s({\boldsymbol{p}}) = ({\boldsymbol{\mu}}_{ss'} - {\boldsymbol{\mu}}_{ss''}) \cdot {\boldsymbol{p}} + \eta_{ss'} - \eta_{ss''}. \end{equation}

By differentiating this with respect to $\smash {{\boldsymbol {p}}}$, one finds

(E18)

\begin{equation} (\lambda_{ss'} - \lambda_{ss''}) {\boldsymbol{v}}_s = {\boldsymbol{\mu}}_{ss'} - {\boldsymbol{\mu}}_{ss''}. \end{equation}

By differentiating this further with respect to $\smash {{\boldsymbol {v}}_s}$, one obtains $\smash {\lambda _{ss'} = \lambda _{ss''}}$. Then, (E18) yields $\smash {{\boldsymbol {\mu }}_{ss'} = {\boldsymbol {\mu }}_{ss''}}$, and (E17) yields $\smash {\eta _{ss'} = \eta _{ss''}}$. In other words, the functions $\smash {\lambda _{ss'}}$, $\smash {{\boldsymbol {\mu }}_{ss'}}$ and $\smash {\eta _{ss'}}$ are independent of their second index and thus can as well be written as

(E19)

\begin{equation} \lambda_{ss'} = \lambda_{s}, \qquad {\boldsymbol{\mu}}_{ss'} = {\boldsymbol{\mu}}_s, \qquad \eta_{ss'} = \eta_s. \end{equation}

But then, (E13) and (E15) also yield $\smash {\lambda _s = \lambda _{s'} \equiv \lambda }$ and $\smash {{\boldsymbol {\mu }}_s = {\boldsymbol {\mu }}_{s'} \equiv {\boldsymbol {\mu }}}$. Therefore, (E16) can be written as

(E20)

\begin{equation} F_s({\boldsymbol{p}}) = \text{const}_s \times \exp(\lambda \mathcal{H}_s({\boldsymbol{p}}) + {\boldsymbol{\mu}} \cdot {\boldsymbol{p}}), \end{equation}

which is the Boltzmann–Gibbs distribution (§ 8.1). This proves that a plasma that conserves its entropy density necessarily has the Boltzmann–Gibbs distribution.

Appendix F. Total momentum and energy

Here, we show that the total momentum and energy in the OC–wave representation equals the total momentum and energy in the particle–field representation.

F.1 Non-relativistic electrostatic interactions

F.1.1 Momentum

Assuming the notation $\smash {\mathcal {P}_l \doteq \sum _s \int \mathrm {d}{\boldsymbol {p}}\,p_l \overline {f}_s}$ and using (9.17) for $\smash {{\boldsymbol {\Theta }}_s}$, one can represent the OC momentum density as follows:

(F1)

\begin{align} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,p_l F_s & = \mathcal{P}_l + \frac{1}{2}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,p_l\,\frac{\partial}{\partial p_i}\left(\Theta_{s,ij}\,\frac{\partial \overline{f}_s}{\partial p_j}\right) \nonumber\\ & \approx \mathcal{P}_l - \frac{1}{2}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,\Theta_{s,lj}\,\frac{\partial F_s}{\partial p_j} \nonumber\\ & = \mathcal{P}_l -\int \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}})\left.\frac{\partial}{\partial \vartheta}\sum_s e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{k_l}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\, {\boldsymbol{k}}\cdot\frac{\partial F_s}{\partial {\boldsymbol{p}}} \right|_{\vartheta=0}\nonumber\\ & = \mathcal{P}_l -\int \mathrm{d}{\boldsymbol{k}}\, k_l h({\boldsymbol{k}})\left.\frac{\partial}{\partial \vartheta}\left(\frac{k^2(\epsilon_{{\parallel}\text{H}}(w({\boldsymbol{k}}) + \vartheta, {\boldsymbol{k}})-1)}{4{\rm \pi}}\right) \right|_{\vartheta=0} \nonumber\\ & = \mathcal{P}_l - \int \mathrm{d}{\boldsymbol{k}}\,k_l J, \end{align}

where we substituted (9.16). This leads to (9.18).

F.1.2 Energy

Assuming the notation $\smash {\mathcal {K} \doteq \sum _s \int \mathrm {d}{\boldsymbol {p}}\,H_{0s} \overline {f}_s}$ and using (9.17) for $\smash {{\boldsymbol {\Theta }}_s}$, one can represent the OC energy density as follows:

(F2)

\begin{align} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s & = \mathcal{K} + \frac{1}{2}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,\frac{p^2}{2m_s}\,\frac{\partial}{\partial p_i}\left(\Theta_{s,ij}\,\frac{\partial \overline{f}_s}{\partial p_j}\right) \nonumber\\ & \approx \mathcal{K} - \frac{1}{2}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,v_s^i\Theta_{s,ij}\,\frac{\partial F_s}{\partial p_j} \nonumber\\ & = \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}})\,\frac{\partial}{\partial \vartheta}\sum_s e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}} \left. \frac{{\boldsymbol{k}}\cdot{{\boldsymbol{v}}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\, {\boldsymbol{k}}\cdot\frac{\partial F_s}{\partial {\boldsymbol{p}}} \right|_{\vartheta=0}. \end{align}

Notice that

(F3)

\begin{align} \frac{\partial}{\partial \vartheta}\frac{{\boldsymbol{k}}\cdot{{\boldsymbol{v}}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} & = \frac{\partial}{\partial \vartheta}\left({-}1 + \frac{w({\boldsymbol{k}}) + \vartheta}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\right) \nonumber\\ & = \frac{1}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta} + w({\boldsymbol{k}})\,\frac{\partial}{\partial \vartheta} \frac{1}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}. \end{align}

Then,

(F4)

\begin{align} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s &= \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,w({\boldsymbol{k}}) h({\boldsymbol{k}})\,\frac{\partial}{\partial \vartheta} \sum_s e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}} \left. \frac{{\boldsymbol{k}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\, \cdot\frac{\partial F_s}{\partial {\boldsymbol{p}}} \right|_{\vartheta=0}\nonumber\\ &\quad -\int \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}}) \sum_s e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{k}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\, \cdot\frac{\partial F_s}{\partial {\boldsymbol{p}}}\nonumber\\ &= \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,w({\boldsymbol{k}}) h({\boldsymbol{k}}) \left.\frac{\partial}{\partial \vartheta}\left(\frac{k^2(\epsilon_{{\parallel}\text{H}}(w({\boldsymbol{k}}) + \vartheta, {\boldsymbol{k}})-1)}{4{\rm \pi}}\right) \right|_{\vartheta=0} \nonumber\\ &\quad -\int \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}})\, \frac{k^2(\epsilon_{{\parallel}\text{H}}(w({\boldsymbol{k}}), {\boldsymbol{k}})-1)}{4{\rm \pi}}. \end{align}

Using (9.14) and (9.16), one obtains that the sum of the OC and wave energy is given by

(F5)

\begin{align} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s + \int \mathrm{d}{\boldsymbol{k}}\,w J & = \mathcal{K} + \int \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}})\,\frac{k^2}{4{\rm \pi}} \nonumber\\ & = \mathcal{K} + \sum_\sigma\frac{\overline{k}_\sigma^2 |{\breve{\varphi}}_\sigma|^2}{16{\rm \pi}} \nonumber\\ & = \sum_s \int \mathrm{d}{\boldsymbol{p}}\left(\frac{p^2}{2m_s} + e_s\overline{\varphi}_s\right)\overline{f}_s + \frac{1}{8{\rm \pi}}\,\overline{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}}}, \end{align}

where we also substituted (7.64).

F.2 Relativistic electromagnetic interactions

F.2.1 Momentum

Let us assume the notation $\smash {{\boldsymbol {\mathcal {P}}} \doteq \sum _s \int \mathrm {d}{\boldsymbol {p}}\,{\boldsymbol {p}} \overline {f}_s}$ and

(F6)

\begin{equation} {\boldsymbol{\chi}}(\omega, {\boldsymbol{k}}) \doteq \sum_s \frac{4{\rm \pi} e_s^2}{\omega^2}{\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}}}{\omega - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\,{\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial{\boldsymbol{p}}} = {\boldsymbol{\epsilon}}(\omega, {\boldsymbol{k}}) - {\boldsymbol{1}} + \frac{{\boldsymbol{\mathfrak{w}}}_p}{\omega^2}. \end{equation}

Then, using (9.52) for $\smash {{\boldsymbol {\Theta }}_s}$, one can represent the OC momentum density as follows:

(F7)

\begin{align} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,p_l F_s & \approx \mathcal{P}_l - \frac{1}{2}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,\Theta_{s,lj}\,\frac{\partial F_s}{\partial p_j} \nonumber\\ & = \mathcal{P}_l -\int \mathrm{d}{\boldsymbol{k}}\,k_l h({\boldsymbol{k}})\left.\frac{\partial}{\partial \vartheta}\sum_s e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{1}{w^2({\boldsymbol{k}})} \frac{({\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}} {\boldsymbol{\eta}})}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\,{\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial p_j} \right|_{\vartheta=0}\nonumber\\ & = \mathcal{P}_l -\int \mathrm{d}{\boldsymbol{k}}\, \frac{k_l h({\boldsymbol{k}})}{4{\rm \pi} w^2({\boldsymbol{k}})} \left.{\boldsymbol{\eta}}^{{\dagger}}\frac{\partial}{\partial \omega}\left( \omega^2{\boldsymbol{\chi}}(\omega, {\boldsymbol{k}}) \right){\boldsymbol{\eta}} \right|_{\omega = w({\boldsymbol{k}})}\nonumber\\ & = \mathcal{P}_l + \int \mathrm{d}{\boldsymbol{k}}\,\frac{k_l h({\boldsymbol{k}})}{2{\rm \pi} w({\boldsymbol{k}})} -\int \mathrm{d}{\boldsymbol{k}}\, \frac{k_l h({\boldsymbol{k}})}{4{\rm \pi} w^2({\boldsymbol{k}})} \left.{\boldsymbol{\eta}}^{{\dagger}}\frac{\partial}{\partial \omega}\left( \omega^2{\boldsymbol{\epsilon}}(\omega, {\boldsymbol{k}}) \right){\boldsymbol{\eta}} \right|_{\omega = w({\boldsymbol{k}})} \nonumber\\ & = \mathcal{P}_l + \int \mathrm{d}{\boldsymbol{k}}\,\frac{k_l}{4{\rm \pi} w({\boldsymbol{k}})}\,(h({\boldsymbol{k}}) + h(-{\boldsymbol{k}})) - \int \mathrm{d}{\boldsymbol{k}}\,k_l J, \end{align}

where we substituted (9.51) and used (7.23). Next, let us rewrite (9.54) as

(F8)

\begin{equation} {\boldsymbol{\mathcal{P}}} = {\boldsymbol{\mathcal{P}}}^{(\text{kin})} + \overline{(\widetilde{{\boldsymbol{A}}}/c)\sum_s e_s \textstyle \int \mathrm{d}{\boldsymbol{p}}\,f_s} = {\boldsymbol{\mathcal{P}}}^{(\text{kin})} + \frac{1}{4 {\rm \pi}c}\, \overline{\widetilde{{\boldsymbol{A}}} (\nabla \cdot \widetilde{{\boldsymbol{E}}})}, \end{equation}

where the last equality is due to Gauss's law. This gives

(F9)

\begin{equation} \mathcal{P}_l - \mathcal{P}^{(\text{kin})}_l = \frac{1}{4 {\rm \pi}}\, \overline{(-\mathrm{i}\widehat{\omega}^{{-}1}\widetilde{E}_l)(\partial_j \widetilde{E}^j)} ={-}\frac{\mathrm{i}}{4 {\rm \pi}}\, \overline{(\widehat{\omega}^{{-}1}\widetilde{E}_l)(\partial_j \widetilde{E}^j)^*}. \end{equation}

Then, using (2.53) and also (7.63) for $\smash {{{\boldsymbol {\mathsf {U}}}}}$, one obtains

(F10)

\begin{align} \mathcal{P}_l - \mathcal{P}^{(\text{kin})}_l & \approx{-}\frac{\mathrm{i}}{4 {\rm \pi}}\, \int \mathrm{d}\omega\,\mathrm{d}{\boldsymbol{k}}\,\omega^{{-}1} {\mathsf{U}}_l{}^j(\omega, {\boldsymbol{k}})(\mathrm{i} k_j)^* \nonumber\\ & \approx{-} \int \mathrm{d}{\boldsymbol{k}}\,\frac{{\boldsymbol{k}} \cdot {\boldsymbol{\eta}}^*}{4{\rm \pi} w({\boldsymbol{k}})}\,\eta_l (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}})), \end{align}

and thus (F7) can be written as follows:

(F11)

\begin{align} \sum_s \int \mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{p}} F_s & + \int \mathrm{d}{\boldsymbol{k}}\,{\boldsymbol{k}} J \approx {\boldsymbol{\mathcal{P}}} + \int \mathrm{d}{\boldsymbol{k}}\,\frac{{\boldsymbol{k}}}{4{\rm \pi} w({\boldsymbol{k}})}\,(h({\boldsymbol{k}}) + h(-{\boldsymbol{k}})) \nonumber\\ & \approx {\boldsymbol{\mathcal{P}}}^{(\text{kin})} + \int \mathrm{d}{\boldsymbol{k}}\,\frac{1}{4{\rm \pi} w({\boldsymbol{k}})}\, ({\boldsymbol{k}} - {\boldsymbol{\eta}} ({\boldsymbol{k}} \cdot {\boldsymbol{\eta}}^*))\, (h({\boldsymbol{k}}) + h(-{\boldsymbol{k}})) \nonumber\\ & = {\boldsymbol{\mathcal{P}}}^{(\text{kin})} + \operatorname{re} \int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{2{\rm \pi} w({\boldsymbol{k}})}\, {\boldsymbol{\eta}}^* \times ({\boldsymbol{k}} \times {\boldsymbol{\eta}}), \end{align}

where we used (7.7) and (7.23) again. For an eikonal wave (9.57), which has $\smash {h({\boldsymbol {k}}) = \delta ({\boldsymbol {k}} - \overline {{\boldsymbol {k}}})|{\breve {{\boldsymbol {E}}}}|^2/4}$ (§ 7.4.1), this gives

(F12)

\begin{equation} \operatorname{re} \int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{2{\rm \pi} w({\boldsymbol{k}})}\, {\boldsymbol{\eta}}^* \times ({\boldsymbol{k}} \times {\boldsymbol{\eta}}) = \frac{1}{8{\rm \pi} c} \,\operatorname{re} \left(\smash{{\breve{{\boldsymbol{E}}}}}^* \times \left(\frac{c\overline{{\boldsymbol{k}}}}{\overline{\omega}} \times {\breve{{\boldsymbol{E}}}}\right)\right) = \frac{\overline{\widetilde{{\boldsymbol{E}}} \times \widetilde{{\boldsymbol{B}}}}}{4{\rm \pi} c}. \end{equation}

In case of a broadband spectrum, the same equality applies as well, because contributions of the individual eikonal waves to both left-hand side and the right-hand side are additive. (Alternatively, one can invoke (2.53) again.) This leads to (9.53).

F.2.2 Energy

Assuming the notation $\smash {\mathcal {K} \doteq \sum _s \int \mathrm {d}{\boldsymbol {p}}\,H_{0s} \overline {f}_s}$ and using (9.52) for $\smash {{\boldsymbol {\Theta }}_s}$, one can represent the OC energy density as follows:

\begin{align*} \sum_s \int &\mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s \approx \mathcal{K} - \frac{1}{2}\sum_s \int \mathrm{d}{\boldsymbol{p}}\,v_s^i\Theta_{s,ij}\,\frac{\partial F_s}{\partial p_j}\\ & \approx \mathcal{K} - \int \mathrm{d}{\boldsymbol{k}}\,h({\boldsymbol{k}})\,\frac{\partial}{\partial \vartheta}\sum_s e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}} \left. \frac{({\boldsymbol{k}} \cdot {\boldsymbol{v}}_s)}{w^2({\boldsymbol{k}})} \frac{({\boldsymbol{\eta}}^{{\dagger}} {\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}} {\boldsymbol{\eta}})}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\, {\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}} \right|_{\vartheta=0}. \end{align*}

Using (F3) and (F6) for $\smash {{\boldsymbol {\chi }}}$, one further obtains

\begin{align*} \sum_s \int &\mathrm{d}{\boldsymbol{p}}\,H_{0s} F_s \\ =\,& \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi} w({\boldsymbol{k}})} \left. {\boldsymbol{\eta}}^{{\dagger}}\,\frac{\partial}{\partial \vartheta}\bigg( \sum_s 4{\rm \pi} e_s^2 {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s + \vartheta}\, {\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}} \bigg){\boldsymbol{\eta}} \right|_{\vartheta=0} \\ & -\int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi}}\, {\boldsymbol{\eta}}^{{\dagger}}\,\bigg( \sum_s \frac{4{\rm \pi} e_s^2}{w^2({\boldsymbol{k}})} {\unicode{x2A0F}} \mathrm{d}{\boldsymbol{p}}\, \frac{{\boldsymbol{v}}_s {\boldsymbol{v}}_s^{{\dagger}}}{w({\boldsymbol{k}}) - {\boldsymbol{k}} \cdot {\boldsymbol{v}}_s}\, {\boldsymbol{k}} \cdot \frac{\partial F_s}{\partial {\boldsymbol{p}}} \bigg){\boldsymbol{\eta}}\\ =\,& \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi} w({\boldsymbol{k}})} \left. {\boldsymbol{\eta}}^{{\dagger}} \frac{\partial(\omega^2{\boldsymbol{\chi}}(\omega, {\boldsymbol{k}}))}{\partial\omega} {\boldsymbol{\eta}} \right|_{\omega = w({\boldsymbol{k}})} -\int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi}}\,{\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\chi}}{\boldsymbol{\eta}} \\ =\,& \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi} w({\boldsymbol{k}})} \left. {\boldsymbol{\eta}}^{{\dagger}} \frac{\partial(\omega^2{\boldsymbol{\epsilon}}(\omega, {\boldsymbol{k}}))}{\partial\omega} {\boldsymbol{\eta}} \right|_{\omega = w({\boldsymbol{k}})} + \int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{2{\rm \pi}}\\ &- \int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi}}\,{\boldsymbol{\eta}}^{{\dagger}}\left( {\boldsymbol{\epsilon}}(w({\boldsymbol{k}}), {\boldsymbol{k}}) - {\boldsymbol{1}} + \frac{{\boldsymbol{\mathfrak{w}}}_p}{w^2({\boldsymbol{k}})} \right){\boldsymbol{\eta}}\\ =\,& \mathcal{K} -\int \mathrm{d}{\boldsymbol{k}}\,w J + \int \mathrm{d}{\boldsymbol{k}}\,\frac{3h({\boldsymbol{k}})}{4{\rm \pi}} - \int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi}}\,{\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\epsilon}}(w({\boldsymbol{k}}), {\boldsymbol{k}}){\boldsymbol{\eta}} \\ & - \int \mathrm{d}{\boldsymbol{k}}\,\frac{h({\boldsymbol{k}})}{4{\rm \pi} w^2({\boldsymbol{k}})}\,{\boldsymbol{\eta}}^{{\dagger}}{\boldsymbol{\mathfrak{w}}}_p{\boldsymbol{\eta}}. \end{align*}

Using (9.50) and proceeding as in § F.2.1, one can also cast this as follows:

(F13)

\begin{equation} \sum_s \int \mathrm{d}{\boldsymbol{p}} H_{0s} F_s + \int \mathrm{d}{\boldsymbol{k}}\,w J = \mathcal{K} + \frac{1}{8{\rm \pi}}\,\big(3\overline{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}}} -\overline{\smash{\widetilde{{\boldsymbol{B}}}}^{{\dagger}} \widetilde{{\boldsymbol{B}}}}\big) - \frac{1}{8{\rm \pi} c^2}\, \overline{\smash{\widetilde{{\boldsymbol{A}}}}^{{\dagger}}{\boldsymbol{\mathfrak{w}}}_p\widetilde{{\boldsymbol{A}}}}. \end{equation}

Now, notice that

(F14)

\begin{align} \mathcal{K} & = \sum_s \int\mathrm{d}{\boldsymbol{p}}\,\overline{H_{0s}({\boldsymbol{p}})f_s^{(\text{kin})}({\boldsymbol{p}} - e_s\widetilde{{\boldsymbol{A}}}/c)} \nonumber\\ & = \sum_s \int\mathrm{d}{\boldsymbol{p}}\,\overline{H_{0s}({\boldsymbol{p}} + e_s\widetilde{{\boldsymbol{A}}}/c)f_s^{(\text{kin})}({\boldsymbol{p}})} \nonumber\\ & \approx \mathcal{K}^{(\text{kin})} + \overline{(\widetilde{{\boldsymbol{A}}}/c) \cdot \sum_s e_s \textstyle \int\mathrm{d}{\boldsymbol{p}}\,{\boldsymbol{v}}_s f_s^{(\text{kin})}} + \sum_s \frac{e_s^2}{2c^2} \int\mathrm{d}{\boldsymbol{p}}\,\overline{(\widetilde{{\boldsymbol{A}}} {\boldsymbol{\mu}}_s^{{-}1} \widetilde{{\boldsymbol{A}}})\,f_s^{(\text{kin})}} \nonumber\\ & \approx \mathcal{K}^{(\text{kin})} + \frac{1}{c}\,\overline{\widetilde{{\boldsymbol{A}}} \cdot \widetilde{{\boldsymbol{j}}}} + \frac{1}{8{\rm \pi} c^2}\, \overline{\smash{\widetilde{{\boldsymbol{A}}}}^{{\dagger}}{\boldsymbol{\mathfrak{w}}}_p\widetilde{{\boldsymbol{A}}}}, \end{align}

where $\smash {\widetilde {{\boldsymbol {j}}}}$ is the oscillating-current density. From Ampere's law,

(F15)

\begin{align} \frac{1}{c}\,\overline{\widetilde{{\boldsymbol{A}}} \cdot \widetilde{{\boldsymbol{j}}}} & = \overline{\frac{(-\mathrm{i} \widehat{\omega}^{{-}1}\widetilde{{\boldsymbol{E}}})^{{\dagger}}}{4{\rm \pi}} \big(\mathrm{i} c \widehat{\boldsymbol{k}} \times \widetilde{{\boldsymbol{B}}} + \mathrm{i} \widehat{\omega}\widetilde{{\boldsymbol{E}}}\big)} \nonumber\\ & \approx{-}\overline{\frac{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}}}{4{\rm \pi}}} -\overline{\frac{\widetilde{{\boldsymbol{E}}}}{4{\rm \pi}} \cdot\bigg(\frac{c \widehat{\boldsymbol{k}}}{\widehat{\omega}} \times \widetilde{{\boldsymbol{B}}}\bigg)} \nonumber\\ & \approx{-}\overline{\frac{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}}}{4{\rm \pi}}} -\overline{\bigg(\widetilde{{\boldsymbol{E}}} \times \frac{c \widehat{\boldsymbol{k}}}{\widehat{\omega}}\bigg) \cdot \frac{\widetilde{{\boldsymbol{B}}}}{4{\rm \pi}}} \nonumber\\ & \approx \frac{1}{4{\rm \pi}}\,\big(\overline{\smash{\widetilde{{\boldsymbol{B}}}}^{{\dagger}} \widetilde{{\boldsymbol{B}}}} - \overline{\smash{\widetilde{{\boldsymbol{E}}}}^{{\dagger}} \widetilde{{\boldsymbol{E}}}}\big). \end{align}

Substituting (F14) and (F15) into (F13) leads to (9.55).

Appendix G. Selected notation

This paper uses the following notation (also see § 2 for the index convention):

Footnotes

¹ The standard QLT (as in, for example, Drummond & Pines Reference Drummond and Pines1962) does not properly conserve energy–momentum either, even though it is formally conservative (see § 7.3.2).

² We treat the traditional QL approximation as a given mathematical model. We seek to push this model to its limits rather than to examine its validity, which is a separate issue. For discussions on the validity of the QL approximation, see Besse et al. (Reference Besse, Elskens, Escande and Bertrand2011), Escande et al. (Reference Escande, Bénisti, Elskens, Zarzoso and Doveil2018) and Crews & Shumlak (Reference Crews and Shumlak2022).

³ This excludes periodic boundary conditions, albeit not entirely (§ 3.1). Other than that, the spacetime metric can still be non-Euclidean, as illustrated by an application to relativistic gravity in § 9.4. See also the footnotes in §§ 2.1.3 and 6.1.

⁴ A common notation is $\smash {{\boldsymbol {\xi }}^{{\dagger}} {\boldsymbol {\psi }} = {\boldsymbol {\xi }} \cdot {\boldsymbol {\psi }}}$, but we reserve the dot-product notation for a scalar product of different quantities (§ 2.1.3).

⁵ Spaces with periodic boundary conditions require a different approach (Rigas et al. Reference Rigas, Sánchez-Soto, Klimov, Řeháček and Hradil2011), so they are not considered here (yet see § 3.1). That said, for a system that is large enough, the boundary conditions are unimportant; then the toolbox presented here is applicable as is.

⁶ More precisely, $\smash {\left. {\boldsymbol {\mathsf {|x}}} \right\rangle}$ is the ket $\smash {\left. {\boldsymbol {\mathsf {|{\mathfrak {e}(\widehat {{{\boldsymbol {\mathsf {x}}}}}; {{{\boldsymbol {\mathsf {x}}}}})} }}} \right\rangle}$ that is an eigenvector of each $\smash {\widehat {{\mathsf {x}}}^i}$ with the corresponding eigenvalues being $\smash {\mathsf{x}^i}$. A similar comment applies to $\smash {\left. {\boldsymbol {\mathsf {|k}}} \right\rangle}$.

⁷ Analytic continuation to complex arguments is possible, but by default, $\smash {{\boldsymbol {\mathsf {x}}}}$ and $\smash {{\boldsymbol {\mathsf {k}}}}$ are real.

⁸ By construction, $\smash {\widehat {\boldsymbol {{\boldsymbol {\mathsf {W}}}}}_{{\boldsymbol {\psi }}}}$ is a matrix with mixed indices, $\smash {(\widehat {\boldsymbol {{\boldsymbol {\mathsf {W}}}}}_{{\boldsymbol {\psi }}})^i{}_j}$. In §§ 5.1 and 5.2, we also operate with a Wigner matrix that has two upper indices. In such cases, the dagger $\smash {^{{\dagger}} }$ in (2.51) is assumed replaced with complex conjugation. For the said sections, this means that $\smash {^{{\dagger}} }$ in (2.51) can be simply omitted, because the field of interest there is real.

⁹ However, the index $\smash {\sigma }$ is reserved as a tag for individual particles and waves.

¹⁰ Note that the inner product (2.59) is different from (2.1). Still, we use the same notation assuming it will be clear from the context which inner product is used in each given case.

¹¹ An exception will be made for eikonal waves, specifically, for quantities evaluated on the local wavevector $\overline {{\boldsymbol {\mathsf {k}}}} \equiv (-\overline {\omega }, \overline {{\boldsymbol {k}}})$.

¹² As a reminder, the notation $\smash {A = \mathcal {O}(\epsilon )}$ does not rule out the possibility that $\smash {A/\epsilon }$ is small. Also note that the terms ‘$\sim$’ and ‘of order’ in this paper mean the same as ‘$\mathcal {O}$’.

¹³ Using the auxiliary variable $\smash {\tau }$ allows us to express the propagator as a regular exponential, rather than ordered exponential, even for $t$-dependent $\smash {\overline {H}}$, because $\smash {\widehat {L}}$ is independent of $\smash {\tau }$.

¹⁴ Unlike classic plasma-wave theory, this approach does not involve spectral decomposition, so there is no need to consider fields that are exponential in time on the whole interval $\smash {(-\infty, \infty )}$.

¹⁵ Starting with § 5.6, we will assume $\smash {\mathrm {d}_t\overline {f} \sim \epsilon \varepsilon ^2 \overline {f}}$ instead.

¹⁶ By the plasma parameter we mean the number of particles within the Debye sphere.

¹⁷ In terms of $\smash {t'\doteq t - \tau }$, (4.2) has a more recognizable form $\smash {\mathrm {d} Y^{a}/\mathrm {d} t' = V^{a}({\boldsymbol {Y}})}$, with $\smash {Y^a(t' = t) = X^a}$.

¹⁸ We use the Zassenhaus formula $\smash {\mathrm {e}^{\widehat {A} + \widehat {B}} = \mathrm {e}^{\widehat {A}}\, \mathrm {e}^{\widehat {B}}\, \mathrm {e}^{-[\widehat {A},\widehat {B}]/2} \mathrm {e}^{[\widehat {B}, [\widehat {A}, \widehat {B}]]/3+[\widehat {A}, [\widehat {A}, \widehat {B}]]/6} \cdots }$

¹⁹ Taylor-expanding delta functions is admittedly a questionable procedure, but here it is understood as a shorthand for Taylor-expanding integrals of $\smash {f}$.

²⁰ One can also derive (5.2) formally from (2.71).

²¹ The difference between $\smash {F}$ and $\smash {\overline {f}}$ is related to the concept of so-called adiabatic diffusion (Galeev & Sagdeev Reference Galeev and Sagdeev1985; Stix Reference Stix1992), which captures some but not all adiabatic effects.

²² The advantage of the amended definition (5.32) is that it will lead to exact conservation laws of our theory, as to be discussed in § 7.5.

²³ Remember that here we neglect $\smash {\varGamma }$ (3.34), which is a part of the collision operator to be reinstated in § 6.

²⁴ See § 9 for examples and § 6.6 for the explanation on how $\smash {\varPhi }$ is related to $\smash {\varDelta }$, which is yet to be introduced. Also note that in combination with (5.40), equation (5.43) generalizes the related results from Kentwell (Reference Kentwell1987), Fraiman & Kostyukov (Reference Fraiman and Kostyukov1995) and Dodin & Fisch (Reference Dodin and Fisch2014).

²⁵ The field action often has the form $\smash {S_0 = \frac {1}{2}\int \mathrm {d}{\boldsymbol {\mathsf {x}}}\,\sqrt {\mathfrak {g}}\,({\boldsymbol {\mathfrak {g}}}\smash {\widetilde {{\boldsymbol {\varPsi }}}}^*)\widehat {\boldsymbol {\varXi }}_0 \widetilde {{\boldsymbol {\varPsi }}}}$, where ${{\boldsymbol {\mathfrak {g}}}({\boldsymbol {\mathsf {x}}})}$ is a spacetime metric, $\smash {\mathfrak {g} \doteq |\det {\boldsymbol {\mathfrak {g}}}\,|}$, and $\smash {\widehat {\boldsymbol {\varXi }}_0}$ is Hermitian with respect to the inner product $\smash {\langle {\boldsymbol {\xi }}|{\boldsymbol {\psi }} \rangle _\mathfrak {g} \doteq \int \mathrm {d}{\boldsymbol {\mathsf {x}}}\,\sqrt {\mathfrak {g}}\,({\boldsymbol {\mathfrak {g}}}{\boldsymbol {\xi }}^*){\boldsymbol {\psi }}}$. Using $\smash {\smash {\widetilde {{\boldsymbol {\varPsi }}}}' \doteq \mathfrak {g}^{1/4}\widetilde {{\boldsymbol {\varPsi }}}}$ and $\smash {\smash {\widehat {\boldsymbol {\varXi }}}_0' \doteq \mathfrak {g}^{1/4}{\boldsymbol {\mathfrak {g}}}\widehat {\boldsymbol {\varXi }}_0 \mathfrak {g}^{-1/4}}$, one can cast this action in the form (6.1), with $\smash {\smash {\widehat {\boldsymbol {\varXi }}}_0'}$ that is Hermitian with respect to the inner product (2.6).

²⁶ This is tacitly assumed already in (6.2), where cubic terms are neglected. Also note that three-wave interactions that involve resonances between low-frequency oscillations of $\smash {F_s}$ and two high-frequency waves, like Raman scattering (Balakin et al. Reference Balakin, Dodin, Fraiman and Fisch2016), are still allowed.

²⁷ Most generally, the problem of finding $\smash {\smash {\widehat {\boldsymbol {\varXi }}}^{-1}}$ is the standard problem of calculating the field produced by a given radiation source.

²⁸ Remember that $\smash {{\boldsymbol {v}}_s}$ is defined as the OC velocity in the above formulas (§ 5.6). If $\smash {{\boldsymbol {v}}_s}$ is treated as the particle velocity instead, then $\smash {\mathcal {H}_s}$ in (6.71) should be replaced with $\smash {\overline {H}_s}$. Both options are admissible within the assumed accuracy, but the former option is preferable because it leads to other conservation laws that are exact within our model (§ 7.5).

²⁹ A complex field can be accommodated by considering its real and imaginary parts as separate components.

³⁰ Here we use that for any oscillating $\smash {a = \operatorname {re} (\mathrm {e}^{\mathrm {i} \theta }{\breve {a}})}$ and $\smash {b = \operatorname {re}(\mathrm {e}^{\mathrm {i} \theta }{\breve {b}})}$, one has $\smash {\overline {a b} = \operatorname {re} ({\breve {a}}^* {\breve {b}})/2}$ and that $\smash {\smash {{\breve {{\boldsymbol {\varPsi }}}}}^{{\dagger}} {\boldsymbol {\varXi }}_\text {H}(\overline {\omega }, \overline {{\boldsymbol {k}}}) {\breve {{\boldsymbol {\varPsi }}}}}$ is real because $\smash {{\boldsymbol {\varXi }}_\text {H}(\overline {\omega }, \overline {{\boldsymbol {k}}})}$ is Hermitian.

³¹ Corrections to the lowest-order dispersion relation produce the so-called spin Hall effect; see Dodin et al. (Reference Dodin, Ruiz, Yanagihara, Zhou and Kubo2019) and Ruiz & Dodin (Reference Ruiz and Dodin2017a) for an overview and Bliokh et al. (Reference Bliokh, Rodrıguez-Fortuño, Nori and Zayats2015), Ruiz & Dodin (Reference Ruiz and Dodin2015a), Oancea et al. (Reference Oancea, Joudioux, Dodin, Ruiz, Paganini and Andersson2020) and Andersson et al. (Reference Andersson, Joudioux, Oancea and Raj2021) for examples. These corrections are beyond the accuracy of the model considered, so they will be ignored.

³² Therefore, in a zero-dimensional wave, where $\smash {\int \mathrm {d}{\boldsymbol {x}}}$ can be omitted, conservation of the total action $\smash {\mathcal {I}}$ implies conservation of $\smash {\mathcal {E}_{\text {w}}/\omega }$, which is a well-known adiabatic invariant of a discrete harmonic oscillator with a slowly varying frequency (Landau & Lifshitz Reference Landau and Lifshitz1976, § 49).

³³ Field complexification is discussed, for example, in Brizard, Cook & Kaufman (Reference Brizard, Cook and Kaufman1993).

³⁴ McDonald & Kaufman (Reference McDonald and Kaufman1985) first Taylor-expand $\smash {{\boldsymbol {\varXi }} \star {{\boldsymbol {\mathsf {U}}}}_{\text {c}}}$ and then integrate over $\smash {\omega }$. Strictly speaking, that is incorrect (because $\smash {{\boldsymbol {\varXi }} \star {{\boldsymbol {\mathsf {U}}}}_{\text {c}}}$ is not smooth), but the final result is the same.

³⁵ The term ‘WKE’ is also used for the equation that describes nonlinear interactions of waves in statistically homogeneous media, or ‘wave–wave collisions’ (Zakharov, L'vov & Falkovich Reference Zakharov, L'vov and Falkovich1992). That is not what we consider here. Inhomogeneities are essential in our formulation, and the QL WKE is linear (in $\smash {J}$) by definition of the QL approximation. That said, the Weyl symbol calculus that we use can facilitate derivations of wave–wave collision operators as well (Ruiz, Glinsky & Dodin Reference Ruiz, Glinsky and Dodin2019).

³⁶ Having $\smash {{\boldsymbol {x}}}$-dependence in $\smash {{\boldsymbol {\varXi }}_0}$, $\smash {|{\boldsymbol {\alpha }}_s^{{\dagger}} {\boldsymbol {\eta }}|^2}$ or $\smash {{\boldsymbol {\eta }}^{{\dagger}} {\boldsymbol {\wp }}_s{\boldsymbol {\eta }}}$ would signify interaction with external fields not treated self-consistently. Such fields could exchange momentum with the wave–plasma system, so the momentum of the latter would not be conserved. A similar argument applies to the temporal dependence of these coefficients vs energy conservation considered below.

³⁷ There is no ambiguity in the definition of the wave momentum and energy in this case (i.e. $\smash {\kappa = 1}$), because (7.88) and (7.90) connect those with the momentum and energy of particles (OCs), which are defined unambiguously.

³⁸ To make (8.10) look more physical (local), one can absorb the global factors $\smash {\mathscr {T}}$ and $\smash {\mathscr {V}_n}$ in the definition of the Fourier transform; cf. § 9.1.5.

³⁹ See, for example, (16.17) in Stix (Reference Stix1992). The extra mass factor appears there because QL diffusion is considered in the velocity space instead of the momentum space.

⁴⁰ Here, the oscillating field $\smash {\widetilde {{\boldsymbol {\varPsi }}} = \widetilde {{\boldsymbol {E}}}}$ has the same dimension as $\smash {{\boldsymbol {x}}}$, so the standard vector notation (including the dot product and the cross product) is naturally extended to $\smash {\widetilde {{\boldsymbol {\varPsi }}}}$.

⁴¹ This section uses notation different from that used in the rest of the paper. In particular, the Greek indices span from 0 to 3, and the standard rules of index manipulations apply.

⁴² Vacuum gravitational waves satisfy $\smash {\omega ^2 = |{\boldsymbol {k}}|^2}$. Hence, satisfying the resonance condition $\smash {k^\rho p_\rho = 0}$ requires $\smash {|{\boldsymbol {k}} \cdot {\boldsymbol {v}}| = |{\boldsymbol {k}}|}$, which requires particle speeds not smaller than the speed of light (remember that the speed of light is equal to one in our units). For massive particles, this cannot be satisfied, so $\smash {{{\boldsymbol {\mathsf {D}}}}}$ vanishes for vacuum gravitational waves. However, such waves can still produce adiabatic ponderomotive effects determined by $\smash {\varDelta }$ (Garg & Dodin Reference Garg and Dodin2020).

⁴³ The covariant Hamiltonian is the dispersion function of particles as quantum waves in the semiclassical limit (Garg & Dodin Reference Garg and Dodin2020).

⁴⁴ This ensures that $\smash {\partial _{\boldsymbol {z}}\overline {h} = \overline {\partial _{\boldsymbol {z}}h}}$, as readily seen from (A1) using integration by parts.

⁴⁵ Even though $\smash {\sigma _{{\mathsf {x}}}}$ has been assumed small compared with $\smash {l_c}$, the smallness of the geometrical-optics parameter $\smash {\epsilon \doteq (|\overline {{\boldsymbol {\mathsf {k}}}}|l_c)^{-1} \ll 1}$ allows choosing $\smash {\sigma _{{\mathsf {x}}}}$ in the interval $\smash {|\overline {{\boldsymbol {\mathsf {k}}}}|^{-1} \ll \sigma _{{\mathsf {x}}} \ll l_c}$.

⁴⁶ The idea of this argument was brought to author's attention by G. W. Hammett and is taken from Landreman (Reference Landreman2017), where it is applied to single-species plasmas with a specific $\smash {\mathcal {H}_s}$.

References

REFERENCES

Andersson, L., Joudioux, J., Oancea, M.A. & Raj, A. 2021 Propagation of polarized gravitational waves. Phys. Rev. D 103, 044053.CrossRef Google Scholar

Balakin, A.A., Dodin, I.Y., Fraiman, G.M. & Fisch, N.J. 2016 Backward Raman amplification of broad-band pulses. Phys. Plasmas 23, 083115.Google Scholar

Besse, N., Elskens, Y., Escande, D.F. & Bertrand, P. 2011 Validity of quasilinear theory: refutations and new numerical confirmation. Plasma Phys. Control. Fusion 53, 025012.CrossRef Google Scholar

Binney, J. & Tremaine, S. 2008 Galactic Dynamics, 2nd edn. Princeton University Press.CrossRef Google Scholar

Bliokh, K.Y., Rodrıguez-Fortuño, F.J., Nori, F. & Zayats, A.V. 2015 Spin-orbit interactions of light. Nat. Photonics 9, 796.CrossRef Google Scholar

Brizard, A.J., Cook, D.R. & Kaufman, A.N. 1993 Wave-action conservation for pseudo-Hermitian fields. Phys. Rev. Lett. 70, 521.CrossRef Google Scholar PubMed

Cartwright, N.D. 1976 A non-negative Wigner-type distribution. Physica A 83, 210.CrossRef Google Scholar

Cary, J.R. & Brizard, A.J. 2009 Hamiltonian theory of guiding-center motion. Rev. Mod. Phys. 81, 693.CrossRef Google Scholar

Cary, J.R. & Kaufman, A.N. 1977 Ponderomotive force and linear susceptibility in Vlasov plasma. Phys. Rev. Lett. 39, 402.CrossRef Google Scholar

Cary, J.R. & Kaufman, A.N. 1981 Ponderomotive effects in collisionless plasma: a Lie transform approach. Phys. Fluids 24, 1238.CrossRef Google Scholar

Catto, P.J., Lee, J. & Ram, A.K. 2017 A quasilinear operator retaining magnetic drift effects in tokamak geometry. J. Plasma Phys. 83, 905830611.CrossRef Google Scholar

Chavanis, P.-H. 2012 Kinetic theory of long-range interacting systems with angle–action variables and collective effects. Physica A 391, 3680.CrossRef Google Scholar

Crews, D.W. & Shumlak, U. 2022 On the validity of quasilinear theory applied to the electron bump-on-tail instability. Phys. Plasmas 29, 043902.CrossRef Google Scholar

Dewar, R.L. 1972 A Lagrangian theory for nonlinear wave packets in a collisionless plasma. J. Plasma Phys. 7, 267.CrossRef Google Scholar

Dewar, R.L. 1973 Oscillation center quasilinear theory. Phys. Fluids 16, 1102.CrossRef Google Scholar

Dewar, R.L. 1977 Energy-momentum tensors for dispersive electromagnetic waves. Aust. J. Phys. 30, 533.CrossRef Google Scholar

Dodin, I.Y. 2014 On variational methods in the physics of plasma waves. Fusion Sci. Tech. 65, 54.Google Scholar

Dodin, I.Y. & Fisch, N.J. 2010 a On generalizing the

$K$-

$\chi$ theorem. Phys. Lett. A 374, 3472.CrossRef Google Scholar

Dodin, I.Y. & Fisch, N.J. 2010 b On the evolution of linear waves in cosmological plasmas. Phys. Rev. D 82, 044044.CrossRef Google Scholar

Dodin, I.Y. & Fisch, N.J. 2012 Axiomatic geometrical optics, Abraham–Minkowski controversy, and photon properties derived classically. Phys. Rev. A 86, 053834.CrossRef Google Scholar

Dodin, I.Y. & Fisch, N.J. 2014 Ponderomotive forces on waves in modulated media. Phys. Rev. Lett. 112, 205002.CrossRef Google Scholar

Dodin, I.Y., Geyko, V.I. & Fisch, N.J. 2009 Langmuir wave linear evolution in inhomogeneous nonstationary anisotropic plasma. Phys. Plasmas 16, 112101.CrossRef Google Scholar

Dodin, I.Y., Ruiz, D.E., Yanagihara, K., Zhou, Y. & Kubo, S. 2019 Quasioptical modeling of wave beams with and without mode conversion. I. Basic theory. Phys. Plasmas 26, 072110.CrossRef Google Scholar

Dodin, I.Y., Zhmoginov, A.I. & Ruiz, D.E. 2017 Variational principles for dissipative (sub)systems, with applications to the theory of linear dispersion and geometrical optics. Phys. Lett. A 381, 1411.CrossRef Google Scholar

Drummond, W.E. & Pines, D. 1962 Non-linear stability of plasma oscillations. Nucl. Fusion 3, 1049.Google Scholar

Eriksson, L.-G. & Helander, P. 1994 Monte Carlo operators for orbit-averaged Fokker–Planck equations. Phys. Plasmas 1, 308.Google Scholar

Escande, D.F., Bénisti, D., Elskens, Y., Zarzoso, D. & Doveil, F. 2018 Basic microscopic plasma physics from

$N$-body mechanics. Rev. Mod. Plasma Phys. 2, 1.CrossRef Google Scholar

Fetterman, A.J. & Fisch, N.J. 2008

$\alpha$ channeling in a rotating plasma. Phys. Rev. Lett. 101, 205003.CrossRef Google Scholar

Fisch, N.J. 1987 Theory of current drive in plasmas. Rev. Mod. Phys. 59, 175.CrossRef Google Scholar

Fisch, N.J. & Rax, J.M. 1992 Interaction of energetic alpha-particles with intense lower hybrid waves. Phys. Rev. Lett. 69, 612.Google Scholar PubMed

Fraiman, G.M. & Kostyukov, I.Yu. 1995 Influence of external inhomogeneous static fields on interaction between beam of charged-particles and packet of electromagnetic waves. Phys. Plasmas 2, 923.CrossRef Google Scholar

Galeev, A.A. & Sagdeev, R.Z. 1985 Theory of Weakly Turbulent Plasma, Part 4 in ‘Basic Plasma Physics I’ (ed. A.A. Galeev & R.N. Sudan). North–Holland.Google Scholar

Gaponov, A.V. & Miller, M.A. 1958 Potential wells for charged particles in a high-frequency electromagnetic field. Zh. Eksp. Teor. Fiz. 34, 242.Google Scholar

Garg, D. & Dodin, I.Y. 2020 Average nonlinear dynamics of particles in gravitational pulses: effective Hamiltonian, secular acceleration, and gravitational susceptibility. Phys. Rev. D 102, 064012.CrossRef Google Scholar

Garg, G. & Dodin, I.Y. 2021 a Gauge-invariant gravitational waves in matter beyond linearized gravity. arXiv:2106.05062.Google Scholar

Garg, G. & Dodin, I.Y. 2021 b Gauge invariants of linearized gravity with a general background metric. arXiv:2105.04680.Google Scholar

Garg, G. & Dodin, I.Y. 2022 Gravitational wave modes in matter. to appear in J. Cosmol. Astropart. Phys. arXiv:2204.09095.CrossRef Google Scholar

Hamilton, C. 2020 A simple, heuristic derivation of the Balescu–Lenard kinetic equation for stellar systems. Mon. Not. R. Astron. Soc. 501, 3371.Google Scholar

Hayes, W.D. 1973 Group velocity and nonlinear dispersive wave propagation. Proc. R. Soc. Lond. A 332, 199.Google Scholar

Hizanidis, K., Molvig, K. & Swartz, K. 1983 A retarded time superposition principle and the relativistic collision operator. J. Plasma Phys. 30, 223.Google Scholar

Kaufman, A.N. 1972 Quasilinear diffusion of an axisymmetric toroidal plasma. Phys. Fluids 15, 1063.CrossRef Google Scholar

Kaufman, A.N. 1987 Phase-space-Lagrangian action principle and the generalized

$K$-

$\chi$ theorem. Phys. Rev. A 36, 982.CrossRef Google Scholar PubMed

Kaufman, A.N. & Holm, D.D. 1984 The Lie-transformed Vlasov action principle: relativistically covariant wave propagation and self-consistent ponderomotive effects. Phys. Lett. A 105, 277.CrossRef Google Scholar

Kennel, C.F. & Engelmann, F. 1966 Velocity space diffusion from weak plasma turbulence in a magnetic field. Phys. Fluids 9, 2377.CrossRef Google Scholar

Kentwell, G.W. 1987 Oscillation-center theory at resonance. Phys. Rev. A 35, 4703.CrossRef Google Scholar PubMed

Kentwell, G.W. & Jones, D.A. 1987 The time-dependent ponderomotive force. Phys. Rep. 145, 319.CrossRef Google Scholar

Krall, N.A. & Trivelpiece, A.W. 1973 Principles of Plasma Physics. McGraw-Hill.CrossRef Google Scholar

Landau, L.D. & Lifshitz, E.M. 1976 Mechanics. Butterworth–Heinemann.Google Scholar

Landreman, M. 2017 The H theorem for the Landau–Fokker–Planck collision operator. Unpublished.Google Scholar

Lee, J., Smithe, D., Wright, J. & Bonoli, P. 2018 A positive-definite form of bounce-averaged quasilinear velocity diffusion for the parallel inhomogeneity in a tokamak. Plasma Phys. Control. Fusion 60, 025007.Google Scholar

Lichtenberg, A.J. & Lieberman, M.A. 1992 Regular and Chaotic Dynamics, 2nd edn. Springer.CrossRef Google Scholar

Lifshitz, E.M. & Pitaevskii, L.P. 1981 Physical Kinetics. Pergamon.Google Scholar

Littlejohn, R.G. 1979 A guiding center Hamiltonian: a new approach. J. Math. Phys. 20, 2445.CrossRef Google Scholar

Littlejohn, R.G. 1981 Hamiltonian formulation of guiding center motion. Phys. Fluids 24, 1730.CrossRef Google Scholar

Littlejohn, R.G. 1983 Variational principles of guiding centre motion. J. Plasma Phys. 29, 111.CrossRef Google Scholar

Littlejohn, R.G. 1986 The semiclassical evolution of wave packets. Phys. Rep. 138, 193.CrossRef Google Scholar

Liu, C. & Dodin, I.Y. 2015 Nonlinear frequency shift of electrostatic waves in general collisionless plasma: unifying theory of fluid and kinetic nonlinearities. Phys. Plasmas 22, 082117.CrossRef Google Scholar

Magorrian, J. 2021 Stellar dynamics in the periodic cube. Mon. Not. R. Astron. Soc. 507, 4840.CrossRef Google Scholar

McDonald, S.W. 1988 Phase-space representations of wave equations with applications to the eikonal approximation for short-wavelength waves. Phys. Rep. 158, 337.CrossRef Google Scholar

McDonald, S.W. 1991 Wave kinetic equation in a fluctuating medium. Phys. Rev. A 43, 4484.CrossRef Google Scholar

McDonald, S.W., Grebogi, C. & Kaufman, A.N. 1985 Locally coupled evolution of wave and particle distribution in general magnetoplasma geometry. Phys. Lett. A 111, 19.CrossRef Google Scholar

McDonald, S.W. & Kaufman, A.N. 1985 Weyl representation for electromagnetic waves: the wave kinetic equation. Phys. Rev. A 32, 1708.CrossRef Google Scholar PubMed

Motz, H. & Watson, C.J.H. 1967 The radio-frequency confinement and acceleration of plasmas. Adv. Electron. El. Phys. 23, 153.CrossRef Google Scholar

Moyal, J.E. 1949 Quantum mechanics as a statistical theory. Proc. Camb. Phil. Soc. 45, 99.CrossRef Google Scholar

Mynick, H.E. 1988 The generalized Balescu–Lenard collision operator. J. Plasma Phys. 39, 303.Google Scholar

Oancea, M.A., Joudioux, J., Dodin, I.Y., Ruiz, D.E., Paganini, C.F. & Andersson, L. 2020 Gravitational spin Hall effect of light. Phys. Rev. D 102, 024075.CrossRef Google Scholar

Ochs, I.E. 2021 Controlling and exploiting perpendicular rotation in magnetized plasmas. PhD thesis, Princeton University.Google Scholar

Ochs, I.E. & Fisch, N.J. 2021 a Nonresonant diffusion in alpha channeling. Phys. Rev. Lett. 127, 025003.CrossRef Google Scholar PubMed

Ochs, I.E. & Fisch, N.J. 2021 b Wave-driven torques to drive current and rotation. Phys. Plasmas 28, 102506.CrossRef Google Scholar

Ochs, I.E. & Fisch, N.J. 2022 Momentum conservation in current drive and alpha-channeling-mediated rotation drive. Phys. Plasmas 29, 062106.Google Scholar

Pinsker, R.I. 2001 Introduction to wave heating and current drive in magnetized plasmas. Phys. Plasmas 8, 1219.CrossRef Google Scholar

Rigas, I., Sánchez-Soto, L.L., Klimov, A., Řeháček, J. & Hradil, Z. 2011 Orbital angular momentum in phase space. Ann. Phys. 326, 426.CrossRef Google Scholar

Rogister, A. & Oberman, C. 1968 On the kinetic theory of stable and weakly unstable plasma. Part 1. J. Plasma Phys. 2, 33.CrossRef Google Scholar

Rogister, A. & Oberman, C. 1969 On the kinetic theory of stable and weakly unstable plasma. Part 2. J. Plasma Phys. 3, 119.CrossRef Google Scholar

Rostoker, N. 1964 Superposition of dressed test particles. Phys. Fluids 7, 479.CrossRef Google Scholar

Ruiz, D.E. 2017 Geometric theory of waves and its applications to plasma physics. PhD thesis, Princeton University.Google Scholar

Ruiz, D.E. & Dodin, I.Y. 2015 a First-principles variational formulation of polarization effects in geometrical optics. Phys. Rev. A 92, 043805.CrossRef Google Scholar

Ruiz, D.E. & Dodin, I.Y. 2015 b On the correspondence between quantum and classical variational principles. Phys. Lett. A 379, 2623.CrossRef Google Scholar

Ruiz, D.E. & Dodin, I.Y. 2017 a Extending geometrical optics: a Lagrangian theory for vector waves. Phys. Plasmas 24, 055704.CrossRef Google Scholar

Ruiz, D.E. & Dodin, I.Y. 2017 b Ponderomotive dynamics of waves in quasiperiodically modulated media. Phys. Rev. A 95, 032114.CrossRef Google Scholar

Ruiz, D.E., Glinsky, M.E. & Dodin, I.Y. 2019 Wave kinetic equation for inhomogeneous drift-wave turbulence beyond the quasilinear approximation. J. Plasma Phys. 85, 905850101.CrossRef Google Scholar

Schlickeiser, R. & Yoon, P.H. 2014 Quasilinear theory of general electromagnetic fluctuations in unmagnetized plasmas. Phys. Plasmas 21, 092102.CrossRef Google Scholar

Schmit, P.F., Dodin, I.Y. & Fisch, N.J. 2010 Controlling hot electrons by wave amplification and decay in compressing plasma. Phys. Rev. Lett. 105, 175003.CrossRef Google Scholar PubMed

Silin, V.P. 1961 Collision integral for charged particles. Zh. Eksp. Teor. Fiz. 40, 1768.Google Scholar

Stix, T.H. 1992 Waves in Plasmas. 2nd edn. AIP.Google Scholar

Tracy, E.R., Brizard, A.J., Richardson, A.S. & Kaufman, A.N. 2014 Ray Tracing and Beyond: Phase Space Methods in Plasma Wave Theory. Cambridge University Press.CrossRef Google Scholar

Trigger, S.A., Ershkovich, A.I., van Heijst, G.J.F. & Schram, P.P.J.M. 2004 Kinetic theory of Jeans instability. Phys. Rev. E 69, 066403.CrossRef Google Scholar PubMed

Vedenov, A.A., Velikhov, E.P. & Sagdeev, R.Z. 1961 Nonlinear oscillations of rarified plasma. Nucl. Fusion 1, 82.CrossRef Google Scholar

Weibel, E.S. 1981 Quasi-linear theory without the random phase approximation. Phys. Fluids 24, 413.CrossRef Google Scholar

Whitham, G.B. 1974 Linear and Nonlinear Waves. Wiley.Google Scholar

Wong, H.V. 2000 Particle canonical variables and guiding center Hamiltonian up to second order in the Larmor radius. Phys. Plasmas 7, 73.CrossRef Google Scholar

Yasseen, F. 1983 Quasilinear theory of inhomogeneous magnetized plasmas. Phys. Fluids 26, 468.CrossRef Google Scholar

Yasseen, F. & Vaclavik, J. 1986 Quasilinear theory of uniformly magnetized inhomogeneous plasmas: electromagnetic fluctuations. Phys. Fluids 29, 450.CrossRef Google Scholar

Ye, H. & Kaufman, A.N. 1992 Self-consistent theory for ion gyroresonance. Phys. Fluids B 4, 1735.CrossRef Google Scholar

Yoon, P.H., Ziebell, L.F., Kontar, E.P. & Schlickeiser, R. 2016 Weak turbulence theory for collisional plasmas. Phys. Rev. E 93, 033203.CrossRef Google Scholar PubMed

Zakharov, V.E., L'vov, V.S. & Falkovich, G. 1992 Kolmogorov Spectra of Turbulence I: Wave Turbulence. Springer.CrossRef Google Scholar

Zhu, H. & Dodin, I.Y. 2021 Wave-kinetic approach to zonal-flow dynamics: recent advances. Phys. Plasmas 28, 032303.CrossRef Google Scholar

Table 1. Interpretation of the individual terms in (7.87) and (7.89). The wave energy–momentum is understood as the canonical (‘Minkowski’) energy–momentum, which must not be confused with the kinetic (‘Abraham’) energy–momentum (Dewar 1977; Dodin & Fisch 2012). Whether the terms with $\smash {\varDelta _s F_s}$ should be attributed to OCs or to the wave is a matter of convention, because $\smash {\varDelta _s F_s}$ scales linearly both with $\smash {F_s}$ and with $\smash {J}$. In contrast, the wave energy density is defined unambiguously as $\smash {\mathcal {E}_s \doteq \int \mathrm {d}{\boldsymbol {p}}\,H_{0s} F_s}$ and does not contain $\smash {\varDelta _s}$. This is because $\smash {\int \mathrm {d}{\boldsymbol {p}}\,\varDelta _s F_s}$ is a part of the wave energy density $\smash {\mathcal {E}_{\text {w}}}$ (Dodin & Fisch 2010a). Similarly, $\smash {\int \mathrm {d}{\boldsymbol {p}}\,(\partial _{{\boldsymbol {v}}_s}\varDelta _s) F_s}$ is a part of the wave momentum density (Dodin & Fisch 2012).