1. Introduction
We consider the real-valued modified Korteweg–de Vries equation (mKdV):
posed on either the real line $\mathbb{R}$ or the circle $\mathbb{T}=\mathbb{R} /\mathbb{Z}$. We say that (1.1) is defocusing if $\mu =+1$ and focusing if $\mu=-1$. Equation (1.1) is also known as the Miura mKdV equation after [Reference Miura31], and this naming differentiates it from its generalization the complex-valued (Hirota) mKdV equation (1.9).
The mKdV equation (1.1) has attracted much attention from both applied and theoretical perspectives, in part since it is both Hamiltonian and completely integrable. Indeed, with respect to the Poisson structure
on Schwartz space $\mathcal{S}$, the mKdV equation (1.1) is the Hamiltonian flow of
Recall that on $\mathbb{T}$, Schwartz space $\mathcal{S}$ corresponds to the space $C^{\infty}(\mathbb{T})$, while on $\mathbb{R}$, it consists of those infinitely differentiable functions which, with all of their derivatives, decay faster than any polynomial as $|x|\to \infty$. Some important consequences of the completely integrable structure of mKdV (1.1) are the existence of a Lax pair and an infinite number of conservation laws. In particular, the simplest conservation law here is the mass $M(q)=\frac{1}{2}\int q^2 dx $. Under the Poisson structure (1.2), M generates translations and thus may be also thought of as the momentum.
The optimal well-posedness of mKdV (1.1) within the L 2-based Sobolev spaces H s, $s\in \mathbb{R}$, has been a long studied question. On the real line, mKdV (1.1) has the scaling symmetry $q_{\lambda}(t,x):=\lambda q(\lambda^{3}t,\lambda x)$, where λ > 0, and this induces the scaling-critical Sobolev exponent $s_{\text{crit}}=-\tfrac{1}{2}$, above which we expect well-posedness, and below which we expect ill-posedness. While this scaling is no longer available on the circle, the heuristic provided by s crit is believed to persist in this setting. However, due to the different ways that dispersion manifests, the well-posedness theory diverges depending on the underlying geometry.
On the real line, it has long been known that Schwartz initial data lead to global Schwartz solutions [Reference Kato20, Reference Tsutsumi41]. Well-posedness, without making use of dispersion, was obtained in $H^{s}(\mathbb{R})$, for $s \gt \tfrac{3}{2}$, either by viewing mKdV (1.1) from the perspective of quasilinear hyperbolic equations [Reference Kato19] or by energy methods [Reference Abdelouhab, Bona, Felland and Saut1, Reference Bona and Smith2]. This was improved by Kenig, Ponce, and Vega [Reference Kenig, Ponce and Vega21, Reference Kenig, Ponce and Vega22], who exploited dispersion through the use of local smoothing estimates and proved local well-posedness in $H^{s}(\mathbb{R})$ for $s\geq \tfrac{1}{4}$. An alternative proof of this result in $H^{1/4}(\mathbb{R})$ was given by Tao [Reference Tao40] using the Fourier restriction norm method. Global well-posedness for $s \gt \tfrac{1}{4}$ was established using the I-method by Colliander, Keel, Staffilani, Takaoka, and Tao [Reference Colliander, Keel, Staffilani, Takaoka and Tao11], and at the end-point $s=\tfrac{1}{4}$ in [Reference Guo12, Reference Kishimoto27]. We also mention the unconditional uniqueness results in [Reference Kwon, Oh and Yoon30, Reference Molinet, Pilod and Vento33].
It turns out that $H^{1/4}(\mathbb{R})$ is the threshold for the local uniform continuous dependence of the solution map for (1.1) with respect to the initial data [Reference Christ, Colliander and Tao8, Reference Kenig, Ponce and Vega23]. Consequently, it is impossible to construct solutions using the contraction mapping theorem when $s \lt \tfrac{1}{4}$. Using the short-time Fourier restriction norm method, Christ, Holmer, and Tataru [Reference Christ, Holmer and Tataru10] proved global existence of solutions for $-\tfrac{1}{8} \lt s \lt \tfrac{1}{4}$, with the uniqueness of these solutions unknown at the time; see also [Reference Molinet, Pilod and Vento33]. Using the complete integrability of mKdV (1.1) (or more accurately, the complex-valued mKdV (1.9)), Koch and Tataru [Reference Koch and Tataru29] and Killip, Vişan, and Zhang [Reference Killip, Vişan and Zhang26] obtained a priori bounds in the whole sub-critical range $s \gt -\tfrac{1}{2}$.
Recently, Harrop-Griffiths, Killip, and Vişan [Reference Harrop-Griffiths, Killip and Vişan14] completed the well-posedness study on the line by proving global well-posedness of mKdV (1.1) in the remaining range $-\tfrac{1}{2} \lt s \lt \tfrac{1}{4}$. Their notion of solution is that the solution map has a unique continuous extension from Schwartz space to $H^{s}(\mathbb{R})$. Note that uniqueness of such solutions is built into this definition although the solutions need not be distributional. Their result went far beyond this though: they proved global well-posedness in the same range for the complex-valued mKdV equation and resolved, in the positive, the long standing open question of well-posedness of the cubic nonlinear Schrödinger equation (NLS) on $\mathbb{R}$ for $-\tfrac{1}{2} \lt s \lt 0$. Moreover, because of their local smoothing estimates, their solutions are in fact also distributional solutions when $s\geq -\frac 16$ in the complex-valued case and for $s \gt -\frac 12$ for (1.1) on $\mathbb{R}$.
For the complex-valued mKdV and cubic NLS, these results are sharp as there is ill-posedness, in the sense of norm inflation, at the critical regularity $s_{\text{crit}}=-\tfrac{1}{2}$; see [Reference Christ, Colliander and Tao9, Reference Harrop-Griffiths, Killip and Vişan14, Reference Kishimoto28, Reference Oh36]. Interestingly, it is not known if their result is sharp for the defocusing real-valued mKdV (1.1) as there is no known ill-posedness at this time at the end point $s_{\text{crit}}=-\tfrac{1}{2}$, although see [Reference Harrop-Griffiths, Killip and Vişan14, proposition A.3] for an ill-posedness result in the focusing case.
The approach that we will use in this article is the method of commuting flows, which was introduced by Killip and Vişan [Reference Killip and Vişan25], and lead to the complete resolution of the well-posedness of KdV [Reference Killip and Vişan25] in the L 2-based Sobolev scale on the line. Their approach also obtained optimal well-posedness on the circle, providing an alternative to the argument by Kappeler and Topalov [Reference Kappeler and Topalov18]. Broadly speaking, it utilizes the completely integrable structure of the equation and proceeds by approximating the flow of the original equation by another flow, which (i) Poisson commutes, (ii) is easily-solved in low regularity, and (iii) converges, in some sense, to the original flow.
The method of commuting flows as in [Reference Killip and Vişan25] applies equally well on the line or the circle as it does not take into account the dispersive nature of the equation. Subsequently, this method was extended in the line setting in [Reference Bringmann, Killip and Vişan5, Reference Harrop-Griffiths, Killip and Vişan14] to incorporate dispersion through local-smoothing estimates, which led to impressive results regarding the well-posedness of the fifth-order KdV, as well as the cubic NLS and complex-valued mKdV equations on $\mathbb{R}$ (as discussed above). We also mention the very recent breakthroughs [Reference Harrop-Griffiths, Killip, Ntekoume and Vişan13, Reference Harrop-Griffiths, Killip and Vişan15, Reference Killip, Ntekoume and Vişan24], using these methods and more, to the low-regularity and large data well-posedness of the derivative NLS.
One of the goals of this article is to show how the arguments in [Reference Harrop-Griffiths, Killip and Vişan14] can be simplified in the more regular setting with $0\leq s \lt \tfrac{1}{4}$ and still yield global well-posedness for mKdV (1.1). Indeed, we show that in this regime, it is enough to argue as in [Reference Killip and Vişan25], without exploiting dispersion explicitly; see theorem 1.1. We hope that this article may therefore partly serve as a bridge for a reader familiar with [Reference Killip and Vişan25] to study [Reference Harrop-Griffiths, Killip and Vişan14].
We now discuss the current well-posedness theory for the mKdV equation (1.1) on the circle. The works [Reference Abdelouhab, Bona, Felland and Saut1, Reference Bona and Smith2] also apply on the circle yielding well-posedness in $H^{s}(\mathbb{T})$ for $s \gt \tfrac{3}{2}$. By introducing the Fourier restriction norm method, Bourgain [Reference Bourgain4] proved local well-posedness in $H^{s}(\mathbb{T})$, $s\geq \tfrac{1}{2}$, which was extended to global well-posedness using the I-method in [Reference Colliander, Keel, Staffilani, Takaoka and Tao11]. In stark contrast to the setting of the real line, the solution map on $\mathbb{T}$ fails to be locally uniformly continuous below $H^{\frac{1}{2}}(\mathbb{T})$ [Reference Christ, Colliander and Tao9]; see also [Reference Bourgain4]. Refined energy-based methods were applied in [Reference Molinet, Pilod and Vento34, Reference Nakanishi, Takaoka and Tsutsumi35, Reference Takaoka and Tsutsumi39], culminating in local well-posedness and unconditional uniqueness in $H^{s}(\mathbb{T})$ for $s\geq \tfrac{1}{3}$.
Combining their well-posedness argument for KdV in $H^{-1}(\mathbb{T})$ from [Reference Kappeler and Topalov18] and the Miura transform, Kappeler and Topalov [Reference Kappeler and Topalov17] proved that the real-valued defocusing mKdV (1.1) is globally well-posed in $L^2(\mathbb{T})$, in the sense that the solution map extends uniquely from $\mathcal{S}$ to $L^2(\mathbb{T})$. In [Reference Kappeler and Molnar16], Kappeler and Molnar also obtained small data global well-posedness in $L^2(\mathbb{T})$ for the focusing case.Footnote 1 This result is sharp, as Molinet [Reference Molinet32] proved that the mKdV equation (1.1) is ill-posed below $L^2(\mathbb{T})$. Moreover, Molinet used the short-time Fourier restriction norm method and also proved global existence of distributional solutions, without uniqueness, for both the focusing and the defocusing equations. This result also implies that the solutions constructed by Kappeler and Topalov are actually distributional solutions. We also mention that Schippa [Reference Schippa38] extended Molinet’s result to $H^{s}(\mathbb{T})$ for s > 0.
We now move onto the main result of this article.
Theorem 1.1 Let $0\leq s \lt \frac{1}{2}$ and consider either $\mathbb{R}$ or $\mathbb{T}$. Then, the real-valued mKdV (1.1) equation is globally well-posed for all initial data in H s in the sense that the solution map Φ extends uniquely from $\mathcal{S}$ to a jointly continuous map $\Phi:\mathbb{R} \times H^{s}\to H^{s}$.
In view of the above discussion, we believe that, while largely not new, theorem 1.1 and its proof have a number of benefits. First, by assuming higher regularity, we may drastically simplify the argument in [Reference Harrop-Griffiths, Killip and Vişan14] for the real line while still obtaining well-posedness that, prior to [Reference Harrop-Griffiths, Killip and Vişan14], was unknown in the range $0\leq s \lt \tfrac{1}{4}$. In particular, we show how the argument in [Reference Killip and Vişan25] can be adapted to the mKdV equation. As we do not need to exploit local smoothing, this allows us to treat the line and circle settings in a parallel fashion, and we show how the structures in [Reference Harrop-Griffiths, Killip and Vişan14] can be adapted to the circle. Moreover, theorem 1.1 on $\mathbb{T}$ provides an alternative argument to the global well-posedness of the defocusing mKdV in $L^2(\mathbb{T})$ by Kappeler and Topalov [Reference Kappeler and Topalov17]. In particular, we do not use the Miura transform and thus are able to obtain large data well-posedness results for the focusing case. Moreover, the solutions we construct agree with those of Kappeler and Topalov. By Sobolev embedding, our solutions are distributional solutions in H s for $s\geq \tfrac{1}{6}$; see remark 4.3. They are also distributional solutions in the remaining range $0\leq s \lt \tfrac{1}{6}$, leaning on the works [Reference Harrop-Griffiths, Killip and Vişan14] on $\mathbb{R}$ or [Reference Molinet32, Reference Schippa38] on $\mathbb{T}$.
As discussed, our proof of theorem 1.1 uses the method of commuting flows as in [Reference Killip and Vişan25]. This approach is motivated by the fact that mKdV (1.1) admits a Lax pair, where the Lax operator we use at spectral parameter $\kappa\geq 1$ is
The Lax pair suggests that formally the quantity
will be conserved under the flow of (1.1). For $q\in L^2$, $(\kappa-\partial)^{-1}q (\kappa+\partial)^{-1}q$ is a trace-class operator so that the functional determinant in (1.5) is well-defined. Unfortunately, the functional determinant may vanish, causing the logarithm of it in (1.5) to become ill-defined.
Following [Reference Killip, Vişan and Zhang26], we instead consider the quantity
where $\ell(\kappa) =1$ on $\mathbb{R}$ and $\ell(\kappa)=\tanh(\kappa/2)$ on $\mathbb{T}$. This quantity formally represents a series expansion of (1.5). For real-valued $q\in L^2$, modulo the constant $\ell(\kappa)$, this is exactly the quantity considered in [Reference Harrop-Griffiths, Killip and Vişan14, Reference Killip, Vişan and Zhang26], which can be seen by cycling the trace. As mentioned, while the logarithm in (1.5) may fail to be defined, the series (1.6) converges absolutely as long as $\kappa^{-\frac{1}{2}} \|q\|_{L^2}$ is small enough and it is conserved under the flow of mKdV (1.1). This smallness condition can be ensured either by scaling on the real line or by choosing κ sufficiently large when on the circle.
The quantity $A(\kappa,q)$ is a generating function (in inverse powers of κ) for the conservation laws of (1.1); see (3.2). We prove these statements about $A(\kappa,q)$ in § 3. The expansion of $A(\kappa,q)$ motivates the definition of the approximating flow Hκ in (4.1), which we show is globally well-posed in L 2 for small enough data or large enough κ. See the beginning of § 4. In order to understand the Hκ-flow, we follow [Reference Harrop-Griffiths, Killip and Vişan14] in § 2 and introduce some auxiliary functions based on the matrix Green’s function of $L_{q}(\kappa)$. In [Reference Harrop-Griffiths, Killip and Vişan14], the off-diagonal matrix elements $g_{12}(\kappa)$ and $g_{21}(\kappa)$ were of great use.
In the real-valued setting, we find that it is convenient and natural to consider linear combinations of these and introduce the new variables $g_{-}(\kappa)$ and $g_{+}(\kappa)$; see (2.10). Interestingly, while the map $q\mapsto g_{+}(\kappa)$ is bounded from L 2 to H 1, the map $q\mapsto \tfrac{\mu}{4\kappa}g_{-}(\kappa)$ is a real-analytic diffeomorphism from L 2 to H 2, thus gaining an additional derivative over each of $g_{12}(\kappa), g_{21}(\kappa)$, and $g_{+}(\kappa)$. This gain of regularity is not available in the complex-valued setting; see remark 2.4.
In § 4, we complete the well-posedness argument by following the approach in [Reference Killip and Vişan25] and using the change of variables $q\mapsto \tfrac{\mu}{4\kappa}g_{-}(\kappa)$ in order to show that the Hκ-flow well-approximates the H mKdV-flow in L 2.
We conclude this introduction with a few remarks.
Remark 1.2. While the well-posedness of mKdV (1.1) in $L^2(\mathbb{T})$ is sharp, the a priori bounds in Schwartz space in [Reference Killip, Vişan and Zhang26] suggest that some form of well-posedness may persist below $L^2(\mathbb{T})$. In this setting, it is expected that one should instead consider the following renormalized mKdV equation:
This equation was first introduced by Bourgain [Reference Bourgain4]. In $L^2(\mathbb{T})$, mKdV (1.1) and the renormalized mKdV (1.7) are equivalent, as solutions of the former can be related to solutions of the latter via the gauge transformation
However, outside of $L^2(\mathbb{T})$, we expect that (1.7) may be well-posed even though (1.1) is not. In this direction, Kappeler and Molnar [Reference Kappeler and Molnar16] proved that the defocusing renormalized mKdV equation (1.7) is locally well-posed in the Fourier–Lebesgue spaces $\mathcal{F}L^{p}(\mathbb{T})$ for p > 2 and also globally well-posed for small data (the small data global result extends to the focusing case too). Note that for p > 2, it holds that $L^2(\mathbb{T}) \subset \mathcal{F}L^p(\mathbb{T})$, so they construct solutions outside of $L^2(\mathbb{T})$. It would be of interest to understand how the gauge transformation (1.8) could be implemented within the method of commuting flows and, in particular, whether the arguments in this article could be extended to the Fourier–Lebesgue setting on $\mathbb{T}$.
Remark 1.3. On the real line, the proof of theorem 1.1 also applies to the complex-valued mKdV equation
where $q(t):\mathbb{R}\to \mathbb{C}$. In this setting, one no longer benefits from using the change of variables $\tfrac{\mu}{4\kappa} g_{-}(\kappa)$ as a map from L 2 to H 2, and instead one can proceed more closely to [Reference Harrop-Griffiths, Killip and Vişan14] and use a single component of the diagonal matrix Green’s function, such as $g_{12}(\kappa)$. However, we stress that our result does not extend to the complex-valued mKdV equation on $\mathbb{T}$. This is why we chose to focus on the real-valued setting.
To understand why this is the case, we recall that Chapouto [Reference Chapouto6, Reference Chapouto7] showed that (1.9) and even its renormalized variant, in the sense of the corresponding gauge transformation to (1.8), are ill-posed below $H^{1/2}(\mathbb{T})$. The cause of the instability is the possibility that the momentum
may be infinite outside $H^{1/2}(\mathbb{T})$. For smooth initial data, the momentum is a conserved quantity of (1.9). If additionally q is real-valued, this conservation is trivial since $P(q)=0$.
It was proposed instead in [Reference Chapouto6] that one should consider a second renormalized complex-valued mKdV equation for study below $H^{1/2}(\mathbb{T})$, which is equivalent to (1.9) for regular enough q via a gauge transformation involving the momentum. As discussed in remark 1.2, we do not know yet how to effectively implement such gauge transformations within the method of commuting flows.
2. The variables $\gamma(\kappa)$, $g_{-}(\kappa)$, and $g_{+}(\kappa)$
Our goal in this section is to understand the variables $\gamma(\kappa)$, $g_{-}(\kappa)$, and $g_{+}(\kappa)$. We begin with some notation and preliminary results. Our conventions for the Fourier transform on the line are
On the circle, we define the Fourier transform by
We denote by $\mathcal{S}$ the class of Schwartz functions on the line or the class of $C^{\infty}$ -functions if on the circle. For $\kappa\geq 1$, we define the L 2-based Sobolev space $H^{s}_{\kappa}$ via the norm
which we take with respect to Lebesgue measure on $\mathbb{R}$ or counting measure on $2\pi \mathbb{Z}$.
The functional derivative $\delta / \delta q$ is defined as
For $0 \lt \sigma \lt 1$ and $\kappa \geq 1$, we define the operator $(\kappa \pm \partial)^{-\sigma}$ using the Fourier multiplier $(\kappa \pm i\xi)^{-\sigma}$, where, for $\arg z\in (-\pi, \pi]$, we define
This convention implies that
In order to deal with both geometries in as parallel a fashion as possible, we consider the following class of potentials: Given δ > 0, let
for $\kappa\geq 1$. We recall the definition of the Lax operator in (1.4), which we view as an unbounded operator on $L^2 \times L^2$ with domain $H^{1} \times H^{1}$. The Green’s function of $L_{q}(\kappa)$ will be a matrix-valued perturbation of the zero potential case, i.e., when q = 0. In this case of zero potential, wehave
which has integral kernel
where $\lceil{x}\rceil$ and $\lfloor{x}\rfloor$ denote the smallest integer n such that $x\leq n$, or the largest integer m such that $m\leq x$, respectively. By formally iterating the resolvent identity, we arrive at an expression for the resolvent $R(\kappa):=L_{q}(\kappa)^{-1}$:
The following proposition justifies that this series converges if $q\in B_{\kappa,\delta}$, for δ > 0 sufficiently small and is an inverse to $L_{q}(\kappa)$:
Proposition 2.1. There exists δ > 0 such that for all $\kappa\geq 1$ and $q\in B_{\delta,\kappa}$, the inverse $R(\kappa)=L_{q}(\kappa)^{-1}$ admits an integral kernel $G(x,y;\kappa,q)$, for which the mapping
is continuous for any ɛ > 0. Moreover, $G-G_0$ is a continuous function of (x, y).
Before we give a proof of proposition 2.1, we state a useful lemma.
Lemma 2.2. Let $\kappa\geq 1$ and $0 \lt \alpha \lt \tfrac{1}{4}$. Then, for any $f\in \mathcal{S}$, we have
Proof. In the following, we consider the circle case, as the case of the real line is similar. For (2.6), we note that for any $h\in C^{\infty}(\mathbb{T})$, we may write
where the kernel $K_{\pm}$ is given by
Therefore, by Plancherel,
In the summation over η, we estimate separately the contributions due to $|\xi+\eta|\ll |\eta|$, $|\xi+\eta|\sim |\eta|$ and $|\xi+\eta|\gg |\eta|$ and obtain
Similarly, the proof of (2.7) reduces to showing
for any $\alpha \lt \tfrac{1}{4}$, which can also be dealt with by considering the same contributions to the sum as in showing (2.9). The restriction $\alpha \lt \tfrac{1}{4}$ appears naturally as a summation condition in the case $|\xi+\eta|\sim |\eta|$. The remaining estimate (2.8) follows from the observation that the kernels for the operators $(\kappa\pm\partial)^{-1}$ in (2.2) and (2.3) belong to $L^{\infty}(\mathbb{T} \times \mathbb{T})$, uniformly in $\kappa\geq 1$.
Proof. Proof of proposition 2.1
The series (2.4) converges in operator norm uniformly for $\kappa\geq 1$ and $q\in B_{\delta,\kappa}$ using (2.6) and (2.8). Moreover, this shows that $R(\kappa)-R_{0}(\kappa)\in \mathfrak{I}_{2}$ and hence admits an integral kernel $G(x,y;\kappa)$ in L 2. Then, the integral kernel $G-G_0$ is a continuous function of (x, y) as (2.7) and (2.8) imply that $R-R_{0}$ is Hilbert–Schmidt from $H^{-\alpha}\times H^{-\alpha}$ to $H^{\alpha}\times H^{\alpha}$ for any $\alpha \lt \tfrac{3}{4}$.
On the line and the circle, proposition 2.1 implies that $(G-G_0)(x,y)$ extends continuously to the diagonal x = y, and thus, we may define
where
Proposition 2.3. Properties of $\gamma, g_{+}$, and $g_{-}$
There exists $\delta_0 \gt 0$ such that for all $0 \lt \delta\leq \delta_0$ the following hold: given $\kappa\geq 1$ and $q\in B_{\delta,\kappa}$, the maps $q\to \gamma(\kappa)$ and $q\to g_{+}(\kappa)$ are bounded from $B_{\delta,\kappa}$ to $H^{1}_{\kappa}$ and satisfy
and the map $q\to \tfrac{\mu}{4\kappa}g_{-}(\kappa)$ is a real-analytic diffeomorphism from $ B_{\delta,\kappa}$ to $H^{2}_{\kappa}$ satisfying
Furthermore, the following identities hold
in the sense of distributions. Lastly, if $q\in \mathcal{S}\cap B_{\delta,\kappa}$, then $\gamma(\kappa), g_{+}(\kappa)$ and $g_{-}(\kappa)$ also belong to $\mathcal{S}$.
Proof. For $f\in \mathcal{S}$, the expansion (2.4) implies that
and $\langle f, g_{+}(\kappa) \rangle$ satisfies a similar formula as $\langle f, g_{-}(\kappa) \rangle$ does in (2.4) except the second term (2.20) in the summation is subtracted instead of added. Thus, in order to verify (2.12) and (2.13), it suffices to individually estimate the summands in (2.18) and (2.4). For (2.18), when m = 1, (2.6) and (2.8) imply
For $m\geq 2$, we proceed similarly along with (2.8) to obtain
Summing this estimate in m with (2.21) as in (2.18) and using duality proves (2.12), provided that δ > 0 is sufficiently small.
We move onto verifying (2.13). Similar to the above computations, for any $m\geq 1$, (2.6) and (2.8) imply
It remains to consider the case when m = 0, which we treat by direct computation, focusing on the circle case. Using (2.3),
Summing (2.23) over $m\geq 2$ completes the proof of (2.13). We note that these arguments also imply that $g_{-}(\kappa)\in H^{1}_{\kappa}$. Assuming for now the veracity of (2.17), this regularity property of $g_{-}(\kappa)$ is upgraded to $H^{2}_{\kappa}$ by (2.17) and (2.13). This gives (2.14).
Now, (2.15) follows from considering $\langle f,\gamma'(\kappa) \rangle$ and using (2.18) with the operator identities $f'=[\kappa+\partial, f]=-[\kappa-\partial,f]$, where $f\in \mathcal{S}$. For (2.16), the series representation similar to (2.4) and the identities
imply
Consider the last term on the right-hand side of (2.25). The integral kernel for the operator $(\kappa^2-\partial^2)^{-1}$ is
where $\|x-y\| :=\text{dist}(x-y,\mathbb{Z})$. See, for example, [Reference Killip, Vişan and Zhang26]. Therefore, on $\mathbb{R}$
while on $\mathbb{T}$,
In either geometry, these identities inserted into (2.25) verify (2.16). Finally, (2.17) follows in a similar way to (2.16) using (2.24).
To show that $\gamma,g_+,\text{and}\,\, g_-$ belong to $\mathcal{S}$ if $q\in \mathcal{S}$, it suffices to show that if $q\in H^{k}$, then $(\gamma,g_{+},g_{-})\in H^{k+1}\times H^{k+1}\times H^{k+2}$ for each $k\in \mathbb{N} \cup\{0\}$, and we have an estimate
where $C_{k} \gt 0$ is a polynomial. We prove this by induction on k, where we have already proved the base case k = 0. The inductive step follows from (2.15), (2.16), and (2.17).
In order to establish that $\gamma, g_{+}$, and $g_{-}$ belong to $\mathcal{S}(\mathbb{R})$ if $q\in \mathcal{S}(\mathbb{R})\cap B_{\delta,\kappa}$, we only need to show that
as the decay of all the derivatives will follow from (2.27), and (2.15), (2.16) and (2.17). To prove (2.27), we employ duality and the relations (2.18) and (2.4). The main point is to commute the weight x n so that it lands on a nearby factor of q, instead of the test function f. This can be achieved by using the identity
Finally, we verify the diffeomorphism claims for $\tfrac{\mu}{4\kappa}g_{-}(\kappa)$. We have from (2.17) and (2.16) that
Thus, $d\big(\tfrac{\mu}{4\kappa}g_{-}(\kappa)\big)\big\vert_{q=0} =(4\kappa^2-\partial^2)^{-1}$, which is an isomorphism from L 2 into $H^{2}_{\kappa}$. By the inverse function theorem, this alone ensures that $\tfrac{\mu}{4\kappa}g_{-}(\kappa)$ is a diffeomorphism from $B_{\widetilde{\delta},\kappa}$ to $H^{2}_{\kappa}$, where $\widetilde{\delta} \gt 0$ is small and may depend on $\kappa\geq 1$. To extend this diffeomorphism to the whole of $B_{\delta,\kappa}$, we need a uniform in $\kappa\geq 1$ estimate on the derivative $dr\vert_{q}$ throughout $B_{\delta,\kappa}$. As part of this, we need the estimate
for $q\in \mathcal{S}\cap B_{\delta,\kappa}$ and reducing δ if necessary. This follows easily by arguments similar to verifying that $\gamma(\kappa)\in H^{1}_{\kappa}$ using (2.18). From (2.28), it follows that for any test function f, we have
Therefore, assuming (2.29) and using (2.30), we have
where the implicit constant is uniform in $\kappa\geq 1$. Thus, $\frac{\mu}{4\kappa} g_{-}(\kappa):B_{\delta, \kappa}\to H^{2}_{\kappa}$ is a real-analytic diffeomorphism from $B_{\delta,\kappa}$ to $H^2_{\kappa}$. This completes the proof of the diffeomorphism claim and hence also the proof of proposition 2.3.
It will be useful later on to separately consider the first few terms in the series expansions (2.18) and (2.4) for γ and $g_{+}$. In particular, we write $\gamma(\kappa)=\gamma^{[2]}(\kappa)+\gamma^{[\geq 4]}(\kappa)$, where
The superscript notation here is motivated by the fact that $\gamma^{[2]}$ represents the term in $\gamma(\varkappa)$, which is quadratic in q. The formula for $\gamma^{[2]}$ follows by computing explicitly the first term in the sum (2.18). On the line, this reduces to computing a simple contour integral, while on the circle, we need the identity
which follows from partial fraction decompositions and the identity
Similarly, we also decompose $g_{+}(\kappa)$ as $g_{+}(\kappa)=g_{+}^{[1]}(\kappa)+g_{+}^{[3]}(\kappa) +g_{+}^{[\geq 5]}(\kappa),$ where
and $g_{+}^{[\geq 5]}(\kappa):=g_{+}(\kappa)-g_{+}^{[1]}(\kappa)-g_{+}^{[3]}(\kappa)$. We have the following bounds on the remainder pieces $\gamma^{[\geq 4]}(\kappa)$ and $g_{+}^{[\geq 5]}(\kappa)$:
The estimate on $g_{+}^{[\geq 5]}$ is immediate from using (2.23) with $m\geq 2$ and the bound for $\gamma^{[\geq 4]}$ follows by duality, and hence (2.18) with $f\in L^{\infty}$, and (2.22) this time placing $(\kappa\pm \partial)^{-1}f(\kappa\mp \partial)^{-1}$ in operator norm.
Remark 2.4. The additional regularity property of $g_{-}$ does not immediately seem to be available in the complex-valued setting. In the complex-valued setting,
In particular, the first-order term of this is
Hence, if $\text{Im} (q)\neq 0$, there does not appear to be an additional smoothing effect from considering any linear combination of diagonal Green’s functions.
3. Conservation laws, dynamics, and equicontinuity
In this section, we prove the claims in the introduction regarding the quantity $A(\kappa,q)$ in (1.5). Namely, we show that it is well-defined for $q\in B_{\delta,\kappa}$ and that it commutes with $A(\varkappa,q)$ for any $\varkappa\geq 1$. We then use the conservation of $A(\kappa)$ to derive a priori bounds and equicontinuity results for the flow of mKdV (1.1) in H s for $0\leq s \lt \tfrac{1}{2}$. We begin with properties of $A(\kappa,q)$.
Proposition 3.1. There exists δ > 0, so that for all $\kappa\geq 1$ and $q\in B_{\delta,\kappa}$, the following hold: the series (1.6) converges uniformly in $\kappa\geq 1$, it is differentiable with derivative
and for $q\in \mathcal{S} \cap B_{\delta,\kappa}$, it has the asymptotic expansion
and for any $\varkappa\geq 1$, it Poisson commutes with $A(\varkappa)$, that is, $\{A(\kappa), A(\varkappa)\}=0.$
Proof. The convergence of (1.6) readily follows from (2.8). We compute
by regrouping terms and cycling the trace with $(\kappa\pm \partial)^{-1}f$, which yields (3.1). For (3.2), since $A(\kappa; 0)=0$, Fubini’s theorem gives
The identities (2.15), (2.16), and (2.28) imply
where $g_{-}(\kappa)=g_{-}^{[1]}(\kappa)+g_{-}^{[3]}(\kappa)+g_{-}^{[\geq 5]}(\kappa)$. Then,
Finally, we verify the Poisson commutativity property. We assume $\kappa \neq \varkappa$. To begin, we note that (2.15), (2.16), and (2.17) imply the following identities for $q\in \mathcal{S}\cap B_{\delta,\kappa}\cap B_{\delta,\varkappa}$:
In the following, we argue that the boundary term from integration by parts always vanishes. This is obvious by periodicity on $\mathbb{T}$, while on $\mathbb{R}$, (2.12), (2.13), and (2.14) imply that $\gamma(\kappa), g_{+}(\kappa)$, and $g_{-}(\kappa)$ all vanish as $|x|\to \infty$. Then, by (3.1) and (2.17), we have
It follows from (2.17) and (3.5) that
Therefore, using (3.4) and (3.5), we have
Lemma 3.2. There exists δ > 0 such that the following holds: given $\kappa\geq 1$, $q(0)\in \mathcal{S}\cap B_{\delta,\kappa}$, let $q(t)\in \mathcal{S}$ denote the global-in-time solution to mKdV (1.1). Then, the function $g_{-}(\kappa,t)=g_{-}(\kappa, q(t))$ satisfies
Proof. Note that conservation of mass implies that $q(t) \in B_{\delta, \kappa}$ for every $t\in \mathbb{R}$ so that $g_{-}(\kappa,t)$ exists for every $t\in \mathbb{R}$ and belongs to $\mathcal{S}$ by proposition 2.3. As to Eq. (3.7), we recall that [Reference Harrop-Griffiths, Killip and Vişan14, corollary 4.8] implies that for each $\varkappa\geq 1$ such that $\varkappa\neq \kappa$, $g_{-}(\varkappa,t)$ evolves under the $A(\kappa)$ flow according to
This is a consequence of the Lax pair for the $A(\kappa)$-flow discovered in [Reference Harrop-Griffiths, Killip and Vişan14, proposition 4.7], which also holds in the real-valued setting and on both the line and the circle. Then, (3.7) follows from (3.8) and the asymptotic expansion (3.2).
Recall that a subset Q of Hs is equicontinuous if
While the conservation of the L 2-norm for regular solutions to (1.1) has long been known, in this section, we demonstrate that bounded orbits of (1.1) in L 2 are equicontinuous. On the circle, equicontinuity and uniform boundedness are equivalent to pre-compactness. The pre-compactness of solutions on the circle in $H^{s}(\mathbb{T})$ for $s\geq \tfrac{1}{3}$ was shown in [Reference Molinet, Pilod and Vento34]. In the line setting, this was also known from [Reference Harrop-Griffiths, Killip and Vişan14, proposition 8.1]. We provide an alternative argument using (1.6) and employing some ideas from [Reference Killip, Vişan and Zhang26]. Our main goal in this section is to prove the following:
Proposition 3.3. Boundedness and equicontinuity
Let $0\leq s \lt \frac{1}{2}$. Given δ > 0, there exists $\delta_0 \gt 0$ so that for each $\kappa\geq 1$, $q\in B_{\delta_{0},\kappa} \cap \mathcal{S}$ and for any Hamiltonian flow on $\mathcal{S}$, which conserves $A(\kappa)$, we have the a priori estimate
Moreover, if $Q\subseteq B_{\delta, \kappa}\cap \mathcal{S}$ is equicontinuous in H s, then, for any two Hamiltonians H 1 and H 2, which Poisson commute with $A(\varkappa)$ for all $\varkappa\geq 1$, the set
is equicontinuous in H s.
In order to prove proposition 3.3, we need to recall some preliminary results. We note that the results in this subsection are valid in geometry of either $\mathbb{R}$ or $\mathbb{T}$, with similar proofs. Thus, for any integral on the frequency side, we take it to mean that the integral is with respect to Lebesgue measure on $\mathbb{R}$ on the line or counting measure on $\mathbb{Z}$ if on the circle.
It is well known [Reference Pego37] that equicontinuity in H s is equivalent to tightness on the Fourier side, and that equicontinuity is the key tool in order to upgrade strong convergence in a low regularity H s-space to a higher regularity space. More precisely,
Lemma 3.4. Fix $-\infty \lt \sigma \lt s \lt \infty$. Then,
(i) a non-empty bounded subset Q of H s is equicontinuous in Hs if and only if
(ii) a sequence $\{q_n\}$ is convergent in H s if and only if it is convergent in H σ and equicontinuous in H s.
We use a number of equivalent ways to express equicontinuity in H s. To this end, we first state a useful lemma.
Lemma 3.5. Let $-1 \lt s \lt 1$ and consider the Fourier multiplier operator $w(-i\partial,\kappa)$ with multiplier
Then,
(i) for any $f\in \mathcal{S}$ and uniformly in $\kappa\geq 1$, we have
(ii) a non-empty bounded subset Q of H s is equicontinuous if and only if
Proof. All of part (i) was shown in [Reference Killip, Vişan and Zhang26, lemma 3.5]. For (ii), we first suppose that (3.15) holds. Then,
where $P_{ \lt \kappa}$ is the sharp projection onto frequencies $\{|\xi| \lt \kappa\}$ and $P_{\geq \kappa}:=\text{Id} -P_{ \lt \kappa}$. Hence, (3.15) implies Q is equicontinuous in Hs. For the reverse direction, given $\varkappa \geq 1$, (3.14) implies
On the other hand, for $0\leq s \lt 1$,
while if $-1 \lt s \lt 0$,
In either case, there exist $\beta_1(s), \beta_2(s) \gt 0$ such that
Using the equicontinuity of Q, we now obtain (3.15).
Lemma 3.6. Let $0\leq s \lt 1$. A non-empty bounded subset of Q of H s is equicontinuous in H s if and only if
The proof of this lemma is quite standard with the forward direction following from low and high frequency decomposition as in the proof of (ii) in lemma 3.5, and the converse follows from the inequality $\tfrac{2\xi^2}{\xi^2+4\kappa^2} \gt rsim 1-\chi_{[-\kappa,\kappa]}(\xi)$. We note that lemma 3.6 remains true with the condition (3.16) replaced by the condition
Proof. Proof of proposition 3.3
We let δ > 0 be small enough so that all prior results hold for any $q(0)\in B_{\delta_{0}, \kappa} \cap \mathcal{S}$, where $\delta_0 =\frac{1}{2}\delta$. We first show that $A(\varkappa)$ is conserved under the H mKdV flow and the mass M flow. Consider the M flow: by (2.15), (2.16), and (3.1), we have
Using (3.1), (2.15), (2.16), and (2.17), we have
Now we examine the first term. Using (2.16), (2.17), and (2.15),
Returning this to (3.18), we see that $\{H_{\text{mKdV}}, A(\varkappa)\}=0$.
To deduce the a priori bound and the equicontinuity claims, we exploit the conservation of $A(\kappa)$ under any Hamiltonian flow, which commutes with $A(\kappa)$. Given $q\in \mathcal{S}$, we let $\varkappa_0= 100(1+\|q\|_{L^2}^{2}) $ and define
for any $\varkappa \geq \varkappa_0$. The existence of $A(\varkappa,q)$ follows from the choice of $\varkappa_0$ and lemma 2.2. We may compute $A^{[2]}(\varkappa)$ explicitly, since
where in the second equality, we exploited the fact that q is real-valued. Therefore,
Recalling (1.6), lemma 2.2 implies
Fix $\kappa\geq 1$, let $q(0)\in B_{\delta, \kappa}\cap \mathcal{S}$, $\varkappa\geq \varkappa_0$ and let q(t) be the corresponding solutions in $\mathcal{S}$. In view of proposition 3.1, $A(\varkappa, q(t))$ is conserved and by commutativity of the mass M, we have that $q(t)\in B_{\delta,\kappa}\cap \mathcal{S}$ for all $t\in \mathbb{R}$. Moreover, (3.20) also holds uniformly in $t\in \mathbb{R}$.
Now, the conservation of $A(\kappa)$, (3.19) and (3.20) imply that for any $\varkappa\geq \varkappa_0$,
Given $N\in 2^{\mathbb{N}}$, we replace $\varkappa$ by $\varkappa N$ in (3.21) and apply (3.14) to obtain
provided that $s \lt \frac{1}{2}$. Choosing $\varkappa=\varkappa_0$, we arrive at (3.10).
For the equicontinuity claim, we return to (3.21) and replace $\varkappa$ by $\varkappa N$, multiply by $(\varkappa N)^{2s}$, and sum over $N\in 2^{\mathbb{N}}$ to obtain
Now, lemma 3.5 yields the claimed equicontinuity.
4. Well-posedness in H s, $s\geq 0$
The expansion (3.2) motivates us to define
Under the Poisson bracket (1.2), we have
since M and $A(\kappa)$ Poisson commute with H mKdV. We use the Hκ-flow to approximate the H mKdV-flow, as heuristically, (3.2) implies $H_{\kappa}=H_{\text{mKdV}}+\mathcal{O}(\kappa^{-2})$ for large κ.
Proposition 4.1. Well-posedness of the approximating flow
There exists δ > 0 so that for all $\kappa\geq 1$, the Hamiltonian flow induced by Hκ, namely
is globally well-posed for initial data in $B_{\delta, \kappa}$. These solutions conserve $A(\varkappa)$ for every $\varkappa\geq 1$ and whenever the initial data belong to $\mathcal{S}$, then the solutions do too. Moreover, under the Hκ flow, we have
Proof. The equation (4.2) follows from (3.1), which gives
Then, local well-posedness of (4.2) follows by first using the convenient change of variables $(t,x)\to (t,x-4\kappa^2 t)$ and noting that, by the diffeomorphism property of $g_{-}(\kappa)$, the nonlinearity is Lipschitz from $B_{\delta,\kappa}$ into H 2 so that the Cauchy–Lipschitz theorem applies. The estimates (2.26) and (2.27) imply that the solutions belong to $\mathcal{S}$ if the initial data do.
Global well-posedness follows from the conservation of $A(\varkappa)$ under the Hκ flow with the specific a priori bounds from proposition 3.3.
The claim that the solutions conserve $A(\varkappa)$ follows from proposition 3.1. Finally, the equation (4.3) follows from (3.8) and that the evolution $g_{-}(\varkappa)$ under the M-flow is simply linear transport in x at unit speed.
The key stepping stone to the proof of theorem 1.1 is the following result, which makes precise how we use the formal idea that Hκ approximates H mKdV for large κ:
Proposition 4.2. Let δ > 0 be sufficiently small, $\kappa\geq 1$, $\varkappa\geq 4$, and $Q\subseteq B_{\delta, \kappa}\cap \mathcal{S}$ be equicontinuous in L 2. Then,
Proof. Fix $\varkappa\geq 4$ and let $\kappa\geq 2\varkappa$. We verify the weaker statement
To see that this suffices to prove (4.4), we note that from proposition 3.3, the set $Q_{\ast}$ defined in (3.11), with $H_1=H_{\text{mKdV}}$ and $H_2=H_{\kappa}$, is equicontinuous in L 2. Hence, the diffeomorphism property of $\frac{\mu}{4\varkappa}g_{-}(\varkappa)$ from proposition 2.3 and the fact that for any $h\in \mathbb{R}$, $g_{-}(x+h;\varkappa, q(x))=g_{-}(x;\varkappa, q(x+h)),$ imply that the set
is equicontinuous in H 2. Combining this fact with (4.5), our desired (4.4) follows from lemma 3.4.
By the fundamental theorem of calculus, (4.5) follows from
Under the difference flow, (3.7) and (4.3) imply
Carefully rewriting the right-hand side of (4.7), we have
where
and we used (2.31) to express $\mathsf{err}_{8}$.
We need to estimate each of these in H −2. By proposition 2.3, the embedding $L^1\subset H^{-1}$, (2.13), and (2.33), we have
which are all acceptable bounds. For $\mathsf{err}_{4}$, we (2.12) and duality to estimate the product:
which yields the good bound
For $\mathsf{err}_{5}$, $\mathsf{err}_{7}$, and $\mathsf{err}_{8}$, we rely on equicontinuity to obtain acceptable bounds. Indeed, using $L^1 \subset H^{-1}$ and proposition 2.3, Hölder’s inequality and Sobolev embedding, we have
where the right-hand side of these estimates tend to zero as $\kappa \to \infty$, uniformly in $q\in Q_{\ast}$ and $|t|\leq T$, by lemma 3.6 and (3.17). Again, by duality, we obtain
and thus,
which tends to zero uniformly in $q\in Q_{\ast}$ as $\kappa \to \infty$ and $|t|\leq T$ by lemma 3.6.
It remains to estimate $\mathsf{err}_{6}$. We write
From (2.33) and proposition 2.3,
For $\mathsf{err}_{6,1}$, we start with some preliminary estimates. From (2.31) and (2.32), we have
Similarly, using (2.32), as well as arguing by duality, yields
and thus,
Thus, each individual term of (4.8) tends to zero in H −2, uniformly in $q\in Q_{\ast}$ and $|t|\leq T$ as $\kappa\to \infty$. This completes the proof of (4.6) and hence also the proof of (4.5).
Proof. Proof of theorem 1.1
We prove the following statement: given T > 0 and $q_{n}(0) \in \mathcal{S}$ converging in L 2, then the corresponding sequence of solutions $q_{n}(t)$ in $\mathcal{S}$ to (1.1) converge in L 2, uniformly in $|t|\leq T$.
To this end, let $\delta_0 \gt 0$ denote the smallest δ > 0 among all the claims in this article thus far. Then, we choose $\kappa \geq 1$ such that
Namely, we now have that $q_{n}(0)\in B_{\delta_0, \kappa}$, with κ independent of $n\in \mathbb{N}$.
Let $Q=\{q_{n}(0)\}$ and define $Q_{\ast}$ as in (3.11), with $H_1=H_{\text{mKdV}}$ and $H_2=H_{\kappa}$. Note that by proposition 3.3, $Q^{\ast}$ is equicontinuous in L 2. By the commutativity of the flows, we have
Hence,
By proposition 4.1, the well-posedness of the Hκ-flow in L 2 implies that the first term in (4.9) tends to zero as $n,m\to \infty$. To deal with the second term, we perform a change of variables: fix $\varkappa\geq 4$ and let $q(t):= e^{tJ\nabla (H_{\text{mKdV}}-H_{\kappa})}q$ for $q\in Q^{\ast}$, and we increase κ if necessary so that $\kappa\geq 2\varkappa$. Note that $q\in (Q_{\ast})_{\ast}$ for any $t\in \mathbb{R}$. We use the change of variables
By proposition 4.2, we have
The diffeomorphism property of the change of variables (4.10) then implies that (4.9) vanishes as $\kappa \to \infty$. The remaining claims in theorem 1.1 follow exactly as in [Reference Killip and Vişan25, corollary 5.2 and 5.3], with well-posedness in H s, for $0 \lt s \lt \tfrac{1}{2}$ employing the bounds and equicontinuity proved in proposition 3.3.
Remark 4.3. In the higher regularity setting of H s, for $s\geq \tfrac{1}{6}$, we do not need to apply the change of variables (4.10) and we can replace the approximation property in (4.4) with
Namely, we establish directly that the Hκ flow approximates the H mKdV flow. This is possible as the additional spatial regularity in $H^{\frac{1}{6}}$ implies that q 3 is a well-defined distribution. The change of variables (4.10) is chosen because, apart from its diffeomorphism property, it evolves simply under the linearized H mKdV-flow, and the nonlinear terms in (3.7) are well-defined in the sense of distributions under the weaker assumption of merely $q\in L^2$.
Acknowledgements
The author would like to thank Rowan Killip and Monica Vişan for their support and for helpful discussions.