1. Introduction
As a conservation law of energy, Hamiltonian systems are frequently considered to describe models arose in celestial mechanics or the motion of charged particles in magnetic fields, see [Reference Chierchia8, Reference Meyer24, Reference Wayne38].
The classical KAM theory, as presented by Arnold, Kolmogorov, and Moser [Reference Arnold1, Reference Kolmogorov20, Reference Moser25], posits that under the Kolmogorov non-degenerate condition, most invariant tori of an integrable Hamiltonian system can withstand small perturbations. While these tori might undergo minor deformations, they transform into other invariant tori that retain the original frequency.
Numerous methods have been explored to study the persistence of invariant tori and the preservation of toral frequency within Hamiltonian systems under certain non-degenerate conditions. For instance, the KAM approach was used in [Reference Benettin, Galgant, Giorgilli and Strelcyn2, Reference Bounemoura4, Reference de la Llave, González, Jorba and Villanueva12, Reference Li and Yi22, Reference Pöschel30, Reference Salamon35]. The direct method using Lindstedt series can be found in references [Reference Chierchia and Falcolini9, Reference Eliasson13, Reference Gallavotti15], while renormalization group techniques were discussed in [Reference Bricmont, Gawedzki and Kupiainen5, Reference Gallavotti, Gentile and Mastropietro16]. Notably, the study presented in [Reference Chow, Li and Yi10] introduced the idea of partial preservation of unperturbed frequencies and delved into the persistence problem on a specified smooth sub-manifold for real analytic Hamiltonian systems, particularly under the Rüssmann-like non-degenerate condition. For insights under analogous conditions, see also [Reference Sevryuk37].
Yet, in the context of persistence, two fundamental questions emerge that warrant attention:
Q1: In the event of a failure in the Kolmogorov non-degenerate condition, can the invariant tori with the same frequency still be preserved under small perturbations?
Q2: If the regularity of the frequency mapping diminishes to mere continuity, can the aforementioned result withstand small perturbations?
To shed light on these questions, we review previous findings and offer a more comprehensive overview.
1.1. Degeneracy
Consider the real analytic nearly integrable Hamiltonian system
where x is the angle variable in the standard torus $\mathbb{T}^{n}$, $n\geq2$ refers to the dimension; y is the action variable in a bounded closed region $G\subset \mathbb{R}^n$, and ɛ > 0 is a small parameter.
A fundamental assumption in historical research is the Kolmogorov non-degenerate condition. However, if we assume that there exists a $y_0\in G$ such that:
then the Kolmogorov non-degenerate condition is not satisfied. The spatial solar system serves as a prominent example of this situation, as detailed in [Reference Féjoz14]. Naturally, a question arises: does the persistence result still stand under these conditions? This question has been a primary motivation for this research.
In fact, even under weaker non-degenerate conditions, KAM tori might not preserve their frequencies. As demonstrated in [Reference Brjuno6, Reference Rüssmann34, Reference Sevryuk36], under the Brjuno non-degenerate condition and Rüssmann non-degenerate condition, the presumption of an unchanged frequency may not necessarily hold true. This is because the frequency of perturbed tori can undergo slight variations. Similar observations are noted in [Reference Biasco, Chierchia and Treschev3, Reference Cheng and Sun7, Reference Chow, Li and Yi10, Reference Cong, Küpper, Li and You11, Reference Heinz18, Reference Qian, Li and Yang32, Reference Xu, You and Qiu40]. Consequently, deriving conditions that assure the persistence of frequencies for KAM tori in the context of a degenerate Hamiltonian becomes rather challenging. Furthermore, the issue of the perturbed invariant tori maintaining a consistent frequency has seldom been tackled for degenerate systems.
1.2. Regularity
On the matter of regularity, it is worth noting the distinctions in the studies of various researchers. Kolmogorov [Reference Kolmogorov20] and Arnold [Reference Arnold1] focused on real analytic Hamiltonian systems. In contrast, Moser [Reference Moser25] illustrated that Hamiltonian systems do not necessarily need to be analytic; a high, albeit finite, level of regularity for the Hamiltonian suffices. This regularity requirement was later reduced to C 5 in work by [Reference Rüssmann and Kleine Nenner33]. Further important contributions on this topic can be found in [Reference Bounemoura4, Reference Herman19, Reference Koudjinan21, Reference Salamon35]. Moreover, the scenario where the frequency mapping has Lipschitz continuous parameters has been explored in [Reference Pöschel29]. A subsequent question of interest is: what are the implications when the regularity of the frequency mapping is merely Hölder continuous with respect to its parameters?
More precisely, we consider a family of Hamiltonian systems under small perturbations:
where $(y,x)\in G\times\mathbb{T}^n$ and ξ is a parameter in a bounded closed region $O\subset \mathbb{R}^n$. The function $\omega(\cdot)$, $\bar h(y,\cdot)=O(|y|^2)$ and $P(y,x,\cdot,\varepsilon)$ are Hölder continuous with respect to the parameter ξ with Hölder index β, for some $0 \lt \beta \lt 1$. The function $\bar h(\cdot,\xi)$ and $P(\cdot,\cdot,\xi,\varepsilon)$ are real analytic with respect to y and x. Additionally, ɛ > 0 is a small parameter.
It is important to note that in the conventional KAM iteration process, the regularity of the frequency mapping concerning the parameters must be at least Lipschitz continuous. This ensures that the parameter domain remains intact. However, when the regularity of the frequency mapping is less stringent than Lipschitz continuous, the traditional method of parameter excavation becomes infeasible. This necessitates the exploration of novel approaches to address the issue.
1.3. Our work
Regarding regularity, when the frequency mapping is Hölder continuous with respect to parameters, we prove that the perturbed invariant tori retain the same Diophantine frequency as their unperturbed counterparts for Hamiltonian systems as described in (1.2), see theorem 2.1. For the degeneracy problem, persistence results under the highly degenerate Hamiltonian system (1.1) are proved in theorem 2.5.
We establish sufficient conditions based on the topological degree condition $\mathrm{(A0)}$ and the weak convexity condition $\mathrm{(A1)}$ for frequency mapping. Detailed descriptions of these conditions are provided in section 2. In deriving our primary results, we employ the quasi-linear KAM iteration procedure as in [Reference Chow, Li and Yi10, Reference Han, Li and Yi17, Reference Li and Yi23, Reference Qian, Li and Yang31]. Notably, we use the technique of translation parameters to counteract frequency drift. Specifically, we adjust the action variable to maintain constant frequency for the highly degenerate Hamiltonian system (1.1). It is also noteworthy that the weak convexity condition proposed in this paper is necessary regardless of the smoothness level of the frequency mapping, as evidenced by proposition 2.3.
It should be pointed out that the KAM-type theorems associated with parameter family are due to Moser [Reference Moser26], Pöschel [Reference Pöschel28]. However, our results are different from theirs: a Diophantine frequency can be given in advance, but Moser’s systems need to be modified in KAM iteration and hence cannot be given beforehand; in Pöschel’s approach, the frequency set need to be dug out in KAM process. Our method is to find a parameter in the family of systems by translating parameter. Of course, it does not work generally. Recently, the persistence of invariant tori with prescribed frequencies has been developed in [Reference Xu and Lu39, Reference Xu and You41]. Here we prove the persistence of invariant tori, which form a Hölder continuous family with respect to parameters. As showed by proposition 2.3, the weak convexity condition (A1) is sharp to such persistence.
The rest of this paper is organized as follows. In section 2, we state our main results (theorems 2.1, 2.5, 2.6 and 2.8). We describe the quasi-linear iterative scheme, show the detailed construction and estimates for one cycle of KAM steps in section 3. In section 4, we complete the proof of theorem 2.1 by deriving an iteration lemma and showing the convergence of KAM iterations. In section 5, we prove theorem 2.5 , which covers the analytic situation, and is also a special case of theorem 2.1. We also prove theorem 2.8 by directly computing. Finally, the proof of theorem 2.6 can be found in appendix B.
2. Main results
To state our main results, we first need to introduce a few definitions and notations.
(1) Given a domain $D\subset G\times \mathbb{T}^n$, we let $\bar{D}$, $\partial D$ denote the closure and the boundary of D, respectively. $D^o:=\bar{D}\setminus\partial D$ refers to the interior.
(2) We shall use the same symbol $\vert\cdot\vert$ to denote an equivalent vector norm and its induced matrix norm, absolute value of functions, etc, and use $\vert\cdot\vert_D$ to denote the supremum norm of functions on a domain D.
(3) For the perturbation function $P(y,x,\xi)$, which is analytic about y and x and Hölder continuous about ξ with Hölder index β, $0 \lt \beta \lt 1$, we define its norm as follows
\begin{equation*}\vert\|P\|\vert_{D}={|P|_{D}}+\|P\|_{C^\beta}\end{equation*}where
(2.1)\begin{equation}\|P\|_{C^\beta}=\sup_{\xi\neq\zeta,~\xi,\zeta\in {O}}\frac{\vert P(y,x,\xi)-P(y,x,\zeta)\vert}{\vert\xi-\zeta\vert^\beta},~~~\forall (y,x)\in {D}.\end{equation}(4) For any two complex column vectors ξ, η in the same space, $\langle\xi,\eta\rangle$ always stands for $\xi^\top\eta$.
(5) id is the unit map, and Id is the unit matrix.
(6) For a vector-value function f, Df denotes the Jacobian matrix of f, and $J_f=det Df$ its Jacobian determinant.
(7) All Hamiltonians in the sequel are endowed with the standard symplectic structure.
(8) As pointed out in [Reference Pöschel30], the real analyticity of the Hamiltonian $H(y,x)$ about y and x on $G\times{\mathbb{T}^n}$ implies that the analyticity extends to a complex neighbourhood $D(s,r)$ of $G\times{\mathbb{T}^n}$, where $D(s,r)$ is defined for some $0 \lt s,r \lt 1$, with
\begin{equation*}D(s,r):=\{(y,x):|y| \lt s,\vert\textrm{Im}x\vert \lt r\}.\end{equation*}(9) For $\forall \delta \gt 0$, $y_0\in G$, let
\begin{align*} B_\delta(y_0)&=:\{y\in G:\vert y-y_0\vert \lt \delta\},\\ \bar B_\delta(y_0)&=:\{y\in G:\vert y-y_0\vert\leq\delta\}. \end{align*}
We are now ready to state our assumptions. Mainly we consider (1.2), i.e., for any ɛ > 0 small enough, we consider the parameterized family of perturbed Hamiltonian systems
First, we make the following assumptions:
(A0) Fix $\xi_0\in O^o$ such that
(2.2)\begin{align} &\deg\left(\omega(\cdot), O^o, \omega(\xi_0)\right)\neq0. \end{align}(A1) There exists a small neighbourhood $B_\varsigma(\xi_0)\subset O^o$ with ς > 0 and a constant σ > 0 such that
(2.3)\begin{align} \left\vert\omega(\xi)-\omega(\xi_{*})\right\vert\geq \sigma\left\vert\xi-\xi_{*}\right\vert^\beta,~~~~\forall \xi,\xi_{*}\in B_\varsigma(\xi_0), \end{align}where β is defined in (2.1).
(A2) For the given $\xi_0\in O^o$, $\omega(\xi_0)$ satisfies the Diophantine condition
(2.4)\begin{align} \left\vert\langle k,\omega(\xi_0)\rangle\right\vert \gt \frac{\gamma}{\vert k\vert^\tau},~~~k\in{\mathbb{Z}^n\setminus{\{0\}}}, \end{align}where $k=(k_1,\cdots,k_n)$, $\vert k\vert=\left\vert k_1\right\vert+\cdots+\left\vert k_n\right\vert$, γ > 0 and $\tau \gt n-1$.
Then, we have the following main results:
Theorem 2.1 Consider Hamiltonian system (1.2). Assume that $\mathrm{(A0)}$, $\mathrm{(A1)}$ and $\mathrm{(A2)}$ hold. Then there exists a sufficiently small $\varepsilon_0 \gt 0$, for any $0 \lt \varepsilon \lt \varepsilon_0$, there exist $\xi_\varepsilon\in O$ with $\xi_\varepsilon\rightarrow\xi_0$ as $\varepsilon\rightarrow0$ and a symplectic transformation $\Psi_*$ such that
where $e_*$ is a constant, $\bar h_*(y,\xi_{\varepsilon})=O(\vert y\vert^2)$, $P_*=O(\vert y\vert^2)$. Thus the perturbed Hamiltonian system $H(y,x,\xi_\varepsilon,\varepsilon)$ admits an invariant torus with frequency $\omega(\xi_0)$.
Moreover, there is a Hölder continuous family of $\xi_\varepsilon(\hat\xi_0)$ with respect to $\hat\xi_0\in \hat O$, where
such that the perturbed Hamiltonian system $H(y,x,\xi_\varepsilon(\hat\xi_0),\varepsilon)$ admits an invariant torus with frequency $\omega(\hat\xi_0)$, and $\xi_\varepsilon(\hat\xi_0)\rightarrow\hat\xi_0$ as $\varepsilon\rightarrow0$.
Remark 2.2. It should be emphasized that we deal with the degenerate Hamiltonian system in which the frequency mapping and the perturbation are Hölder continuous about parameters in this theorem.
In the following, we will give some examples to state that conditions $\mathrm{(A0)}$ and $\mathrm{(A1)}$ are indispensable, especially for condition $\mathrm{(A1)}$. See below for a counter example:
Proposition 2.3. Consider Hamiltonian system (1.2), for n = 2, with
where $\xi=(\xi_1,\xi_2)$,
$\bar{\omega}(\xi_0)=(\bar{\omega}_1,\bar{\omega}_2)^\top$ satisfies Diophantine condition (2.4), and
Then condition $\mathrm{(A1)}$ fails for any parameter $\xi\in(-\frac{1}{2},\frac{1}{2})\times(-\frac{1}{2},\frac{1}{2})$. Moreover, although the topological degree is not equal to 0, we cannot find ξɛ such that $\xi_\varepsilon\rightarrow\xi_0$ as $\varepsilon\rightarrow0$.
See appendix A for the complete proof.
Remark 2.4. This counter example implies that $\mathrm{(A1)}$ is necessary to ensure $\xi_\varepsilon\rightarrow\xi_0$, as $\varepsilon\rightarrow0$.
Nevertheless, one asks what happens to the frequency mapping in the analytic situation. As a special case of our theorem 2.1, we also obtain the Kolmogorov’s theorem for analytic Hamiltonian systems with degeneracy. This is stated in the following theorem.
Theorem 2.5 Consider real analytic Hamiltonian system (1.1). Fix $\xi_0\in G$ such that $\mathrm{(A0)}$, $\mathrm{(A1)}$ and $\mathrm{(A2)}$ hold for $\omega(\xi)=\nabla h(\xi)$, O = G, and $\beta\geq 1$. Then there exists a sufficiently small positive constant $\varepsilon' \gt 0$ such that if $0 \lt \varepsilon \lt \varepsilon'$, there exists $y_\varepsilon\in G$ such that Hamiltonian system (1.1) at $y=y_\varepsilon$ admits an invariant torus with frequency $\nabla h(\xi_0)$.
This theorem is proved in section 5.1.
Next, we give an example that satisfies conditions $\mathrm{(A0)}$-$\mathrm{(A1)}$. For simplicity we use the action variable y as the parameter ξ.
Theorem 2.6 Consider Hamiltonian system (1.1) with
where $y\in G\subset \mathbb{R}^n$, l is a positive integer, $\omega\in \mathbb{R}^n\setminus{\{0\}}$ satisfies the Diophantine condition (2.4). Then Hamiltonian system (1.1) admits an invariant torus with frequency ω for any small enough perturbation.
The proof can be found in appendix B.
Proposition 2.7. If $h(y)=\langle \omega,y\rangle+\frac{1}{2l+1}\left\vert y\right\vert^{2l+1}$ in Hamiltonian system (1.1), $\omega\in \mathbb{R}^n\setminus{\{0\}}$ satisfies the Diophantine condition (2.4), then the system may not admit invariant torus with frequency ω.
The proof can be found in appendix C.
Above results imply that condition $\mathrm{(A0)}$ is indispensable for n > 1 case. Furthermore, we also prove that for n = 1, the persistence results in theorem 2.6 hold under some weaker conditions, provided that the frequency satisfies Diophantine condition $\mathrm{(A2)}$.
Theorem 2.8 Consider Hamiltonian (1.1), for n = 1, with
where $y\in G=[-1,1]\subset \mathbb{R}^1$, ω satisfies Diophantine condition (2.4).
(1) If $g(y)\in C^{2\ell+1}$, $g'(0)=\cdots=g^{2\ell}(0)=0$, $g^{2\ell+1}(0)\neq 0$, $\ell$ is a positive integer, then the perturbed system admits at least two invariant tori with frequency ω for the small enough perturbation satisfying $\varepsilon P'(y)\, sign (g^{2\ell+1}(0)) \lt 0$; conversely, if $\varepsilon P'(y)\, sign (g^{2\ell+1}(0)) \gt 0$, the unperturbed invariant torus with frequency ω will be destroyed.
(2) If $g(y)\in C^{2\ell+2}$, $g'(0)=\cdots=g^{2\ell+1}(0)=0$, $g^{2\ell+2}(0)\neq 0$, $\ell$ is a positive integer, then the perturbed system admits an invariant tori with frequency ω for any small enough perturbation.
Remark 2.9. We do not know whether the results in theorem 2.8 can be extended to higher dimensions or not.
3. KAM step
In this section, we describe the quasi-linear iterative scheme, show the detailed construction and estimates for one cycle of KAM steps, which is essential to study the KAM theory, see [Reference Chow, Li and Yi10, Reference Han, Li and Yi17, Reference Li and Yi22, Reference Li and Yi23, Reference Pöschel30]. It should be pointed out that in our KAM iteration, we translate the parameter domain; while in the usual KAM iteration, one has to dig out a decreasing series of parameter domains, see [Reference Chow, Li and Yi10, Reference Han, Li and Yi17, Reference Li and Yi23, Reference Pöschel29–Reference Qian, Li and Yang32].
3.1. Description of the 0th KAM step
Given an integer $m\geq\beta+1$, where β is defined in (2.1). Denote $\rho=\frac{1}{2(m+1)}$, and let η > 0 be an integer such that $(1+\rho)^\eta \gt 2$. We define
Consider Hamiltonian (1.2). We first define the following 0th KAM step parameters:
where $0 \lt s_0,\gamma_0,\mu_0\leq 1$, $\tau \gt n-1$, $M^* \gt 0$ is a constant defined as in lemma 3.4, and
Therefore, we can write
where $(y,x,\xi)\in D(s_0,r_0)\times O_0$.
We first prove an important estimate.
Proof. Using the fact $\gamma_0^{n+m+2}=\varepsilon^{\frac{1}{4}}$ and $\left[\log\frac{1}{\mu_0}\right]+1 \lt \frac{1}{\mu_0}$, we have
Moreover, let $\varepsilon_0 \gt 0$ be small enough so that
using the fact that $\mu_0=\varepsilon^{\frac{1}{8\eta(\tau+1)(m+1)}}$, we get
and by (3.4) and $0 \lt \varepsilon \lt \varepsilon_0$,
i.e.,
which implies (3.3).
This completes the proof.
3.2. Induction from the νth KAM step
3.2.1. Description of the νth KAM step
We now define the νth KAM step parameters:
where $\rho=\frac{1}{2(m+1)}$.
Now, suppose that at νth step, we have arrived at the following Hamiltonian:
defined on $D(s_\nu,r_\nu)\times O_{\nu}$ and
The equation of motion associated to Hν is
Except for additional instructions,we will omit the index for all quantities of the present KAM step (at νth step) and use + to index all quantities (Hamiltonians, domains, normal forms, perturbations, transformations, etc.) in the next KAM step (at $(\nu+1)$th step). Using this notation, ξν is abbreviated as ξ, which can easily be confused with the parameter ξ. Therefore, we will not omit the subscript of ξν and continue to denote it as ξν. To simplify the notations, we will not specify the dependence of P, $P_+$ etc. All the constants c 1-c 6 below are positive and independent of the iteration process, and we will also use c to denote any intermediate positive constant which is independent of the iteration process.
Define
3.2.2. Construct a symplectic transformation
We will construct a symplectic coordinate transformation $\Phi_{+}$:
such that it transforms the Hamiltonian ($3.7$) into the Hamiltonian of the next KAM cycle (at $(\nu+1)$th step), i.e.,
where $N_{+}$ and $P_{+}$ have similar properties as N and P respectively on $D(s_{+},r_{+})\times O_+$, and the equation of motion (3.9) is changed into
In the following, we prove (3.12). Let $\Phi_+(y_+,x_+):=\left(\Phi_+^1(y_+,x_+),\Phi_+^2(y_+,x_+)\right)$, by (3.10), we have
Then this together with (3.9) yields
where J is the standard symplectic matrix, i.e.,
This finishes the proof of (3.12).
Next, we show the detailed construction of $\Phi_+$ and the estimates of $P_+$.
3.2.3. Truncation
Consider the Taylor-Fourier series of P:
and let R be the truncation of P of the form
Remark 3.2. As demonstrated in [Reference Pöschel30], truncating y to first order is feasible. When proving theorem 2.1 in this paper, truncating y to first order is permissible (i.e., m = 1), though truncating to higher orders is also valid. However, in proving theorem 2.5, truncating y to higher orders is necessary (i.e., $m\geq\beta+1$); see (5.12) in subsection 5.1. Because we regard the action variable as the parameter in proving theorem 2.5, and the translation of the parameter is equivalent to the translation of the action variable, we need the integer $m\geq\beta+1$ to ensure that the translated action variable y does not run out of the given domain. Here, to make the KAM steps applicable to both theorems, we truncated y to higher orders. In this paper, we denote the order of truncation as m, which provides a more general description of the KAM iteration, clarifying how estimates vary with truncation and demonstrating the role of higher-order terms in the normal form, even though it may make the proof appear somewhat more complicated.
Next, we will prove that the norm of P − R is much smaller than the norm of P by selecting truncation appropriately, see lemma below.
Lemma 3.3. Assume that
Then there is a constant c 1 such that
Proof. Denote
Then
To estimate I, we note by (3.8) that
where the first inequality has been frequently used in [Reference Chow, Li and Yi10, Reference Cong, Küpper, Li and You11, Reference Han, Li and Yi17, Reference Li and Yi22, Reference Pöschel30–Reference Qian, Li and Yang32, Reference Salamon35] and the detailed proof see [Reference Salamon35]. This together with ${\mathbf{(H1)}}$ yields
It follows from (3.8) and (3.16) that
For $\vert p\vert=m+1$, let $\int$ be the obvious antiderivative of $\frac{\partial^{p}}{\partial y^p}$. Then the Cauchy estimate of P − I on Dα yields
Thus,
and therefore,
Next, we estimate $\left\|P-R\right\|_{C^\beta}$. In view of the definition of $\|\cdot\|_{C^\beta}$, for $\forall y,x\in D_\alpha$, we have
where the third inequality follows from Cauchy estimate and the last inequality follows from (3.8).
Similarly, we get
It follows from (3.17), (3.18), (3.19) and (3.20) that (3.13) and (3.14) hold.
This completes the proof.
3.2.4. Homological equation
As usual, we shall construct a symplectic transformation as the time-1 map $\phi_{F}^1$ of the flow generated by a Hamiltonian F to eliminate all resonant terms in R, i.e., all terms
To do so, we first construct a Hamiltonian F of the form
satisfying the equation
where $[R]=\frac{1}{(2\pi)^n}\int_{T^n}R(y,x)dx$ is the average of the truncation R.
Substituting (3.21) into (3.22) yields that
By comparing the coefficients above, we then obtain the following quasi-linear equations:
We declare that the quasi-linear equations (3.23) are solvable under some suitable conditions. The details can be seen in the following lemma:
Lemma 3.4. Assume that
where
Then the quasi-linear equations (3.23) can be uniquely solved on D(s) to obtain a family of functions $f_{k\imath}$ which are analytic in y, Hölder continuous about ξ with Hölder index β, and satisfy the following properties:
for all $\vert\imath\vert\leq m, 0 \lt \vert k\vert\leq K_+, \vert i\vert\leq 2$, where c 2 is a constant.
Proof. For $\forall y\in D(s)$, by $\mathbf{(H2)},\mathbf{(H3)}$, we have
and
which imply that
It follows from (3.25) and $\mathrm{(A2)}$ that for $\xi\in O$, $y\in D(s)$, $0 \lt |k|\leq K_+$,
Hence
is invertible, and
for all $y\in D(s)$, $\xi\in O$, $0 \lt \vert k\vert\leq K_+$, $\vert\imath\vert\leq m$. The analyticity of $\bar h(y)$ and $p_{k\imath}$ implies that $f_{k\imath}$ is analytic with respect to y. It follows from (3.23) that the Hölder continuity of $f_{k\imath}$ with respect to the parameter ξ is determined by $\omega(\xi)$, $\bar{h}$ and $p_{k\imath}$, which are Hölder continuous about parameter ξ with Hölder index β. Let $0 \lt \vert k\vert\leq K_+$. We note by the first inequality of (3.15) and Cauchy estimate that
and
So, by (3.28), (3.29) and (3.30), we get
This completes the proof.
Next, we apply the above transformation $\phi_F^1$ to Hamiltonian H, i.e.,
where
3.2.5. Translation
In this subsection, we will translate the parameter domain O to $O_+$ so that appropriate parameters can be found in $O_+$ to ensure the frequency remains unchanged in the next KAM step. To achieve this, we need to find an appropriate $\xi_+\in O$.
3.2.6. Frequency-preserving
In this subsection, we find the parameter $\xi_+$ such that the new frequency $\omega_+$ at $\xi_+$ is equal to the given frequency $\omega_0(\xi_0)$. Recall the topological degree condition $\mathrm{(A0)}$ and the weak convexity condition $\mathrm{(A1)}$. The former ensures that the parameter $\xi_+$ can be found in the parameter set to keep the frequency unchanged at this KAM step. The latter ensures that the distance between $\xi_+$ and ξ is smaller than the distance between ξ and $\xi_{\nu-1}$, i.e., the sequence of parameters is convergent after an infinite number of iteration steps. The following lemma is crucial to our arguments.
Lemma 3.5. Assume that
There exists $\xi_+\in B_{c\mu^{{1}/{\beta}}}(\xi_\nu)\subset O^o$ such that
Proof. The proof will be completed by an induction on ν. We start with the case ν = 0. It is obvious that $\omega_0(\xi_0)=\omega_0(\xi_0)$. Now assuming that for some ν > 0 we have
where $\omega_i(\xi)=\omega_0(\xi)+\sum_{j=0}^{i-1}p_{01}^j(\xi)$. We need to find $\xi_+$ near ξ such that
In view of the property of topological degree, ${\mathbf{(H4)}}$ and $\mathrm{(A0)}$, we have
i.e., there exists at least a $\xi_+\in O^o$ such that (3.36) holds.
Next, we estimate $\vert\xi_+-\xi_\nu\vert$. (3.14) in lemma 3.3 implies that
i.e.,
According to (3.37) and (3.38), we get
This together with $\mathrm{(A1)}$ and (3.39) yield
Then, by (3.41) and (3.14) in lemma 3.3, we have
which implies $\xi_+\in B_{c\mu^{{1}/{\beta}}}(\xi_\nu)$. From $\xi\in O^o$ in (3.37) and the fact that ɛ is small enough (i.e., µ is small enough), we have $B_{c\mu^{{1}/{\beta}}}(\xi_\nu)\subset O^o$.
This completes the proof.
3.2.7. Estimate on $N_+$
Now, we give the estimate on the next step $N_+$.
Lemma 3.6. There is a constant c 3 such that the following conclusions hold:
3.2.8. Estimate on $\Phi_+$
Recall that F is as in (3.21) with the coefficients and its estimate given by lemma 3.4. Then, we have the following estimate on F.
Lemma 3.7. There is a constant c 4 such that for all $\vert j\vert+\vert i\vert\leq 2$,
Proof. By (3.21) and (3.24), we have
This completes the proof.
Lemma 3.8. Assume that
Then the following conclusions hold.
(1) For all $0\leq t\leq 1$, the mappings
(3.46)\begin{align} \phi_F^t&:D_{\frac{1}{4}\alpha}\rightarrow D_{\frac{1}{2}\alpha} \end{align}are well defined.
(2) $\Phi_+=\phi_F^1:D_+\rightarrow D(s,r).$
(3) There is a constant c 5 such that
\begin{align*} \left\vert\left\|\phi_F^t-id\right\|\right\vert_{\tilde{D}}&\leq c_5\mu\Gamma(r-r_+),\\ \left\vert\left\|D\phi_F^t-Id\right\|\right\vert_{\tilde{D}}&\leq c_5\mu\Gamma(r-r_+),\\ \left\vert\left\|D^2\phi_F^t\right\|\right\vert_{\tilde{D}}&\leq c_5\mu\Gamma(r-r_+). \end{align*}(4)
\begin{align*} \left\vert\left\|\Phi_+-id\right\|\right\vert_{\tilde{D}}&\leq c_5\mu\Gamma(r-r_+),\\ \left\vert\left\|D\Phi_+-Id\right\|\right\vert_{\tilde{D}}&\leq c_5\mu\Gamma(r-r_+),\\ \left\vert\left\|D^2\Phi_+\right\|\right\vert_{\tilde{D}}&\leq c_5\mu\Gamma(r-r_+). \end{align*}
Proof. First, we prove (1). To verify (3.46), we denote $\phi_{F_1}^t$ and $\phi_{F_2}^t$ as the components of $\phi_{F}^t$ in the y and x planes, respectively. Let $X_F=(F_y,-F_x)^\top$ be the vector field generated by F. Then
For any $(y,x)\in D_{\frac{1}{4}\alpha}$, we let $t_*=\sup\{t\in[0,1]:\phi_F^t(y,x)\in D_\alpha\}$. Then for any $0\leq t\leq t_*$, by $(y,x)\in D_{\frac{1}{4}\alpha}$, (3.45) in lemma 3.7, ${\mathbf{(H5)}}$ and ${\mathbf{(H6)}}$, we can get the following estimates:
Thus, $\phi_F^t\in D_{\frac{1}{2}\alpha}\subset D_\alpha$, i.e. $t_*=1$ and (1) holds.
It follows from (1) that (2) holds.
We now prove (3). Using (3.45) in lemma 3.7 and (3.47), we immediately have
By (3.45) in lemma 3.7, (3.47) and Gronwall Inequality, we get
It follows from the induction and a similar argument that we have the estimate on the 2-order derivative of $\phi_F^t$, i.e.,
(4) now follows from (3).
This completes the proof.
3.2.9. Estimate on $P_+$
In the following, we estimate the next step $P_+$.
Lemma 3.9. Assume ${\mathbf{(H1)}}$-${\mathbf{(H6)}}$. Then there is a constant c 6 such that
Moreover, if
then
Proof. By (3.13) and (3.14) in lemma 3.3, (3.45) in lemma 3.7 and lemma 3.8 (3), we have that, for all $0\leq t\leq 1$,
So, by (3.35),
By $\mathbf{(H7)}$, we see that
which implies (3.49).
This completes the proof.
This completes one cycle of KAM steps.
4. Proof of theorem 2.1
4.1. Iteration lemma
In this subsection, we will prove an iteration lemma which guarantees the inductive construction of the transformations in all KAM steps.
Let $r_0,s_0,\alpha_0, \mu_0,H_0,N_0,e_0,\bar h_0,P_0$ be given at the beginning of section 3 and let $D_0=D(s_0,r_0)$, $K_0=0$, $\Phi_0=id$. We define the following sequences inductively for all $\nu=1,2,\cdots$:
Lemma 4.1. Denote
If ɛ is small enough, then the KAM step described on the above is valid for all $\nu=0,1,\cdots$, resulting the sequences
$\nu=1,2,\cdots,$ with the following properties:
(1)
(4.1)\begin{align} \left\vert e_{\nu+1}-e_{\nu}\right\vert&\leq\frac{\mu_*^{\frac{1}{2}}}{2^{\nu}}, \end{align}(4.2)\begin{align} \left\vert e_\nu-e_{0}\right\vert&\leq2\mu_*^{\frac{1}{2}}, \end{align}(4.3)\begin{align} \left\vert\left\|\bar h_{\nu+1}-\bar h_{\nu}\right\|\right\vert_{D(s_{\nu})}&\leq\frac{\mu_*^{\frac{1}{2}}}{2^{\nu}}, \end{align}(4.4)\begin{align} \left\vert\left\|\bar h_\nu-\bar h_{0}\right\|\right\vert_{D(s_\nu)}&\leq2\mu_*^{\frac{1}{2}}, \end{align}(4.5)\begin{align} \left\vert\left\|P_\nu\right\|\right\vert_{D(s_\nu,r_\nu)}&\leq\frac{\mu_*^{\frac{1}{2}}}{2^{\nu}}, \end{align}(4.6)\begin{align} \left\vert\xi_{\nu+1}-\xi_{\nu}\right\vert&\leq\left(\frac{\mu_*^{\frac{1}{2}}}{2^{\nu}}\right)^{\frac{1}{\beta}}. \end{align}(2) $\Phi_{\nu+1}:\tilde{D}_{\nu+1}\rightarrow \tilde{D}_{\nu}$ is symplectic, and
(4.7)\begin{align} \left\vert\left\|\Phi_{\nu+1}-id\right\|\right\vert_{\tilde{D}_{\nu+1}}\leq\frac{\mu_*^{\frac{1}{2}}}{2^{\nu}}. \end{align}Moreover, on $D_{\nu+1}$,
\begin{equation*} H_{\nu+1}=H_\nu\circ\Phi_{\nu+1}=N_{\nu+1}+P_{\nu+1}. \end{equation*}
Proof. The proof amounts to the verification of $\mathbf{(H1)}$-$\mathbf{(H7)}$ for all ν. For simplicity, we let $r_0=1$. It follows from ɛ small enough that µ 0 is small. So, we see that $\mathbf{(H2)}$, $\mathbf{(H4)}$-$\mathbf{(H7)}$ hold for ν = 0. From (3.2), $\mathbf{(H3)}$ holds for ν = 0. According to the definition of µν, we see that
Let $\zeta\gg1$ be fixed and µ 0 be small enough so that
Then
Denote
We notice that
Since
it is obvious that if ζ is large enough, then
which implies that $\mathbf{(H7)}$ holds for all $\nu\geq1$, and
By (4.11) and (4.12), it is easy to verify that $\mathbf{(H5)}$ and $\mathbf{(H6)}$ hold for all $\nu\geq1$ as ζ is large enough and ɛ is small enough.
By (3.14) in lemma 3.3 and (4.10), we have
which implies $\mathbf{(H4)}$.
To verify $\mathbf{(H3)}$, we observe by (4.8) and (4.10) that
as ζ is large enough. Then
which verifies $\mathbf{(H3)}$ for all $\nu\geq1$.
Let $\zeta^{1-\rho}\geq2$ in (4.9), (4.10). We have that for all $\nu\geq1$
The verification of $\mathbf{(H2)}$ follows from $(4.15)$ and an induction application of $(3.44)$ in lemma 3.6 for all $\nu=0,1,\cdots.$
Since $\left(1+\rho\right)^\eta \gt 2$, we have
It follows from above that
as µ is small, which is ensured by making ɛ small. Thus,
i.e. $\mathbf{(H1)}$ holds.
Above all, the KAM steps described in section 3 are valid for all ν, which give the desired sequences stated in the lemma.
Now, (4.1) and (4.3) follow from lemma 3.6, (4.15) and (4.17); by adding up (4.1) and (4.3) for all $\nu=0,1,\cdots$, we can get (4.2) and (4.4); (4.5) follows from (3.49) in lemma 3.9 and (4.15); (4.6) follows from (3.42) in lemma 3.6 and (4.15); (2) follows from lemma 3.8.
This completes the proof.
4.2. Convergence
The convergence is standard. For the sake of completeness, we briefly give the framework of proof. Let
By lemma 4.1, we have
$\nu=0,1,\cdots,$ where $\Psi^0=id$, $\omega_\nu(\xi)=\omega_0(\xi)+\sum_{j=0}^{\nu-1}p_{01}^j\left(\xi\right)$. Using (4.7) and the identity
it is easy to verify that $\Psi^\nu$ is uniformly convergent and denote the limitation by $\Psi^\infty$.
In view of lemma 4.1, it is obvious to see that eν, $\bar{h}_\nu$ and ξν converge uniformly about ν, and denote its limitation by $e_\infty$, $\bar{h}_\infty$ and $\xi_\infty$. By lemma 3.5, we have
Taking limits at both sides of (4.18), we get
Then, on $D(\frac{s_0}{2})$, Nν converge uniformly to
Hence, on $D\left(\frac{s_0}{2},\frac{r_0}{2}\right)$,
converge uniformly to
Since
by (4.15), we have that it converges to 0 as $\nu\rightarrow\infty$. So, on $D(0,\frac{r_0}{2})$,
Thus, for the given $\xi_0\in O$, the Hamiltonian
admits an analytic, quasi-periodic, invariant n-torus $\mathbb{T}^n\times\{0\}$ with the Diophantine frequency $\omega(\xi_0)$, which is the corresponding unperturbed toral frequency.
Recall
For given $\hat\xi_0,\tilde\xi_0\in\hat O$, similar to (4.19), we obtain
Then
i.e.,
It follows from the Hölder continuity of ω and (4.22) that $\xi_\infty(\hat\xi_0)$ is Hölder continuous about $\hat\xi_0\in \hat O$. Using the above proof process and replacing ξ 0 by $\hat\xi_0$, we can get an invariant torus with the Diophantine frequency $\omega(\hat\xi_0)$.
5. Proof of theorem 2.5 and theorem 2.8
First, we briefly outline the proof framework of theorem 2.5 as it follows the KAM step in section 3. Here, we mainly highlight two major differences from the proof of theorem 2.1. The first difference is that we utilize homotopy invariance and excision of the topological degree to keep the frequency unchanged during the iteration process, rather than selecting parameters. This is because we consider a Hamiltonian, not a family of Hamiltonians. The second difference is that the transformation is defined on a smaller domain because we treat the action variable as a parameter, and the translation of the parameter is equivalent to the action variable’s. We need the integer $m\geq\beta+1$ to ensure that the translated action variable y does not run out of the given domain.
5.1. Proof of theorem 2.5
In this section, we will describe the translation of action variable and state how the frequency can be preserved in the iterative process, which are different from subsection 3.2.6.
Let $\xi_0\in{G}$ be fixed as statement $\mathrm{(A0)}$. The Taylor expansion of Hamiltonian (1.1) about ξ 0 reads
where $e_0=h(\xi_0)$, $\omega_0(\xi_0)=\nabla h(\xi_0)$, $\bar{h}(y-\xi_0)=O(\left\vert y-\xi_0\right\vert^2)$. Using the transformation $(y-\xi_0)\rightarrow y$ in the above, we have
where (y, x) lies in a complex neighbourhood $D(s,r)$. Denote
For convenience, we will omit the index for all quantities of the present KAM step (at νth step) and use + to index all quantities in the next KAM step (at $(\nu+1)$th step). Now, suppose that at νth step, we have arrived at the following real analytic Hamiltonian:
First, we shall construct a symplectic transformation as the time-1 map $\phi_{F}^1$ such that it transforms the Hamiltonian H into the following form:
The concrete details see subsection 3.2.4
Next, we will construct a translation so as to keep the frequency unchanged. Consider the translation
where $\xi_+$ is to be determined. Let
Then
where
As in subsection 3.2.6, we will show that the frequency can be preserved in the iteration process. The following lemma is crucial to our arguments.
Lemma 5.1. There exists $\xi_+\in B_{s\mu^{{1}/{\beta}}}(\xi)$ such that
Proof. The proof will be completed by an induction on ν. We begin with the case ν = 0. It is obvious that $\omega_0(\xi_0)=\omega_0(\xi_0)$. Now supposing that for some $\nu\geq0$ we have
where $i=1,2,\cdots,\nu.$ Then, we need to find $\xi_+$ near ξ such that $\omega_+(\xi_+)=\omega(\xi)$. In view of (5.6), we observe that
We split
Consider homotopy $H_t(y):[0,1]\times G\rightarrow \mathbb{R}^n$,
For any $y\in\partial G$, $t\in[0,1]$, by $\mathrm{(A1)}$, we have that
where $\delta:=\min\{\vert y-\xi_0\vert, \forall y\in\partial G\}$.
So, it follows from homotopy invariance and $\mathrm{(A0)}$ that
We note by $\mathrm{(A1)}$, (5.9) and (5.10) that for any $y\in G\backslash B_{s\mu^{{1}/{\beta}}}(\xi)$,
where the last equality follows from $m\geq \beta+1$.
Hence, by excision, we have that
i.e., there exists at least a $\xi_+\in B_{s\mu^{{1}/{\beta}}}(\xi)$, such that $H_1(\xi_+)=0,$ i.e.,
which implies (5.8).
This completes the proof.
In the following, we prove
Recall that $m\geq\beta+1$ and $\alpha=\mu^{\frac{1}{m+1}}$, we have
For $\forall (y,x)\in D_{\frac{1}{8}\alpha}$, we note by $\xi_+\in B_{s\mu^{{1}/{\beta}}}(\xi)$ in lemma 5.1 and (5.13) that
which implies (5.12).
Next, we prove theorem 2.8 by a direct method.
5.2. Proof of theorem 2.8
(1) The unperturbed motion of (1.1) is described by the equation
The flow is $x=h'(y)t+x_0,y\in G$, where x 0 is an initial condition. Notice that
i.e., h(y) is degenerate at $\xi_0=0$. Obviously, by simple calculation, we get
i.e., $\mathrm{(A0)}$ fails, then theorem 2.6 is not applicable.
Note that the perturbed motion equation is
The flow is $x=\left(h'(y)+\varepsilon P'(y)\right)t+x_1,~y\in G$, where x 1 is an initial condition. To ensure the frequency is equal to $h'(0)$, we need to find a solution of the following equation in G:
i.e.,
Notice that the Taylor expansion of $g'(y)$ at $\xi_0=0$ is
then the equation (5.14) is equivalent to
which is solvable provided that $\varepsilon P'(y)\, sign \left(g^{2\ell+1}(0)\right) \lt 0$. So the perturbed system admits at least two invariant tori with frequency $\omega=h'(0)$ for the small enough perturbation satisfying $\varepsilon P'(y)\, sign \left(g^{2\ell+1}(0)\right) \lt 0$. Conversely, if $\varepsilon P'(y)\, sign \left(g^{2\ell+1}(0)\right) \gt 0$, the unperturbed invariant torus with frequency $\omega=h'(0)$ will be destroyed.
(2) Note that h(y) is degenerate in $\xi_0=0$. Obviously, by simple calculation, we get
Then, by theorem 2.6, the above persistence result hold. In addition, we can also directly prove this result. Similarly, we need to solve the following equation in G:
i.e.,
Notice that the Taylor expansion of $g'(y)$ at $\xi_0=0$ is
then the equation (5.15) is equivalent to
whose solution always exists in G for any small enough perturbation. Hence, the perturbed system admits an invariant torus with frequency $\omega=h'(0)$ for any small enough perturbation.
Acknowledgements
The authors would like to thank the editors and referees for their valuable suggestions and comments. The first author (Jiayin Du) is supported by the Fundamental Research Funds for the Central Universities (Grant number 2412024QD003). The second author (Yong Li) is supported by National Basic Research Program of China (Grant number 2013CB8-34100), National Natural Science Foundation of China (Grant numbers 11571065, 11171132, and 12071175), and Natural Science Foundation of Jilin Province (Grant number 20200201253JC).
Appendix A. Proof of proposition 2.3
Proof. Obviously, for $\forall \xi\in (-1,1)\times(-1,1)$,
and for $\forall \xi\in\partial(-1,1)\times(-1,1),$
Using Borsuk’s theorem in [Reference Motreanu, Motreanu and Papageorgiou27], we have
i.e.,
i.e., $\mathrm{(A0)}$ holds. For $\xi,\xi_*\in\left[-\frac{1}{2},\frac{1}{2}\right]$, and $\xi\neq\xi_*$, we have
but
which shows that $\mathrm{(A1)}$ fails. Note that the flow of unperturbed motion equation is
where x 0 is an initial condition, and the flow of perturbed motion equation is
In order to keep the frequency $\omega(0)=\bar\omega$ unchanged, we have to solve the following equation
i.e.
which implies that the second component ξ 2 of solution ξ is discontinuous and alternately appears on $\left(-1,-\frac{1}{2}\right)$ and $\left(\frac{1}{2},1\right)$ as $\varepsilon\rightarrow0_+$. Hence we cannot find a solution ξɛ such that $\xi_\varepsilon\rightarrow\xi_0$ as $\varepsilon\rightarrow0$.
Appendix B. Proof of theorem 2.6
Proof. Notice that
For $0 \lt \delta \lt 1$, $B_\delta(0)$ denotes the open ball centred at the origin with radius δ. We have that $\nabla h(y)-\nabla h(0)$ is odd and unequal to zero on $\partial B_\delta(0)$, i.e.,
It follows from Borsuk’s theorem in [Reference Motreanu, Motreanu and Papageorgiou27] that,
Obviously, there exist $\sigma=\frac{\min_{y\in B_\delta(0)}\{{(y+\varrho)\vert y+\varrho\vert^{2\ell}-y\vert y\vert^{2\ell}}\}}{2\varrho^{2\ell+1}}$ and $L=2l+1$ such that
where ϱ > 0, $B_\varrho(y_*)\subset B_\delta(0)$. So, by theorem 2.5, the perturbed system admits an invariant torus with frequency ω for any small enough perturbation.
Appendix C. Proof of proposition 2.7
Proof. Let $\varepsilon P=\varepsilon y, \varepsilon \gt 0$. Notice that for $y\in G\subset\mathbb{R}^1$,
which implies that the Hamiltonian H is degenerate at y = 0. By the definition of degree, we have for $0 \lt \delta \lt 1$
i.e., $\mathrm{(A0)}$ fails. Then, theorem 2.5 cannot be used to prove the persistence result of keeping frequency unchanged.
Note that the flow of unperturbed motion equation at y = 0 is
where x 0 is an initial condition, and the flow of perturbed motion equation is
In order to preserve frequency ω, we need to solve $y^{2\ell}+\varepsilon=0$ in G, which has no real solution in G. Hence, the persistence result of keeping frequency unchanged fails.