1. Introduction and statement of results
A foundational result in Diophantine approximation is Dirichlet's approximation theorem, which asserts that for every real number $\alpha$ there are infinitely many coprime solutions $(p,q)$ to the inequality
It is well known that this result is optimal up to constant factors for numbers $\alpha$ whose partial quotients in the continued fraction representation are bounded (so-called badly approximable numbers). Metric number theory asks to what extent (1) can be improved for typical reals $\alpha$, in the sense that the exceptional set has vanishing Lebesgue measure.
One of the fundamental results of metric Diophantine approximation is Khintchine's theorem [Reference KhintchineKhi24]. Let $\psi (q)$ be a non-negative sequence, and suppose that $q \psi (q)$ is non-increasing. Then the inequality
has infinitely many integer solutions $(p,q)$ for almost all real numbers $\alpha$, provided that the series $\sum _{q=1}^\infty \psi (q)$ diverges. In contrast, inequality (2) has only finitely many solutions for almost all $\alpha$ if this series converges. Very roughly speaking, this says that for typical reals the Dirichlet approximation theorem can be improved by a factor of logarithmic order. By periodicity, it is sufficient to consider $\alpha \in [0,1]$. It can easily be seen that Khintchine's theorem addresses the question whether the set system
contains a given real $\alpha$ for infinitely or only finitely many values of $q$. If we assume that $\psi (q) \leq 1/2$ (as we will throughout this paper, to avoid degenerate situations), then the measure of such a set is exactly $2\psi (q)$. Thus, the ‘only finitely many’ part of Khintchine's theorem is a straightforward application of the convergence part of the Borel–Cantelli lemma. The ‘infinitely many’ part of the theorem, however, is much more delicate since the divergence part of the Borel–Cantelli lemma requires some form of stochastic independence. The purpose of the monotonicity condition in the statement of Khintchine's theorem is to guarantee this stochastic independence property of the set system.
Duffin and Schaeffer [Reference Duffin and SchaefferDS41] showed that Khintchine's theorem generally fails without the monotonicity condition. More precisely, they constructed a function $\psi$ which is supported on a set of very smooth integers (having a large number of small prime factors), such that $\sum _{q=1}^{\infty } \psi (q)$ diverges, but for almost all $\alpha$ there are only finitely many solutions to (2). From a probabilistic perspective, the counterexample of Duffin and Schaeffer exploits the lack of stochastic independence in the set system, by constructing a special configuration where the overlaps between different sets of the system are too large; the crucial point here is that a fraction $p/q$ can have many different representations as a quotient of integers (as long as non-reduced representations are allowed), and thus may appear in many different elements of the set system.
Duffin and Schaeffer suggested that this lack of independence could be overcome by switching to the coprime setting. More precisely, the Duffin–Schaeffer conjecture asserted that for almost all $\alpha$ there are infinitely many coprime solutions $(p,q)$ to (2) if and only if the series $\sum _{q=1}^\infty \varphi (q) \psi (q)/q$ diverges, where $\varphi$ denotes the Euler totient function. Let
Then, writing $\lambda$ for the Lebesgue measure and again assuming that $\psi (q) \leq 1/2$ for all $q$, we have
Thus, the ‘only finitely many’ part of the Duffin–Schaeffer conjecture is again a direct consequence of the convergence part of the Borel–Cantelli lemma. However, the divergence part of the Duffin–Schaeffer conjecture has resisted resolution for many decades. After important contributions of Gallagher [Reference GallagherGal61], Erdős [Reference ErdősErd70], Vaaler [Reference VaalerVaa78], Pollington and Vaughan [Reference Pollington and VaughanPV90], and Beresnevich and Velani [Reference Beresnevich and VelaniBV06], the Duffin–Schaeffer conjecture was finally solved in full generality by Koukoulopoulos and Maynard [Reference Koukoulopoulos and MaynardKM20] in 2020. Their argument relies on an ingenious construction of what they call ‘GCD graphs’. This allows them to implement a step-by-step quality increment strategy until they finally arrive at a situation where they can completely control the divisor structure which is at the heart of the problem. The final, number-theoretic, input is an ‘anatomy of integers’ statement that quantifies the observation that there are only few integers that have many small prime factors.
In the present paper, we prove a quantitative version of the Koukoulopoulos–Maynard theorem. Their result states that there are infinitely many coprime solutions to (2) for almost all $\alpha$ if the sum of measures diverges. We show that for almost all $\alpha$ the number of solutions in fact grows proportionally to the sum of measures.
Theorem 1 Let $\psi :~\mathbb {N} \to [0,1/2]$ be a function such that $\sum _{q=1}^{\infty } {\varphi (q) \psi (q)}/{q}=\infty$. Write $S(Q)=S(Q,\alpha )$ for the number of coprime solutions $(p,q)$ to the inequality
and let
Let $C>0$ be arbitrary. Then, for almost all $\alpha$,
It is not clear to what extent the error term in the theorem can be improved. It seems to us that any result which contains a power saving, that is, has a multiplicative error of order $(1 + O(\Psi (Q)^{-\varepsilon }))$ for some $\varepsilon >0$, would require a substantial improvement of the argument in the present paper. By analogy with other results from metric number theory it is reasonable to assume that Theorem 1 actually holds with an error term $(1 + O(\Psi (Q)^{-1/2+\varepsilon }))$ for any $\varepsilon >0$, and probably even $(1 + O(\Psi (Q)^{-1/2} (\log \Psi (Q))^c))$ for some appropriate $c$. We note in passing that very precise metric estimates for the asymptotic order of $S(Q)$ are known when an extra monotonicity assumption is imposed upon $\psi$, in the spirit of Khintchine's original result; see, for example, Chapter 3 of [Reference PhilippPhi71] and Chapter 4 of [Reference HarmanHar98]. However, from a technical perspective, the problem is of a very different nature when this extra monotonicity assumption is made. The results for the monotonic case imply as a corollary that Theorem 1 above cannot hold in general with a multiplicative error of order $(1 + O(\Psi (Q)^{-1/2}))$ or less.
The key problem in the metric theory of approximations by reduced fractions is to control the measure of the overlaps $\mathcal {A}_q \cap \mathcal {A}_r$ in some averaged sense. Pairwise independence $\lambda (\mathcal {A}_q \cap \mathcal {A}_r) = \lambda (\mathcal {A}_q) \lambda (\mathcal {A}_r)$ would allow a direct application of the second Borel–Cantelli lemma, but it turns out that $\lambda (\mathcal {A}_q \cap \mathcal {A}_r)$ can exceed $\lambda (\mathcal {A}_q) \lambda (\mathcal {A}_r)$ by a factor as large as $\log \log (qr)$ for some configurations of $q,r,\psi$. Such an exceedingly large overlap can happen if there are many small prime factors dividing $q$ but not dividing $r$, or vice versa, and if simultaneously the greatest common divisor of $q$ and $r$ lies in a certain critical range (which is determined by the values of $\psi (q)$ and $\psi (r)$). The crucial point then is to show that such large extra factors appear only for a small number of pairs $q,r$. Consider the quotient
Without imposing an absolute lower bound on $\Psi (Q)$, this quotient can be arbitrarily large. The main breakthrough of Koukoulopoulos and Maynard was to prove that
This property is called quasi-independence on average, and is sufficient for an application of the second Borel–Cantelli lemma (in the Erdős–Rényi formulation of the lemma); cf. [Reference Beresnevich and VelaniBV23]. In the present paper, we show that even more is true: we have
Thus, the set system $(\mathcal {A}_q)_{q \geq 1}$ moves towards pairwise independence on average as the total mass of the set system (the sum of measures of the approximation sets) tends towards infinity. Since we consider this fact, which is the key ingredient in our proof of Theorem 1, to be very interesting in its own right, we state it below as a separate theorem.
Theorem 2 Let $\psi :~\mathbb {N} \to [0,1/2]$ be a function. Let the sets $\mathcal {A}_q$, $q=1,2,\dots$, be defined as in (3), and let $\Psi (Q)$ be defined as in (4). Let $C>0$ be arbitrary. For any $Q \in \mathbb {N}$ such that $\Psi (Q) \ge 3$, we have
with an implied constant depending only on $C$.
The rest of this paper is organized as follows. In § 2 we show how Theorem 2 implies Theorem 1. The following seven sections are concerned with the proof of Theorem 2. Section 3 contains an estimate of the measure of the overlap $\mathcal {A}_q \cap \mathcal {A}_r$ for given $q$ and $r$. This estimate exploits information on the divisor structure of $q$ and $r$ in order to bound the difference between $\lambda \big (\mathcal {A}_q \cap \mathcal {A}_r\big )$ and $\lambda (\mathcal {A}_q) \lambda (\mathcal {A}_r)$, thus addressing the issue of the ‘stochastic dependence’ between $\mathcal {A}_q$ and $\mathcal {A}_r$. In § 4 we reduce Theorem 2 to two second-moment bounds. Section 5 contains a brief introduction to the ‘GCD graph’ machinery developed by Koukoulopoulos and Maynard [Reference Koukoulopoulos and MaynardKM20]. In § 6 we show how the second-moment bounds follow from the existence of a ‘good’ GCD subgraph. In the final two sections, we establish the existence of such a good GCD subgraph, using a modification of the iteration procedure of [Reference Koukoulopoulos and MaynardKM20]. Our argument requires a careful balancing of the ‘quality gain’ against the potential ‘density loss’ coming from this iterative procedure, in such a way that information on the ‘anatomy of integers’ can be exploited beyond a certain threshold. This threshold is determined by the order of the error terms coming from sieve theory (which translate into the error terms of the overlap estimate in § 3).
For the rest of the paper, $\psi : \mathbb {N} \to [0,1/2]$ is an arbitrary function, $\mathcal {A}_q$, $q \in \mathbb {N}$, is as in (3), and $\Psi (Q)$, $Q \in \mathbb {N}$, is as in (4).
2. Proof of Theorem 1
Let $C>4$ be fixed, and assume that Theorem 2 holds. Let $\mathbb {1}_A$ denote the indicator function of a set $A$. Formulated in probabilistic language, Theorem 2 controls the variance of the random variables $\mathbb {1}_{\mathcal {A}_1}, \dots, \mathbb {1}_{\mathcal {A}_Q}$, and we obtain
Define
and let
By Chebyshev's inequality and (5), we have
Since we assumed that $C>4$, we have $\sum _{k=1}^\infty \lambda (\mathcal {B}_k) < \infty$, and the Borel–Cantelli lemma implies that almost all $\alpha$ are contained in at most finitely many sets $\mathcal {B}_k$. Thus, for almost all $\alpha$,
holds for all $k \geq k_0(\alpha )$. Clearly, for any $Q \geq 3$ there exists a $k$ such that $Q_k \leq Q < Q_{k+1}$, which also implies that
Since $\psi \leq 1/2$ by assumption, we have $\Psi (Q_k) \in \big [e^{k^{1/\sqrt {C}}},e^{k^{1/\sqrt {C}}} + 1/2\big ]$, and so
From the previous three formulas and the triangle inequality, we deduce that for almost all $\alpha$ there exists a $Q_0 = Q_0(\alpha )$ such that for all $Q \geq Q_0$,
As $C$ can be chosen arbitrarily large, this proves Theorem 1.
3. The overlap estimate
In this section we develop a new estimate for the measure of the overlaps $\mathcal {A}_q \cap \mathcal {A}_r$. For the rest of the paper, let
The standard bound for the measure of $\mathcal {A}_q \cap \mathcal {A}_r$ is due to Pollington and Vaughan [Reference Pollington and VaughanPV90]: for any $q \neq r$,
with an absolute implied constant. Clearly, because of the presence of the implied constant this standard bound cannot be sufficient to deduce Theorem 2. Below we will use a more refined argument from sieve theory which allows us to isolate a main term, and prove an upper bound of the form
with an error term that becomes small if there are not too many small primes which divide $q$ and $r$ with different multiplicities (see Lemma 5 below for details).
The following lemma is called the fundamental lemma of sieve theory. We state it in the formulation of [Reference KoukoulopoulosKou19, Theorem 18.11].
Lemma 3 (Fundamental lemma of sieve theory)
Let $(a_n)_{n \geq 1}$ be non-negative reals, such that $\sum _{n=1}^\infty a_n < \infty$. Let $\mathcal {P}$ be a finite set of primes, and write $P = \prod _{p \in \mathcal {P}} p$. Set $y = \max \mathcal {P}$ and $A_d = \sum _{n \equiv 0 \mod d} a_n$. Assume that there exist a multiplicative function $g$ such that $0 \le g(p) < p$ for all $p \in \mathcal {P}$, a real number $x$, and positive constants $\kappa,C$ such that
and
Then, uniformly in $u \geq 1$, we have
We will also need an estimate for the order of the partial sums of a particular multiplicative function.
Lemma 4 Let $\mathcal {P}$ be a set of odd primes, and define
Then, for any $x \ge 2$,
where the implied constant is absolute.
Proof. Define $g(n) = \sum _{d \mid n} \mu (d) f(n/d)$, where $\mu$ is the Möbius function. Note that $f$ and $g$ are multiplicative functions. For $p \in \mathcal {P}$,
whereas for $p \not \in \mathcal {P}$, we have $g(p^m)=0$ for all $m \geq 1$. By the definition of $g$,
Here
and it remains to estimate the error term in (8).
Note that $p^m g(p^m) \le p/(p-2) \le 3$ for all prime powers $p^m$. Hence, by a general upper bound for the order of partial sums of multiplicative functions (see, for example, [Reference KoukoulopoulosKou19, Theorem 14.2]), the partial sums of $dg(d)$ satisfy
In particular, $\sum _{x \le d \le 2x} g(d)/d \ll x^{-2} \sum _{d \le 2x}\,d g(d) \ll x^{-1}$, and the first error term in (8) is $x\sum _{d>x} g(d)/d \ll 1$. Further, $\sum _{x \le d \le 2x} g(d) \le x^{-1} \sum _{d \le 2x} d g(d) \ll 1$, and the second error term in (8) is $\sum _{d \le x} g(d) \ll \log x$, as claimed. All implied constants are absolute.
Lemma 5 (Overlap estimate)
Let $\psi :~\mathbb {N} \to [0,1/2]$ be a function and $\mathcal {A}_q$, $q=1,2,\dots,$ be defined as in (3). For any positive integers $q \neq r$ and any reals $u \ge 1$ and $T \ge 2$, we have
with an absolute implied constant, where $D=D(q,r)$ is as in (6). In particular, for any $C \ge 1$,
with an implied constant depending only on $C$, where
Proof. We follow the general strategy of Pollington and Vaughan in [Reference Pollington and VaughanPV90, § 3]. If $D<1/2$, then $\psi (q)/q+\psi (r)/r<1/\mathrm {lcm}(q,r)$, hence $\mathcal {A}_q \cap \mathcal {A}_r = \emptyset$, and the claim trivially holds. We may thus assume throughout the rest of the proof that $D \ge 1/2$.
We set
and define the piecewise linear function
We can express the measure of $\mathcal {A}_q \cap \mathcal {A}_r$ as
For any prime $p$, let $u = u(p,q)$ and $v = v(p,r)$ be defined by $q = \prod _p p^u$ and $r = \prod _p p^v$, and let
If $q,r \geq 2$, we have
which follows from the assumption that $\psi (q), \psi (r) \leq \frac {1}{2}$.
Following the argument on p. 195 of [Reference Pollington and VaughanPV90] (an application of the Chinese remainder theorem, together with a simple counting argument) thus leads for $q,r \geq 2$ to
and the formula above follows immediately also in the case $q = 1$ or $r = 1$.
Assume first that $l$ is odd. By rewriting the right-hand side of the previous formula we see that $\lambda (\mathcal {A}_q \cap \mathcal {A}_r)$ equals
We now find an upper bound for this expression. First, we replace the condition $\text{gcd} (c,n)=1$ by $\text{gcd} (c,n^*)=1$, where $n^*$ denotes the $T$-smooth part of $n$ (i.e. $n^*=\prod _{p \le T,~u \neq v}p^{\max (u,v)}$). Next, we fix a large positive integer $K$, and divide $[\Delta -\delta, \Delta +\delta ]$ into $K$ subintervals of equal length. Observe that the piecewise constant function
satisfies $w(y) \leq w^*(y)$ for all $y \ge 0$. Therefore $\lambda (\mathcal {A}_q \cap \mathcal {A}_r)$ is bounded above by
Here
Now fix $k \in \{0,\dots,K-1\}$, and set
and $a_c = 0$ for $c > \ln (\Delta + \delta - 2k \delta /K)$. Note that for $d \mid n^*$ we have $a_{dc} = a_c$ as long as $dc \leq \ln (\Delta + \delta - 2k \delta /K)$. By Lemma 4, for any $d \mid n^*$ we thus have
We have
and by an application of Lemma 3 (with $\mathcal {P}$ the set of prime divisors of $n^*$, $\max \mathcal {P} \le T$ and $|r_d| \ll \log (D+2)$) this is
Since
formula (11) thus yields that $\lambda (\mathcal {A}_q \cap \mathcal {A}_r)$ is bounded above by
Letting $K \to \infty$, and using $D = \Delta l n$ and $\varphi (n^*)/n^* \ge \prod _{p \le T} (1-1/p) \gg 1/\log T$, we obtain
Finally, observe that
This establishes (9) for odd $l$.
Assume next that $l$ is even. Then
and similarly to before we obtain that $\lambda (\mathcal {A}_q \cap \mathcal {A}_r)$ equals
The rest of the proof for odd $l$ applies mutatis mutandis to even $l$. This completes the proof of (9).
Given $C \ge 1$, let us choose
One can readily check that $u^{-u/2} \le (\log (D+100))^{-C}$. Using $4/\log \log \log 100 <10$, we also have $T^u \le (D+100)^{1/2} (\log (D+100))^{10C}$, hence
is negligible compared to $(\log (D+2))^{-C}$.
4. Second-moment bounds
In this section we show how two second-moment bounds, stated as Propositions 6 and 7 below, together with the overlap estimate in Lemma 5, imply Theorem 2. These propositions should be compared to the second-moment bound of Koukoulopoulos and Maynard [Reference Koukoulopoulos and MaynardKM20, Proposition 5.4], which, together with the overlap estimate of Pollington and Vaughan in (7), implies the Duffin–Schaeffer conjecture.
Let $D(q,r)$ be as in (6). For the sake of readability, let
and
Proposition 6 For any $Q \in \mathbb {N}$ and any real $t \ge 1$, the set
satisfies
with an absolute implied constant.
Proposition 7 Let $C \ge 1$ be arbitrary. For any $Q \in \mathbb {N}$ and any real $t \ge 1$, the set
satisfies
with an implied constant depending only on $C$.
We now present the proof of Theorem 2, assuming Propositions 6 and 7.
Proof of Theorem 2 Let $Q \in \mathbb {N}$ be such that $\Psi (Q) \ge 3$. We may assume that $C>0$ is greater than any prescribed absolute constant. Let $K>1$ be a large constant in terms of $C$, to be chosen.
We partition the index set $[1,Q]^2$ into the sets
The contribution of $\mathcal {E}^{1}$ is clearly negligible:
Now we consider $\mathcal {E}^{2}$. For any $(q,r) \in \mathcal {E}^{2}$, the condition $L_{F(\Psi (Q))}(q,r) \le 1$, together with Mertens’s theorem, ensures that
In the last step we used the rough estimate $F(x) \le x$ for all $x \ge 3$. The overlap estimate (Lemma 5) thus shows that for any $(q,r) \in \mathcal {E}^{2}$,
Applying Proposition 6 with $t=(\log \Psi (Q))^C$ leads to
Next we consider $\mathcal {E}^{3}$. For any $(q,r) \in \mathcal {E}^{3}$, let $j(q,r)$ be the maximal integer $j$ such that $L_{F(\exp \exp (j))}(q,r) >1$; note that, by construction, $j(q,r) \ge \lfloor \log \log \Psi (Q) \rfloor$. Let $(q,r) \in \mathcal {E}^{3}$ with $j(q,r)=j$. By definition, $L_{F(\exp \exp (j+1))}(q,r) \le 1$, hence Mertens’s theorem implies
Thus, the overlap estimate gives
and applying Proposition 7 with $t=\exp \exp (j)$ leads to
In the last step we used the fact that $F(x)$ increases faster than any power of $\log x$.
Now we consider $\mathcal {E}^{4}$. For any $(q,r) \in \mathcal {E}^{4}$,
The overlap estimate thus gives
hence
Finally, we consider $\mathcal {E}^{5}$. Set $\kappa = \frac {1}{100} (e/C)^C$. For any $(q,r) \in \mathcal {E}^{5}$, let $i(q,r)$ be the maximal integer $i$ such that
Note that
therefore
Here
Let $(q,r) \in \mathcal {E}^{5}$ such that $i(q,r)=i$. By definition,
hence Mertens’s theorem shows that
The overlap estimate thus gives
Another application of Mertens’s theorem, this time with the error term $O((\log x)^{-C})$ due to Landau [Reference LandauLan09, p. 201], leads to
In the last step we used the facts that $h(x):=\log \log F (\kappa \exp \exp (x))$ satisfies $h'(x) \ll 1$, and
Choosing $K>1$ large enough in terms of $C$, it follows that
hence $D(q,r) \le \kappa \exp \exp ({i}/{(\log \Psi (Q))^C})$. Applying Proposition 7 with $t=\kappa \exp \exp ({i}/{(\log \Psi (Q))^C})$ thus leads to
and by summing over all possible values of $i$,
Combining formulas (14)–(18) shows that
as claimed.
5. GCD graphs: notation and basic properties
The proof of the Duffin–Schaeffer conjecture given by Koukoulopoulos and Maynard in [Reference Koukoulopoulos and MaynardKM20] is based on a concept called ‘GCD graphs’, which they introduced in that paper. Very roughly speaking, a GCD graph encodes information on the divisor structure of a set of integers. To each GCD graph a ‘quality’ can be assigned, and the key argument in [Reference Koukoulopoulos and MaynardKM20] is that one can iteratively pass to subgraphs of the original GCD graph in such a way that in each step the quality increases and/or the divisor structure becomes more regular. At the end of this procedure, one has a graph that has either particularly high quality or a very regular divisor structure. High quality directly implies that the density of the edge set, essentially controlling the influence of the bad pairs $(q,r)$ in such sets as $\mathcal {E}^{1},\ldots,\mathcal {E}^{5}$ of the previous section, is small, leading to the desired result. If one cannot achieve high quality, then one obtains a GCD subgraph that has perfect control of the divisor structure of the underlying set of integers; in this case, results on the ‘anatomy of integers’ can be used to show that the problematic factor
in the overlap estimate can only be large for a very small proportion of pairs $(q,r)$, again leading to the desired result.
We do not give a fully detailed presentation of the notion of a GCD graph here, and refer the reader to § 6 of [Reference Koukoulopoulos and MaynardKM20] instead. However, for the convenience of the reader, we will recall the basic definitions and some of the basic properties of GCD graphs.
A GCD graph is a septuple $G = (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$, for which the following properties hold.
(a) $\mu$ is a measure on $\mathbb {N}$ for which $\mu (n)<\infty$ for all $n$. This measure is extended to $\mathbb {N}^2$ by defining
\[ \mu(\mathcal{N}) = \sum_{(n_1,n_2) \in \mathcal{N}} \mu(n_1) \mu(n_2), \quad \mathcal{N} \subseteq \mathbb{N}^2. \](b) The vertex sets $\mathcal {V}$ and $\mathcal {W}$ are finite sets of positive integers.
(c) The edge set $\mathcal {E}$ is a subset of $\mathcal {V} \times \mathcal {W}$.
(d) $\mathcal {P}$ is a set of primes.
(e) $f$ and $g$ are functions from $\mathcal {P}$ to $\mathbb {Z}_{\geq 0}$ such that for all $p \in \mathcal {P}$:
(i) $p^{f(p)} \mid v$ for all $v \in \mathcal {V}$ and $p^{g(p)} \mid w$ for all $w \in \mathcal {W}$;
(ii) if $(v,w) \in \mathcal {E}$, then $p^{\min (f(p),g(p))} \parallel \text{gcd} (v,w)$;
(iii) if $f(p) \neq g(p)$, then $p^{f(p)} \parallel v$ for all $v \in \mathcal {V}$ and $p^{g(p)} \parallel w$ for all $w \in \mathcal {W}$.
For two GCD graphs $G = (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ and $G' = (\mu ',\mathcal {V}',\mathcal {W}',\mathcal {E}',\mathcal {P}',f',g')$ we say that $G'$ is a GCD subgraph of $G$, and write $G' \preceq G$, if
and if $f$ (respectively, $g$) coincides with $f'$ (respectively, $g'$) on $\mathcal {P}$.
For given $\mathcal {V}$ and $k \geq 0$ we define $\mathcal {V}_{p^k} = \{v \in \mathcal {V}:~p^k \parallel v\}$. We write $\mathcal {E}_{p^k,p^\ell } = \mathcal {E} \cap (\mathcal {V}_{p^k} \times \mathcal {W}_{p^\ell })$. It turns out that for $p \not \in \mathcal {P}$, the GCD graph
is a GCD subgraph of $G$ (where $f_{p^k}$ and $g_{p^\ell }$ are defined in such a way that they respectively coincide with $f$ and $g$ on $\mathcal {P}$, and $f_{p^k}(p)=k$ and $g_{p^{\ell }}(p)=\ell$).
For a GCD graph $G = (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ we make the following definitions.
(i) The edge density
\[ \delta(G) = \frac{\mu(\mathcal{E})}{\mu(\mathcal{V}) \mu(\mathcal{W})}, \]provided that $\mu (\mathcal {V}) \mu (\mathcal {W}) \neq 0$. If $\mu (\mathcal {V}) \mu (\mathcal {W}) = 0$, we define $\delta (G)$ to be $0$.(ii) The neighborhood sets
\[ \Gamma_G(v) = \left\{ w \in \mathcal{W}:~(v,w) \in \mathcal{E} \right\}, \quad v \in \mathcal{V}, \]and\[ \Gamma_G(w) = \{v \in \mathcal{V}:~(v,w) \in \mathcal{E} \}, \quad w \in \mathcal{W}. \](iii) The set $\mathcal {R}(G)$ of primes that have not (yet) been accounted for in the GCD graph:
\[ \mathcal{R}(G) = \{ p \not\in \mathcal{P}:~\exists (v,w) \in \mathcal{E} \text{ such that } p \mid \text{gcd}(v,w) \}. \](iv) The quality
\[ q(G) = \delta (G)^{10} \mu(\mathcal{V}) \mu(\mathcal{W}) \prod_{p \in \mathcal{P}} \frac{p^{|f(p)-g(p)|}}{\big(1 - \mathbb{1}_{f(p)=g(p) \geq 1}/p\big)^2 \big(1 - p^{-31/30} \big)^{10}}. \]
This notion of quality of a GCD graph is an ad-hoc definition, which turns out to serve the required purpose for the argument of [Reference Koukoulopoulos and MaynardKM20]. We refer to [Reference Koukoulopoulos and MaynardKM20] for the heuristic reasoning which led to this particular definition. It is possible that a modified notion of quality would be better suited to the argument in the present paper. However, we prefer to stick to the original definition of quality from [Reference Koukoulopoulos and MaynardKM20], since this allows us to directly use a large part of the iteration procedure from [Reference Koukoulopoulos and MaynardKM20] without the need to adapt it to a modified framework.
We also introduce
This should be compared to the sets $\mathcal {R}^{\sharp }(G)$ and $\mathcal {R}^{\flat }(G)$ used in [Reference Koukoulopoulos and MaynardKM20], the latter of which is defined analogous to our $\mathcal {R}^{\unicode{x266B} }(G)$ but with $1-10^{40}/p$ instead of $1-1/\sqrt {p}$. Finally, we define
Among the basic properties of GCD graphs are the facts that $G_1 \preceq G_2$ and $G_2 \preceq G_3$ together imply $G_1 \preceq G_3$ (transitivity), and that $G_1 \preceq G_2$ implies $\mathcal {R}(G_1) \subseteq \mathcal {R}(G_2)$. However, in general $G_1 \preceq G_2$ does not imply that $\mathcal {R}^{\unicode{x266B} }(G_1) \subseteq \mathcal {R}^{\unicode{x266B} }(G_2)$.
6. Good GCD subgraphs
In this section we state two results on the existence of a ‘good’ GCD subgraph of an arbitrary GCD graph with trivial multiplicative data (i.e. $\mathcal {P}=\emptyset$) in the form of Propositions 8 and 9 below; these should be compared to [Reference Koukoulopoulos and MaynardKM20, Proposition 7.1]. We then show how Proposition 6 (respectively, 7) follows from Proposition 8 (respectively, 9).
Proposition 8 Let $G=(\mu,\mathcal {V},\mathcal {W},\mathcal {E},\emptyset,f_\emptyset,g_\emptyset )$ be a GCD graph with trivial set of primes and edge density $\delta (G)>0$. Then there exists a GCD subgraph $G' = (\mu,\mathcal {V}',\mathcal {W}',\mathcal {E}',\mathcal {P}',f',g')$ of $G$ such that the following assertions hold.
(a) $\mathcal {R} (G') = \emptyset$.
(b) For all $v \in \mathcal {V}'$, we have $\mu (\Gamma _{G'}(v)) \geq ({9 \delta (G')}/{10}) \mu (\mathcal {W}')$.
(c) For all $w \in \mathcal {W}'$, we have $\mu (\Gamma _{G'}(w)) \geq ({9 \delta (G')}/{10}) \mu (\mathcal {V}')$.
(d) $q(G') \gg q(G)$ with an absolute implied constant.
Proposition 9 Let $G= (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\emptyset,f_{\emptyset },g_{\emptyset })$ be a GCD graph with trivial set of primes, and let $C \ge 1$. Assume that
with some $t \ge 1$ sufficiently large in terms of $C$. Then there exists a GCD subgraph $G' = (\mu,\mathcal {V}',\mathcal {W}',\mathcal {E}',\mathcal {P}',f',g')$ of $G$ such that the following assertions hold.
(a) $\mathcal {R}(G') = \emptyset$.
(b) For all $v \in \mathcal {V}'$, we have $\mu (\Gamma _{G'}(v)) \geq ({9\delta (G')}/{10})\mu (\mathcal {W}')$.
(c) For all $w \in \mathcal {W}'$, we have $\mu (\Gamma _{G'}(w)) \geq ({9\delta (G')}/{10})\mu (\mathcal {V}')$.
(d) One of the following assertions holds.
(i) $q(G') \gg t^3 q(G)$ with an implied constant depending only on $C$.
(ii) $q(G') \gg q(G)$ with an implied constant depending only on $C$, and for any $(v,w) \in \mathcal {E}'$, if we write $v = v'\prod _{p \in \mathcal {P}'}p^{f'(p)}$ and $w = w'\prod _{p \in \mathcal {P}'}p^{g'(p)}$, then $L_{F(t)}(v',w') \geq {1}/{2F(t)^{1/4}}$.
Proof of Proposition 6 Let $\psi : \mathbb {N} \to [0,1/2]$ be a function, let $Q \in \mathbb {N}$ and let $t \ge 1$. Consider the GCD graph $G=(\mu, \mathcal {V}, \mathcal {W}, \mathcal {E}, \emptyset, f_{\emptyset }, g_{\emptyset })$ with the measure $\mu (v)= {\varphi (v)\psi (v)}/{v}$, the vertex sets $\mathcal {V}=\mathcal {W}=[1,Q]^2$, and the edge set
Note that $\mu (\mathcal {V})=\mu (\mathcal {W})=\Psi (Q)/2$. In the language of GCD graphs, the claim of Proposition 6 can equivalently be written as $\mu (\mathcal {E}) \ll \Psi (Q)^2 /t^{1/5}$, that is, $\delta (G) \ll t^{-1/5}$.
By Proposition 8, there exists a GCD subgraph $G'=(\mu, \mathcal {V}', \mathcal {W}', \mathcal {E}', \mathcal {P}', f', g')$ of $G$ having properties (a)–(d) of the proposition. Following the steps in [Reference Koukoulopoulos and MaynardKM20, Proof of Proposition 6.3 assuming Proposition 7.1], from properties (a)–(c) we deduce $q(G') \ll \Psi (Q)^2/t^2$. Since $G$ has trivial set of primes, by the definition of quality and property (d),
Therefore $\delta (G) \ll t^{-1/5}$, as claimed.
For the proof of Proposition 7 we will need the following fact about the ‘anatomy of integers’; compare this to [Reference Koukoulopoulos and MaynardKM20, Lemma 7.3], which is a similar result for a fixed value of $c$ on the right-hand side, rather than allowing $c \to 0$ as, in view of Lemma 5 above, will be necessary for our application.
Lemma 10 For any real $x,t \geq 1$ and $c>0$,
with an absolute implied constant.
Proof. An application of the Markov inequality gives
Now let $f$ be the multiplicative function defined at prime powers as $f(p^m) = e^{100 t/p}$ if $p \ge t$, and $f(p^m)=1$ if $p< t$. Note that $f(p^m) \le e^{100}$ at all prime powers. Hence, by [Reference KoukoulopoulosKou19, Theorem 14.2] the partial sums of $f$ satisfy
where the implied constants are absolute.
Proof of Proposition 7 Let $\psi : \mathbb {N} \to [0,1/2]$ be a function, let $Q \in \mathbb {N}$ and let $t \ge 1$. Consider the GCD graph $G=(\mu, \mathcal {V}, \mathcal {W}, \mathcal {E}, \emptyset, f_{\emptyset }, g_{\emptyset })$ with the measure $\mu (v)= {\varphi (v)\psi (v)}/{v}$, the vertex sets $\mathcal {V}=\mathcal {W}=[1,Q]^2$, and the edge set
Note that $\mu (\mathcal {V})=\mu (\mathcal {W})=\Psi (Q)/2$. In the language of GCD graphs, the claim can equivalently be written as $\mu (\mathcal {E}) \ll \Psi (Q)^2/F(t)^{1/2}$, that is, $\delta (G) \ll F(t)^{-1/2}$. We may assume in the sequel that $\delta (G) \ge F(t)^{-1/2}$ and that $t$ and $F(t)$ are large enough in terms of $C$, since otherwise the claim trivially holds.
By Proposition 9, there exists a GCD subgraph $G'=(\mu, \mathcal {V}', \mathcal {W}', \mathcal {E}', \mathcal {P}', f', g')$ of $G$ having properties (a)–(d) of the proposition. Let $a=\prod _{p \in \mathcal {P}'}p^{f'(p)}$ and $b=\prod _{p \in \mathcal {P}'}p^{g'(p)}$. By the definition of a GCD graph, $a \mid v$ for all $v \in \mathcal {V}'$ and $b \mid w$ for all $w \in \mathcal {W}'$. Since $\mathcal {R}(G')=\emptyset$, we also have $\text{gcd} (v,w)=\text{gcd} (a,b)$ for all $(v,w) \in \mathcal {E}'$. Following the steps in [Reference Koukoulopoulos and MaynardKM20, Proof of Proposition 6.3 assuming Proposition 7.1], we deduce from properties (a)–(c) of Proposition 9 that
where $w_0=\max \mathcal {W}'$ and $v_{\max }(w)=\max \{ v \in \mathcal {V}' \, : \, (v,w) \in \mathcal {E}' \}$.
Assume first that $G'$ satisfies property (d)(i) in Proposition 9, that is, $q(G') \gg t^3 q(G)$. Since $G$ has trivial set of primes, by the definition of quality and (19) we obtain
Therefore $\delta (G) \ll t^{-1/10} \ll F(t)^{-1/2}$, as claimed.
Assume next that $G'$ satisfies property d)(ii) in Proposition 9, that is, $q(G') \gg q(G)$, and for any $(v,w) \in \mathcal {E}'$, if we write $v = av'$ and $w = bw'$, then $L_{F(t)}(v',w') \geq {1}/{2F(t)^{1/4}}$. Note that here $\text{gcd} (v',w')=1$. As in the first case, we have
For the sake of readability, define $R_s(n)=\sum _{p \mid n,~p \ge s} 1/p$ for any $n \in \mathbb {N}$ and $s \ge 1$. Then $1/(2F(t)^{1/4}) \le L_{F(t)}(v',w') = R_{F(t)}(v') + R_{F(t)}(w')$ implies that $R_{F(t)}(v') \ge 1/(4F(t)^{1/4})$ or $R_{F(t)}(w') \ge 1/(4F(t)^{1/4})$. The previous formula thus shows that $\delta (G)^{10} \ll S_1+S_2$ with
An application of Lemma 10 with $x=v_{\max }(bw')/a$ and $c=1/(4F(t)^{1/4})$ yields
Another application of Lemma 10 with $x=w_0/b$ and $c=1/(4F(t)^{1/4})$ similarly yields
Therefore $\delta (G) \ll (S_1+S_2)^{1/10} \ll t^{-10} \ll F(t)^{-10}$, as claimed.
7. Four technical lemmas
In this section we state four lemmas on GCD subgraphs, and show that Propositions 8 and 9 follow from these four lemmas. The key technical improvement in comparison with the iteration argument of [Reference Koukoulopoulos and MaynardKM20] is in Lemma 11 below, which more carefully balances the quality gain versus the potential density loss of the iteration procedure. The ratio of quality gain to density loss which is necessary for the proof of Theorem 2 is determined by the range of admissible parameters $u$ and $A$ in Lemma 5, and what Lemma 11 provides is just enough for a successful completion of the proof. Lemma 12, which should be compared to [Reference Koukoulopoulos and MaynardKM20, Lemma 8.4], and Lemma 13 follow from results in [Reference Koukoulopoulos and MaynardKM20] in a more or less straightforward way. Finally, for the convenience of the reader, we cite [Reference Koukoulopoulos and MaynardKM20, Lemma 8.5] in the form of Lemma 14.
Lemma 11 Let $G = (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\emptyset,f_{\emptyset },g_{\emptyset })$ be a GCD graph with trivial set of primes and $\delta (G) > 0$. Let $C \ge 1$, and let $t \ge 1$ be sufficiently large in terms of $C$. Then there exists a GCD subgraph $G' \preceq G$ such that $\mathcal {R}^{\unicode{x266B} }(G') = \emptyset$, and at least one of the following two statements hold.
(a) $q(G') \ge t^3 q(G)$.
(b) $q(G') \gg q(G)$, $ {\delta (G')}/{\delta (G)} \ge {1}/{F(t)^{1/4}} ,\quad \lvert \mathcal {P}_{\mathrm {diff}}(G')\rvert \le \log t$ with an implied constant depending only on $C$.
Lemma 12 Let $G= (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ be a GCD graph. Assume that
with a sufficiently large $s \ge 1$. Then there exists a GCD subgraph $G' = (\mu,\mathcal {V},\mathcal {W},\mathcal {E}',\mathcal {P},f,g)$ of $G$ such that
Lemma 13 Let $G=(\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ be a GCD graph with $\delta (G) > 0$. Then there exists a GCD subgraph $G'=(\mu, \mathcal {V}', \mathcal {W}', \mathcal {E}', \mathcal {P}', f', g')$ of $G$ such that
with an absolute implied constant.
Lemma 14 [Reference Koukoulopoulos and MaynardKM20, Lemma 8.5]
Let $G= (\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ be a GCD graph with $\delta (G) > 0$. Then there exists a GCD subgraph $G' = (\mu,\mathcal {V},\mathcal {W},\mathcal {E}',\mathcal {P},f,g)$ of $G$ such that the following assertions hold.
(a) $q(G') \geq q(G)$.
(b) $\delta (G') \geq \delta (G)$.
(c) For all $v \in \mathcal {V}'$ and $w \in \mathcal {W}'$, we have
\[ \mu(\Gamma_{G'}(v)) \geq \frac{9\delta(G')}{10}\mu(\mathcal{W}') \quad \textrm{and} \quad \mu(\Gamma_{G'}(w)) \geq \frac{9\delta(G')}{10}\mu(\mathcal{V}'). \]
We now show how Lemmas 11–14 imply Propositions 8 and 9.
Proof of Proposition 8 Apply Lemma 13 to $G$ to obtain a GCD subgraph $G^{(1)} \preceq G$ with $\mathcal {R}(G^{(1)})= \emptyset$ and $q(G^{(1)}) \gg q(G)$, satisfying properties (a) and (d). Next, apply Lemma 14 to $G^{(1)}$ to obtain a GCD subgraph $G^{(2)} \preceq G^{(1)}$ which additionally satisfies properties (b) and (c).
Proof of Proposition 9 We follow [Reference Koukoulopoulos and MaynardKM20, Proof of Proposition 7.1], although the ordering of the different stages needs to be changed. It suffices to prove the existence of a GCD subgraph which satisfies properties (a) and (d). Indeed, applying Lemma 14 to such a subgraph, we obtain a GCD subgraph that satisfies all required properties (a)–(d).
We start by applying Lemma 11 to $G$, and obtain a GCD subgraph $G^{(1)} \preceq G$ such that $\mathcal {R}^{\unicode{x266B} }(G^{(1)}) = \emptyset$ and $G^{(1)}$ satisfies at least one of the following properties:
(A) $q(G^{(1)}) \ge t^3 q(G)$;
(B) $q(G^{(1)}) \gg q(G)$, $ {\delta (G^{(1)})}/{\delta (G)} \ge {1}/{F(t)^{1/4}}$, $\lvert \mathcal {P}_{\text {diff}}(G^{(1)})\rvert \le \log t$.
We distinguish between two cases depending on whether (A) or (B) is satisfied.
Case (A). Assume that $q(G^{(1)}) \ge t^3 q(G)$. We apply Lemma 13 to obtain a GCD subgraph $G^{(2A)} \preceq G^{(1)}$ with $\mathcal {R}(G^{(2A)}) = \emptyset$ and $q(G^{(2A)}) \gg q(G^{(1)})$. Then $G^{(2A)}$ satisfies properties (a) and (d)(i) in Proposition 9. This finishes the proof for case (A).
Case (B). Assume that $q(G^{(1)}) \gg q(G)$, $ {\delta (G^{(1)})}/{\delta (G)} \ge {1}/{F(t)^{1/4}}$, $\lvert \mathcal {P}_{\text {diff}}(G^{(1)})\rvert \le \log t$. First, we remove the effect of the large primes in $\mathcal {R}(G^{(1)})$ on $L_{F(t)}(v,w)$. By the assumption $\delta (G) \ge 1/ F(t)^{1/2}$, we have $\delta (G^{(1)}) \ge 1/F(t)^{1/4}$. We can thus apply Lemma 12 to $G^{(1)}$ with $s = F(t)$ to obtain a GCD subgraph $G^{(2B)} \preceq G^{(1)}$ with edge set $\mathcal {E}^{(2B)}$ such that
Now we remove the contribution of the primes in $\mathcal {P}_{\text {diff}}(G^{(1)})$. Using $\lvert \mathcal {P}_{\text {diff}}(G^{(1)})\rvert \le \log t$, we obtain that for any $(v,w) \in \mathcal {E}^{(2B)}$,
for large enough $t$. Hence, for any $(v,w) \in \mathcal {E}^{(2B)}$,
Finally, we apply Lemma 13 to $G^{(2B)}$ to obtain a GCD subgraph $G^{(3B)} \preceq G^{(2B)}$ such that
Thus, $G^{(3B)}$ satisfies property (a) in Proposition 9. Following the steps in stage 4b of [Reference Koukoulopoulos and MaynardKM20, Proof of Proposition 7.1], we deduce from (20) that $G^{(3B)}$ satisfies property (d)(ii) as well. This finishes the proof for case (B).
8. Proof of Lemmas 12 and 13
Proof of Lemma 12 Define
Following the steps in [Reference Koukoulopoulos and MaynardKM20, Proof of Lemma 8.4], from the assumptions $\mathcal {R}^{\unicode{x266B} }(G) = \emptyset$ and $\delta (G) \ge 1/s^{1/4}$ we deduce that
for large enough $s$. Consider the edge set
An application of the Markov inequality gives
that is, $\mu (\mathcal {E'}) \geq \frac {24}{25}\mu (\mathcal {E})$. By the definition of quality, the GCD subgraph $G' := (\mu,\mathcal {V},\mathcal {W},\mathcal {E}',\mathcal {P},f,g)$ thus satisfies
Further, for any $(v,w) \in \mathcal {E}'$ we have
as claimed.
To prove Lemma 13, we will iteratively apply the following two propositions.
Proposition 15 Let $G=(\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ be a GCD graph with $\delta (G) > 0$. Then there is a GCD subgraph $G'=(\mu, \mathcal {V}', \mathcal {W}', \mathcal {E}', \mathcal {P}', f',g')$ of $G$ such that
Proof. This is a slight modification of [Reference Koukoulopoulos and MaynardKM20, Proposition 8.3], the only difference being that in our formulation the set $\mathcal {P}$ can be non-empty. The proof given in [Reference Koukoulopoulos and MaynardKM20] actually covers the formulation stated above, since it only relies on the iterative application of [Reference Koukoulopoulos and MaynardKM20, Lemma 13.2], which holds for GCD graphs with an arbitrary set of primes.
Proposition 16 Let $G=(\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ be a GCD graph with $\delta (G) > 0$ such that $\emptyset \neq \mathcal {R}(G) \subseteq \{p > 10^{2000}\}$. Then there is a GCD subgraph $G'=(\mu, \mathcal {V}', \mathcal {W}', \mathcal {E}', \mathcal {P}', f',g')$ of $G$ such that
Proof. This follows directly from [Reference Koukoulopoulos and MaynardKM20, Propositions 8.1 and 8.2].
Proof of Lemma 13 First, we apply Proposition 15 to obtain a GCD subgraph $G^{(1)} \preceq G$ with
If $\mathcal {R}(G^{(1)}) = \emptyset$, we are done. Otherwise, we apply Proposition 16 to obtain a GCD subgraph $H_1 \preceq G^{(1)}$ with $\mathcal {R}(H_1) \subsetneq \mathcal {R}(G^{(1)})$ and $q(H_1) \geq q(G^{(1)}).$ By iterating this argument, we obtain a chain of GCD subgraphs $G^{(1)} \succeq H_1 \succeq H_2 \succeq \cdots$ with
Since $\mathcal {R}(G^{(1)})$ is a finite set, we arrive after finitely many steps at a GCD subgraph $G' \preceq G$ with $\mathcal {R}(G') = \emptyset$ and $q(G') \geq q(G^{(1)}) \gg q(G)$. Furthermore, we have $\mathcal {P}' \subseteq \mathcal {P} \cup \mathcal {R}(G)$ since this property is preserved at each step.
9. Quality increment versus density loss
The goal of this section is to prove Lemma 11. We start with three preliminary results.
Lemma 17 Let $G=(\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ be a GCD graph with $\delta (G)>0$, let $p \in \mathcal {R}(G)$, and let
Then there exists a pair of non-negative integers $(k,l)=(k_p,l_p)$ such that $\alpha _k, \beta _l >0$, and
Proof. This follows from a straightforward modification of the proof of [Reference Koukoulopoulos and MaynardKM20, Lemma 12.1], replacing the estimate $\frac {1}{1000} \sum _{|j|\geq 1} 2^{-|j|/20} \leq \frac {1}{10}$ by $\sum _{|j| \geq 1} {1}/{40 j^2} \leq \frac {1}{10}$ in one of the steps.
Lemma 18 Let $\alpha _k,\beta _k,\alpha _l,\beta _l \in [0,1]$ with $\alpha _k, \beta _l >0$ be such that $\alpha _k + \alpha _l \leq 1$ and $\beta _k + \beta _l \leq 1$, and let
If $\min \{\alpha _k,\beta _k\} \leq 1-R$ and $\min \{\alpha _{l},\beta _{l}\} \leq 1-R$ with some $R \in \left [0,1/\sqrt {2} \right ]$, then $ {S^2}/{\alpha _k\beta _{l}} \geq {R}/{2}$.
Proof. Clearly,
Since the conditions of the lemma and $S$ are invariant under switching $\alpha _k$ with $\beta _l$ and $\alpha _l$ with $\beta _k$, respectively, we may assume that $\alpha _k \geq \beta _{l}$.
Assume first that $\alpha _k \le 1/2$. Then $\beta _l \le 1/2$ as well, hence
Therefore $S^2/(\alpha _k \beta _l) \ge 4>R/2$, as claimed.
Assume next that $\alpha _k>1/2$. Formula (21) then gives
If $\beta _k \le 1-R$, then $R \le 1-\beta _k \le S^2/(\alpha _k \beta _l)$, as claimed. If $\beta _k>1-R>1/4$, then by the assumption $\min \{ \alpha _k, \beta _k \} \le 1-R$ we have $\alpha _k \le 1-R$, and we similarly deduce
which finishes the proof of the statement.
The following lemma is a variant of [Reference Koukoulopoulos and MaynardKM20, Lemma 12.2].
Lemma 19 Consider a GCD graph $G=(\mu,\mathcal {V},\mathcal {W},\mathcal {E},\mathcal {P},f,g)$ with $\delta (G)>0$ and a prime $p \in \mathcal {R}^{\unicode{x266B} }(G)$. Let $(k,l)=(k_p,l_p)$ be a pair of non-negative integers which satisfies the conclusion of Lemma 17. Then there is a GCD subgraph $G'=(\mu,\mathcal {V}',\mathcal {W}',\mathcal {E}',\mathcal {P}',f',g')$ of $G$ with $\mathcal {P}' = \mathcal {P} \cup \{p\}$ and $\mathcal {R}(G') \subseteq \mathcal {R}(G) \backslash \{p\}$ such that
and
Proof. We claim that $G'= G_{p^k, p^l}$ satisfies all required properties. Note that $\mathcal {P}' = \mathcal {P} \cup \{p\}$ and $\mathcal {R}(G') \subseteq \mathcal {R}(G) \backslash \{p\}$ hold by the definition of $G_{p^k, p^l}$. If $k = l$, then by Lemma 17 and the definition of quality,
and
as claimed. Let $S$ be as in Lemma 18. If $k \neq l$, then by Lemma 17 together with (21),
Furthermore,
The assumption $p \in \mathcal {R}^{\unicode{x266B} }(G)$ ensures that $\min \{ \alpha _k, \beta _k \} \leq 1 -1/\sqrt {p}$ and $\min \{ \alpha _l, \beta _l \} \leq 1 - 1/\sqrt {p}$. Hence, we can apply Lemma 18 with $R = 1/\sqrt {p}$, which shows that
as claimed.
Proof of Lemma 11 We apply Lemma 19 iteratively to $G$ until we obtain a GCD subgraph $G'=(\mu, \mathcal {V}', \mathcal {W}', \mathcal {E}', \mathcal {P}', f', g')$ of $G$ such that $\mathcal {R}^{\unicode{x266B} }(G')=\emptyset$. Note that each prime $p$ is used at most once, and $\mathcal {P}'$ is precisely the set of primes to which Lemma 19 was applied. For each $p \in \mathcal {P}'$, let $(k_p,l_p)$ be the pair of non-negative integers with which Lemma 19 is applied.Footnote 1 Since the original graph $G$ had an empty set of primes, we have $\mathcal {P}_{\textrm {diff}}(G') = \{ p \in \mathcal {P}' \, : \, k_p \neq l_p \}$. By Lemma 19, the resulting graph $G'$ satisfies
In particular,
Fix $C \ge 1$, and let $t \ge 1$ be large enough in terms of $C$. Let $N=|\mathcal {P}_{\textrm {diff}}(G')|$, and, for the sake of readability, in the sequel let $\log _i$ denote the $i$-fold iterated logarithm. It will be enough to show that if $q(G')< t^3 q(G)$ (i.e. property (a) does not hold), then $\delta (G')/\delta (G) \ge 1/F(t)^{1/4}$, and $N \le \log t$ (i.e. property (b) holds). The latter follows easily from (22) and $q(G')< t^3 q(G)$:
Hence, $N \ll (\log t)/\log _2 t$, and, in particular, $N \le \log t$ for large enough $t$, as claimed. It remains to show that $q(G')< t^3 q(G)$ implies $\delta (G')/\delta (G) \ge 1/F(t)^{1/4}$.
Let $Y=\{ p \in \mathcal {P}_{\mathrm {diff}}(G') \, : \, |k_p-l_p| \ge \log _3 t \}$. Bounding the sum term by term gives
On the other hand, (22) and the assumption $q(G')< t^3 q(G)$ lead to
hence $|Y| \ll (\log t)/(\log _2 t \log _3 t)$. The previous formula also shows that $\sum _{p \in Y} |k_p-l_p| \ll \log t$. An application of the inequality of arithmetic and geometric means thus yields
The previous formula and (23) thus give
Hence, $-\log (\delta (G')/\delta (G)) \le \frac {1}{4} \log F(t)$ for large enough $t$, that is, $\delta (G')/\delta (G) \ge 1/F(t)^{1/4}$, and we obtain the desired result.
Acknowledgements
CA is supported by the Austrian Science Fund (FWF), projects F-5512, I-3466, I-4945, I-5554, P-34763, P-35322 and Y-901. BB is supported by the Austrian Science Fund (FWF), project F-5510. We wish to thank the referee for a very careful reading of our paper and for many helpful comments.