1 Introduction
Galois theory for lattice polytopes
A problem of enumerative geometry asks how many geometric objects satisfy a generic geometric constraint in a given space of constraints $P$ . Galois theory for this enumerative problem studies how the solutions of this problem permute as the constraint runs along loops in $P$ . In the last decade, particularly strong results were obtained in Galois theory of Schubert calculus; see [Reference Sottile and WhiteSW15] and references therein.
We develop Galois theory in the same vein for another well-known enumerative problem, the Kouchnirenko–Bernstein theorem, counting the solutions of a system of generic polynomial equations composed of a given finite collection of monomials. More accurately, let us identify points $a=(a_{1},\ldots ,a_{n})\in \mathbb{Z}^{n}$ with monomials $x^{a}=x_{1}^{a_{1}}\ldots \,x_{n}^{a_{n}}$ ; then every finite set of monomials $A\subset \mathbb{Z}^{n}$ gives rise to the space of Laurent polynomials $\mathbb{C}^{A}=\{\sum _{a\in A}c_{a}x^{a},\,c_{a}\in \mathbb{C}\}$ , supported at $A$ . These polynomials are defined as functions on the complex torus $(\mathbb{C}\setminus 0)^{n}$ .
Theorem 1.1 (Kouchnirenko–Bernstein [Reference BernsteinBer75]).
For every collection of finite sets $A=(A_{1},\ldots ,A_{n})$ in $\mathbb{Z}^{n}$ , there exists a proper exceptional algebraic set $B_{A}\subset \mathbb{C}^{A}=\mathbb{C}^{A_{1}}\oplus \cdots \oplus \mathbb{C}^{A_{n}}$ , such that the number of common roots $x\in (\mathbb{C}\setminus 0)^{n}$ of a system of polynomial equations $f_{1}(x)=\cdots =f_{n}(x)=0$ for every tuple of polynomials $(f_{1},\ldots ,f_{n})\in \mathbb{C}^{A}$ outside $B_{A}$ equals the lattice mixed volume of (the convex hulls of) $A_{1},\ldots ,A_{n}$ .
In the setting of the Kouchnirenko–Bernstein theorem, denote the mixed volume by $V$ . Then every loop in $\mathbb{C}^{A}\setminus B_{A}$ , pointed at some tuple $f=(f_{1},\ldots ,f_{n})$ , defines a permutation of the roots of $f=0$ . For all loops in $\mathbb{C}^{A}\setminus B_{A}$ , these permutations form a subgroup of the group $S_{V}$ of all permutations of the $V$ roots of $f=0$ . This subgroup will be called the monodromy group of the general system of polynomial equations supported at $A$ and denoted by $G_{A}$ .
We shall be interested in the following two problems:
(I) Compute $G_{A}$ .
(II) Classify solvable tuples $A$ , for which the multivalued function $\mathbb{C}^{A}\setminus B_{A}\rightarrow (\mathbb{C}\setminus 0)^{n}$ , assigning the roots of the system $f=0$ to an element $f\in \mathbb{C}^{A}\setminus B_{A}$ , can be expressed by radicals.
The first problem helps to solve the second one, because a solvable tuple $A$ has a solvable monodromy group $G_{A}$ (see, for example, [Reference KhovanskiiKho15]).
Example 1.2. For $n=1$ and $A=A_{1}=\{0,1,\ldots ,d\}$ , the problems above ask (I) for the monodromy of the generic univariate polynomial $c_{d}x^{d}+c_{d-1}x^{d-1}+\cdots +c_{0}$ and (II) for the expression of its roots by radicals in terms of the coefficients $c_{0},c_{1},\ldots ,c_{d}$ . It is classically known that the monodromy $G_{A}$ equals $S_{d}$ , and thus the general equation of degree $d$ is solvable for $d\leqslant 4$ .
For arbitrary $n$ , the second problem, though not the first one, can be reduced without loss of generality to reduced irreducible tuples $A=(A_{1},\ldots ,A_{n})$ in the sense of the following Definition 1.3. Thus, the subsequent Theorem 1.5 leads to a complete solution of problem (II), and seems to be a natural first step towards the solution of problem (I).
Definition 1.3. (1) A tuple of finite sets $A_{1},\ldots ,A_{k}$ in $\mathbb{Z}^{n}$ is said to be reduced, if they cannot be shifted to the same proper sublattice of $\mathbb{Z}^{n}$ .
(2) A tuple of finite sets $A_{1},\ldots ,A_{k}$ in $\mathbb{Z}^{n}$ is said to be irreducible (respectively, linearly independent), if it is impossible to shift all but $m$ (respectively, $m-1$ ) of them to the same sublattice of codimension $m$ , for $m>0$ .
Remark 1.4. (1) Mind the difference between reduced and reducible (i.e. non-irreducible).
(2) Similar conditions were introduced by various authors for particular values of $n-k$ (cf., for instance, essential tuples in [Reference SturmfelsStu94] for $k=n+1$ ). We prefer the terms ‘linearly independent’, ‘reduced’ and ‘irreducible’ (introduced in [Reference KhovanskiiKho78] and [Reference Esterov and GusevEG16] for $k=n$ ), because discriminants and other geometric objects, related to the system of equations $f=0$ for the general tuple $f\in \mathbb{C}^{A}$ , tend to be reduced and irreducible in the sense of algebraic geometry if the tuple $A=(A_{1},\ldots ,A_{k})$ has the property of the same name. See Remark 3.17 and Theorem 3.21 for some instances of this correspondence.
Theorem 1.5. If $A=(A_{1},\ldots ,A_{n})$ is a reduced irreducible tuple, then the monodromy group $G_{A}$ equals the symmetric group $S_{V}$ .
The proof is given at the end of this section.
Systems of equations, solvable by radicals
Since $S_{V}$ is not solvable for $V>4$ , Theorem 1.5 implies the following corollary.
Corollary 1.6 [Reference Esterov and GusevEG16, Conjecture 1].
For a reduced irreducible tuple $(A_{1},\ldots ,A_{n})$ , the general system of equations supported at $(A_{1},\ldots ,A_{n})$ is solvable by radicals if and only if it has at most four solutions, that is, the lattice mixed volume of $A_{1},\ldots ,A_{n}$ does not exceed $4$ .
This fact actually gives the inductive classification of all solvable tuples $A=(A_{1},\ldots ,A_{n})$ .
Classification 1.7. (0) We can and will assume without loss of generality that every $A_{i}$ contains 0. Indeed, otherwise shift $A_{i}$ by a vector $-a_{i},\,a_{i}\in A_{i}$ , to a set $\widetilde{A}_{i}$ containing $0$ . Now, instead of polynomials $f_{i}\in \mathbb{C}^{A_{i}}$ , we can study polynomials $f_{i}(x)/x^{a_{i}}\in \mathbb{C}^{\widetilde{A}_{i}}$ , because they have the same roots as $f_{i}$ .
(1) We can and will assume that $A$ is reduced. Indeed, otherwise $A_{i}$ is the image of $B_{i}$ under a lattice embedding $j:\mathbb{Z}^{n}\rightarrow \mathbb{Z}^{n}$ for a reduced tuple $B=(B_{1},\ldots ,B_{n})$ , and we have the following fact: the solvability of $B$ is equivalent to the solvability of $A$ .
Proof. Consider the surjection of complex tori $h:(\mathbb{C}\setminus 0)^{n}\rightarrow (\mathbb{C}\setminus 0)^{n}$ , corresponding to the embedding $j$ of their character lattices, so that $h(x)^{b}=x^{j(b)}$ for $x\in (\mathbb{C}\setminus 0)^{n}$ and $b\in \mathbb{Z}^{n}$ . Then every tuple of polynomials $f\in \mathbb{C}^{A}$ has the form $f(x)=g(h(x)),\,g\in \mathbb{C}^{B}$ . Since $h$ is invertible by radicals, it follows that $f=0$ and $g=0$ are solvable by radicals simultaneously.◻
(2) We can and will assume that $A$ is irreducible. Otherwise, up to reordering, the sets $A_{1},\ldots ,A_{k},\,0<k<n$ , belong to the same $k$ -dimensional plane $L\subset \mathbb{Z}^{n}$ , and, denoting the tuple of the images of the other $A_{i}$ under the projection $\mathbb{Z}^{n}\rightarrow \mathbb{Z}^{n}/L$ by $A^{\prime \prime }$ , we have the following fact: the solvability of $A$ is equivalent to the solvability of the smaller-dimensional tuples $A^{\prime }=(A_{1},\ldots ,A_{k})$ and $A^{\prime \prime }$ .
Proof. Note that upon an appropriate automorphism of $(\mathbb{C}\setminus 0)^{n}$ , the polynomial $f_{i}\in \mathbb{C}^{A_{i}}$ depends only on the first $k$ coordinates for $i\leqslant k$ , so, substituting these coordinates with a solution of $f_{1}=\cdots =f_{k}=0$ in the system of equations $f_{k+1}=\cdots =f_{n}=0$ , we obtain a system of the form $g=0,\,g\in \mathbb{C}^{A^{\prime \prime }}$ . Thus solving a generic system $f=0$ supported in $A$ amounts to solving a generic system $f_{1}=\cdots =f_{k}=0$ supported in $A^{\prime }$ and a system $g=0$ , which is also generic in $\mathbb{C}^{A^{\prime \prime }}$ in the sense that assigning $g$ to $f$ is a dominant map $\mathbb{C}^{A}\rightarrow \mathbb{C}^{A^{\prime \prime }}$ .◻
(3) Finally, a reduced and irreducible tuple $A$ is solvable if and only if the lattice mixed volume of $A_{1},\ldots ,A_{n}$ does not exceed $4$ (by Corollary 1.6).
This algorithm reduces the classification of solvable systems of equations to the classification of irreducible mixed volume $4$ tuples of lattice sets. The latter classification is given in [Reference Esterov and GusevEG16] in dimension 2, and is, moreover, finite in every dimension; see Theorem 1.11 below for details.
Remark 1.8. In the same way, the classification of systems of equations solvable by $k$ -radicals in the sense of [Reference KhovanskiiKho15] (i.e. those that can be reduced to solving univariate polynomial equations of degree at most $k$ ) is reduced to the classification of tuples of lattice sets of mixed volume at most $k$ .
Example 1.9. For $n=2$ , if a reduced consistent general system of equations is solvable by radicals, then its Newton polygons either have lattice mixed volume at most $4$ (there are 14 such maximal pairs up to automorphisms of $\mathbb{Z}^{2}$ ; see [Reference Esterov and GusevEG16]), or equal a segment $I$ of lattice length at most $4$ and an arbitrary polygon $P$ , whose support lines parallel to $I$ are at the lattice distance not exceeding $4$ from each other.
Classification of small polytopes
Each of the infinitely many pairs $(I,P)$ in the preceding example has mixed volume at most 16, due to the following fact. We denote the lattice mixed volume of the convex hulls of $A_{1},\ldots ,A_{n}$ by $\operatorname{MV}(A_{1},\ldots ,A_{n})$ .
Theorem 1.10. Let $B_{1},\ldots ,B_{N}$ be lattice sets in $\mathbb{Z}^{N}$ and $A_{1},\ldots ,A_{M}$ in $\mathbb{Z}^{N}\,\oplus \,\mathbb{Z}^{M}$ . Then $\operatorname{MV}(A_{1},\ldots ,A_{M},B_{1},\ldots ,B_{N})=\operatorname{MV}(pA_{1},\ldots ,pA_{M})\operatorname{MV}(B_{1},\ldots ,B_{N})$ , where $p:\mathbb{Z}^{N}\oplus \mathbb{Z}^{M}\rightarrow \mathbb{Z}^{M}$ is the standard projection.
This well-known fact admits an especially simple proof in the spirit of Classification 1.7(2) ([Reference EsterovEst06]; see, for example, [Reference Steffens and TheobaldST10, Lemma 4] for a geometric proof).
An algebraic proof.
For $f_{i}\in \mathbb{C}^{A_{i}}$ and $g_{j}\in \mathbb{C}_{j}^{B}$ , every solution of the system $f=g=0$ is of the form $(x_{0},y_{0})$ , where $x_{0}\in (\mathbb{C}\setminus 0)^{N}$ is a solution of the system $g=0$ and $y_{0}\in (\mathbb{C}\setminus 0)^{M}$ is a solution of the system $f(x_{0},y)=0$ . For generic $f$ and $g$ , the number of solutions of the three mentioned systems equals the three lattice mixed volumes in the statement by the Kouchnirenko–Bernstein theorem.◻
This reduces the infinite classification of tuples with small mixed volume to the classification of irreducible tuples, which is already finite.
Theorem 1.11. For every $n$ and $V$ , there are finitely many irreducible tuples $(A_{1},\ldots ,A_{n})$ in $\mathbb{Z}^{n}$ of mixed volume $V$ , up to automorphisms of $\mathbb{Z}^{n}$ and shifts of the sets.
The proof is given in § 2. Moreover, if we restrict our attention to the unmixed case, where $A_{1}=\cdots =A_{n}=A$ , the classification becomes essentially finite across all dimensions: it was shown in [Reference Esterov and GusevEG16] that every reduced $A\subset \mathbb{Z}^{n}$ of lattice volume $4$ can be obtained from 34 ‘elementary’ configurations of dimension at most $6$ by affine automorphisms of $\mathbb{Z}^{n}$ and constructing cones over lattice sets in the following sense.
Definition 1.12. The cone over $B\subset \mathbb{Z}^{m}$ is the set $c(B)=\{0\}\cup (B\times \{k\})\subset \mathbb{Z}^{m+1}$ .
Remark 1.13. (1) The same is true for every value of the volume, as shown in [Reference Hofscheier, Katthän and NillHKN18, Corollary 3.1] (although, starting from volume $5$ , the classification of non-cones seems to be incomprehensibly large).
(2) In the notation of the preceding definition, the solution by radicals of the system $f_{0}=\cdots =f_{m}=0$ supported at the cone $c(B)$ can be reduced to the solution by radicals of the system $g_{1}=\cdots =g_{m}=0$ supported at its base $B$ , by setting $g_{i}(x)=f_{i}(x)/f_{i,0}-f_{0}(x)/f_{0,0}$ , where $f_{i,0}$ is the constant term of $f_{i}$ . Thus the solution by radicals of all solvable unmixed systems of arbitrarily many variables reduces to the 34 elementary ones listed in [Reference Esterov and GusevEG16].
(3) The classification of the 34 non-cones of volume $4$ in [Reference Esterov and GusevEG16] includes only reduced ones (or spanning ones, in terms of [Reference Hofscheier, Katthän and NillHKN18]), because this suits the needs of Corollary 1.6. The classification of all (possibly non-reduced) non-cones of volume $4$ is also possible, but is more complicated and not finite due to empty simplices; see [Reference Hibi and TsuchiyaHT17].
Monodromy of reducible systems of equations
In contrast to the problem of solvability, the computation of the monodromy of an arbitrary tuple cannot easily be reduced to the case of reduced irreducible tuples. We formulate a conjecture regarding non-reduced tuples and show by an example that the case of reducible tuples is yet more complicated (so that we do not even make any predictions).
Conjecture 1.14. In the setting of step (1) of Classification 1.7, if the tuple $B$ is reduced and irreducible of mixed volume $d$ , then the monodromy group $G_{A}$ equals the wreath product of $\operatorname{coker}j$ and $S_{d}$ acting on $\{1,\ldots ,d\}$ .
Remark 1.15. We now explain why $G_{A}$ obviously embeds into this wreath product, so the problem is whether the embedding is actually an isomorphism. In the notation of part (1) of Classification 1.7, the roots of $f=0$ split into the fibres of the surjection $h:\{f=0\}\rightarrow \{g=0\}$ . All fibres are cosets of the subgroup $\operatorname{coker}j\subset (\mathbb{C}\setminus 0)^{n}$ , and every monodromy permutation of the set $\{f=0\}$ ‘respects $j$ ’, that is to say, it sends every fibre into a fibre, preserving its $(\operatorname{coker}j)$ -torsor structure. In particular, the group $G_{A}$ is contained in the group of all permutations respecting $j$ , and the latter is exactly the wreath product sought.
Example 1.16. If the tuple $A$ is as shown on the left in Figure 1, then $G_{A}$ is obviously equal to $V_{4}\subset S_{4}$ , generated by $(12)(34)$ and $(13)(24)$ . However, if the tuple $A$ is as shown on the right, then its Cayley discriminant (Definition 3.12) has codimension 1, so a small loop around this discriminant corresponds to a transposition in $G_{A}$ (see Remark 3.26), thus the group is strictly greater than $V_{4}$ (actually, it equals $D_{8}$ ). This is despite, in the notation of step (2) of Classification 1.7, the groups $G_{A^{\prime }}$ and $G_{A^{\prime \prime }}$ are the same (equal to $S_{2}$ ) for both examples. Thus $G_{A}$ is not defined solely by $G_{A^{\prime }}$ and $G_{A^{\prime \prime }}$ .
Nevertheless, we can confirm in our setting the ‘symmetric or imprimitive’ dichotomy, conjectured in [Reference Sottile and WhiteSW15] for Schubert enumerative problems, modulo one obvious exclusion.
Example 1.17. Let $B$ and $C$ be tuples of finite sets of lattice mixed volume 1 in $\mathbb{Z}^{k}$ and $\mathbb{Z}^{m}$ respectively, $k>0,\,m\geqslant 0$ (see [Reference Esterov and GusevEG15] or § 2 below for the classification of such tuples), and let $j:\mathbb{Z}^{k}\rightarrow \mathbb{Z}^{k}\oplus \mathbb{Z}^{m}$ send $v$ to $(pv,0)$ for some odd prime $p$ . Let $P^{\prime }$ be the tuple $j(B)$ in $\mathbb{Z}^{k}\oplus \mathbb{Z}^{m}$ , and let $P^{\prime \prime }$ be a tuple of $m$ sets in $\mathbb{Z}^{k}\oplus \mathbb{Z}^{m}$ whose projections to $\mathbb{Z}^{m}$ form the tuple $C$ . Then the mixed volume of a tuple $P=(P^{\prime },P^{\prime \prime })$ equals $p$ , and, moreover, by Remark 1.15, the monodromy group $G_{P}$ is a subgroup of $\mathbb{Z}/p\mathbb{Z}$ , that is, equals $\mathbb{Z}/p\mathbb{Z}$ or the trivial group, of which the former is primitive and the latter is not. Actually one can check that $G_{P}$ always equals $\mathbb{Z}/p\mathbb{Z}$ in accordance with Conjecture 1.14 (which is obvious in the one-dimensional case, that is, for the equation $c_{p}x^{p}+c_{0}=0$ , corresponding to $P_{1}=\{0,p\}\subset \mathbb{Z}^{1}$ , and less obvious in general).
A tuple that can be identified with $P$ by an isomorphism of lattices will be called a prime tuple.
Definition 1.18. A tuple of sets $A=(A_{1},\ldots ,A_{n})$ in $\mathbb{Z}^{n}$ is said to be numerically non-reduced, if there exist sets $B_{1},\ldots ,B_{k}$ in $\mathbb{Z}^{k}$ and an embedding $j:\mathbb{Z}^{k}\rightarrow \mathbb{Z}^{n}$ , such that the lattice mixed volume of $B_{1},\ldots ,B_{k}$ is greater than 1, the embedding is not saturated (i.e. $\mathbb{Z}^{n}/j(\mathbb{Z}^{k})$ is not free), and $j(B_{1}),\ldots ,j(B_{k})$ coincide with $k$ of the sets $A_{1},\ldots ,A_{n}$ up to a shift.
The tuple $A$ is said to be numerically reducible, if a quantity $k<n$ of $A_{i}$ can be shifted to a $k$ -dimensional sublattice $L$ such that the lattice mixed volumes of both $A^{\prime }=$ (the tuple of $A_{i}$ shifted to $L$ ) and $A^{\prime \prime }=$ (the tuple of the images of the rest of the $A_{i}$ under the projection $\mathbb{Z}^{n}\rightarrow \mathbb{Z}^{n}/L$ ) are greater than 1.
The name is chosen because the mixed volume $V$ of the tuple $A$ equals the product of the mixed volumes of $A^{\prime }$ and $A^{\prime \prime }$ by Theorem 1.10.
Theorem 1.19. For every non-prime tuple $A$ (see Example 1.17), the monodromy group $G_{A}$ is the symmetric group $S_{V}$ if the tuple $A$ is numerically reduced and irreducible, and is imprimitive otherwise.
Proof. If the tuple $A$ of subsets of $\mathbb{Z}^{n}$ is numerically non-reduced, then, in the notation of Definition 1.18, let $h:(\mathbb{C}\setminus 0)^{n}\rightarrow (\mathbb{C}\setminus 0)^{k}$ be the surjection of tori, corresponding to the embedding $j:\mathbb{Z}^{k}\rightarrow \mathbb{Z}^{n}$ of their character lattices so that $h(x)^{b}=x^{j(b)}$ for all $x\in (\mathbb{C}\setminus 0)^{n}$ and $b\in \mathbb{Z}^{k}$ . Then every system of equations $f(x)=0,\,f\in \mathbb{C}^{A}$ contains a subsystem of the form $g(h(x))=0,\,g\in \mathbb{C}^{B}$ . By Remark 1.15, the fibres of the surjection $h:\{f=0\}\rightarrow \{g=0\}$ are blocks of the monodromy action of $G_{A}$ . The number and size of the blocks are greater than 1, because the mixed volume of $B_{1},\ldots ,B_{k}$ is greater than 1, and $j$ is not saturated.
If $A$ is numerically reducible, then, in the notation of Definition 1.18, upon an appropriate automorphism of $(\mathbb{C}\setminus 0)^{n}$ and reordering the tuple, we may assume that $A_{1},\ldots ,A_{k}$ are contained in the first $k$ -dimensional coordinate plane $L\subset \mathbb{Z}^{n},\,0<k<n$ , and the mixed volumes $V^{\prime }$ and $V^{\prime \prime }$ of both $A^{\prime }=(A_{1},\ldots ,A_{k})$ and $A^{\prime \prime }=$ (the images of $A_{k+1},\ldots ,A_{n}$ in $\mathbb{Z}^{n}/L$ ) are greater than 1. In this case, every common root of a generic tuple of polynomials $f=(f_{1},\ldots ,f_{n})\in \mathbb{C}^{A}$ is of the form $(x^{\prime },x^{\prime \prime })$ , where $x^{\prime }\in (\mathbb{C}\setminus 0)^{k}$ is one of the $V^{\prime }$ roots of the system $f^{\prime }=(f_{1},\ldots ,f_{k})$ . In particular, the fibres of the projection $\{f=0\}\rightarrow \{f^{\prime }=0\}$ are $V^{\prime }>1$ blocks of size $V^{\prime \prime }>1$ for the action of the monodromy group $G_{A}$ , so this action is imprimitive.
If the tuple $A$ is numerically reduced, numerically irreducible and not prime, then it is reduced. So, if $A$ is irreducible in this case, then $G_{A}$ is symmetric by Theorem 1.5.
Thus, it remains to consider reducible $A$ that is numerically reduced, numerically irreducible and not prime. In this case, in the notation of part (2) of Classification 1.7, the tuples $A^{\prime }$ and $A^{\prime \prime }$ are also numerically reduced, numerically irreducible and not prime, and the mixed volume of one of them equals $1$ . Thus $G_{A}$ equals the monodromy group of the other one, which is symmetric by induction on the dimension.◻
Structure of the paper
In § 2 we prove and discuss Theorem 1.11. The rest of the paper is devoted to the proof of Theorem 1.5. In § 3 we reduce the assumption of irreducibility to a more general notion of dual effectiveness (antonym to dual defectiveness; see Definition 3.14 below).
Theorem 1.20. A reduced irreducible tuple of $n$ sets in $\mathbb{Z}^{n}$ is dual effective unless, upon an automorphism of the lattice, all of its sets can be shifted to the standard simplex (i.e. the system of equations is essentially linear).
For the proof, see Corollary 3.23. Besides the relation to Galois theory, this result may be important as an illustration of a new approach to dual defectiveness in the toric setting, independent of the known ones [Reference Di RoccoDiR06, Reference Dickenstein, Feichtner and SturmfelsDFS07, Reference Cattani and CurranCC07, Reference EsterovEst18, Reference Furukawa and ItoFI16, Reference ForsgårdFor17].
Remark 1.21. (1) In the case of full-dimensional tuples, Theorem 1.20 was deduced from [Reference Furukawa and ItoFI16] in [Reference Borger and NillBN18], settling the conjecture from [Reference Cattani, Cueto, Dickenstein, Di Rocco and SturmfelsCCDDS13]. Our proof is independent of [Reference Furukawa and ItoFI16], and it would be important to extend the technique of [Reference Borger and NillBN18] from full-dimensional tuples to irreducible ones.
(2) It would be important to drop the irreducibility assumption and completely classify dual defective tuples in various senses (see Remark 3.16), as Example 1.16 suggests.
Theorem 1.22. If $A$ is a reduced dual effective tuple, then the monodromy $G_{A}$ contains a transposition.
Roughly speaking, the transposition is produced by running a small loop around the discriminant; see Theorem 3.25 for the proof and Theorem 3.27 for a possible generalization to non-square systems of equations.
Theorem 1.23. If $A$ is a reduced irreducible tuple, then the monodromy $G_{A}$ is doubly transitive.
The proof is standard and is given in § 4.
Proof of Theorem 1.5.
Unless the system of equations generically has one solution (satisfying $G_{A}=S_{1}$ ), Theorem 1.20 ensures that the tuple is dual effective, so the monodromy contains a transposition by Theorem 1.22. Since it is also doubly transitive by Theorem 1.23, it coincides with the symmetric group.◻
2 Lattice polytopes of small mixed volume
Theorem 2.1 [Reference Lagarias and ZieglerLZ91].
For any $n$ , there are finitely many convex lattice polytopes of a given lattice volume in $\mathbb{Z}^{n}$ up to affine automorphisms of the lattice.
Theorem 2.2 (Minkowski [Reference MinkowskiMin11]).
A tuple is linearly dependent if and only if its mixed volume equals $0$ .
Proof of Theorem 1.11.
Tuples $(B_{1},B_{1},B_{3},\ldots ,B_{N})$ and $(B_{2},B_{2},B_{3},\ldots ,B_{N})$ are said to be AF-descendants of $(B_{1},B_{2},B_{3},\ldots ,B_{N})$ , if both of them are linearly independent. If the tuple $B^{\prime }$ is the AF-descendant of $B$ , then, by the Aleksandrov–Fenchel inequality and Theorem 2.2, we have
Every linearly independent tuple $B$ that entirely consists of sets contained in the irreducible tuple $A$ , can be obtained from $A$ by taking a sequence of AF-descendants $A^{\prime },A^{\prime \prime },\ldots ,A^{(k)}=B$ . Applying the inequality (*) to this sequence, we conclude that if all sets of the tuple $B$ are contained in the irreducible tuple $A$ , then
Note that (**) trivially holds also for linearly dependent tuples $B$ by Theorem 2.2.
We can now estimate the lattice volume of the Minkowski sum $A_{1}+\cdots +A_{N}$ as follows. Write it as $\operatorname{MV}(A_{1}+\cdots +A_{N},\ldots ,A_{1}+\cdots +A_{N})$ , open the brackets and estimate every term by the inequality (**). As a result, for every irreducible tuple $(A_{1},\ldots ,A_{N})$ of mixed volume $V$ , the volume of the Minkowski sum $A_{1}+\cdots +A_{N}$ is at most $N^{N}V^{2^{N}}\!$ , so by Theorem 2.1 there are finitely many possibilities for $A_{1}+\cdots +A_{N}$ and hence for $(A_{1},\ldots ,A_{N})$ .◻
Remark 2.3. It would be interesting to obtain a sharper estimate on the volume of $A_{1}+\cdots +A_{N}$ in terms of the mixed volume of an irreducible tuple $(A_{1},\ldots ,A_{N})$ .
The classification of irreducible tuples is known only up to mixed volume 4 in dimension 2 (see [Reference Esterov and GusevEG16]), and up to mixed volume 1 in arbitrary dimension.
Corollary 2.4 (Minkowski).
The unique irreducible tuple of mixed volume $0$ is a point in $\mathbb{Z}^{1}$ .
Theorem 2.5 [Reference Esterov and GusevEG15].
The unique (up to automorphisms of the lattice and shifts of polytopes) maximal (by inclusion) irreducible tuple of lattice polytopes of mixed volume $1$ in $\mathbb{Z}^{N}$ is the tuple of $N$ copies of the standard simplex.
3 Discriminants and dual defectiveness
Mixed resultants
Let $A=(A_{0},\ldots ,A_{n})$ be a tuple of finite sets in $\mathbb{Z}^{n}$ .
Definition 3.1. The $A$ -resultant $R_{A}$ is the closure of the set of all tuples of polynomials $f=(f_{0},\ldots ,f_{n})\in \mathbb{C}^{A}$ that have a common root $f_{0}(x)=\cdots =f_{n}(x)=0,\,x\in (\mathbb{C}\setminus 0)^{n}$ .
Example 3.2. For $n=1$ , the set $R_{A}$ is the zero locus of the classical Sylvester resultant.
Theorem 3.3 ([Reference EsterovEst07, Theorem 2.26]; see also [Reference SturmfelsStu94] for the first part of the statement).
If $A$ is irreducible, then the resultant $R_{A}$ is a non-empty irreducible hypersurface, and a generic tuple $f\in R_{A}$ has a unique common root in $(\mathbb{C}\setminus 0)^{n}$ .
Gelfand–Kapranov–Zelevinsky discriminants
Let $A\subset \mathbb{Z}^{n}$ be a finite set.
Definition 3.4 [Reference Gelfand, Kapranov and ZelevinskyGKZ94].
The $A$ -discriminant $D_{A}$ is the closure of the set of all polynomials $f\in \mathbb{C}^{A}$ that have a singular root $f(x)=0,\,df(x)=0,\,x\in (\mathbb{C}\setminus 0)^{n}$ .
Example 3.5. For $n=1$ , the set $D_{A}$ is the zero locus of the classical discriminant.
Definition 3.6. The tuple $A$ is said to be dual defective if $D_{A}$ is not a hypersurface, and dual effective otherwise.
This is equivalent to the fact that the projectively dual variety to the toric variety $X_{A}$ is not a hypersurface (hence the name). The study of dual defective projective varieties is a classical topic in algebraic geometry [Reference EinEin86]. In particular, there is an extensive literature on the classification of dual defective lattice sets; see [Reference Di RoccoDiR06, Reference EsterovEst18] and [Reference Furukawa and ItoFI16] for some of the most explicit answers (the first one is for the case of smooth toric varieties).
Example 3.7. The set $A=\{(00),(10),(20),(01)\}\subset \mathbb{Z}^{2}$ is defective.
Theorem 3.8 [Reference Gelfand, Kapranov and ZelevinskyGKZ94].
If a dual effective $A$ cannot be shifted to a proper sublattice of $\mathbb{Z}^{n}$ , then a generic polynomial $f\in D_{A}$ has a unique singular root $x\in (\mathbb{C}\setminus 0)^{n}$ , and the Hessian of $f$ at this root is non-degenerate.
If $A$ is dual effective, then the set $D_{A}$ is the zero locus of a unique irreducible integer polynomial on $\mathbb{C}^{A}$ (up to the choice of sign). This polynomial is also called the $A$ -discriminant. The coefficients $c_{a},a\in A$ , of the general Laurent polynomial $\sum _{a\in A}c_{a}x^{a}$ in $\mathbb{C}^{A}$ form the natural system of coordinates in $\mathbb{C}^{A}$ , and we shall consider the $A$ -discriminant as a polynomial of $c_{a},a\in A$ .
Lemma 3.9 [Reference EsterovEst10, Lemma 2.21].
For every dual effective $A\subset \mathbb{Z}^{n}$ and every $a\in A$ , the $A$ -discriminant has positive degree in $c_{a}$ .
Remark 3.10. For every $B\subset A$ , there is a natural forgetful projection $\mathbb{C}^{A}\rightarrow \mathbb{C}^{B}$ , sending $\sum _{a\in A}c_{a}x^{a}$ to $\sum _{a\in B}c_{a}x^{a}$ , and we shall denote the preimage of $D_{B}$ under this map also by $D_{B}$ .
Corollary 3.11. If $A$ is dual effective, then $D_{A}\neq D_{B}$ for every $B\subsetneq A$ .
Discriminants of systems of equations
For a tuple $A=(A_{1},\ldots ,A_{k})$ of finite sets in $\mathbb{Z}^{n},\,2\leqslant k\leqslant n$ , the concept of the discriminant of the system of equations supported at $A$ is ambiguous. We introduce three different versions of this notion that appear in the literature, and it will be important for us that all of them coincide for irreducible tuples. Denote the standard basis in $\mathbb{Z}^{k}$ by $e_{1},\ldots ,e_{k}$ , and, for every $I\subset \{1,\ldots ,k\}$ , let $A_{I}$ be the Cayley configuration $\bigcup _{i\in I}\{e_{i}\}\times A_{i}\subset \mathbb{Z}^{k}\times \mathbb{Z}^{n}$ . For every $f\in \mathbb{Z}^{A}$ , let $f_{I}$ be the polynomial $\sum _{i\in I}\unicode[STIX]{x1D706}_{i}f_{i}(x)\in \mathbb{C}^{A_{I}}$ of variables $\unicode[STIX]{x1D706}=(\unicode[STIX]{x1D706}_{1},\ldots ,\unicode[STIX]{x1D706}_{k})\in (\mathbb{C}\setminus 0)^{k}$ and $x\in (\mathbb{C}\setminus 0)^{n}$ .
Definition 3.12. (1) The naive $A$ -discriminant [Reference EsterovEst10] is the closure of the set of all tuples $f\in \mathbb{C}^{A}$ having a singular common root $x\in (\mathbb{C}\setminus 0)^{n}$ (so that $f_{1}(x)=\cdots =f_{k}(x)=0$ and $df_{1}(x),\ldots ,df_{k}(x)$ are linearly dependent).
(2) The mixed $A$ -discriminant ([Reference Cattani, Cueto, Dickenstein, Di Rocco and SturmfelsCCDDS13] for $k=n$ ) is the closure of the set of all tuples $f\in \mathbb{C}^{A}$ having a non-degenerate singular common root $x\in (\mathbb{C}\setminus 0)^{n}$ (i.e. a singular common root such that no proper subtuple of $df_{1}(x),\ldots ,df_{k}(x)$ is linearly dependent).
(3) The Cayley $A$ -discriminant [Reference EsterovEst10] is the image of the discriminant $D_{A_{\{1,\ldots ,k\}}}\subset \mathbb{C}^{A_{\{1,\ldots ,k\}}}$ under the natural isomorphism $\mathbb{C}^{A_{\{1,\ldots ,k\}}}\rightarrow \mathbb{C}^{A}$ inverse to sending every $f$ to $f_{\{1,\ldots ,k\}}$ .
All of these sets obviously coincide for the Gelfand–Kapranov–Zelevinsky case $k=1$ . However, for $k>1$ (including $k=n$ ), they may be pairwise different [Reference Cattani, Cueto, Dickenstein, Di Rocco and SturmfelsCCDDS13, Example 1.2] and have irreducible components of different dimensions [Reference EsterovEst10, Example 2.25]. Nevertheless, this difference disappears for irreducible tuples.
Theorem 3.13. If $A$ is irreducible, the three discriminant sets of Definition 3.12 coincide up to irreducible components of codimension greater than $1$ .
Proof. If the Cayley discriminant has codimension greater than 1, then so do the naive discriminant by [Reference EsterovEst10, Theorem 2.31] and the mixed discriminant (as its subset).
To study the opposite case, define $\unicode[STIX]{x1D6F4}_{\{j_{1},\ldots ,j_{q}\}}$ as the set of all tuples $f=(f_{1},\ldots ,f_{k})\in \mathbb{C}^{A}$ such that $f(x)=0$ for some $x\in (\mathbb{C}\setminus 0)^{n}$ and $\sum _{i}\unicode[STIX]{x1D706}_{i}df_{q_{i}}(x)=0$ for some $(\unicode[STIX]{x1D706}_{1},\ldots ,\unicode[STIX]{x1D706}_{q})\in (\mathbb{C}\setminus 0)^{q}$ [Reference EsterovEst10, Definition 2.33].
If the Cayley discriminant has codimension 1, then this hypersurface $H$ is the only codimension 1 component of the naive discriminant by [Reference EsterovEst10, Theorem 2.31] and the only codimension 1 set of the form $\unicode[STIX]{x1D6F4}_{J}$ (namely, the one corresponding to $J=\{1,\ldots ,k\}$ ) by [Reference EsterovEst10, Lemma 2.34]. The latter fact implies that a singular common root of a generic tuple $f\in H$ is non-degenerate (because the linear dependence of its differentials $df_{j}$ for $j\in J^{\prime }\neq \{1,\ldots ,k\}$ would imply that $\unicode[STIX]{x1D6F4}_{J^{\prime }}=\unicode[STIX]{x1D6F4}_{J}$ also has codimension 1). Thus $H$ is also a codimension 1 component of the mixed discriminant, and the latter has no other codimension 1 components, because it is contained in the naive discriminant.◻
Dual defectiveness of systems of equations
Definition 3.14. By Theorem 3.13, for an irreducible tuple $A$ , we can denote the common hypersurface components of the three discriminant sets of Definition 3.12 by $D_{A}$ , and call this hypersurface the $A$ -discriminant. The irreducible tuple $A$ is said to be dual defective if $D_{A}$ is empty, and dual effective otherwise.
If the tuple $A$ consists of one set $A_{1}\subset \mathbb{Z}^{n}$ , then it is irreducible, and its dual defectiveness is the same property as in Definition 3.6.
Conjecture 3.15. For irreducible tuples, the three discriminant sets of Definition 3.12 coincide completely, that is, they are the same irreducible set.
Remark 3.16. The interrelation between the three notions of the discriminant in Definition 3.12 is not yet completely understood for reducible tuples, particularly concerning the higher-codimension components. As a consequence, the notion of dual defectiveness for reducible tuples splits into several non-equivalent versions, looking for the non-existence of codimension 1 components and/or existence of higher-codimension components in any of the three notions of the discriminant. It would be important to understand how these numerous versions are related.
Remark 3.17. As we have observed, the irreducibility of the tuple $A$ implies the irreducibility of the codimension 1 part of the naive $A$ -discriminant. On the other hand, the codimension 1 part of the naive $A$ -discriminant tends to be reducible if $A$ is reducible; see [Reference EsterovEst10, Lemma 2.34]. The situation with reduced tuples is similar: the codimension 1 components $D_{i}$ of the naive $A$ -discriminant come with natural multiplicities equal to the number of singular roots of the system $f=0$ for a generic tuple $f\in D_{i}$ . By Theorem 3.8, an irreducible tuple $A$ is reduced if and only if $D_{A}$ is reduced in the sense of the aforementioned multiplicity (see [Reference EsterovEst13] for the computation of the multiplicities for non-reduced and reducible tuples).
Lemma 3.18. An irreducible tuple $A$ is dual effective if and only if some $f\in \mathbb{C}^{A}$ has an isolated singular root.
Proof. If the tuple $f=(f_{1},\ldots ,f_{k})$ has an isolated singular root $x$ , then the set of tuples in $\mathbb{C}^{A}$ that have a singular root contains a hypersurface in a small neighbourhood of $f$ . Indeed, the projection $\unicode[STIX]{x1D70B}$ of the incidence set $\{(\tilde{x},\tilde{f})\mid \tilde{f}(\tilde{x})=0\}\subset (\mathbb{C}\setminus 0)^{n}\times \mathbb{C}^{A}$ to $\mathbb{C}^{A}$ has the critical set $C$ of dimension one less than $\mathbb{C}^{A}$ . Since $x$ is an isolated singular root of $f$ , the fibres of the projection $\unicode[STIX]{x1D70B}:C\rightarrow \mathbb{C}^{A}$ near $(x,f)$ are finite, thus the image of $C$ contains a hypersurface passing through $f$ . Thus, according to the naive version of the definition of the discriminant (see Definition 3.12), $D_{A}$ contains a non-empty hypersurface.
To prove the statement in the other direction, recall that $f_{\{1,\ldots ,k\}}$ is a homogeneous polynomial in the variables $\unicode[STIX]{x1D706}_{1},\ldots ,\unicode[STIX]{x1D706}_{k}$ , so the equation $f_{\{1,\ldots ,k\}}=0$ defines a subset in $\mathbb{C}\mathbb{P}^{k-1}\,\times \,(\mathbb{C}\setminus 0)^{n}$ . Denote the image of the torus $(\mathbb{C}\setminus 0)^{k}$ under the projection $\mathbb{C}^{k}\rightarrow \mathbb{C}\mathbb{P}^{k-1}$ by $T$ .
In this notation, if the tuple $A$ is dual effective, then so is $A_{\{1,\ldots ,k\}}$ , then, by Theorem 3.8, a generic polynomial $f_{\{1,\ldots ,k\}}$ in it has a unique (and thus isolated) singular root in $T\times (\mathbb{C}\setminus 0)^{n}$ , then so does the tuple $f\in \mathbb{C}^{A}$ .◻
Proof of Theorem 1.20
For a finite set $A\subset \mathbb{Z}^{n}$ , let $\mathbb{C}\mathbb{P}^{A}$ be the projective space with the homogeneous coordinates $z_{a},\,~a\in A$ , and let $m=m_{A}:(\mathbb{C}\setminus 0)^{n}\rightarrow \mathbb{C}\mathbb{P}^{A}$ be the monomial map such that $m_{A}(x)$ has coordinates $z_{a}=x^{a}$ .
Definition 3.19. The $A$ -image of an algebraic set $V\subset (\mathbb{C}\setminus 0)^{n}$ is the image of $m_{A}(V)$ in $\mathbb{C}\mathbb{P}^{A}$ .
Remark 3.20. The $A$ -image is usually not closed. In what follows, whenever we discuss its degree and irreducibility, we refer to the corresponding properties of its closure. On the other hand, its projectively dual set is defined as the set of all tangent hyperplanes to its smooth points, and is usually also not closed.
Theorem 3.21. Let $A=(A_{1},\ldots ,A_{k})$ and $A^{\prime }=(A_{2},\ldots ,A_{k})$ be tuples of finite sets in $\mathbb{Z}^{n},~n\geqslant k$ , and let $M$ be the $A_{1}$ -image of the complete intersection $f=0$ for a generic tuple of polynomials $f=(f_{2},\ldots ,f_{k})\in \mathbb{C}^{A^{\prime }}$ .
(1) If $A$ is irreducible, then $f=0$ and $M$ are irreducible.
(2) If, besides being irreducible, $A$ is reduced, then $M$ is also reduced (in the sense that the map $m=m_{A_{1}}:\{f=0\}\rightarrow M$ has degree $1$ ).
(3) Assume that $A$ is reduced and irreducible. Then the degree of $M$ is greater than $1$ unless the sets $A_{1},\ldots ,A_{k}$ can be shifted to the same lattice simplex of lattice volume $1$ .
(4) Assume that $A$ is reduced and irreducible. Then $A$ is dual defective if and only if $M$ is dual defective (i.e. its projectively dual set has codimension greater than $1$ ). Moreover, if $A$ and $M$ are dual effective, then a generic tuple of polynomials in the discriminant $D_{A}$ has a unique singular root, and this root is non-degenerate.
Remark 3.22. (1) We shall apply this lemma for $k=n$ , in which case by the non-degenerate singular root we mean just the root of multiplicity 2. However, part (4) makes sense for arbitrary $k\leqslant n$ . In this case a root $x$ of a system of equations $g=0$ is said to be singular non-degenerate, if $g=0$ defines an isolated singularity of a complete intersection in a neighbourhood of $x$ , and its Milnor number equals 1 (see [Reference LooijengaLoo84]).
(2) Part (1) for $k<n$ actually occurs and will be proved under a strictly weaker assumption that we call coirreducibility (cf. Definition 1.3): no $m$ sets of the tuple $A^{\prime }$ can be shifted to the same $m$ -dimensional sublattice.
Proof. If $A$ is irreducible and $k\leqslant n$ , then the tuple $A^{\prime }$ is coirreducible (in the sense of Remark 3.22(2)), so part (1) follows from [Reference KhovanskiiKho16] for $f=0$ and thus also for $M$ .
We shall now assume without loss of generality that $0\in A_{1}$ , because all properties of $A$ mentioned in the statement are invariant under parallel translations. For every linear form $l$ on $\mathbb{C}\mathbb{P}^{A_{1}}$ , denote the rational function $l/z_{0}$ by $\tilde{l}$ . In this notation, assigning the function $f_{1}(x)=\tilde{l}(m(x))$ to a form $l$ (or, in coordinates, assigning the polynomial $f_{1}(x)=\sum _{a\in A_{1}}c_{a}x^{a}$ to the form $l(z)=\sum _{a\in A_{1}}c_{a}z_{a}$ ), we establish an isomorphism between $\mathbb{C}^{A_{1}}$ and the space of linear forms on $\mathbb{C}\mathbb{P}^{A_{1}}$ .
Assume towards a contradiction that part (2) does not hold. Then, for generic linear forms $l_{1},\ldots ,l_{n-k+2}$ such that the plane $l_{\bullet }=0$ intersects $M$ , an intersection point would have more than one preimage in $f=0$ , that is, a generic tuple of polynomials
supported at the tuple
would have more than one common root. This would contradict Theorem 3.3, because irreducibility of $A$ implies irreducibility of $B$ .
In the setting of part (3), we may assume without loss of generality by Theorem 2.5 that the tuple
has mixed volume greater than 1, because it is reduced and irreducible. Then the degree of $M$ is greater than 1, because it equals the number of intersections of $V$ with a generic plane $l_{1}=\cdots =l_{n-k+1}=0$ , that is, the number of common roots of a generic tuple of polynomials $(\tilde{l}_{1}(m(\cdot )),\ldots ,\tilde{l}_{n-k+1}(m(\cdot )),f_{2},\ldots ,f_{k})\in \mathbb{C}^{B^{\prime }}$ , which equals the mixed volume of $B^{\prime }$ by the Kouchnirenko–Bernstein theorem.
It remains to prove part (4). If $M$ is dual effective, then the hyperplane $l=0$ , corresponding to a smooth point of the projectively dual variety, is tangent to $M$ at a unique point $z$ , and the tangency is non-degenerate (in the sense that the restriction of $\tilde{l}$ to $M$ has the non-degenerate Hessian at $z$ ). Then the restriction of the polynomial $f_{1}(x)=\tilde{l}(m(x))$ to the complete intersection $f=(f_{2},\ldots ,f_{k})=0$ has a unique and non-degenerate singular root, then the resulting tuple $(f_{1},f_{2},\ldots ,f_{k})$ has a unique and non-degenerate singular root. By Lemma 3.18, this implies that $A$ is dual effective. The other direction is proved in the same way.◻
Corollary 3.23 (Theorem 1.20 refined).
A reduced irreducible tuple of sets $A=(A_{1},\ldots ,A_{n})$ in $\mathbb{Z}^{n}$ is dual effective unless, upon an automorphism of the lattice, all of its sets can be shifted to the standard simplex. Moreover, in this case a generic tuple $f\in D_{A}$ has a unique multiple root, and this root has multiplicity $2$ .
Proof. By Theorem 3.21(1)–(3), the closure of $M$ is a reduced irreducible curve of degree greater than 1. Since every such curve is dual effective, the sought statement follows from Theorem 3.21(4).◻
Remark 3.24. Excluding the notion of the projectively dual variety from this reasoning, we can describe more explicitly the picture in $\mathbb{C}\mathbb{P}^{A_{1}}$ corresponding to a minimally degenerate system of equations as follows. Taking a generic tuple $(f_{2},\ldots ,f_{n})\in \mathbb{C}^{A^{\prime }}$ , the curve
is reduced, irreducible and not a line. Thus, a generic tangent hyperplane $\sum _{a\in A_{1}}c_{a}z_{a}=0$ to $M$ has a simple tangency and is transversal to $M$ at the other intersection points. Then the system of equations $\sum _{a\in A_{1}}c_{a}x^{a}=f_{2}(x)=\cdots =f_{n}(x)=0$ has one root of multiplicity 2, and the other roots are of multiplicity 1.
Proof of Theorem 1.22
We first need an explicit construction of the exceptional set $B_{A}$ in the Kouchnirenko–Bernstein Theorem 1.1.
The restriction of a linear function $v:\mathbb{R}^{n}\rightarrow \mathbb{R}$ to a finite set $A\subset \mathbb{Z}^{n}$ takes its maximal value at certain points of $A$ . The set of all such points will be denoted by $A^{v}$ . For a tuple $A=(A_{1},\ldots ,A_{k})$ , denote the tuple $(A_{1}^{v},\ldots ,A_{k}^{v})$ by $A^{v}$ , and the naive discriminant of $A^{v}$ (see Definition 3.12) by $D_{v}$ . We shall consider $D_{v}$ as a subset of $\mathbb{C}^{A}$ in the sense of Remark 3.10. The set
is algebraic, because there are only finitely many distinct algebraic sets among $D_{v},~v\in \mathbb{Z}^{n}$ . More specifically, write $u\sim v$ if $A^{u}=A^{v}$ ; then this equivalence relation splits $\mathbb{R}^{n}$ into finitely many relatively open polyhedral cones. These cones form a fan $\unicode[STIX]{x1D6F4}$ (see, for example, [Reference FultonFul93]), and $A^{v}$ and $D_{v}$ depend only on the cone $C\in \unicode[STIX]{x1D6F4}$ containing $v$ . So we shall also denote $A^{v}$ and $D_{v}$ by $A^{C}$ and $D_{C}$ , respectively.
We claim that the set $B$ can be taken as the exceptional set $B_{A}$ in Theorem 1.1 in the following strong sense. Denote the incidence set
by $E$ and its projection to $\mathbb{C}^{A}$ by $\unicode[STIX]{x1D70B}$ .
Theorem 3.25 (Theorem 1.22 refined).
Let the tuple $A=(A_{1},\ldots ,A_{n})$ , the set $B$ and the projection $\unicode[STIX]{x1D70B}$ be as above with $k=n$ .
(1) The projection $\unicode[STIX]{x1D70B}$ is a covering outside the set $B$ . In particular, every $f\in \mathbb{C}^{A}\setminus B$ has exactly $\operatorname{MV}(A)$ roots, and the group $G_{A}$ is the monodromy group of this covering.
(2) If $A$ is reduced and dual effective, then, for a generic $f\in D_{A}$ :
– the system $f=0$ has a unique singular root $x\in (\mathbb{C}\setminus 0)^{n}$ , and its multiplicity is $2$ ;
– we have $f\notin D_{v}$ for every non-zero $v:\mathbb{R}^{n}\rightarrow \mathbb{R}$ .
(3) For such $f$ , let $F:(\mathbb{C},0)\rightarrow (\mathbb{C}^{A},f)$ be a germ of a smooth curve transversal to $D_{A}$ . Then the monodromy of the covering from part (1) along the loop $F(\unicode[STIX]{x1D700}\exp (2\unicode[STIX]{x1D70B}it))$ for small $\unicode[STIX]{x1D700}>0$ is a transposition.
Remark 3.26. Instead of assuming dual effectiveness in part (2), it is enough to assume that the Cayley configuration $A_{\{1,\ldots ,n\}}$ is dual effective, and then a small loop around the Cayley discriminant $D_{A_{\{1,\ldots ,n\}}}$ still gives a transposition; see Theorem 3.27 below for this and some other generalizations.
Proof of part (2).
The first statement follows from Corollary 3.23, the second one from Corollary 3.11 applied to the Cayley discriminant $D_{A_{\{1,\ldots ,n\}}}$ .◻
Proof of parts (1) and (3).
Choose a unimodular simplicial fan $\unicode[STIX]{x1D6F4}^{\prime }$ , subdividing $\unicode[STIX]{x1D6F4}$ (see [Reference Kempf, Knudsen, Mumford and Saint-DonatKKMS73] for its existence), and consider the corresponding smooth toric variety $X\supset (\mathbb{C}\setminus 0)^{n}$ . Every cone $C\in \unicode[STIX]{x1D6F4}^{\prime }$ corresponds to an orbit $O_{C}\subset X$ , and, for $C\neq \{0\}$ , the closure of the incidence set $E$ in $X\times C^{A}$ contains a point of the form $(x,f)\in O_{C}\times \mathbb{C}^{A}$ only if $f\in D_{C}$ . In particular, if $f\notin D_{v}$ for every non-zero $v:\mathbb{R}^{n}\rightarrow \mathbb{R}$ , then, for a small neighbourhood $U\ni f$ , its preimage $V=\unicode[STIX]{x1D70B}^{-1}(U)$ is disjoint from the orbits $O_{C},\,C\neq \{0\}$ , that is, the restriction $\unicode[STIX]{x1D70B}:V\rightarrow U$ is proper. Now consider two cases, corresponding to the setting of part (1) and part (3), respectively: $f\notin D_{A}$ and $f\in D_{A}$ .
If $f\notin D_{A}$ , then the restriction $\unicode[STIX]{x1D70B}:V\rightarrow U$ also has no critical points (this claim makes sense, because $E$ is smooth), so it is a trivial covering, and part (1) is proved.
If $f\in D_{A}$ has a unique multiple root $x$ , and this root has multiplicity 2, then the local degrees of $\unicode[STIX]{x1D70B}$ at the point $(x,f)$ and at the other points of the fibre $\unicode[STIX]{x1D70B}^{-1}(f)$ equal 2 and 1, respectively. Thus $\unicode[STIX]{x1D70B}$ has an ${\mathcal{A}}_{1}$ singularity at $(x,f)$ and no singularities at other points of the fibre $\unicode[STIX]{x1D70B}^{-1}(f)$ , that is, $\unicode[STIX]{x1D70B}(z_{1},z_{2},\ldots ,z_{N})=(z_{1}^{2},z_{2},\ldots ,z_{N})$ in suitable local coordinates $(z_{1},\ldots ,z_{N})$ on $T$ near $(x,f)$ . In particular, the monodromy along a small loop around the origin in the complex line $z_{2}=\cdots =z_{N}=0$ is a transposition.◻
Monodromy of non-square systems of equations
We outline a generalization of Theorem 3.25 to some reducible tuples $A$ and to the case $k<n$ in order to clarify what happens in examples similar to Example 1.16 and what could be a natural counterpart of the topic of this paper for non-square systems of equations.
Theorem 3.27. Let $A=(A_{1},\ldots ,A_{k})$ , $B$ and $\unicode[STIX]{x1D70B}$ be as above with arbitrary $k\leqslant n$ .
(1) The projection $\unicode[STIX]{x1D70B}$ in a locally trivial fibration outside the set $B$ . Moreover, $B$ is the minimal closed set with this property. In particular, every loop in the complement to $B$ gives rise to the monodromy automorphism in the cohomology $H$ of the fibre of this fibration.
(2) The set $B$ is a hypersurface unless $m+2$ of the sets in the tuple $A$ can be shifted to the same $m$ -dimensional plane (in which case the aforementioned fibration is empty).
(3) For a reduced tuple $A$ , whose Cayley discriminant (Definition 3.12) is a hypersurface, and for a generic $f$ in the Cayley discriminant:
– the system $f=0$ has a unique singular root $x\in (\mathbb{C}\setminus 0)^{n}$ , and this singular root is non-degenerate;
– we have $f\notin D_{v}$ for every non-zero $v:\mathbb{R}^{n}\rightarrow \mathbb{R}$ .
(4) For such $f$ , let $F:(\mathbb{C},0)\rightarrow (\mathbb{C}^{A},f)$ be a germ of a smooth curve transversal to $D_{A}$ . Then the $\unicode[STIX]{x1D701}$ -function of the monodromy transformation from part (1), corresponding to the loop $F(\unicode[STIX]{x1D700}\exp (2\unicode[STIX]{x1D70B}it))$ for small $\unicode[STIX]{x1D700}>0$ , has the form $t^{2}-1$ .
Parts (1) and (2) follow from [Reference EsterovEst13, Theorems 1.1 and 1.4]. The first statement of part (3) follows from the fact that the Cayley discriminant is a component of multiplicity 1 in the Euler discriminant $E_{A}$ ; see [Reference EsterovEst13, Proposition 1.11]. (This works in particular for $k=n$ , but we preferred to give a more straightforward proof of Theorem 3.25 in that case.) The rest is proved in the same way as for $k=n$ in Lemma 3.25.
Remark 3.28. In particular, the correspondence from Theorem 3.27(1) maps the fundamental group of the complement of $B$ to the group $\operatorname{GL}(H)$ . The image $G_{A}$ is the monodromy group of the (non-square) system of equations supported at the tuple $A$ . The results of the present paper give some hope that $G_{A}$ can be quite explicitly described in terms of $A$ at least for reduced irreducible $A$ . This important study has been recently initiated in the simplest non-square case, $(k,n)=(1,2)$ ; see [Reference Cretois and LangCL18, Reference Cretois and LangCL17, Reference SalterSal17].
4 Double transitivity of monodromy
Consider a morphism $\unicode[STIX]{x1D70B}$ of an algebraic set $E$ to an irreducible algebraic set $C$ as an abstract enumerative problem: regard a point $z\in C$ as an incidence condition, and the points of its fibre $\unicode[STIX]{x1D70B}^{-1}(z)$ as the solutions of the enumerative problem with a given incidence condition. The enumerative problem is said to be well posed if its generic fibre is finite. In this case, there exists a Zariski open set $U\subset C$ such that $\unicode[STIX]{x1D70B}$ is a covering over $U$ . The monodromy group of this covering does not depend on the choice of $U$ and is called the monodromy group of the enumerative problem.
Example 4.1. The enumerative problem of the present paper falls into this scheme, if we define
and denote the projection of $E$ to $C=\mathbb{C}^{A}$ by $\unicode[STIX]{x1D70B}$ . For every tuple $A=(A_{1},\ldots ,A_{n})$ , it is well posed by Theorem 3.25(1).
Let us recall a classically known geometric criterion for the double transitivity of the monodromy of the abstract enumerative problem $\unicode[STIX]{x1D70B}:E\rightarrow C$ . Although its versions can be found in [Reference Sottile and WhiteSW15] and other relevant works, we shall recall the proof to keep the story self-contained. Consider the fibre square
and its projection $\unicode[STIX]{x1D70B}_{2}:E_{2}\rightarrow C$ , sending $(x,y)$ to $\unicode[STIX]{x1D70B}(x)=\unicode[STIX]{x1D70B}(y)\in C$ . If the enumerative problem is well posed, that is, for a certain Zariski open $U\subset C$ , its preimage $V=\unicode[STIX]{x1D70B}^{-1}(U)$ defines a covering $\unicode[STIX]{x1D70B}:V\rightarrow U$ , then the fibre square $V_{2}=\unicode[STIX]{x1D70B}_{2}^{-1}(U)$ also defines a covering $\unicode[STIX]{x1D70B}_{2}:V_{2}\rightarrow U$ . Note that the diagonal $D=\{(x,x)\mid x\in U\}\subset V_{2}$ is an irreducible component of $V_{2}$ .
Theorem 4.2. The monodromy of the well-posed enumerative problem $\unicode[STIX]{x1D70B}:E\rightarrow C$ is doubly transitive if and only if $V_{2}$ has at most one irreducible component different from $D$ .
Proof. If $D=V_{2}$ , then the monodromy is trivial. Otherwise, let $F$ be the second component of $V_{2}$ . In order to prove the double transitivity, we should take two pairs of distinct points $(x,y)$ and $(x^{\prime },y^{\prime })$ in the fibre $\unicode[STIX]{x1D70B}^{-1}(z)$ of a point $z\in U$ and construct a loop in $U$ such that the monodromy along this loop sends $x$ to $x^{\prime }$ and $y$ to $y^{\prime }$ . Since neither $(x,y)$ nor $(x^{\prime },y^{\prime })$ is contained in $D$ , both of them are contained in $F$ . Since $F$ is irreducible, these two points can be connected with a path $\unicode[STIX]{x1D6FE}$ . Then $\unicode[STIX]{x1D70B}_{2}(\unicode[STIX]{x1D6FE})$ is the loop sought.◻
Corollary 4.3. Let $\unicode[STIX]{x1D70B}:E\rightarrow C$ be a well-posed enumerative problem. If at most one irreducible component of $E_{2}$ besides the diagonal $D$ has the same dimension as $D$ , then the monodromy is doubly transitive.
Proof of Theorem 1.23.
The idea is to apply Corollary 4.3 to the setting of Example 4.1. In this case we have
In order to prove that $G_{A}$ is doubly transitive, it is enough to prove that $E_{2}$ has at most one more irreducible component $F$ of dimension $N=\dim \mathbb{C}^{A}$ . We shall prove it by counting the dimension of fibres of the projection $p:E_{2}\rightarrow S$ . Every such fibre is a vector subspace of $\mathbb{C}^{A}$ , but different fibres may have different dimension. Namely, assuming for convenience without loss of generality that every $A_{i}$ contains 0, the fibre $p^{-1}(x,y)$ is given in $\mathbb{C}^{A}$ by
independent linear equations, where $d_{x,y}$ is the number of $A_{i}$ such that $x^{a}=y^{a}$ for all $a\in A_{i}$ . Indeed, since $0\in A_{i}$ , the linear equations $f_{i}(x)=f_{i}(y)=0$ on the element $f=(f_{1},\ldots ,f_{i},\ldots ,f_{n})\in \mathbb{C}^{A}$ are dependent if and only if they coincide and if and only if $x^{a}=y^{a}$ for all $a\in A_{i}$ , so (*) follows.
This implies that $\dim p^{-1}(x,y)$ is the same for all $(x,y)$ in the set $U_{L}$ , defined as follows:
Namely, if $(x,y)\in U_{L}$ , then, by (*), the fibre $p^{-1}(x,y)$ is given in $\mathbb{C}^{A}$ by
independent linear equations, where $d_{L}$ is the number of $A_{i}$ contained in $L$ .
Therefore, denoting the preimage of $U_{L}$ in $E_{2}$ by $E_{L}$ , we conclude by (**) that $p:E_{L}\rightarrow U_{L}$ is a vector bundle of rank $N-2n+d_{L},\,N=\dim \mathbb{C}^{A}$ . Moreover, since $\dim U_{L}=2n-\dim L$ , we conclude that $\dim E_{L}=N+d_{L}-\dim L$ .
Since the tuple $A$ is reduced and irreducible, we have
unless $L=\mathbb{Z}^{n}$ , or $L$ contains no $A_{i}$ at all. In the latter cases, $E_{L}$ equals the diagonal $D\subset E_{2}$ or one more $N$ -dimensional subset $F\subset E_{2}$ (independent of $L$ ), respectively. Since $E_{2}$ is covered by $E_{L}$ as $L$ runs over all sublattices, we have proved that it has two $N$ -dimensional components, so that Corollary 4.3 applies.◻
Acknowledgements
I am grateful to Christopher Borger and Benjamin Nill, whose proof [Reference Borger and NillBN18] of a conjecture from [Reference Cattani, Cueto, Dickenstein, Di Rocco and SturmfelsCCDDS13] contributed to working out the present approach to the conjecture from [Reference Esterov and GusevEG16], and to Yuri Burman and the referee for valuable remarks.