1 Introduction
Russell and Whitehead (henceforth, R&W) adopted a conception of the natural numbers akin to the following:Footnote 1
-
Numbers as Cardinalities ( NaC ): The natural numbers are the finite cardinalities.
A property of properties is a finite cardinality just in case it is the property of having (exactly) n instances, for some natural number n. Thus, according to NaC, the natural numbers are properties of a special kind, rather than individuals. Through NaC and related theses R&W [Reference Whitehead and Russell80] offered characterizations of the arithmetical primitives in their Ramified Type Theory ( $\mathtt {RTT}$ ) and famously interpreted Second-Order Peano Arithmetic $(\mathtt {PA}_{\mathtt {2}})$ in it.Footnote 2 Since ${\mathtt {RTT}}$ ’s primitives all seem to have some claim to being purely logical notions, as they consist of Boolean connectives and quantifiers, all typically regarded as logical, R&W’s result promised to establish the logical truth of all of $\mathtt {PA}_{\mathtt {2}}$ ’s theorems, and that arithmetic is a part of logic.Footnote 3
R&W’s interpretation of $\mathtt {PA}_{\mathtt {2}}$ in $\mathtt {RTT}$ essentially appealed to the following principle:Footnote 4
-
Axiom of Infinity ( InfAx ): There are infinitely many individuals.
So, their result shows that $\mathtt {PA}_{\mathtt {2}}$ ’s theorems are all logically true only if InfAx is itself logically true. Yet, arguably, and as Russell [Reference Russell61, p. 141] himself acknowledged, InfAx is neither necessarily true nor a priori knowable.Footnote 5 Hence, on common conceptions of logical truths as necessary (or a priori knowable), InfAx arguably fails to count as logically true. For this reason it is nowadays thought that R&W’s type-theoretic interpretation of $\mathtt {PA}_{\mathtt {2}}$ has failed to establish arithmetic’s logicality.
Here, my main aim is to present and partially defend finitary upper arithmetical logicism (upper), a novel form of logicism about arithmetic. In a nutshell, according to upper arithmetical entities and relations are all higher-type entities—and, in particular, NaC is true—and arithmetic is reducible to higher-order modal logic under a higher-orderist interpretation (I’ll explain what is meant by ‘higher-orderist interpretation’ in Section 2). This is why upper is ‘upper.’ A further essential commitment of upper is to the necessary truth (and, indeed, the logicality) of the following thesis:
-
Finitary plenitude (FP) : All finite cardinalities of properties of individuals could have been instantiated.
Finitary plenitude is a principle of plenitude in the sense that it is directly concerned with what is possible.Footnote 6 Notably, upper is ‘finitary’ in that it is not committed to InfAx, or even to its possible truth. So, even if upper is itself committed to NaC, it is significantly different from R&W’s original approach. A further difference is that upper makes essential use of modal resources, which were absent from R&W’s theory. These, it will be argued in Section 2, go naturally with a higher-orderist interpretation of type theory.
The partial defense of upper here offered will be premised on the following results:
-
[Int] $\mathtt {PA}_{\mathtt {2}}$ is interpretable in the very weak system $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ —an extension of the modal logic $\mathtt {K}$ with a negative free type theory, a contingentist-friendly comprehension principle, and principle FP.
-
[Nec] It is provable in $\mathtt {S5PQL}_{\mathtt {C}}$ —a plural as well as modal extension of $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ (with the axioms of the modal logic $\mathtt {S5}$ ), but without FP—that the truths of pure arithmetic are all metaphysically necessary.
Owing to FP’s platitudinous character, which contrasts with InfAx’s controversiality, [Int] in particular constitutes a significant improvement on R&W’s interpretability result. It promises to deliver a sound and NaC-based logicist foundation for $\mathtt {PA}_{\mathtt {2}}$ .
Upper’s core theses are presented in Section 2. In Section 3 are characterized the languages and deductive systems of relevance for the remainder of the paper. The proofs of [Int] and [Nec] are outlined in, respectively, Sections 4 and 5. In Section 6 is defended the metaphysical necessity of both FP and all of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s theorems—an extension of $\mathtt {S5PQL}_{\mathtt {C}}$ with instances of serious actualism, the thesis that standing in a relation implies being something.Footnote 7 In Section 7 upper is briefly compared to prima facie related philosophies of arithmetic—specifically, Hellman’s [Reference Hellman31] modal structuralism, Hodes’s [Reference Hodes32, Reference Hodes33] alternative theory, Roeper’s [Reference Roeper60] vindication of logicism, and Linnebo and Shapiro’s [Reference Linnebo and Shapiro46] arithmetical potentialism—and Section 8 concludes.
[Int] is proven in Appendix A, the necessity of $\mathrm {FP}$ in Appendix B, and [Nec] in Appendix C. In Appendix D is shown that, in the deductive systems adopted in the paper, the rules of necessitation and universal generalization can be dispensed with if the necessitation and the universal generalization of any axiom is itself an axiom. In Appendix E is shown that generation, a watered-down version of Lewis’s [Reference Lewis43] principle of recombination, implies FP, and some results concerning the relationship between InfAx and FP established. And in Appendix F is characterized a model theory fitting upper, and a structure is presented which witnesses the joint consistency of the system $\mathtt {S5PQL}_{\mathtt {SA,C}}$ (the result of adding serious actualism to $\mathtt {S5PQL}_{\mathtt {C}}$ ) with FP, contingentism at all orders, and the necessary falsehood of InfAx.
2 Finitary upper arithmetical logicism
InfAx’s sole role in R&W interpretation of $\mathtt {PA}_{\mathtt {2}}$ is to ensure SIC’s derivability in $\mathtt {RTT}$ :
-
Successor is cardinality injective (SIC): Distinct finite cardinalities have distinct successors.
Here, the successor of a property of properties n is the property of properties $\mathbf {S}[n]$ of having exactly one more instance than any instance of property n. The importance of R&W’s result is that SIC is their interpretation of the following axiom of $\mathtt {PA}_{\mathtt {2}}$ :
-
Successor is injective (SI): Distinct natural numbers have distinct successors.
SIC is not derivable in (that system which results from removing from $\mathtt {RTT}$ ’s axioms). Further, and partly due to R&W’s commitment to the identity of coextensional properties, it is derivable in that SIC implies InfAx.
As it turns out, R&W’s interpretability result can be improved by abandoning the principle that coextensional properties are identical—after all, some coextensive properties could have failed to have been coextensive—and replacing InfAx by the weaker principle FP, which also guarantees SIC’s derivability. Since FP’s platitudinous character makes it reasonable to think that it is logically true, the improved interpretability result paves the way for a defense of upper.
In this section will be presented upper’s core theses. While not all of them will be defended in the paper, they are nonetheless relevant as they provide a fuller picture of the sort of logicist view that the results here presented can be seen as supporting. Two of upper’s component theses are the following:
-
Necessity of FP: FP is necessarily true.
-
Higher-orderism: Predicates have a semantic role distinct from those of individual and of plural terms. Quantification into predicate position is legitimate and irreducible to first-order singular or to plural quantification.
Some reasons in favor of necessity of FP will be given in Section 6. As to higher-orderism, it specifies upper’s intended interpretation of the type-theoretic language. Prima facie, if individual and plural terms had the same semantic role as that of predicates, then they would be substitutable for predicates in meaningful sentences without loss of meaningfulness. But they are not. For instance, ‘John runs’ is meaningful, whereas ‘John Mary’ is meaningless. Similarly, predicates are not substitutable for individual or plural terms without loss of meaningfulness. For instance, ‘John runs’ is meaningful, but ‘swims runs’ is meaningless.Footnote 8 Accordingly, from the standpoint of higher-orderists, while individual and plural terms refer (respectively, singularly and plurally), this is not the role of predicates. Following Trueman’s [Reference Trueman79, chaps. 2–7] gloss (see also [Reference Button and Trueman9, sec. 5.3]), the semantic role of predicates may be said to be that of ‘saying things’ of what they are predicated of. For instance, ‘swims’ says of any individual that it swims, and ‘everything’ says of anything said of individuals that it is had by all of them.Footnote 9
Moreover, quantification into predicate position is to be understood on its own terms—not as restricted first-order singular or plural quantification over individuals, sets, classes, etc. While individual variables take subject position, are bound by first-order quantifiers, and have a semantic role akin to that of individual terms, predicate variables take predicate position, are bound by higher-order quantifiers, and have a semantic role akin to that of predicates.
What has been offered here is nothing but a very brief gloss on the higher-orderist interpretation of typed languages. Indeed, from the standpoint of higher-orderists, an appropriate semantic theory for a typed language is to be formulated not in set theory (by assigning sets as the semantic values of predicates), but rather in a (more expressive) type-theoretic language, by assigning entities of distinct types as the semantic values of predicates of distinct types (see, e.g., [Reference Williamson81, pp. 453–454]).
According to higher-orderists natural language talk of ‘properties’ and ‘relations’ is to be understood with a grain of salt since expressions such as ‘the property of swimming’‘ and ‘the membership relation’ take subject rather than predicate position. For this reason, in this paper I engage in such talk merely for the sake of readability. Defenses of higher-orderism can be found in, among others, [Reference Frege23, Reference Prior55, Reference Rayo and Yablo59, Reference Wright, Leng, Paseau and Potter84], [Reference MacBride and Smith47, sec. 19.2.2], [Reference Williamson81, pp. 254–261], and [Reference Jones39, Reference Trueman79].Footnote 10 Here is not the place to further defend it.
The adoption of higher-orderism naturally leads to distinctive views on the interaction between modality and higher-order quantification. While sets and pluralities are commonly thought to (i) have their instances essentially, and (ii) be extensionally individuated, in general the values of higher-order variables have their instances inessentially and are not extensionally individuated.Footnote 11 Indeed, in Appendix F is offered a model theory for modal type theory on which, in general, the values of higher-order variables fail to satisfy (i) and (ii).
Since InfAx is derivable from the conjunction of FP with any of (i) and (ii) already in somewhat weak modal type-theoretic systems,Footnote 12 there would be no real gain in adopting FP rather than InfAx, unless it was thought that the values of higher-order variables did not satisfy (i) nor (ii). Higher-orderism is thus an important aspect of upper as it makes it reasonable to think that the values of higher type variables do not satisfy (i) nor (ii), and so that a commitment to FP does not collapse in a commitment to InfAx.
Upper’s third and fourth core theses are the following:
-
Logicality: The metaphysically necessary truths formulated in a pure modal and type-theoretic language are all logically true.
-
Higher-type arithmetical ontology: Natural numbers are finite cardinalities, and the Russellian characterization of arithmetic’s primitives—natural number, zero and successor—are all true.Footnote 13
Logicality is a joint consequence of (i) the popular view (see, e.g., [Reference Shapiro and Schirn64, Reference Shapiro and Shapiro65]) that the logical truths include the necessary truths formulated solely in terms of logical expressions; and (ii) that among the latter are all of pure modal type theory’s primitives. Notoriously, these include the necessity operator, which did not figure in the logicist theories of Frege, Whitehead, and Russell, and Hale and Wright.Footnote 14 Alas, since a defense of the metaphysical necessity operator’s logicality, and so of logicality, would require separate treatment, it will be left for another occasion.Footnote 15
Higher type arithmetical ontology’s conception of the arithmetical entities is directly connected to what has been called the applications constraint and, more specifically, to Frege’s constraint. Roughly, according to Frege’s constraint the canonical applications of mathematical entities to the characterization of the world must be contained within their nature.Footnote 16 Arguably, the application of properties and relations to the characterization of the world is contained within their nature. That is, part of what it is to be a property is to have application conditions—conditions under which entities instantiate or fail to instantiate it—and those conditions will be essential to the properties that they are application conditions of.Footnote 17
Now, conceiving of the natural numbers as properties of properties does not, on its own, imply the satisfaction of Frege’s constraint. For satisfaction of Frege’s constraint requires that the canonical applications of natural numbers be contained within the properties claimed to be natural numbers. Still, NaC accounts for the application of natural numbers to counting—arguably, their canonical application—and it does so in the most straightforward way. For the result of counting is the attribution of a cardinality property. So, it would appear that higher-type arithmetical ontology does imply the satisfaction of Frege’s constraint.Footnote 18
Relatedly, NaC accounts for our use of natural number expressions as determiners—as in, e.g., ‘Mars has two moons’—as these are commonly thought to consist of properties of (and relations between) properties. Now, a number of philosophers (perhaps most notably, Frege) have held that the use of number expressions in subject position requires viewing numbers as individuals rather than properties. Unsurprisingly, I reject that this is so, though a sustained defense of the view that natural numbers are properties rather than individuals lies beyond the scope of the present paper. Suffice it to say for the moment that proponents of the view that natural numbers are individuals likewise face the challenge of accounting for the use of natural number expression as determiners, and that NaC might afford the means of accounting for our natural language use of natural number expressions in subject position provided that this use is derived from their use as determiners—a view argued for in, e.g., [Reference Hofweber36, chap. 2].Footnote 19
Finally, the appeal to modal resources fits naturally with higher-type arithmetical ontology and the canonical application of the natural numbers to counting. For besides considering what finite cardinality a given property actually falls under, we also commonly consider those that it could have fallen under. For instance, it is not only true that Mars has two moons, but also that it could have had more.
Importantly, it will be a commitment of higher type arithmetical ontology that natural number-talk is ambiguous across types. For each type $\tau $ , there is a zero $_{\langle \langle \tau \rangle \rangle }$ which applies to properties of type $\tau $ -entities.Footnote 20 Mutatis mutandis for talk of ‘successor’ and ‘natural number’. Proponents of upper thus regard $\mathtt {PA}_{\mathtt {2}}$ as ambiguous between different theories $\mathtt {PA}_{\mathtt {2},\tau }$ , for each type $\tau $ , whose apparently first-order quantifiers in fact range over properties of properties of type $\tau $ entities.
This kind of type ambiguity fits naturally with higher-orderism. From a higher-orderist standpoint, talk of identity, equinumerosity,Footnote 21 and cardinality is itself ambiguous across types. To give just one example, there is not only a cardinality property of being instantiated by exactly three individuals, which applies to properties of individuals, but also, for each type, the property of being instantiated by exactly three entities of that type, a property which applies to properties applicable to entities of that type. The ambiguity of natural number talk is thus of a kind already familiar to higher-orderists. And it naturally follows, given higher-orderism, from a conception of natural numbers as finite cardinalities.
upper’s remaining core theses are the following:
-
Interpretability of Arithmetic: There is a deductive system formulated in a pure modal and type-theoretic language and including FP among its axioms whose theorems are necessary truths, and include the higher type arithmetical ontology-respecting translations of all theorems of $\mathtt {PA}_{\mathtt {2}}$ .Footnote 22
-
Necessity of Arithmetic: The purely arithmetical truths expressible in the language of $\mathtt {PA}_{\mathtt {2}}$ are all metaphysically necessary truths also expressible in a pure modal type theory.
Interpretability of arithmetic—alongside with higher type arithmetical ontology and logicality—establishes that every theorem of $\mathtt {PA}_{\mathtt {2},\tau }$ is logically true, for every type $\tau $ . This is a pleasing result from a logicist standpoint insofar as $\mathtt {PA}_{\mathtt {2}}$ is a canonical second-order system for arithmetic.
Now, Gödel’s incompleteness theorems appear to pose an important challenge to any logicist project as they establish that no (effectively axiomatizable and consistent) deductive system contains as theorems all arithmetical truths expressible in the language of $\mathtt {PA}_{\mathtt {2}}$ . As a consequence, interpretability of arithmetic, higher type arithmetical ontology and logicality fail to imply (on their own) that all arithmetical truths expressible in the language of $\mathtt {PA}_{\mathtt {2}}$ are logically true.
In my view, logicists should see Gödel’s incompleteness theorems as revealing not that some arithmetical truths are not logically true, but rather that no (effectively axiomatizable) deductive system contains as theorems all logical truths. For this reason, a successful logicist account of a mathematical field F will ideally involve not only an interpretability result showing how to derive a canonical (true) deductive theory of F by purely logical means, but also a sound argument for the logical truth of the logicist interpretation, in purely logical terms, of every truth of F. In the case of upper, this second component consists in a defense of necessity of arithmetic.Footnote 23 For necessity of arithmetic, higher-type arithmetical ontology and logicality do jointly imply that all arithmetical truths expressible in the language of $\mathtt {PA}_{\mathtt {2}}$ are logically true. Summing up, the following are all of upper’s core theses.
-
Upper: The conjunction of (i) necessity of FP; (ii) higher-orderism; (iii) logicality; (iv) higher-type arithmetical ontology; (v) interpretability of arithmetic; and (vi) necessity of arithmetic.
The partial defense here offered of upper will only be concerned with the theses of necessity of FP, interpretability of arithmetic and necessity of arithmetic. I hope to offer defenses of logicality and higher-type arithmetical ontology in another occasion.
3 Typed modal systems
The typed modal systems of interest in the remainder of the paper are formulated in either the language $\mathrm {MT}$ , upper’s official pure typed logical language, or else in $\mathrm {PMT}$ , an extension of $\mathrm {MT}$ with plural quantification. The set of types is the following:
Definition 1 (Types).
The set $\mathscr {T}$ of types is the smallest set such that e is a type, and, for every positive integer n, if $\tau ^1,\ldots ,\tau ^n$ are all types, then $\langle \tau ^1,\ldots ,\tau ^n\rangle $ is a type.
The variables of $\mathrm {PMT}$ come in two kinds: singular variables and plural variables.
Definition 2 (Variables).
For each type $\tau $ , $\mathrm {Var}^{s}_{\tau }$ , the set of singular type $\tau $ -variables, is the set $\{x^i_{\tau }:i\in \mathbb {N}\}$ , and $\mathrm {Var}^{p}_{\tau }$ , the set of plural type $\tau $ -variables, is the set $\{xx^i_{\tau }:i\in \mathbb {N}\}$ , for each type $\tau $ .
Terms and formulae of $\mathrm {PMT}$ are simultaneously defined as follows:
Definition 3 (Terms and formulae of $\mathrm {PMT}$ ).
For each type $\tau $ , the sets $\mathrm {PMT}^{s}_{\tau }$ and $\mathrm {PMT}_{\mathrm {wff}}$ of, respectively, singular type $\tau $ -terms and formulae of $\mathrm {PMT}$ , are the smallest sets such that:
-
– Atomic singular terms: (i) $\mathrm {Var}^{s}_{\tau }\subseteq \mathrm {PMT}^{s}_{\tau }$ ; (ii) $\ulcorner =_{\tau }\urcorner \in \mathrm {PMT}^{s}_{\tau }$ whenever $\tau $ is the type $\langle \sigma ,\sigma \rangle $ , for any type $\sigma $ .
-
– Formulae: For all types $\tau ^1,\ldots ,\tau ^n$ and $\sigma $ : (iii) if $\beta \in \mathrm {PMT}^{s}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ and $\alpha ^i\in \mathrm {PMT}^{s}_{\tau ^i}$ , for each i such that $1\leq i\leq n$ , then $\ulcorner \beta (\alpha ^1,\ldots ,\alpha ^n)\urcorner \in \mathrm {PMT}_{\mathrm {wff}}$ ; (iv) if $\{\varphi ,\psi \}\subseteq \mathrm {PMT}_{\mathrm {wff}}$ , then $\{\ulcorner \neg \varphi \urcorner ,\ulcorner \varphi \wedge \psi \urcorner ,\ulcorner \Box \varphi \urcorner \}\subseteq \mathrm {PMT}_{\mathrm {wff}}$ ; (v) if $v\in \mathrm {Var}^{s}(\sigma )$ and $\varphi \in \mathrm {PMT}_{\mathrm {wff}}$ , then $\ulcorner \forall v\varphi \urcorner \in \mathrm {PMT}_{\mathrm {wff}}$ ; (vi) If $vv\in \mathrm {Var}^{p}_{\sigma }$ and $\varphi \in \mathrm {PMT}_{\mathrm {wff}}$ , then $\ulcorner \forall vv\varphi \urcorner \in \mathrm {PMT}_{\mathrm {wff}}$ ; (vii) if $\alpha \in \mathrm {PMT}^{s}_{\sigma }$ and $vv\in \mathrm {Var}^{p}_{\sigma }$ , then $\ulcorner \alpha \prec vv\urcorner \in \mathrm {PMT}_{\mathrm {wff}}$ ; (viii) if $vv,uu\in \mathrm {Var}^{p}_{\sigma }$ , then $\ulcorner vv=uu\urcorner \in \mathrm {PMT}_{\mathrm {wff}}$ .
-
– Complex predicates: (ix) If $\tau $ is the type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ , $v^1\in \mathrm {Var}^{s}_{\tau ^1}, \ldots ,v^n\in \mathrm {Var}^{s}_{\tau ^n}$ are distinct variables, and $\varphi \in \mathrm {PMT}_{\mathrm {wff}}$ , then $\ulcorner \lambda v^1\ldots v^n\varphi \urcorner \in \mathrm {PMT}^{s}_{\tau }$ , for all types $\tau ^1,\ldots ,\tau ^n$ .
Definition 4 (Variables, terms and formulae of $\mathrm {MT}$ ).
For each type $\tau $ , $\mathrm {MT}$ ’s type $\tau $ -variables are $\mathrm {PMT}$ ’s singular type $\tau $ -variables, $\mathrm {MT}$ ’s type $\tau $ -terms are those type $\tau $ -terms of $\mathrm {PMT}$ in which no plural variables occur, and $\mathrm {MT}$ ’s formulae are those formulae of $\mathrm {PMT}$ in which no plural variables occur.
Loosely, an entity will be said to be of a given type when the terms that stand for entities of its kind are of that type. Type e-singular terms stand for individuals—entities of which it is nonsensical to say that they are instantiated as well as that they are not.Footnote 24 Roughly, type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ -singular terms stand for relations between entities of, respectively, types $\tau ^1$ , $\ldots $ , $\tau ^n$ . Moreover, whereas a type $\tau $ -singular variable ranges over type $\tau $ -entities, a type $\tau $ -plural variable ranges over pluralities of type $\tau $ -entities.Footnote 25
The Boolean connectives ‘ $\neg $ ’ and ‘ $\wedge $ ’ are given their standard reading, ‘ $\Box $ ’ expresses metaphysical necessity, and each complex predicate $\ulcorner \lambda v^1\ldots v^n\varphi \urcorner $ stands for the type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ relation of being $v^1$ , $\ldots $ , $v^n$ such that $\varphi $ . There are no complex terms standing for pluralities. Intuitively, $\ulcorner \alpha \prec vv\urcorner $ states that $\alpha $ is one of the $vv$ , and $\ulcorner vv=uu\urcorner $ states that the $vv$ are nothing but the $uu$ . The remaining Boolean connectives, the existential quantifier and the possibility operator are all contextually defined in the usual manner. Further, $\alpha \neq \beta :=\neg \alpha =\beta $ , and for each type $\tau $ and natural number n, $\left \langle \tau \right \rangle _n:=\underbrace {\langle \tau ,\ldots ,\tau \rangle }_{\times n}$ .
Note that ‘ $=$ ’ as it occurs in clause (viii) of Definition 3, is not to be confused with $\ulcorner =_{\langle \tau ,\tau \rangle }\urcorner $ , for any type $\tau $ . Rather, it is a syncategorematic expression. A different option would have been to have taken formulae of the form $\ulcorner vv=uu\urcorner $ to be abbreviations of, e.g., plural coextensiveness (i.e., $yy_{\tau }=zz_{\tau }:=\forall x_{\tau }(x\prec yy_{\tau }\leftrightarrow x\prec zz_{\tau })$ ), or plural coextensiveness and existence (i.e., $yy_{\tau }=zz_{\tau }:=\exists ww_{\tau }\forall x_{\tau }(x\prec yy_{\tau }\leftrightarrow x\prec ww)\wedge \forall x_{\tau }(x\prec yy_{\tau }\leftrightarrow x\prec zz_{\tau })$ ). None of these options has been pursued here in order not to presuppose any particular view on whether the identity of pluralities requires their existence, or whether it suffices that they be coextensive.
We will also make use of the following defined expressions:
Definition 5 (Existence).
For any type $\tau $ , type $\tau $ -singular term or plural variable $\zeta $ , and where u is the first type $\tau $ variable (singular if $\zeta $ is singular, plural if $\zeta $ is plural) that occurs nowhere in $\zeta $ : $E[\zeta ]:=\exists u(u=\zeta )$ .
Regimenting finitary plenitude requires defining what it is to be a finite cardinality. This and related notions are defined as follows (in a manner akin to Frege’s own definitions):
Definition 6. Where, for every type $\tau $ : (i) $\alpha $ is a type $\langle \tau \rangle $ -singular term; (ii) $\beta $ is a type $\tau $ -singular term; (iii) $\xi $ is a type $\langle \langle \tau \rangle \rangle $ -singular term; (iv) v is the first type $\langle \tau \rangle $ -singular variable that occurs nowhere free in $\alpha $ , $\beta $ or $\xi $ ; (v) w is the first type $\tau $ -singular variable that occurs nowhere free in $\xi $ ; (vi) $\gamma $ is a type $\langle \langle \langle \tau \rangle \rangle \rangle $ -singular term, (vii) r and t are, respectively, the first and the second type $\langle \langle \tau \rangle \rangle $ -singular variables that occur nowhere free in $\gamma $ :
(a) Nothing: $\mathbf {0}_{\langle \langle \tau \rangle \rangle }=_{\mathrm {df}}\lambda x_{\langle \tau \rangle }(\neg \exists y_{\tau }(x(y)))$ ;
(b) Minus: $\alpha -\beta :=\lambda v(\alpha (v)\wedge v\neq \beta )$ ;
(c) Successor of: $\mathbf {S}_{\langle \langle \langle \tau \rangle \rangle \rangle _2}[\xi ]:=\lambda v(\exists w(v(w)\wedge \xi (v-w))))$ ;
(d) Successor: $\mathbf {S}_{\langle \langle \langle \tau \rangle \rangle \rangle _2}=_{\mathrm {df}}\lambda x_{\langle \langle \tau \rangle \rangle }y_{\langle \langle \tau \rangle \rangle }(y=\textbf {S}_{\langle \langle \langle \tau \rangle \rangle ,\langle \langle \tau \rangle \rangle \rangle }[x])$ ;
(e) Hereditary: $Her[\gamma ]:=\forall r\forall t((\gamma (r)\wedge \textbf {S}(r,t))\rightarrow \gamma (t))$ ;
(f) Finite cardinality: $\mathbf {N}_{\langle \langle \langle \tau \rangle \rangle \rangle }=_{\mathrm {df}}\lambda x_{\langle \langle \tau \rangle \rangle }(\forall y_{\langle \langle \langle \tau \rangle \rangle \rangle }((y(\textbf {0})\wedge Her[y])\rightarrow y(x)))$ .
For each type $\tau $ , finitary plenitude $_{\tau }$ is then regimented as follows, where FP is just FPe :
Moreover, according to higher-type arithmetical ontology:
-
Arithmetical primitives: For every type $\tau $ :
-
(a) To be the natural number zero $_{\langle \langle \tau \rangle \rangle }$ just is to be $\mathbf {0}_{\langle \langle \tau \rangle \rangle }$ —i.e., the property of having no instances.
-
(b) To be the successor relation $_{\langle \langle \langle \tau \rangle \rangle \rangle _2}$ just is to be $\mathbf {S}_{\langle \langle \langle \tau \rangle \rangle \rangle _2}$ —i.e., the relation of being $x_{\langle \langle \tau \rangle \rangle }$ and $y_{\langle \langle \tau \rangle \rangle }$ such that $y_{\langle \langle \tau \rangle \rangle }=\textbf {S}_{\langle \langle \langle \tau \rangle \rangle \rangle _2}[x_{\langle \langle \tau \rangle \rangle }]$ , where $\mathbf {S}_{\langle \langle \langle \tau \rangle \rangle \rangle _2}[x_{\langle \langle \tau \rangle \rangle }]$ is the property of being a $z_{\langle \tau \rangle }$ such that some $u_{\tau }$ falls under $z_{\langle \tau \rangle }$ while the property of being a $z_{\langle \tau \rangle }$ distinct from $u_{\tau }$ falls under $x_{\langle \langle \tau \rangle \rangle }$ .
-
(c) To be the property of being a natural number $_{\langle \langle \langle \tau \rangle \rangle \rangle }$ just is to be $\mathbf {N}_{\langle \langle \langle \tau \rangle \rangle \rangle }$ —i.e., the property of having all of $\mathbf {0}_{\langle \langle \tau \rangle \rangle }$ ’s hereditary properties, where a property $p_{\langle \langle \langle \tau \rangle \rangle \rangle }$ is hereditary just in case $x_{\langle \langle \tau \rangle \rangle }$ has it only if $x_{\langle \langle \tau \rangle \rangle }$ ’s successor has it.
-
The base deductive systems of interest in the remainder of the paper are characterized in terms of the schemata and inference rules in Table 1.
${}^{\textrm {a}}\,$ Provided that v occurs free nowhere in $\varphi $ .
${}^{\textrm {b}}\,$ Where $\alpha $ is free for v in $\varphi $ and $\varphi ^{v}_{\alpha }$ is a formula that results from substituting $\alpha $ for all free occurrences of v in $\varphi $ .
${}^{\textrm {c}}$ Where $\varphi '$ is a formula which results from $\varphi $ by having $\gamma $ occur at some places where $\alpha $ occurs, re-lettering bound variables to ensure that no variables free in $\alpha =\gamma $ are bound in $\varphi $ or in $\varphi '$ .
${}^{\textrm {d}}$ Given a list $\mu ^1$ , $\ldots $ , $\mu ^n$ of all atomic terms or plural variables (except $\ulcorner =_{\langle \tau ,\tau \rangle }\urcorner $ , for all types $\tau $ ) free in $\beta $ , $E_{\beta }:=E[\mu ^1]\wedge \cdots \wedge E[\mu ^n]$ .
In Table 1 $\varphi $ and $\psi $ range over $\mathrm {PMT}$ -formulae. In addition, for every type $\tau $ , v ranges over singular as well as plural variables of type $\tau $ , $\alpha $ and $\gamma $ range over type $\tau $ -singular terms as well as over type $\tau $ -plural variables, u is a singular type $\tau $ -variable, and $uu$ and $tt$ are plural type $\tau $ -variables.
Furthermore, for all $n\in \mathbb {N}^{+}$ and i such that $1\leq i\leq n$ and every type $\tau ^i$ : (i) $\delta ^i$ ranges over type $\tau ^i$ -singular terms; (ii) $\overline {\delta }$ abbreviates the sequence $\delta ^1,\ldots ,\delta ^n$ ; (iii) $u^i$ ranges over type $\tau ^i$ -variables; (iv) $\overline {u}$ abbreviates the sequence $u^1\ldots u^n$ ; (v) $\beta $ ranges over type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ -singular terms; and (vi) $\varphi ^{\overline {u}}_{\overline {\delta }}$ is the result of simultaneously substituting each free occurrence of $u^i$ in $\varphi $ by $\delta ^i$ , where each $\delta ^i$ is free for $u^i$ in $\varphi $ .
The present interest is in the deductive systems in Table 2 and their combinations characterized in Definition 7.
Definition 7 (Systems of modal type theory).
If $A\in Mod:=\{\mathtt {K},\mathtt {T},\mathtt {S5}\}$ , $B\in Qu:=\{\mathtt {QL},\mathtt {PQL}\}$ , then $AB$ is that system whose axioms and inference rules result from putting together, respectively, the axioms and inferences of A and B. If B is $\mathtt {QL}$ , then it and $AB$ are formulated in $\mathrm {MT}$ . If it is $\mathtt {PQL}$ then it and $AB$ are formulated in $\mathrm {PMT}$ . Further, each one of $B_{\mathtt {SA}}$ , $B_{\mathtt {C}}$ , $B_{\mathtt {SA,C}}$ , $AB_{\mathtt {SA}}$ , $AB_{\mathtt {C}}$ , $AB_{\mathtt {SA,C}}$ is the system that results from adding to B, or $AB$ , respectively, all instances of $\mathrm {SA}$ , all instances of $\mathrm {Comp}_{\mathrm {C}}$ , and all instances of both $\mathrm {SA}$ and $\mathrm {Comp}_{\mathrm {C}}$ .
Moreover, for each one of the defined deductive systems S, $\ulcorner \vdash _{S}\urcorner $ stands for derivability in S, ‘ $\vdash $ ’ stands for $\vdash _{\mathtt {KQL}_{\mathtt {C}}}$ , and $S^{\mathtt {FP}}$ is that system whose theorems are those formulae $\varphi $ such that . Finally, and as usual, for each system S, $\Gamma \vdash _{S}\varphi $ if and only if there are $\gamma ^1,\ldots ,\gamma ^n\in \Gamma $ such that $\vdash _{S}\gamma ^1\rightarrow (\gamma ^2\rightarrow (\ldots \rightarrow (\gamma ^n\rightarrow \varphi )\ldots ))$ .
FP does not figure as an extra axiom of any of the systems characterized in Definition 7 is so that its metaphysical necessity not be presupposed without argument—which it would otherwise be, owing to the presence of the $\mathrm {Nec}$ rule. While, I’ll later argue for necessity of FP, the system $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ which will be shown to interpret $\mathtt {PA}_{\mathtt {2}}$ does not have FP’s metaphysical necessity as a theorem.
4 Interpretability of $\mathtt {PA}_{\mathtt {2}}$ in $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$
Recall that $\mathtt {PA}_{\mathtt {2}}$ is formulated in a second-order language (without $\lambda $ -terms), $\mathrm {PA}_{2}$ , whose only non-logical primitives are the individual constant ‘ $0$ ’, the unary predicate ‘N’ (‘is a natural number’) and the binary predicate ‘S’ (‘is succeeded by’). $\mathtt {PA}_{\mathtt {2}}$ will be shown to be interpretable in $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ via Russellian translations—so called owing to their alignment with Russell and Whitehead’s take on arithmetic’s subject matter.
Definition 8 (Russellian translations).
For each type $\tau $ , the Russellian translation $_{\tau }$ from $\mathrm {PA}_{2}$ to $\mathrm {MT}$ , $(\cdot )^{\mathscr {R}}_{\tau }$ is defined as follows, where $\varphi $ and $\psi $ are $\mathrm {PA}_{2}$ -formulae, v is an individual or predicate variable of $\mathrm {PA}_{2}$ , $\ulcorner x^i\urcorner $ is $\mathrm {PA}_{2}$ ’s ith individual variable, $\ulcorner x^i_{\langle \langle \tau \rangle \rangle }\urcorner $ is $\mathrm {MT}$ ’s ith type $\langle \langle \tau \rangle \rangle $ -variable, $\ulcorner X^i_{n}\urcorner $ is $\mathrm {PA}_{2}$ ’s ith n-ary predicate variable, and $\ulcorner x^i_{\langle \langle \langle \tau \rangle \rangle \rangle _n}\urcorner $ is $\mathrm {MT}$ ’s ith type $\langle \langle \langle \tau \rangle \rangle \rangle _n$ -variable:
(a) $(\text {`}0\text {'})^{\mathscr {R}}_{\tau }=\ulcorner \textbf {0}_{\langle \langle \tau \rangle \rangle }\urcorner $ ;
(b) $(\text {`}N\text {'})^{\mathscr {R}}_{\tau }=\ulcorner \textbf {N}_{\langle \langle \langle \tau \rangle \rangle \rangle }\urcorner $ ;
(c) $(\text {`}S\text {'})^{\mathscr {R}}_{\tau }=\ulcorner \textbf {S}_{\langle \langle \langle \tau \rangle \rangle \rangle _2}\urcorner $ ;
(d) $(\text {`}=\text {'})^{\mathscr {R}}_{\tau }= \ulcorner =_{\langle \langle \langle \tau \rangle \rangle \rangle _2}\urcorner $ ;
(e) $(\ulcorner x^i\urcorner )^{\mathscr {R}}_{\tau }=\ulcorner x^i_{\langle \langle \tau \rangle \rangle }\urcorner $ ;
(f) $(\ulcorner X^{i}_n\urcorner )^{\mathscr {R}}_{\tau }=\ulcorner x^i_{\langle \langle \langle \tau \rangle \rangle \rangle _n}\urcorner $ ;
(g) $(\ulcorner \beta (\alpha ^1,\ldots ,\alpha ^n)\urcorner )^{\mathscr {R}}_{\tau }= \ulcorner (\beta )^{\mathscr {R}}_{\tau }((\alpha ^1)^{\mathscr {R}}_{\tau },\ldots ,(\alpha ^n)^{\mathscr {R}}_{\tau })\urcorner $ ;
(h) $(\ulcorner \neg \varphi \urcorner )^{\mathscr {R}}_{\tau }=\ulcorner \neg (\varphi )^{\mathscr {R}}_{\tau }\urcorner $ ;
(i) $(\ulcorner \varphi \wedge \psi \urcorner )^{\mathscr {R}}_{\tau }=\ulcorner (\varphi )^{\mathscr {R}}_{\tau }\wedge (\psi )^{\mathscr {R}}_{\tau }\urcorner $ ;
(j) $(\ulcorner \forall v\varphi \urcorner )^{\mathscr {R}}_{\tau }=\ulcorner \forall (v)^{\mathscr {R}}_{\tau }(\varphi )^{\mathscr {R}}_{\tau }\urcorner $ .
In Definition 8 is characterized a family of translations, one for each type, so as to reflect upper’s commitment to the systematic ambiguity across types of the arithmetical vocabulary. The following is the paper’s main result (the numbering of the theorems and lemmas stated in the main text reflect the order by which they are proven in Appendixes A–F):
Theorem 8 (Interpretability).
$\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ interprets $\mathtt {PA}_{\mathtt {2}}$ via the Russellian translation $(\cdot )^{\mathscr {R}}_{\tau }$ , for each type $\tau $ . More precisely, for every closed theorem $\varphi $ of $\mathtt {PA}_{\mathtt {2}}$ , $(\varphi )^{\mathscr {R}}_{\tau }$ is a theorem of $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ , for every type $\tau $ .
In Definition 11 (Appendix A) is given an axiomatization of $\mathtt {PA}_{\mathtt {2}}$ . The proofs of the Russellian translations of all of $\mathtt {PA}_{\mathtt {2}}$ ’s axioms are routine (they will be given in Appendix A)—except for SIC $_{\tau }$ , the Russellian translation of successor is injective:
In the remainder of this section I’ll go through the main ideas involved in the proof of SIC $_{\tau }$ from $\mathrm {FP}$ . I leave the details for Appendix A, where a full proof of Theorem 8 can be found.
A crucial result appealed to in R&W’s proof of SIC $_{\tau }$ from InfAx is the equinumerosity lemma, according to which, for every natural number, if a property $y_{\langle \tau \rangle }$ has it, then any property $z_{\langle \tau \rangle }$ has it if and only if $y_{\langle \tau \rangle }$ and $z_{\langle \tau \rangle }$ are equinumerous. In the context $\mathtt {KQL}_{\mathtt {C}}$ it is possible to prove the following strengthening of this lemma—one of the main results on which the $\mathtt {KQL}^{\mathtt {FP}}_{\mathtt {C}}$ -derivation of relies:
Lemma 6 (Modal equinumerosity lemma).
$\vdash \forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow \Box \forall y_{\langle \tau \rangle }(x(y)\rightarrow \forall z_{\langle \tau \rangle }(x(z)\leftrightarrow y\approx z)))$ .
Two other important results are the following:
Lemma 7 (Impossible sharing).
$\vdash \forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle }((\textbf {N}(x)\wedge \textbf {N}(y)\wedge \Diamond \exists z_{\langle \tau \rangle }(x(z)\wedge y(z)))\rightarrow x=y)$ .
Lemma 8. $\vdash \mathrm {FP}_{\mathrm {e}}\rightarrow \mathrm {FP}_{\tau }$ .
Lemmas 6, 7 and 10 are proved in Appendix A. On their basis, can be derived in $\mathtt {KQL}^{\mathtt {FP}}_{\mathtt {C}}$ . To see this, assume, for arbitrarily existing $x_{\langle \langle \tau \rangle \rangle }$ and $y_{\langle \langle \tau \rangle \rangle }$ , that they are natural numbers, and that $z_{\langle \langle \tau \rangle \rangle }$ is a successor of both. Then, $z_{\langle \langle \tau \rangle \rangle }$ is a natural number, by the arithmetical axiom according to which the successor of a natural number is itself a natural number. So, $\mathrm {FP}_{\tau }$ implies that it could have been that $z_{\langle \langle \tau \rangle \rangle }$ was instantiated by some $w_{\langle \tau \rangle }$ . So, by the definition of successor, it could have been that there were $s_{\tau }$ and $t_{\tau }$ such that $s_{\tau }$ and $t_{\tau }$ both instantiated $w_{\langle \tau \rangle }$ , $w_{\langle \tau \rangle }-s_{\tau }$ (i.e., the property of being a $w_{\langle \tau \rangle }$ distinct from $s_{\tau }$ ) instantiated $x_{\langle \langle \tau \rangle \rangle }$ , and $w_{\langle \tau \rangle }-t_{\tau }$ (i.e., the property of being a $w_{\langle \tau \rangle }$ distinct from $t_{\tau }$ ) instantiated $y_{\langle \langle \tau \rangle \rangle }$ . In such a case, $w_{\langle \tau \rangle }-s_{\tau }$ would have been equinumerous with $w_{\langle \tau \rangle }-t_{\tau }$ . So it could have been that there were $P_{\langle \tau \rangle }$ and $Q_{\langle \tau \rangle }$ such that $P_{\langle \tau \rangle }$ instantiated $x_{\langle \langle \tau \rangle \rangle }$ , $Q_{\langle \tau \rangle }$ instantiated $y_{\langle \langle \tau \rangle \rangle }$ , and $P_{\langle \tau \rangle }$ was equinumerous with $Q_{\langle \tau \rangle }$ . So, by the modal equinumerosity lemma, it could have been that there was a $P_{\langle \tau \rangle }$ such that $P_{\langle \tau \rangle }$ instantiated both $x_{\langle \langle \tau \rangle \rangle }$ and $y_{\langle \langle \tau \rangle \rangle }$ . Hence, $x_{\langle \langle \tau \rangle \rangle }=y_{\langle \langle \tau \rangle \rangle }$ , by the impossible sharing lemma. So, if $\mathrm {FP}_{\tau }$ , then any natural numbers $x_{\langle \langle \tau \rangle \rangle }$ and $y_{\langle \langle \tau \rangle \rangle }$ having the same successor are identical. Hence, $\mathrm {SIC}_{\tau }$ , by Lemma 10.
5 Necessity in $\mathtt {S5PQL}_{\mathtt {C}}$ of the true purely arithmetical formulae of $\mathrm {MT}$
The paper’s second main result—that the truths of pure arithmetic are all metaphysically necessary—requires getting clearer on what formulae of the typed modal language $\mathrm {MT}$ count, from the standpoint of upper, as purely arithmetical.
A first suggestion is that the purely arithmetical formulae are those formulae of $\mathrm {MT}$ which are Russellian translations of formulae of $\mathtt {PA}_{\mathtt {2}}$ . Alas, this option is unsatisfactory. For instance, it implies that the Russellian translations of all formulae of pure second-order logic (formulae without any distinctively arithmetical vocabulary) express purely arithmetical facts. Among these is, for instance, a formula that intuitively expresses that (i) there is some type $\langle \langle \langle e\rangle \rangle \rangle $ -property X with a cardinality greater than some infinite cardinality; (ii) there is some type $\langle \langle \langle e\rangle \rangle \rangle $ -property Y with a cardinality greater than X; and (iii) there is some type $\langle \langle \langle e\rangle \rangle \rangle $ -property Z with a cardinality greater than Y. But it is not reasonable to think that, even if true, the conjunction of (i)–(iii) is a purely arithmetical fact. After all, this claim seems to have little to do with what is true about the type $\langle \langle e\rangle \rangle $ -natural numbers. In general, questions concerning how many different infinite cardinalities are instantiated are not under the purview of arithmetic.
Likewise, a type-theoretic version of the continuum hypothesis can be formulated in pure second-order logic.Footnote 26 But even if the continuum hypothesis turns out to be neither necessarily true nor necessarily false, that does not show that some truths of pure arithmetic are not necessarily true. For, arguably, neither the continuum hypothesis is a truth of pure arithmetic nor its negation is.
A better option arises once $\mathrm {PA}_{2}$ ’s first-order quantifiers are interpreted as implicitly restricted to the natural numbers, as is already customary to do in arithmetical theories. Consider the following class of translations from $\mathrm {PA}_{2}$ to $\mathrm {MT}$ :
Definition 9 (Arithmetical translations).
For each type $\tau $ , the arithmetical translation $_{\tau }$ from $\mathrm {PA}_{2}$ to $\mathrm {MT}$ , $(\cdot )^{\mathscr {A}}_{\tau }$ , is defined exactly as the Russellian translation $_{\tau }$ , except that $(\ulcorner \forall v\varphi \urcorner )^{\mathscr {A}}_{\tau }=\ulcorner \forall (v)^{\mathscr {A}}_{\tau }(\textbf {N}((v)^{\mathscr {A}}_{\tau })\rightarrow (\varphi )^{\mathscr {A}}_{\tau })\urcorner $ .
The proposal is then that, for each type $\tau $ , the formulae of $\mathrm {MT}$ which express purely arithmetical facts, as they concern the type $\langle \langle \tau \rangle \rangle $ -natural numbers, are the images of the arithmetical translation $_{\tau }$ :
Definition 10 ( $\tau $ -Arithmetical formulae).
A formula $\varphi $ of $\mathrm {MT}$ is a closed $\tau $ -arithmetical formula if and only if $\varphi =(\psi )^{\mathscr {A}}_{\tau }$ , for some closed formula $\psi $ of $\mathrm {PA}_{2}$ .
Each $\tau $ -arithmetical formula is obtained from some $\mathrm {PA}_{2}$ -formula $\varphi $ by indexing $\varphi $ ’s variables with the type $\langle \langle \tau \rangle \rangle $ , substituting the primitives of $\mathrm {PA}_{2}$ occurring in $\varphi $ by their Russellian characterizations, and restricting the type $\langle \langle \tau \rangle \rangle $ -quantifiers to natural numbers.
We are now able to state the paper’s second main result:
Theorem 9. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}}(\varphi )^{\mathscr {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathscr {A}}_{\tau }$ , for every closed formula $\varphi $ of $\mathrm {PA}_{2}$ and every type $\tau $ .
Theorem 9 is a corollary of a more general result, Lemma 28 (both proven in Appendix C). Say that a relation is arithmetical if and only if it exists necessarily and necessarily, whatever it holds of, those things are natural numbers and it necessarily holds of them. Then, Lemma 28, proven by induction on the complexity of formulae, establishes that if the parameters occurring in $(\varphi )^{\mathscr {A}}_{\tau }$ are all natural numbers or arithmetical relations, then $(\varphi )^{\mathscr {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathscr {A}}_{\tau }$ . Theorem 9 then follows.
For most cases, Lemma 28’s proof is straightforward, owing especially to the following facts: (i) natural numbers are necessarily natural numbers; (ii) if a natural number is a successor of another, then it is necessarily so; (iii) identical natural numbers are necessarily identical; and (iv) if an arithmetical relation holds of some natural numbers, then it necessarily holds of them.
The one tricky case is when $\varphi $ is a universal quantification into predicate position. The proof of this case relies on the following observation: that if some n-ary relation R is such that $(\varphi )^{\mathscr {A}}_{\tau }$ , then there is some n-ary arithmetical relation U such that $((\varphi )^{\mathscr {A}}_{\tau })^{R}_{U}$ . The witness arithmetical relation U is characterized in terms of the plurality $uu$ of those n-tuples (understood as haecceitistic relations) of natural numbers of which R obtains. It is the relation of being $x_1$ , $\ldots $ , $x_n$ such that this n-tuple is one of the n-tuples in $uu$ .
Owing to pluralities’ rigidity, and to the natural numbers’ necessary existence, U can be shown to be an arithmetical relation. And from this observation it is shown that if every relation R is such that $(\varphi )^{\mathscr {A}}_{\tau }$ , then, necessarily, every relation R whatsoever (and not just every arithmetical relation) is such that $((\varphi )^{\mathscr {A}}_{\tau })$ , thus concluding Lemma 28’s proof.
Once Theorem 9 is conjoined with the view that the truths of pure arithmetic are those truths expressible by closed $\tau $ -arithmetical formulae, it implies that all truths of pure arithmetic are necessary truths expressible in a pure modal type theory, from which it follows, given logicality, that all truths of arithmetic are logically true—thus vindicating the form of logicism encapsulated in upper.
Theorem 9’s proof, in $\mathtt {S5PQL}_{\mathtt {C}}$ , involves principles of the logic of plurals. So, should a proponent of upper also be committed to plural logic really being logic? I am sympathetic to this view, and anticipate that purely logical, neoRussellian characterizations of the primitives of other mathematical domains might have to rely on plural resources. Notwithstanding, upper’s proponents need not be committed to the view in order to have available a successful defense of the logical truth of all truths of pure arithmetic premised on Theorem 9. It suffices that plural logic’s principles be (plainly) true. For, if they are, then every truth of pure arithmetic is a necessary truth expressible by a formula of pure modal type theory. Even if plural logic isn’t really logic, it can still be used to show that some truths are logically true.Footnote 27
6 In favor of the necessary truth of every theorem of $\mathtt {S5PQL}_{\mathtt {SA,C}}^{\mathtt {FP}}$
Given Theorem 9, and provided that Definition 10 pins down the purely arithmetical formulae, necessity of arithmetic will have been established if all theorems of $\mathtt {S5PQL}_{\mathtt {C}}$ can be shown to be true. Further, given Theorem 8, necessity of FP and interpretability of arithmetic will have been established if it can be shown that $\mathtt {KQL}^{\mathtt {FP}}_{\mathtt {C}}$ ’s theorems are all metaphysically necessary. And a couple of auxiliary results (Lemmas 36 and 40, proven in Appendix E) presuppose schema $\mathrm {SA}$ .Footnote 28
Accordingly, in this section I’ll start by offering some reasons for thinking that the result of prefixing any instance of any axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ with any sequence of necessity operators and universal quantifiers, in any order, is true. From this it will follow that every theorem of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ is true, assuming the truth-preservingness of modus ponens—as shown in Lemma 30 in Appendix D. The truth of all theorems of $\mathtt {S5PQL}_{\mathtt {C}}$ and $\mathtt {KQL}^{\mathtt {FP}}_{\mathtt {C}}$ , and of all instances of $\mathrm {SA}$ , is then obtained as a corollary.
I’ll then offer offering some considerations in favor of FP. Since it is a theorem of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ that if FP is true, then the result of prefixing it with any sequence of necessity operators and universal quantifiers, in any order, is also true (a straightforward corollary of Lemma 15, proven in Appendix B), this will establish the necessary truth of all theorems of $\mathtt {S5PQL}^{\mathtt {FP}}_{\mathtt {SA,C}}$ (owing to $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s $\mathrm {Nec}$ rule and axiom $\mathrm {K}$ ).
6.1 Necessity of $\mathtt {S5PQL}_{\mathtt {SA,C}}$
System $\mathtt {K}$ constitutes the weakest normal modal logic. It is sound for metaphysical necessity. Indeed, and citing Williamson [Reference Williamson81, p. 44], ‘ $(\ldots )$ most metaphysicians accept $\mathtt {S5}$ [a system stronger than $\mathtt {K}$ ] as the propositional modal logic of metaphysical modality ( $\ldots $ )’.Footnote 29 The $\mathrm {PMT}$ -instances of schemata $\forall {\mathrm {1}}$ , $\forall {\mathrm {2}}$ and $\forall {\mathrm {E}}$ are all commonly taken to be metaphysically necessary. Schema SA encodes serious actualism (so called by Plantinga [Reference Plantinga53], called ‘being constraint’ by Williamson [Reference Williamson81]), a principle according to which falling under a (genuine) predicate implies being identical to something. Since I have little to add to the defense of serious actualism offered in [Reference Jacinto38], I won’t here further argue for this principle.Footnote 30
Axiom schemata $\forall {=}$ and $\mathrm {Ind}$ are commonly thought to be among the principles governing identity: everything is self-identical, and identicals are indiscernible. Notwithstanding, some might want to take issue with $\mathrm {Ind}$ in the present context. For $\mathrm {Ind}$ is known to have false instances when at least one of the expressions flanking the identity predicate is a nonrigid designator, and some might worry that complex predicates are nonrigid designators.
As an example of a failure of $\mathrm {Ind}$ , consider the sentence ‘if Joe Biden is the US president, then if it is necessary that, if he exists, then Joe Biden is Joe Biden, then it is necessary that, if he exists, Joe Biden is the US president’—‘ $b=f(a)\rightarrow (\Box (\exists x(x=b)\rightarrow b=b)\rightarrow \Box (\exists x(x=b)\rightarrow b=f(a)))$ ’—where ‘b’ and ‘a’ are individual constants, assumed to be rigid designators, standing for, respectively, Biden and the US, and ‘f’ is a functional term standing for ‘the president of’-function, and which I’ll assume for the sake of exposition is a nonrigid designator.
At each world of evaluation w, ‘ $f(a)$ ’ picks out whoever is the US president at w. Since the US could have had different presidents, there will be worlds of evaluation at which ‘ $f(a)$ ’ picks out distinct individuals. Now, ‘ $b=f(a)$ ’ and ‘ $\Box (\exists x(x=b)\rightarrow b=b)$ ’ are both true at the actual world. Yet, ‘ $\Box (\exists x(x=b)\rightarrow b=f(a))$ ’ is not true at the actual world, since there is at least one world of evaluation w such that Joe Biden exists at w and ‘ $f(a)$ ’ designates Kamala Harris at w. So, ‘ $\exists x(x=b)\rightarrow b=f(a)$ ’ is not true at w. So, $\mathrm {Ind}$ has false instances when the identity signed is flanked by some nonrigid designators.
The worry presently under consideration is thus that complex predicates semantically function in a manner akin to functional terms, being nonrigid designators themselves. But this worry is unfounded. On the present conception, complex predicates are rigid designators—provided that they designate anything at all. Their presence in the language does not lead to there being false instances of $\mathrm {Ind}$ .Footnote 31 Briefly, and roughly, in my view the semantic value of each complex predicate $\ulcorner \lambda u^1\ldots u^n\varphi \urcorner $ consists in a relation R compositionally determined in terms of the propositional function that is formula $\varphi $ ’s semantic value; R obtains, at each world w, of $u^1$ , $\ldots $ , $u^n$ , just in case the semantic value of the formula $\varphi $ maps $u^1$ , $\ldots $ , $u^n$ to a proposition which is true at w. So, $\ulcorner \lambda u^1\ldots u^n\varphi =\alpha \urcorner $ is true at a world w if, at w, this relation R is identical to the denotation of $\alpha $ , and is not true at w otherwise. There is no world-relative denotation involved.
This is not, however, the place to further develop this view on the semantics of complex predicates.Footnote 32 For the present purposes, suffice it to say that in Appendix F is presented a standard model-theoretic treatment of complex predication on which the value of a complex predicate in a model is not world-dependent, and which validates $\mathrm {Ind}$ . At the very least, this shows the coherence of the view that complex predicates are rigid designators, if they designate anything at all.
Turning now to schemata $\mathrm {Ab1}$ and $\mathrm {Ab2}$ , $\mathrm {Ab1}$ is fairly uncontroversial. If a predication involving a complex predicate is true, then the corresponding formula is also true. $\mathrm {Ab2}$ is also fairly uncontroversial, as it is a weakening of more common principles such as (i) that a complex predication is true if the corresponding formula is true, and (ii) that a complex predication is true if the corresponding formula is true and the entities of which the complex predicate is predicated all exist.
One reason to prefer $\mathrm {Ab2}$ to both (i) and (ii) concerns the following theses—one for each type $\tau $ :
-
Contingentism $_{\tau }$ : There could have been some $x_{\tau }$ that could have been nothing. $\Diamond \exists x_{\tau }\Diamond \neg \exists y_{\tau }(x=y).$
Contingentism—i.e., contingentism $_{e}$ —is seemingly plausible. For instance, prima facie, Joe Biden could have been nothing. In addition, some philosophers find contingentism $_{\tau }$ attractive also whenever $\tau \neq e$ .Footnote 33 But principle (ii) (as well as (i)) is inconsistent with the following theses: $(a)$ the truth of a predication implies the being of the relation that the predicate stands for; $(b)$ the property of being distinct from a given $x_{\tau }$ exists only if that particular $x_{\tau }$ exists;Footnote 34 and $(c)$ contingentism $_{\tau }$ . To see why, assume, for instance, that Biden is a contingent being (and so, is a witness to $(c)$ ). Then, and where ‘h’ stands for Kamala Harris, ‘ $h\neq b$ ’ will be true at any world w at which Kamala Harris exists but Joe Biden doesn’t. Then, by (ii), ‘ $\lambda x(x\neq b)(h)$ ’ is true at w. So, by $(a)$ , $\lambda x(x\neq b)$ exists at w. But this conflicts with $(b)$ , since Joe Biden doesn’t exist at w. By contrast with principle (ii), $\mathrm {Ab2}$ leads to no contradiction when conjoined with $(a)$ – $(c)$ . To conclude, then, $\mathrm {Ab1}$ and $\mathrm {Ab2}$ are extremely plausible.
According to $\mathrm {Comp}_{\mathrm {C}}$ , there is a relation that a complex predicate stands for provided that the values of the parameters occurring in it all exist. One rationale in favor of the metaphysical necessity of its instances goes via a principle of ‘semantic determination’ according to which (i) the semantic value of a complex predicate $\lambda \overline {u}\varphi $ with parameters $t^1$ , $\ldots $ , $t^n$ exists at world w if the propositional function which is the semantic value of $\varphi $ relative to an assignment of values to $t^1$ , $\ldots $ , $t^n$ exists at w, and (ii) this propositional function exists at w just in case the semantic values of the terms occurring in $\varphi $ , alongside with the values of the parameters $t^1$ , $\ldots $ , $t^n$ , all exist at w. But this is just what is required by $\mathrm {Comp}_{\mathrm {C}}$ , assuming that the semantic values of $\mathrm {MT}$ ’s identity predicates are all necessary beings.Footnote 35
Principle $\mathrm {Comp}_{\mathrm {C}}$ is jointly consistent both with all instances of contingentism $_{\tau }$ and with all instances of necessitism $_{\tau }$ where, for each type $\tau $ , necessitism $_{\tau }$ is contingentism $_{\tau }$ ’s contradictory—the thesis that necessarily, every $x_{\tau }$ is necessarily something.Footnote 36 The subscripted ‘ $\mathrm {C}$ ’ in ‘ $\mathrm {Comp}_{\mathrm {C}}$ ’ signals its compatibility with a contingentist world-view. Williamson [Reference Williamson81] ultimately argued that contingentists and necessitists alike should find $\mathrm {Comp}_{\mathrm {C}}$ to be too weak for some of the development of modal mathematics. Be that as it may, Theorem 8 shows that $\mathrm {Comp}_{\mathrm {C}}$ suffices for the interpretability of arithmetic in modal type theory.Footnote 37
The instances of the plural principles $\prec {\mathrm {I}}$ , $\mathrm {PluComp}$ and ${=}\mathrm {I}_{\mathrm {P}}$ are all (typed versions of) theorems of standard (free) modal plural logic.Footnote 38 Concerning $\Box {\mathrm {E}}_{\mathrm {P}}$ , given a conception of pluralities as, in some sense, being nothing over and above their members, it is an uncontroversial thesis that necessarily, it was once the case that necessarily, if the $uu$ exist and every one of the $uu$ existed then, then the $uu$ also existed then.Footnote 39 And it is a consequence of this thesis that if the $vv$ exist and all the entities which are among the $vv$ necessarily exist, then the $vv$ necessarily exist.
So, it is reasonable to think not only that all of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s axioms are metaphysically necessary, but also that the results of prefixing them with any sequences of necessity operators and universal quantifiers, in any order, are all true. But then, all theorems of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ are true—assuming that $\mathrm {MP}$ is truth-preserving, as is all but reasonable to assume—owing to Lemma 30 (as already mentioned). So, all theorems of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ are metaphysically necessary, due to the $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s $\mathrm {Nec}$ rule. This establishes necessity of arithmetic on the assumption that Definition 10 does pin down the purely arithmetical formulae.
6.2 The truth of finitary plenitude
As it turns out, FP implies its metaphysical necessity, by $\mathtt {S5QL}_{\mathtt {C}}$ -reasoning (see Appendix B):
Lemma 15. .
So, given what has been argued so far, interpretability of arithmetic will have been established provided that FP can be shown to be true. Accordingly, in the remainder of this section I’ll put forward a couple of considerations in favor of FP.
The first of these—a minor adaptation of an argument of Williamson’s [Reference Williamson81, p. 144]—goes as follows. It is plausible to think that there could have been n donkeys, for every finite cardinality n. The contradictory hypothesis—that the metaphysical laws imply the existence of an upper bound on the finite cardinality that possibly numbers being a donkey—is presumably absurd. So, every finite cardinality n could have been instantiated. Hence, is true.
Even if ‘easy,’ the argument is compelling. In addition, anyone resisting it must also be committed to the metaphysical laws being such that, for every property whatsoever, there is an upper bound on the finite cardinality that numbers it: there is a bound on the finite cardinality that possibly numbers being a quark, being a spatiotemporal location, being one of someone’s hairs, etc. But such a view is surely absurd. It is absurd to think that there is a natural number n such that no property whatsoever could have had more than n-many instances. In any case, at this point the onus should be on the opponent of to explain why it should be rejected, not on ’s proponent.
Finitary plenitude is also a consequence of prominent principles of plenitude (indeed, a failure to imply $\mathrm {FP}$ would presumably reveal the weakness of any such principle). In what follows I’ll focus on Lewis’s [Reference Lewis43] popular principle of recombination,Footnote 40 according to which, for any objects in any worlds, there is a world that contains any number of duplicates of all those objects, ‘size and shape permitting.’
There has been much discussion concerning how best to formulate a more precise version of recombination, and even whether the principle is consistent.Footnote 41 Regardless, those attracted to at least its spirit, if not its letter, should find the following, much weaker principle close to a triviality:
-
Generation: There is an attribute of individuals numbered by some finite cardinality which, necessarily, no matter what finite cardinality numbers it, its successor could have numbered it.
$$\begin{align*}\exists x_{\langle e\rangle }\exists y_{\langle \langle e\rangle \rangle }((\textbf {N}(y)\wedge y(x))\wedge \forall w_{\langle \langle e\rangle \rangle }\forall z_{\langle \langle e\rangle \rangle }((\textbf {N}(w)\wedge \textbf {S}(w,z))\rightarrow \Box (w(x)\rightarrow \Diamond z(x)))).\end{align*}$$
Generation is a consequence of the claim that, no matter how finitely many duplicates of a chosen individual there are, there could have been one more. Arguably, it will be amongst recombination’s consequences independently of what its best regimentation is, and while being substantially weaker than it. But is derivable from generation in $\mathtt {S5QL}_{\mathtt {SA,C}}$ (a proof of this result can be found in Appendix E):
Lemma 36. .
Thus, those attracted to the spirit, if not the letter, of Lewis’s principle of recombination—so that they advocate at least generation—should also accept finitary plenitude.
Hence, FP is not only minimally demanding—the conclusion of ‘easy’ but compelling arguments—but also a consequence of generation, and so of a popular, general, conception of what is possible. There are, then, good reasons for thinking that FP is true, and so metaphysically necessary.
To those that remain unconvinced, I would just like to point out that FP is indeed significantly weaker than InfAx. On the one hand, FP is consistent with the necessary falsehood of InfAx even in the context of the deductive system $\mathtt {S5PQL}_{\mathtt {SA,N}}$ —a system just like $\mathtt {S5PQL}_{\mathtt {SA,C}}$ except that it has among its axioms all instances of the necessitist comprehension principle $\mathrm {Comp}_{\mathrm {N}}$ :
Footnote 42 $\mathrm {Comp}_{\mathrm {N}}$ is a necessitist comprehension principle for modal type theory insofar as necessitism $_{\tau }$ is derivable in $\mathtt {KQL}_{\mathtt {N}}$ , for every type $\tau $ .Footnote 43 A model witnessing the consistency of $\mathtt {S5PQL}^{\mathtt {FP}}_{\mathtt {SA,N}}$ with is given in Lemma 45 in Appendix F. In this context, it should also be mentioned that $\mathtt {S5PQL}^{\mathtt {FP}}_{\mathtt {SA,C}}$ is consistent with , for all types $\tau $ , and . This is shown in Lemma 44 in Appendix F. So, FP is furthermore consistent with a thoroughly contingentist metaphysics.
On the other hand, FP is derivable from InfAx already in $\mathtt {TQL}_{\mathtt {C}}$ , the result of augmenting $\mathtt {KQL}_{\mathtt {C}}$ with the modal axiom $\mathrm {T}$ (see Lemma 42 in Appendix E). So, $\mathtt {PA}_{\mathtt {2}}$ ’s interpretability in $\mathtt {KQL}^{\mathtt {FP}}_{\mathtt {C}}$ makes the case for upper stronger than the case for a logicism along Russellian lines has arguably ever been.
7 Other approaches
By way of conclusion, I will briefly compare and contrast upper with four seemingly related philosophies of arithmetic: Hellman’s [Reference Hellman31] modal structuralism, Hodes’s [Reference Hodes33] alternative theory, Roeper’s [Reference Roeper60] vindication of logicism, and Linnebo and Shapiro’s [Reference Linnebo and Shapiro46] arithmetical potentialism.
7.1 Modal structuralism
Hellman’s [Reference Hellman31] modal structuralism is sometimes viewed as a logicist programme couched in modal and higher-order resources. Applied to arithmetic, it is the view that there are no natural numbers, and that the real content of each arithmetical theorem $\varphi $ is given by the following formula of pure second-order modal logic (where (i) $Ax(\mathtt {PA}_{\mathtt {2}})$ is the conjunction of $\mathtt {PA}_{\mathtt {2}}$ ’s axioms, (ii) ‘X’ is a unary predicate variable, ‘S’ is a binary predicate variable and ‘z’ is an individual variable, and (iii) $Ax(\mathtt {PA}_{\mathtt {2}})^{N\;S\;0}_{X\;R\;z}$ is the result of replacing every occurrence of ‘N’, ‘S’ and ‘ $0$ ’ in $Ax(\mathtt {PA}_{\mathtt {2}})$ by, respectively, ‘X’, ‘R’ and ‘z’, and likewise for $\varphi ^{N\;S\;0}_{X\;R\;z}$ ): $\Diamond \exists X\exists R \exists z (Ax(\mathtt {PA}_{\mathtt {2}})^{N\;S\;0}_{X\;R\;z})\wedge \Box \forall X\forall R\forall z(Ax(\mathtt {PA}_{\mathtt {2}})^{N\;S\;0}_{X\;R\;z}\rightarrow \varphi ^{N\;S\;0}_{X\;R\;z})$ . That is, it is possible that some property, relation and individual ‘play the role’ of being a natural number, successor, and zero with respect to $\mathrm {PA}_{2}$ ’s axioms, and necessarily, whatever plays those roles satisfies $\varphi ^{N\;S\;0}_{X\;R\;z}$ .
Modal structuralism thus shares with upper the views that the real content of each arithmetical statement can be expressed by a formula of a pure higher-order modal language, and that logic includes higher-order modal resources. Notwithstanding, a respect in which upper and modal structuralism clearly differ is that modal structuralists reject the existence of natural numbers. By contrast, upper is committed to their existence. Another significant difference is that modal structuralism is presumably not committed to higher-orderism. For Hellman [Reference Hellman31] subscribes to Quine’s view that ‘some predication relation is hidden in the notation ‘ $F(x)$ ’, once we admit F as a quantifiable variable’, and speaks of the values of second-order variables as ‘classes’ and ‘extensions’, taking them to be identical if coextensional.
Concerning modal structuralism’s logicist character, under Hellman’s specific way of interpreting its theorems $\mathtt {PA}_{\mathtt {2}}$ is interpretable in second-order modal logic only if it is the case that $\Diamond \exists X\exists R \exists z (Ax(\mathtt {PA}_{\mathtt {2}})^{N\;S\;0}_{X\;R\;z})$ , a claim true only if is. So, modal structuralism’s interpretability result affords a reduction of arithmetic to logic only provided that is logically true. For this reason, upper seemingly does at least as well as modal structuralism vis-à-vis the two theories’ respective claims to logicality. For, at least in the context of $\mathtt {S5PQL}_{\mathtt {SA,N}}$ and weaker systems, FP is consistent with both and . By contrast, implies FP already in the context of $\mathtt {S5QL}_{\mathtt {C}}$ (this is a straightforward consequence of Lemma 15 in Appendix B, and Lemma 42 in Appendix E). We can thus conclude that if is logically true, so is FP, whereas the logical truth of FP does not appear to imply ’s logical truth.Footnote 44
In Hellman’s [Reference Geoffrey29] more recent development of modal structuralism, to which I’ll refer to as ‘modal structuralism+’, higher-order quantification is replaced by a combination of plural quantification and mereology, owing to Hellman’s qualms about the nominalist pedigree of classes (which, recall, he takes to be the values of second-order variables). The differences between upper and modal structuralism+ are even greater than those between upper and modal structuralism. In particular, it is unreasonable to think that modal structuralism+ affords a reduction of arithmetic to logic. For parthood is commonly regarded as a nonlogical notion. Relatedly, modal structuralism+’s adopted mereological theory is classical extensional mereology, which includes the strong unrestricted fusion schema according to which for any satisfied condition, there is a fusion of those things that satisfy it. Yet, there arguably is no such thing as the fusion of my nose with the Eiffel Tower, in which case the unrestricted fusion schema has false instances. Further, even if the unrestricted fusion schema turns out to be possibly true, I for one find it unreasonable to think that it is necessarily true. In any case, it arguably isn’t a logical truth, and so modal structuralism+ does not found arithmetic in logic.Footnote 45
Still by way of comparison, it will be useful to consider how modal structuralism and modal structuralism+ account for arithmetic’s applicability. For instance, according to Hellman, to attribute the number $2$ to the property is a moon of Mars really consists in taking something like (1) to be true:
According to (1), necessarily, for every property X, relation R and individuals z and x, if the actually true material facts are true, as well as $Ax(\mathtt {PA}_{\mathtt {2}})^{N\;S\;0}_{X\;R\;z}$ , and x is ‘the successor’ of z, then there is a bijection between the properties is identical to z or to x and is a moon of Mars.
The condition that the actually true material facts be true is required for a number of reasons. For instance, if there is some possibility at which Mars has three moons rather than two and there are some individuals arranged in an $\omega $ -sequence, then (2) will be false:
But if it is furthermore required that the actually true material facts be true in the possibility in question, then the case where Mars has three moons rather than two is ruled out as a counterexample.
The nonvacuous true of (1) requires that it be possible for both the actually true material facts to be true and for some individuals to be arranged in an $\omega $ -sequence—a possibility postulated by Hellman. The nonvacuous truth of possibility claims of number (e.g., that Mars could have had three moons) likewise requires that, for every possibility, it be possible for the material facts true at that possibility to be compossible with there being some individuals arranged in an $\omega $ -sequence. But while some might be eased into thinking that is (logically) true, the claim that for every possibility there is a ‘shadow possibility’ in which the material facts are the same and yet there are also infinitely many individuals is highly controversial. As I see it, this claim is almost (if not exactly as) controversial as the claim that is necessarily true.
So, with respect to how to account for the applicability of arithmetic, modal structuralism’s contrast with upper couldn’t be greater. From the standpoint of upper’s proponents, for Mars to have two moons just is for the property $2_{\langle \langle e\rangle }$ to be had by the property is a moon of Mars. Indeed, it is the recognition of the natural numbers as finite cardinalities—the main contrasting point between upper and modal structuralism—that makes it so easy for upper’s proponents to accommodate the applicability of arithmetic in what appears to be a straightforward manner.Footnote 46
7.2 The alternative theory
According to Hodes [Reference Hodes32, Reference Hodes33], arithmetical sentences have truth-conditions which concern what holds of finite cardinality (type $\langle \langle e\rangle \rangle $ ) properties. These truth-conditions are expressible solely in terms of logical expressions, and the ones which obtain are expressible by logical truths. But since arithmetic itself is formulated via numerals, which appear to be individual (type e) terms, a semantic story is required explaining how arithmetical sentences get to have truth-conditions concerned not with individuals but rather with what is true of the finite cardinality properties.
Hodes proposes a semantic account formulated via a ternary designation relation. Let a representor be a $1$ – $1$ function from finite cardinality properties to individuals. Then, the ternary designation relation relates representors and expressions to worldly entities. Relative to different representors, the same expression—in particular, the same arithmetical expression—may designate different entities.
Specifically, relative to each representor the designation of ‘N’—the ‘is a natural number’ predicate—is the representor’s range. As to the designation of numerals, Hodes takes each numeral to be shorthand for a singular term formulated in terms of ‘ $\#$ ’—the number of-operator. ‘ $0$ ’ is shorthand for ‘ $\#(\lambda x_e(\bot ))$ , and $\ulcorner S(n)\urcorner $ is shorthand for $\ulcorner \#(\lambda x_e(x_e\leq n)\vee \neg N(n))\urcorner $ . The designation of ‘ $\leq $ ’—the ‘is less-than-or-equal to’ predicate (applicable to individuals)—relative to each representor is specified in terms of the less-than-or-equal-to relation between properties of properties of individuals: ‘ $\leq $ ’ designates, relative to each representor f, that relation that obtains between $x_e$ and $y_e$ if and only if there are cardinality properties $Q_{\langle \langle e\rangle \rangle }$ and $R_{\langle \langle e\rangle \rangle }$ such that $f(Q_{\langle \langle e\rangle \rangle })=x_e$ , $f(R_{\langle \langle e\rangle \rangle })=y_e$ , and $Q_{\langle \langle e\rangle \rangle }$ is less-than-or-equal-to $R_{\langle \langle e\rangle \rangle }$ . The number of-operator $\#$ designates, relative to each representor f, a function from properties of individuals to individuals which maps each property $X_{\langle e\rangle }$ to an individual $y_e$ if and only if there is some cardinality property $Q_{\langle \langle e\rangle \rangle }$ such that $f(Q_{\langle \langle e\rangle \rangle })=y_e$ and $X_{\langle e\rangle }$ falls under $Q_{\langle \langle e\rangle \rangle }$ .
Truth relative to a representor is defined in a standard manner. Relative to each representor, the designation of a numeral can then be seen as representing a particular finite cardinality property, and arithmetical sentences are true or false in virtue of facts about individuals which can be seen as ‘encoding’ facts about finite cardinality properties.
The crux of Hodes’s semantic account is that plain truth consists of truth relative to every representor and plain falsehood of falsehood relative to every representor. This supervaluationist conception of truth thus ‘abstracts away’ from any contribution that specific individuals and relations between them may make to the truth of arithmetical sentences. What remains are truth-conditions concerned solely with the finite cardinality properties. Thus, a sentence such as ‘ $7+5=12$ ’ is true owing not to any particular individuals standing in any particular relation, but instead owing to the finite cardinality properties exactly seven $_{\langle \langle e\rangle \rangle }$ , exactly five $_{\langle \langle e\rangle \rangle }$ and exactly twelve $_{\langle \langle e\rangle \rangle }$ standing in the addition relation.
Among other things, Hodes’s semantic account yields a prima facie elegant solution to the Caesar problem for Fregean and NeoFregean logicists. While relative to each representor ‘ $7$ ’ designates a particular individual, and ‘ $\text {Caesar}=7$ ’ will be true relative to those representors assigning Caesar to exactly seven $_{\langle \langle e\rangle \rangle }$ , and false relative to the remaining ones, this sentence is neither true nor false.
Yet, and as acknowledged by Hodes [Reference Hodes32], this semantic account presupposes InfAx’s truth. For instance, if there are only finitely many individuals, then there are no representors, owing to there being no $1$ – $1$ function mapping the finite cardinality properties to the individuals. After all, if there are only finitely many individuals, then even if the finite cardinality properties are themselves finitely many, they will be more numerous than the individuals (if there are only n-many individuals, there are at least ( $n+1$ )-many finite cardinality properties, since having zero instances is a cardinality property). So, since truth is truth relative to every representor, if there are only finitely many individuals, then Hodes’s simple account of the semantics of the language of arithmetic will imply the truth of every arithmetical sentence whatsoever—including obvious falsehoods such as ‘ $0=1$ ’ and ‘every natural number is identical to $0$ ’.
Even though the semantic account offered in [Reference Hodes32] presupposes the truth of InfAx, Hodes [Reference Hodes32] is of the opinion that ‘Arithmetic should be able to face boldly the dreadful chance that in the actual world there are only finitely many objects’. Further, and as has previously been discussed, it is difficult to see how the InfAx can count as a logical truth.
In the last couple of paragraphs of [Reference Hodes32] it is suggested to replace InfAx by FP. Indeed, Hodes concludes by advocating the view that ‘mathematics is higher-order modal [our emphasis] logic’, with the development and defense of this view being left for another occasion. It is thus not unreasonable to take a view along the lines of upper to be among those advocated in [Reference Hodes32]. Notwithstanding, no arguments for the claim that ‘mathematics is higher-order modal logic’, or logico-mathematical results potentially figuring in such arguments, are offered in [Reference Hodes32].
In [Reference Hodes33] is undertaken the task of developing a model theory for the language of arithmetic which is both neutral on the truth of the InfAx and capable of accounting for how we are able to speak as if numbers were individuals, given a commitment to the truth-conditions of arithmetical sentences concerning what is true of the finite cardinality properties. In this model theory, which presupposes the truth and logicality of FP, representors may map finite cardinality properties to individuals not all of which need actually exist (i.e., not all of which need be in the domain of the ‘actual world’ of a model). The underlying idea is that arithmetical have truth-conditions which concern finite cardinality properties in virtue of these being represented by possibly existing individuals.
Hodes shows that potentialist versions of some arithmetical truths are true in every model, where the potentialist version of a sentence is obtained by prefixing each existential quantifier occurring in the original sentence with a possibility operator.Footnote 47 He also shows that possibilist versions of more arithmetical truths are true in every model, where the possibilist version of a sentence is obtained by substituting each existential quantifier occurring in the original sentence with ‘ $\overset {\prime }{\exists }$ ’, the possibilist existential quantifier, where $\overset {\prime }{\exists }x\varphi $ is true at a world w if and only if some possible individual satisfies $\varphi $ at w. Still, as Hodes also shows this alternative semantic account of the language of arithmetic, while consistent with , will not count the potentialist or the possibilist versions of all arithmetical truths as true in every model.Footnote 48
Owing to the mismatch between what is true of finite cardinality properties and the truth of arithmetical sentences formulated in terms of numerals, understood by Hodes as individual (type e) terms, Hodes sees the project of developing an account of the semantics of the arithmetical language vindicating logicism and NaC, not committed to the axiom of infinity, and consistent with contingentism as having failed. As he put it [Reference Hodes33, p. 391],
‘I tentatively conclude that an Individual-Actualist who accepts the Alternative theory does best to accept an actual infinitude [of individuals]’.
Roughly, Hodes’s Alternative theory consists of the conjunction of NaC with the claim that the semantics of the arithmetical language is to be accounted for by ‘supervaluating’ over representors, and that what makes arithmetical sentences true or false are facts about finite cardinality properties expressible by, respectively, logical true and logically false sentences of higher-order logic.
The subsequent discussion in [Reference Hodes33] proceeds on the assumption of InfAx’s truth, with the semantic account in terms of merely possible individuals being abandoned. This suggests that in [Reference Hodes33] a view along the lines of upper was no longer endorsed.
In my view, contingentist serious actualists should already be suspicious of a semantic theory formulated in terms of representors having in their range merely possible individuals. For representors are functions. So, and assuming functions are relations, by serious actualism there are no representors having in their (actual) range merely possible individuals. Relatedly, some representors may map exactly zero $_{\langle \langle e\rangle \rangle }$ and exactly one $_{\langle \langle e\rangle \rangle }$ to incompossible individuals. But in such a case contingentist serious actualists would reject the claim that these incompossible individuals possibly stand in any relation. So, ‘ $\Diamond (0\leq 1)$ ’ will be false relative to some representors. But then, ‘ $\Diamond (0\leq 1)$ ’ is not logically true—i.e., not true in all models (satisfying FP) relative to all worlds and all representors—though it is true (in all models satisfying FP, and relative to all worlds and all representors) that exactly zero $_{\langle \langle e\rangle \rangle }$ is less than exactly one $_{\langle \langle e\rangle \rangle }$ .
Furthermore, even if higher-orderist proponents of NaC owe some explanation for seemingly true talk of numbers as individuals, there is reason to think that appealing to a semantic theory for the language of arithmetic along the lines of Hodes’s is wrong-headed. As an example, Hodes is concerned with how to account for the occurrence of number-expressions in sentences such as the following:
As it occurs in (3), ‘the number of students’ appears to be functioning as an individual term. If so, then ‘three’ would appear to itself be functioning in (3) as an individual term (assuming (3)’s truth).
Yet, from higher-orderists’ standpoint, seemingly true talk of numbers as individuals is arguably part of a much more general phenomenon. For instance, and assuming its truth, in the sentence
the expressions ‘the colour of the sky’ and ‘blue’ appear to be functioning as individual terms. Yet, being blue is arguably a paradigmatic case of a property, not of an individual. Similarly, as they occur in (5),
‘the smallest equivalence relation’ and ‘the identity relation’ appear to be functioning as individual terms (assuming the truth of (5)). Yet, the identity relation is a paradigmatic case of a relation, not of an individual.
Now, the view that, in general, talk of properties as individuals is to be semantically accounted for by taking properties to have individuals representing them seems unmotivated at best. Indeed, it faces a general version of the problem afflicting Hodes’s strategy for semantically accounting for talk of numbers as individuals. Just as there is no one-to-one function from the finite cardinality properties to the (actually existing) individuals if there are only finitely many individuals, there is also no one-to-one function mapping properties to individuals, by a higher-order version of Cantor’s theorem. And in this case assuming the truth of InfAx does not solve the problem.
Since seemingly true talk of properties as individuals would, in general, appear not to be correctly accounted for by bringing in representors, it is worth considering a different strategy for accounting for seemingly true talk of properties as individuals.Footnote 49 Such an explanation would be of interest not only to those committed to NaC, but also to other projects in higher-order metaphysics. My ‘two cents’ on the matter are that such talk is explained by the fact that natural language quantifier-expressions are flexible with respect to the type of the quantifier they pick at a particular context. For this reason they are flexible with respect to what lies in their ‘domains of quantification’ at different contexts, and so sometimes straddle across type-distinctions. Developing this view is, however, outside the scope of this paper.Footnote 50 For the present purposes, the relevant point is that higher-orderist proponents of NaC are under little pressure to accept Hodes’s specific semantic account of the language of arithmetic. For they would then still be faced with explaining seemingly true talk of properties as individuals in general. Conversely, a general, higher-orderist explanation of seemingly true talk of properties as individuals—which would not go along the lines of Hodes’s semantic theory, for the reasons just mentioned—would ipso facto deliver an explanation for seemingly true talk of natural numbers as individuals.
Notwithstanding, there are important points of agreement between upper and Hodes’s views on arithmetic: (a) the truth-conditions of arithmetical sentences concern what is true of the finite cardinality properties; (b) these truth-conditions can ultimately be expressed in purely logical terms; (c) the true sentences of higher-order logic expressing them are logical truths.
In addition, Hodes did acknowledge the difficulty in establishing, in higher-order logic, certain true facts about finite cardinalities—e.g., that eleven $_{\langle \langle e\rangle \rangle }$ is not the sum of seven $_{\langle \langle e\rangle \rangle }$ and five $_{\langle \langle e\rangle \rangle }$ —unless InfAx’s logicality was assumed, despite the unreasonableness of this assumption. A further point of agreement between the views in [Reference Hodes32] and upper is thus the solution found to this difficulty: to recognize that ‘necessity’ and ‘possibility’ are themselves logical expressions, and that FP is a logical truth.
But Hodes has not proceeded to show that all the theorems of a natural theory of the finite cardinality properties—encapsulated in the Russellian translation of $\mathtt {PA}_{\mathtt {2}}$ —are logically derivable, or that all the relevant truths about the finite cardinality properties—encapsulated by those truths which are arithmetical translations of formulae of $\mathtt {PA}_{\mathtt {2}}$ —are logically necessarily. Instead, in [Reference Hodes33] he gave up on replacing InfAx with FP, owing to the failure to provide a satisfactory semantic account of the arithmetical language in terms of representors mapping finite cardinality properties to (perhaps merely) possible individuals. By doing so the account of arithmetic in [Reference Hodes33] ends up relying on a nonlogical principle.
From the standpoint of upper’s proponents, Hodes drew the wrong conclusion from this failure. It is the semantic account in terms of representors that must ultimately be dismissed, not the logicism about arithmetic. The arithmetical facts about the finite cardinality properties are all logical truths, and the logical principle FP plays a crucial role in the derivation of some of those facts. Even if he changed his mind later on, Hodes was right in claiming, in [Reference Hodes32], that mathematics (well, at least arithmetic) is higher-order modal logic.
7.3 A vindication of logicism
Roeper [Reference Roeper60] aims to ‘vindicate logicism’ by improving both Frege’s logicist account of arithmetic—via a consistent logicist characterization of cardinal number—and the neoFregean’s account of arithmetic—via a characterization of cardinal number from which Hume’s principle can be logically derived.
According to Roeper, to be a cardinal number is to be a cardinality property—a property such as one, $255587$ , countably many, etc. Further, Roeper takes cardinality properties to be applicable to pluralities of individuals—e.g., the Beatles were four, and there are seven Daltons. Roeper’s characterization of cardinality properties is the following (I’ll use variables indexed with ‘ $\langle ee\rangle $ ’ for variables ranging over properties of pluralities). A property of pluralities $X_{\langle ee\rangle }$ of individuals is a cardinality property just in case $X_{\langle ee\rangle }$ is: (i) Strongly rigid: for every plurality $yy_{e}$ , the $yy$ are necessarily $X_{\langle ee\rangle }$ s, or the $yy$ are necessarily not $X_{\langle ee\rangle }$ s (i.e., $\forall yy_e(\Box X_{\langle ee\rangle }(yy)\vee \neg \Box X_{\langle ee\rangle }(yy))$ );Footnote 51 (ii) Strictly congruent: for all $yy_e$ and $zz_e$ , necessarily, if the $yy$ are $X_{\langle ee\rangle }$ s, then the $yy$ and the $zz$ are equinumerous if and only if the $zz$ are $X_{\langle ee\rangle }$ s (i.e., $\forall yy_e\forall zz_e\Box (X(yy)\rightarrow (yy\approx zz\leftrightarrow X(zz)))$ ); and (iii) Consistent: $X_{\langle ee\rangle }$ could have been instantiated (i.e., $\Diamond \exists yy_e(X_{\langle ee\rangle }(yy))$ ).
Insofar as the natural numbers are cardinal numbers, it is to be expected that it is a consequence of Roeper’s account of cardinality properties that every natural number property (understood as a property of pluralities) could have been instantiated, and so that it is committed to FP, owing to cardinality properties being consistent.Footnote 52 This is an important respect in which upper and Roeper’s views are in agreement.
The disagreement concerning whether the natural numbers are properties of pluralities or instead properties of properties also seems to be minor. For, given a conception of the natural numbers as properties of pluralities, one can define corresponding properties of properties, and vice versa.Footnote 53 So, Roeper and upper’s proponents can presumably agree that, in some sense, talk of natural numbers as properties of properties is meaningful, as is talk of natural numbers as properties of pluralities.
Now, a more significant difference between the two views is that Roeper includes in the characterization of what it is to be a natural number that it is a cardinality property, and so possibly instantiated. By contrast, upper’s characterization does not include that condition. According to upper, to be a natural number is simply to have all the hereditary properties (under the successor relation) had by zero.
This difference turns out to be of significance. While proponents of upper and Roeper are both committed to the logical truth of FP, without the adoption of commitments beyond those of upper logicists Roeper would not have been able to derive the principle that the successor of a natural number is itself a natural number. After all, to show, this, Roeper would have to first establish that the successor of a natural number is a cardinality property, and so that the successor of a natural number is possibly instantiated. By contrast that every natural number has a successor is a straightforward consequence of upper logicists’ characterization of the property of being a natural number.
Indeed, and as I will go on to explain, in order to be able to derive the principle that every natural number has a successor natural number Roeper commits to the logical truth of postulate—a principle (characterized below) which, at least syntactically, has strong affinities to Frege’s infamous basic law V, according to which two properties have the same extension just in case they are equinumerous. And like in the case of basic law V, there are excellent reasons for thinking that postulate is false.
A further commitment of Roeper’s is to properties also belonging, secondarily, to the category of individuals, owing to our seemingly being able to refer to them to via individual (type e) terms. The postulate, as Roeper calls it, which governs the move from expressing a property via a predicate to referring to that property via a singular term is the following (where $\varphi $ and $\psi $ are open formulae in which $xx_e$ occurs free):
Postulate: Properties $\rho xx_e(\varphi )$ and $\rho xx_e(\psi )$ are identical if and only if for all $xx_e$ , necessarily, $\varphi $ if and only if $\psi $ .
The $\rho $ -operator, as used in the formulation of postulate, is a variable binding operator which binds plural variables. It is ‘introduced by stipulation; its role is to change a predicate expressing a property into a singular term for that property’ [Reference Roeper60, sec. 2.2, p. 366].Footnote 54
Since ‘Any attempt to formulate the sameness in question will be ungrammatical’, Roeper holds that postulate ‘is not a definition, but it records the effect of the stipulation [concerning the role of $\rho $ ] to the extent that it can be expressed in the symbolism’ [Reference Roeper60, sec. 2.2, p. 367]. Importantly, and by contrast with the neoFregeans’ take on the relationship between Hume’s principle and the ‘number of’ operator, Roeper does not take postulate to be a definition of $\rho $ ; $\rho $ ’s meaning is already fixed when used in the formulation of postulate.
Roeper seemingly takes postulate to be logically true, and the system in which he offers a (purportedly logicist) derivation of $\mathtt {PA}_{\mathtt {2}}$ has a weakening of postulate amongst its axioms. Yet, not only does postulate have strong affinities with basic law V, there is also reason to think that, far from being logically true, postulate is itself false, since a form of Russell’s paradox can be derived from it. Consider the following predicate:
Assume, for reductio, that postulate is true. Then, by plural comprehension we have that $\exists uu_e(\forall u_e(u\prec uu\leftrightarrow u=\rho zz_e(\neg zz\sqsubset zz)))$ —since $\forall zz_e(\Box (\neg zz\sqsubset zz\leftrightarrow \neg zz\sqsubset zz))$ , and so $\rho zz_e(\neg zz\sqsubset zz)=\rho zz_e(\neg zz\sqsubset zz)$ , by postulate, and therefore $\exists x_e(x=\rho zz_e(\neg zz\sqsubset zz))$ . Let $rr_e$ —the Russell plurality—be one such plurality $uu_e$ such that $\forall u_e(u\prec uu_e\leftrightarrow u=\rho zz_e(\neg zz\sqsubset zz))$ .
Suppose that $rr_e\sqsubset rr_e$ . Then, $\rho zz_e(X(zz))\prec rr_e\wedge X(rr_e)$ , for some $X_{\langle ee\rangle }$ . So, $\rho zz_e(X_{\langle ee\rangle }(zz))=\rho zz_e(\neg zz\sqsubset zz)$ . Hence, $\forall zz_e\Box (X_{\langle ee\rangle }(zz)\leftrightarrow \neg zz\sqsubset zz)$ , by postulate. Since $X_{\langle ee\rangle }(rr_e)$ , we have that $\neg rr_e\sqsubset rr_e$ . Suppose now that $\neg rr_e\sqsubset rr_e$ . We have that $\rho zz_e(\neg zz\sqsubset zz)\prec rr_e$ . So, $\exists X_{\langle ee\rangle }(\rho zz_e(X(zz))\prec rr_e\wedge X(rr_e))$ , by a version of a principle of comprehension (and indeed, of $\mathrm {Comp}_{\mathrm {C}}$ ) for properties of pluralities. So, $rr_e\sqsubset rr_e$ . Contradiction. So, postulate is false.
Now, the formal theory in which Roeper offers a derivation of $\mathtt {PA}_{\mathtt {2}}$ has as an axiom not postulate but rather C-postulate, a weakening of postulate. This theory is formulated in a language stripped off of modal operators.Footnote 55 The C-postulate is formulated in terms of the operator ‘C’. This operator, which binds plural variables, is defined as follows in terms of conditions arising from the formulations of strict congruency and consistency by removing the modal operators occurring in them (where $xx_e$ occurs free in $\varphi $ , and $yy_e$ and $zz_e$ are free for $xx_e$ in $\varphi $ ):
Intuitively, $C xx_e(\varphi )$ states that $\lambda xx_e(\varphi )$ is a cardinality property. The C-postulate is formulated as follows (where $xx_e$ occurs free in both $\varphi $ and $\psi $ ):
C-postulate: If $C xx_e(\varphi )$ and $C xx_e(\psi )$ , then properties $\rho xx_e(\varphi )$ and $\rho xx_e(\psi )$ are identical if and only if for all $xx_e$ , $\varphi $ if and only if $\psi $ .
Roeper goes on to lay down some further principles relating talk of natural numbers as properties to talk of natural numbers as individuals, and defines the plurality of natural numbers as the smallest plurality of cardinalities (qua individuals) which includes zero and is closed under succession.
Roeper then shows that the axioms of $\mathtt {PA}_{\mathtt {2}}$ are derivable in his theory, given his characterization of the plurality of natural numbers.Footnote 56 Importantly, while Roeper is able to straightforwardly derive the result that different natural numbers have different successors on the basis of his definition of cardinality properties as (strictly congruent and) instantiated (rather than just possibly instantiated) properties, his proof that every natural number has a successor natural number makes essential use of the C-postulate. Indeed, and in Fregean fashion, Roeper’s derivation relies on the observation that the plurality of numbers smaller than or equal to n has as its natural number the successor of n.
An important difference between Roeper’s logicist account of arithmetic and upper’s is thus that while Roeper’s initial characterization of cardinality properties is formulated in modal terms, and seemingly implies FP, the characterization of cardinality properties and natural numbers which is actually used in the derivation of $\mathtt {PA}_{\mathtt {2}}$ ’s axioms presupposes that every cardinality property, and so also every natural number property, is actually—rather than just possibly—instantiated. Further, and by contrast with Theorem 8’s proof, Roeper’s proof that there are infinitely many natural numbers essentially appeals to C-postulate and proceeds not via the observation that modal space can be sufficiently rich so as to have all natural numbers, qua properties, be possibly instantiated just in case they are infinitely many, but instead via the observation that if the natural numbers are individuals, then the plurality of natural numbers smaller than or equal to n will have as its natural number n’s successor. Thus, the result is proven in a way akin to how Fregeans demonstrate the infinitude of the natural numbers.
Now, how satisfactory is Roeper’s logicist account of arithmetic? As has been shown, the principle which is the linchpin of Roeper’s account, postulate, is presumably false, and perhaps even logically contradictory. But then, what reason is there for thinking that its weakening, C-postulate, is true? That is, what reason is there for thinking that while, in general, properties of pluralities do not have reifications, some properties of pluralities—specifically, the cardinality properties—do? What makes them special?
Strikingly, neoFregeanism may offer the best way of defending the analyticity, if not the logicality, of C-postulate. For, the $\rho $ -operator may be defined in a natural manner in terms of the neoFregeans’ ‘number-of’ operator $\#$ (applied to pluralities rather than properties), where $\varphi $ is a predicate in which $xx_e$ occurs free and in which $y_e$ does not occur free, and $\iota $ is a variable-binding, definite-description operator:
That is, the property of being $xx_e$ such that $\varphi $ is either the number of some things such that $\varphi $ , provided that being such that $\varphi $ is a cardinality property, or else is the number of the plurality of all things.Footnote 57
Once the $\rho $ -operator is so defined, the C-postulate is derivable from plural Hume’s principle, the plural version of Hume’s principle according to which the number of the $xx_e$ is identical to the number of the $yy_e$ if and only if the $xx_e$ are equinumerous with the $yy_e$ ( $\forall xx_e\forall yy_e(\#(xx)=\#(yy)\leftrightarrow xx\approx yy)$ ).
I leave the exercise of deriving this result to the reader. What it shows is that neoFregeans can take the $\rho $ -operator, as it occurs in C-postulate, as an independently interpreted operator, and not take C-postulate to be a definition of $\rho $ . Presumably, there is even room for them to adhere to the idea that the natural numbers are not only individuals but also themselves properties of pluralities, if they nonetheless stick to Roeper’s type-restrictions in their formal theory.
So, neoFregeanism may constitute the best shot at defending Roeper’s C-postulate. Of course, such a defense will only be as successful as the neoFregeans’ defense of the truth and analyticity of plural Hume’s principle. As Roeper himself remarks, that defense is plagued with difficulties—e.g., the Caesar problem and bad company-style objections.Footnote 58 Arguably, upper is not faced with objections of this sort.
7.4 Arithmetical potentialism
According to arithmetical potentialism [Reference Linnebo and Shapiro46], there could be no completed infinitude of natural numbers. Rather, only the following weaker thesis is true:
Potential infinity: Necessarily, every natural number could have had some successor. $\Box \forall x(N(x)\rightarrow \Diamond \exists y(S(x,y))).$
arithmetical potentialism and upper are both formulated in terms of modal resources. Further, they agree that, for each natural number n, there could have been at least n individuals, and none of these views is committed to . Still, the two views differ in a number of respects.Footnote 59
First, potentialists take natural numbers to be individuals, a view rejected by upper’s proponents. An arguably more significant difference is that potentialists reject the possible existence of all possible natural numbers, while arithmetical upper logicists advocate the necessary existence of all possible natural numbers. Further, while some arithmetical potentialists might be attracted to logicist projects, arithmetical potentialism is not directly concerned with the question whether arithmetic is reducible to logic.Footnote 60
To mention just one difficulty for the potentialist programme, it would appear that mathematical entities, qua abstract entities, necessarily exist if they possibly do, so that all possible numbers actually exist after all. Yet, the actual existence of all natural numbers is inconsistent with arithmetical potentialism. One reply available to potentialists is to formulate their view in terms of a modality other than metaphysical necessity while acknowledging the metaphysically necessary existence of all possible natural numbers. They then face the challenge of explaining what this modality consists of.
Focusing on just one alternative interpretation, some potentialists have interpreted the modality in question as interpretational modality (see, e.g., [Reference Linnebo44]). On this understanding, ‘ $\Diamond \varphi $ ’ means that ‘we can abstract so as to make it the case that $\varphi $ ’ [Reference Linnebo44, chap. 3, p. 61], while ‘ $\Box \varphi $ ’ means that ‘no matter how we abstract and thereby shift the meaning of the language, $\varphi $ ’ [Reference Linnebo44, chap. 12, p. 205]. As Linnebo [Reference Linnebo44, chaps. 3, pp. 61–62] puts it, the ‘interpretational’ interpretation of the modal operators is ‘concerned with how the language is interpreted, not with how reality is. In particular, every interpretational possibility is compatible with the metaphysically actual world’.
According to Linnebo [Reference Linnebo44, chap. 8], abstraction is something made by a community. For a community to abstract with respect to an equivalence relation $\sim $ is for it to extend its language and the meanings of the expressions occurring in it so that the meaning of, e.g., the quantifiers and the identity predicate shift so as to make it the case that, e.g., ‘ $\exists w\exists v(Abs_{\sim }(w,y)\wedge Abs_{\sim }(v,z)\wedge w=v)$ ’ is true whenever $y\sim z$ for some y and z, where ‘ $Abs_{\sim }$ ’ is a predicate that applies to any a and b whenever a is the abstract of b with respect to the equivalence relation $\sim $ . This requires that the community uses their language in accordance with appropriate assertibility conditions (specified in [Reference Linnebo44, chap. 8]). Once they do so, the quantifiers’ meanings become ‘more expansive’. Whereas before y’s and z’s respective abstracts were not within the language’s quantifiers’ range, after the meaning shifts they come to be.
In the arithmetical context, one salient option is to abstract via Hume’s principle.Footnote 61 Let ‘ $\overline {0}$ ’ abbreviate ‘number of the property of being self-distinct’ and ‘ $1$ ’ abbreviate ‘number of the property of being identical to $\overline {0}$ ’. Since the property of being self-distinct is equinumerous with itself, we can abstract on equinumerosity, thereby shifting the meanings of our language’s expressions so as to make ‘ $\overline {0}$ is identical to something’ true. Then, ‘the property of being identical to $\overline {0}$ is equinumerous with itself’ will be true under such an extension. And in such a case one can further shift the meanings of our language’s expressions so as to make true ‘ $\overline {1}$ is identical to something’. After this second step of abstraction it will also be true that ‘ $\overline {1}$ is the successor of $\overline {0}$ ’. Since whenever one abstracts on equinumerosity the meanings of our language’s expressions will shift so as to make it the case that ‘something is x’s successor’ is satisfied by some ‘previously given number’, potentialists will contend potential infinity comes out true.
Now, it appears that arithmetical potentialists must take ‘ $\Diamond $ ’ to express a propositional rather than a sentential operator, so that it acts not on formulae but rather on their semantic values. For instance, potentialists take de re modal constructions to be well formed, and several of the theses advocated by them involve quantification into modal contexts. But it is notoriously difficult to make sense of quantification into modal contexts when modal operators are understood as acting on formulae rather than propositions. For example, how are we to make sense of the ‘previously given number’ which will satisfy ‘something is x’s successor’ on a permissible shift of the language’s meanings, after a previous permissible shift of the language’s meanings? After all, we have no number, or even possible number, being given. What we have are language extensions.
But if ‘ $\Box $ ’ and ‘ $\Diamond $ ’ express propositional operators, then presumably ‘ $\Box \forall x(N(x)\rightarrow \Diamond \exists y(S(x,y)))$ ’ expresses that no matter how we abstract so as to shift the meaning of our expressions, for every natural number x, we can abstract so as to shift the meaning of our expressions so that something is x’s successor, with ‘every’, ‘some’, ‘number’, ‘successor’ and so on being used with their actual meanings, not with the meanings they might have had. If that is right, it would appear that ‘ $\Box \forall x(N(x)\rightarrow \Diamond \exists y(S(x,y)))$ ’ expresses no more than that for every natural number, something is its successor. The modal operators occurring it are vacuous.
This observation is reminiscent of the riddle how many legs does a dog have if you call his tail a leg? Just as the right answer is four since calling a tail a leg does not make it a leg, so it is that being able to abstract so as to make it the case that our language’s expressions mean something different from what they actually mean does not make it the case that they actually mean something different. So, if the potentialists’ modal operators act on semantic values rather than formulae, then it is the semantic values that formulae in fact have, not the ones that they would have, which are relevant to the truth of modalized claims.
Yet, this is not how ‘ $\Diamond $ ’ is supposed to function from the standpoint of potentialists. According to them, the truth of ‘ $\Box \forall x(N(x)\rightarrow \Diamond \exists y(S(x,y)))$ ’ should depend not on the meaning that the expressions occurring in it actually have, but rather on the meanings that they would have under some ‘legitimate abstraction’. There is thus an important tension in the potentialists’ interpretational understanding of the modal operators.Footnote 62 By contrast, upper is formulated in terms of plain old, familiar metaphysical modality, and so does not immediately face this sort of difficulty (which is not to say that the question of how to understand metaphysical modality in other terms is not worth exploring).
Let us bracket the concern with the coherence of the arithmetical potentialists’ use of interpretational modality. What would be clarified by understanding arithmetical potentialism in such a way? A possible response (based on [Reference Linnebo44]) is that appealing to interpretational modality promises to account for the sense in which the natural numbers’ existence is ‘thin’. It is thin because it is a product of little more but a shift in the meaning of the language’s quantifiers. Barely no contribution from the world is needed.
The idea is the following. In our present language we do quantify over all the infinitely many natural numbers. But arithmetical potentialism allows us to understand why their existence is ontologically thin. On the basis of the property of being self-distinct’s equinumerosity with itself the meanings of our quantifiers have shifted, so that ‘ $0$ exists’ is now true, with the equinumerosity of the property of being self-distinct grounding $0$ ’s existence. And once that shift occurred, further meaning shifts were made available—without any substantive change in the underlying facts. The present truth of ‘there are infinitely many natural numbers’ is ultimately grounded in those shifts and in equinumerosity facts which come ‘for free’—specifically, that the property of being self-distinct is equinumerous with itself, as is the property of being less than or equal to n, for each natural number n. From this standpoint, the potentiality of the natural numbers encapsulates the permissibility of abstraction-based meaning change through which our quantifiers ended up ranging over infinitely many natural numbers.
Upper’s proponents will tell a different story. Given adherence to a logical theory as weak as $\mathtt {KQL}^{\mathtt {FP}}_{\mathtt {C}}$ , $\mathtt {PA}_{\mathtt {2}}$ can be seen to be true by taking the natural numbers to be finite cardinalities. Logic on its own will then imply not only the actual but also the necessary existence of the infinitely many natural numbers. The natural numbers will then be thin at least insofar as their necessary existence is implied by principles of an extremely general nature—indeed, by purely logical principles.
Which story is more satisfactory? Well, the potentialist explanation arguably relies on a creation myth: the myth of a language created by a community of speakers coordinating on its use so that the meanings of its quantifiers expanded further and further, with the consequence that they have ended up ranging over the infinitely many numbers. I know of no empirical data supporting the claim that the language of any actual community has been created in this way. Furthermore, it would appear that no language of any human community could have been created in this way. After all, in order for our quantifiers to range over infinitely many natural numbers there would have had to have been infinitely many rounds of coordinated meaning-expansion. But surely, no human community could have gone through so many rounds of coordinated meaning-expansion.Footnote 63
So, how can a story along these lines be more satisfactory, vis-à-vis the natural numbers’ ‘thinness’, than the one put forward by upper’s proponents? I, for one, don’t think it is.
8 Conclusion
The paper’s main aim was to introduce upper, a novel, neoRussellian philosophy of arithmetic, and to offer a partial defense of this view. Upper was characterized in Section 2, while its defense took place in Sections 3–6. After having laid out upper’s ground type-theoretic modal language (in Section 3), it was shown that $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ interprets $\mathtt {PA}_{\mathtt {2}}$ via a translation that respects NaC and the Russellian conception of the arithmetical primitives (in Section 4), and that $\mathtt {S5PQL}_{\mathtt {C}}$ implies that every purely arithmetical formula of $\mathrm {MT}$ is necessarily true (in Section 5). Then, a defense was offered in Section 6 of the metaphysical necessity of all theorems of $\mathtt {S5PQL}_{\mathtt {SA,C}}^{\mathtt {FP}}$ . Finally, upper was compared (in Section 7) with Hellman’s modal structuralism, Hodes’s alternative theory, Roeper’s vindication of logicism, and Linnebo and Shapiro’s arithmetical potentialism.
By way of conclusion, upper is a clear instance of the following, more general logicist view:
Finitary Upper Mathematical Logicism (FUML): the conjunction of necessity of FP, higher-orderism and logicality with the following theses:
-
– Higher-type mathematical ontology: The mathematical entities are all of higher types, and mathematics’s primitives are all truly characterizable as higher type-entities definable in pure modal type theory;
-
– Interpretability of mathematics: For every standard, true deductive theory $T_f$ of any mathematical field f, there is a deductive system formulated in a pure modal and type-theoretic language, and that includes FP among its axioms, whose theorems: (i) are all metaphysically necessary truths; and (ii) include the mathematical primitives-respecting translations of all theorems of $T_f$ ;
-
– Necessity of mathematics: The mathematical truths are all metaphysically necessary truths expressible in a pure modal type theory.
Arguably, FUML affords an attractive picture of the mathematical realm and the relationship between mathematics and logic. According to FUML, ‘mathematical entities are higher type-entities, mathematics is higher-order modal logic, and FP is true’. Despite FUML’s attractiveness, defending it presumably amounts to a never-ending project as the process of formulating novel mathematical theories is seemingly never-ending. But that need not count against FUML. Indeed, the development of an upper logicist philosophy of mathematics would presumably amount to ‘mathematical philosophy’, in Russell’s sense of the expression, at its best: a programme of continuously revealing mathematics’s underlying foundations.
The present paper’s aim has been to offer a defense of upper, not of FUML. Notwithstanding, a cogent defense of the former presumably constitutes a ‘proof of concept’ with respect to the latter view’s prospects: if the finitary upper logicist picture is true of arithmetic, then it might cogently extend to other mathematical fields, perhaps all. For instance, in my view recent work on the relationship between set theory and type theory [Reference Degen and Johannsen14, Reference Linnebo and Rayo45] can be deployed at the service of a defense of a form of finitary upper set-theoretical logicism. Alas, the development of such a defense will have be to be left for future work.
A Interpretability of $\mathtt {PA}_{\mathtt {2}}$ in $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ via Russellian translations
In this section is proven Theorem 8. In order to do so, it will be helpful to start by briefly presenting the axiomatization of $\mathtt {PA}_{\mathtt {2}}$ that will be considered:
Definition 11 ( $\mathtt {PA}_{\mathtt {2}}$ ).
$\mathtt {PA}_{\mathtt {2}}$ is the deductive system in language $\mathrm {PA}_{2}$ whose inference rules are $\mathrm {MP}$ and $\mathrm {Gen}$ and whose axioms are all instances of the following axiom schemata (Table 3):
aWhere, U ranges over n-ary predicate variables, $v^1, \ldots, v^n$ range over individual variables, and U occurs nowhere free in $\varphi $ .
The proofs to be offered in what follows will, for the most part, be presented as natural deduction proofs. Suffice it to say that when a line where $\varphi $ occurs is labelled as an assumption, it is really an instance of the $\mathtt {PL}$ -theorem $\varphi \rightarrow \varphi $ . We will also be frequently appealing to the following derived rules:
-
– $\mathrm {(QGen)}$ $\Gamma \vdash _{S} E[v]\rightarrow \varphi \Rightarrow \Gamma \vdash _{S} \forall v\varphi $ , where v occurs nowhere free in $\Gamma $ ;
-
– $(\exists \mathrm {E})$ $(\Gamma \vdash _{S} \exists v\varphi \;\&\;\Delta \vdash _{S} \varphi ^v_{u}\rightarrow \psi )\Rightarrow \Gamma ,\Delta \vdash _{S} \psi $ , where u occurs nowhere free in $\Gamma $ , $\Delta $ or $\psi $ , and the usual provisos preventing recapture of variables apply.
It is a routine exercise to show that these are derived rules of $\mathtt {QL}$ and all its extensions considered in the paper. Finally, the justifications provided for our lines of deduction will be given in terms of the previous lines from which that line is derived, and the name of the system sanctioning the reasoning involved.
Lemma 1. $\vdash _{\mathtt {QL}_{\mathtt {C}}} E[\mathbf {0}]$ , $\vdash _{\mathtt {QL}_{\mathtt {C}}} E[\textbf {S}]$ , $\vdash _{\mathtt {QL}_{\mathtt {C}}} E[\textbf {N}]$ .
Proof of Lemma 1.
A straightforward consequence of $\mathrm {Comp}_{\mathrm {C}}$ , by $\mathtt {PL}$ -reasoning.
Theorem 1 (Axiom ).
$\vdash _{\mathtt {QL}_{\mathtt {C}}}\textbf {N}(\mathbf {0})$ .
Theorem 2 (Axiom ).
$\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle }((\textbf {N}(x)\wedge \textbf {S}(x,y))\rightarrow \textbf {N}(y))$ .
Theorem 3 (Axiom $\mathtt {PA}_{\mathtt {2}}$ : $(\text {SF})^{\mathscr {R}}_{\tau }$ ).
$\vdash _{\mathtt {QL}}\forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle }\forall z_{\langle \langle \tau \rangle \rangle }((\textbf {N}(x)\wedge \textbf {S}(x,y)\wedge \textbf {S}(x,z))\rightarrow y=z)$ .
Proof of Theorem 3.
Provable by straightforward $\mathtt {QL}$ -reasoning.
Before we turn to the proof of the arithmetical axiom according to which zero has no successor, it will be useful to prove an auxiliary lemma about empty relations, which are defined as follows:
Definition 12 (Empty relations).
For every positive integer n and all types $\tau ^1$ , $\ldots $ , $\tau ^n$ :
So, an empty relation is one that obtains between whatever things are both self-identical and not self-identical. The following lemma will be useful later on. As it turns out empty relations exist as a matter of logic, and, unsurprisingly, obtain between no things:
Lemma 2. $\vdash _{\mathtt {QL}_{\mathtt {C}}} E[\Lambda ]$ and $\vdash _{\mathtt {QL}}\neg \Lambda (y^{1},\ldots ,y^{n})$ .
Proof of Lemma 2.
$E[\Lambda ]$ is provable from $\mathrm {Comp}_{\mathrm {C}}$ , and $\neg \Lambda (y^{1},\ldots ,y^{n})$ is provable by $\mathtt {QL}$ -reasoning.
Theorem 4 (Axiom $\mathtt {PA}_{\mathtt {2}}$ : ).
$\vdash _{\mathtt {QL}_{\mathtt {C}}}\neg \exists x_{\langle \langle \tau \rangle \rangle }(\textbf {S}(x,\textbf {0}))$ .
Theorem 5 (Axiom $\mathtt {PA}_{\mathtt {2}}$ : ).
$\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow \exists y_{\langle \langle \tau \rangle \rangle }(\textbf {S}(x,y)))$ .
Proof of Theorem 5.
Provable from Lemma 1 by straightforward $\mathtt {QL}_{\mathtt {C}}$ -reasoning.
Theorem 6 (Axiom $\mathtt {PA}_{\mathtt {2}}$ : ).
$\vdash _{\mathtt {QL}}\forall y_{\langle \langle \langle \tau \rangle \rangle \rangle }((y(\textbf {0})\wedge Her[y])\rightarrow \forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow y(x)))$ .
Proof of Theorem 6.
Provable by straightforward $\mathtt {QL}$ -reasoning.
We will now work our way towards the proof that successor is an injective relation. In order to do so, a number of auxiliary lemmas will be required—starting with a familiar one:
Lemma 3. $\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow (x\neq \textbf {0}\rightarrow \exists y_{\langle \langle \tau \rangle \rangle }(\textbf {S}(y,x)\wedge \textbf {N}(y))))$ .
Proof of Lemma 3.
Follows from Lemma 1 and Theorems 1, 2 and 6, by $\mathtt {QL}_{\mathtt {C}}$ -reasoning.
Our next auxiliary result, the equinumerosity lemma, is directly concerned with numbers qua cardinalities. It plays a crucial role in Russell’s original derivation of the injectivity of the successor relation, and it is also proved in the Appendix of [Reference Boolos and George5]. As its name registers, it is formulated in terms of the equinumerosity relation. So let me quickly define it before getting on to stating the lemma.
Definition 13 (Subrelation).
For all types $\tau _1$ , $\ldots $ , $\tau _n$ and positive integers n, and provided that none of $v^1$ , $\ldots $ , $v^n$ occurs free in $\alpha $ nor $\beta $ :
(a) $\alpha _{\langle \tau ^1,\ldots ,\tau ^n\rangle }\subseteq \beta _{\langle \tau ^1,\ldots ,\tau ^n\rangle }:=\forall v^1\ldots \forall v^n(\alpha (v^1,\ldots ,v^n)\leftrightarrow \beta (v^1,\ldots ,v^n))$ ; (b) $\alpha _{\langle \tau ^1,\ldots ,\tau ^n\rangle }\subset \beta _{\langle \tau ^1,\ldots ,\tau ^n\rangle }:=\alpha \subseteq \beta \wedge \neg \beta \subseteq \alpha $ .
Definition 14 (Equinumerosity).
Where, for all types $\tau $ and $\rho $ , $\alpha $ is a type $\langle \tau \rangle $ -singular term, $\beta $ is any type $\langle \rho \rangle $ -singular term, $\pi $ is any type $\langle \langle \tau \rangle ,\langle \rho \rangle \rangle $ -singular term, v and s are, respectively, the first and second type $\tau $ -singular variables that occur nowhere free in $\alpha $ , $\beta $ or $\pi $ , u and t are, respectively, the first and second type $\rho $ -singular variables that occur nowhere free in $\alpha $ , $\beta $ or $\pi $ (or the third and fourth, if $\tau =\rho $ ):
(a) $1{-}1[\pi ]:=\forall v\forall s\forall u((\pi (v,u)\wedge \pi (s,u))\rightarrow v=s)$ ; (b) $Dom[\pi ,\alpha ]:=\forall v(\alpha (v)\leftrightarrow \exists u(\pi (v,u)))$ ; (c) $Func[\pi ]:=\forall v\forall u\forall t((\pi (v,u)\wedge \pi (v,t))\rightarrow u=t)$ ; (d) $Img[\pi ,\beta ]:=\forall u(\beta (u)\leftrightarrow \exists v(\pi (v,u))))$ ; (e) $Inj_{\langle \langle \langle \tau \rangle ,\langle \rho \rangle \rangle ,\langle \tau \rangle ,\langle \rho \rangle \rangle }=_{\mathrm {df}}\lambda x_{\langle \langle \tau \rangle ,\langle \rho \rangle \rangle }y_{\langle \tau \rangle }z_{\langle \rho \rangle }(Func[x]\wedge 1-1[x]\wedge Dom[x,y]\wedge \exists w_{\langle \rho \rangle }(Img[x,w]\wedge w\subseteq z))$ ; (f) $Bij_{\langle \langle \langle \tau \rangle ,\langle \rho \rangle \rangle ,\langle \tau \rangle ,\langle \rho \rangle \rangle }=_{\mathrm {df}}\lambda x_{\langle \langle \tau \rangle ,\langle \rho \rangle \rangle } y_{\langle \tau \rangle }z_{\langle \rho \rangle }(Func[x]\wedge 1-1[x]\wedge Dom[x,y]\wedge Img[x,z])$ ; (e) $\approx _{\langle \langle \tau \rangle ,\langle \rho \rangle \rangle }=_{\mathrm {df}}\lambda y_{\langle \tau \rangle }z_{\langle \rho \rangle } \exists x_{\langle \langle \tau \rangle ,\langle \rho \rangle \rangle }(Bij(x,y,z))$ .
Lemma 4 (Equinumerosity lemma).
$\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow \forall y_{\langle \tau \rangle }(x(y)\rightarrow \forall z_{\langle \tau \rangle } (x(z)\leftrightarrow y\approx z)))$ .
Proof of Lemma 4.
A consequence of Lemma 1 and Theorem 6, by $\mathtt {QL}_{\mathtt {C}}$ -reasoning.
Our remaining auxiliary lemmas go beyond the previous results by integrating modality. For this reason we make the reasoning explicit.
Lemma 5. $\vdash \forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow (\Box \textbf {N}(x)\wedge \Box E[x]))$ .
Our next result is a modal strengthening of the equinumerosity lemma:
Lemma 6 (Modal equinumerosity lemma).
$\vdash \forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow \Box \forall y_{\langle \tau \rangle }(x(y)\rightarrow \forall z_{\langle \tau \rangle }(x(z)\leftrightarrow y\approx z)))$ .
The next result is arguably the crux of our proof that is a theorem of $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ :
Lemma 7 (Impossible sharing).
$\vdash \forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle }((\textbf {N}(x)\wedge \textbf {N}(y)\wedge \Diamond \exists z_{\langle \tau \rangle }(x(z)\wedge y(z)))\rightarrow x=y)$ .
Theorem 7 (Axiom $\mathtt {PA}_{\mathtt {2}}$ : ).
$\vdash \forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow \Diamond \exists y_{\langle \tau \rangle }(x(y)))\rightarrow \forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle } \forall z_{\langle \langle \tau \rangle \rangle }((\textbf {N}(x)\wedge \textbf {N}(y)\wedge \textbf {S}(x,z)\wedge \textbf {S}(y,z))\rightarrow x=y)$ .
Next we prove a couple of lemmas from which it will follow that, for every type $\tau $ , $\mathrm {FP}_{\tau }$ is derivable in $\mathtt {KQL}_{\mathtt {C}}$ from $\mathrm {FP}_{\mathrm {e}}$ . The first of these results will make use of the following definitions:
Definition 15 (Representative).
For all positive integers n and $i\leq n$ , types $\tau ^1$ , $\ldots $ , $\tau ^n$ , and every $\tau ^i$ -singular term $\alpha $ , and where there is some $j\leq n$ such that $\tau ^j=e$ only if $\tau ^i=e$ , let
where $v^1$ , $\ldots $ , $v^n$ are the first distinct variables of, respectively, types $\tau ^1$ , $\ldots $ , $\tau ^n$ that occur free nowhere in $\alpha $ . We say that $\uparrow [\alpha ]_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ is the $\langle \tau ^1,\ldots ,\tau ^n\rangle $ -representative of $\alpha $ .
Definition 16 (Lift).
For all positive integers n and $i\leq n$ , types $\tau ^1$ , $\ldots $ , $\tau ^n$ , and every $\langle \tau ^i\rangle $ -singular term $\beta $ , and where there is some $j\leq n$ such that $\tau ^j=e$ only if $\tau ^i=e$ , let
where v and u are, respectively, the first distinct singular variables of, respectively, type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ and type $\tau ^i$ that occur nowhere free in $\beta $ . We say that $\Uparrow [\beta ]_{\langle \langle \tau ^1,\ldots ,\tau ^n\rangle \rangle }$ is the $\langle \langle \tau ^1,\ldots ,\tau ^n\rangle \rangle $ -lift of $\beta $ .
To give an example, let $\alpha $ be ‘ $x_e$ ’, $n=3$ , $\tau _1=e$ , $\tau _2=\langle e\rangle $ and $\tau _{3}=e$ . Then, $\alpha $ ’s $\langle e,\langle e \rangle ,e\rangle $ -representative is $\uparrow [x_{e}]_{\langle e,\langle e\rangle ,e\rangle }:=\lambda y_{e}z_{\langle e\rangle }w_{e}(y=x_{e}=w\wedge z=\Lambda )$ . Further, let $\beta $ be the variable ‘ $u_{\langle e\rangle }$ ’. Then, $\Uparrow [u_{\langle e\rangle }]_{\langle \langle e,\langle e\rangle ,e\rangle \rangle }:=\lambda t_{\langle e,\langle e\rangle ,e\rangle }(\exists x_{e}(u(x)\wedge t=\uparrow [x]_{\langle e,\langle e\rangle ,e\rangle }))$ . Finally, $x_{\langle e\rangle }$ has no type $\langle e,\langle e\rangle \rangle $ representative, since $\langle e\rangle \neq e$ .
Intuitively, $x_{\langle \tau ^i\rangle }$ and its $\langle \langle \tau ^1,\ldots ,\tau ^n\rangle \rangle $ -lift are equinumerous, for every $i\leq n$ . We begin by establishing that this is indeed so.
Lemma 8. $\vdash \Box \forall x_{\langle \tau ^i\rangle }(x\approx \Uparrow [x]_{\langle \langle \tau ^1,\ldots ,\tau ^n\rangle \rangle }\wedge E[\Uparrow [x]_{\langle \langle \tau ^1,\ldots ,\tau ^n\rangle \rangle }])$ , where $1\leq i\leq n$ and, for every $j\leq n$ , $\tau ^j=e$ only if $\tau ^i=e$ .
Lemma 9. $\forall x_{\langle \langle \tau \rangle \rangle }(\textbf {N}(x)\rightarrow \exists y_{\langle \langle \sigma \rangle \rangle }(\textbf {N}(y)\wedge \Box \forall z_{\langle \sigma \rangle }\forall u_{\langle \tau \rangle }((y(z)\wedge z\approx u)\rightarrow x(u))))$ .
Lemma 10. $\vdash \mathrm {FP}_{\mathrm {e}}\rightarrow \mathrm {FP}_{\tau }$ .
Proof of Lemma 10.
The lemma’s proof is by induction on the set of types. The base case is obvious, since $\vdash \mathrm {FP}_{e}\rightarrow \mathrm {FP}_{e}$ , by $\mathtt {PL}$ . We show that $\vdash \mathrm {FP}_{e}\rightarrow \mathrm {FP}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ . So, as the inductive hypothesis, assume that $\vdash \mathrm {FP}_{e}\rightarrow \mathrm {FP}_{\tau ^i}$ , for all positive integers n and i such that $1\leq i\leq n$ . Now, let: $\tau :\langle \tau ^1,\ldots ,\tau ^n\rangle $ , and
To prove that $\vdash \mathrm {FP}_{e}\rightarrow \mathrm {FP}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ , it suffices, given the inductive hypothesis, to show that $\vdash \mathrm {FP}_{\xi }\rightarrow \mathrm {FP}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ . This is what we’ll now show:
This concludes the proof.
Lemma 11. For every theorem $\varphi $ of $\mathtt {PA}_{\mathtt {2}}$ , , for every type $\tau $ , where $\overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ is the result of prefixing $(\varphi )^{\mathscr {R}}_{\tau }$ with any sequence of universal quantifiers binding all variables occurring in $(\varphi )^{\mathscr {R}}_{\tau }$ .
Proof of Lemma 11.
The proof is by induction on the length of a $\mathtt {PA}_{\mathtt {2}}$ -derivation. The base case has the following subcases: (i) $\varphi $ is an arithmetical axiom of $\mathtt {PA}_{\mathtt {2}}$ ; (ii) $\varphi $ is a $\mathtt {PL}$ theorem; (iii) $\varphi $ is an instance of $\forall {\mathrm {1}}$ , $\forall {\mathrm {2}}$ or $\mathrm {Ind}$ ; (iv) $\varphi $ is an instance of $\mathrm {{=}I}$ ; (v) $\varphi $ is an instance of $\mathrm {C}{\forall }\mathrm {E}$ ; (vi) $\varphi $ is an instance of $\mathrm {Comp}$ :
Subcase (i): $\varphi $ is an arithmetical axiom of $\mathtt {PA}_{\mathtt {2}}$ . Then , by Theorems 1–7. Hence, , by $\mathtt {QL}$ -reasoning. So, , by $\mathtt {QL}$ -reasoning. So, , by $\mathtt {QL}$ -reasoning.
Subcase (ii): $\varphi $ is a $\mathtt {PL}$ -theorem. Then $\vdash (\varphi )^{\mathscr {R}}_{\tau }$ , since $(\cdot )^{\mathscr {R}}_{\tau }$ preserves Boolean connectives, and every $\mathtt {PL}$ -theorem is an axiom of $\mathtt {KQL}_{\mathtt {C}}$ . So, $\vdash \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {QL}$ -reasoning. Hence, $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {PL}$ -reasoning.
Subcase (iii): $\varphi $ is an instance of ${\forall }\mathrm {1}$ , ${\forall }\mathrm {2}$ , or $\mathrm {Ind}$ . Then, $\vdash (\varphi )^{\mathscr {R}}_{\tau }$ . So, $\vdash \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {QL}$ -reasoning. Hence, $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {PL}$ -reasoning.
Subcase (iv): $\varphi $ is an instance of $\mathrm {{=}I}$ . Then $\varphi :v=v$ . Let $\overline {\forall }$ be a sequence of the form $\forall ^{\star }\forall (v)^{\mathscr {R}}_{\tau }\forall ^{\dagger }$ , where $\forall ^{\star }$ and $\forall ^{\dagger }$ are themselves sequences of universal quantifiers and $\forall (v)^{\mathscr {R}}_{\tau }$ does not occur in $\forall ^{\dagger }$ . Then, $\vdash (v=v)^{\mathscr {R}}_{\tau }\rightarrow \forall ^{\dagger }((v=v)^{\mathscr {R}}_{\tau })$ , by repeated applications of $\forall {\mathrm {1}}$ , $\forall {\mathrm {2}}$ and $\mathrm {Gen}$ . By a further application of $\mathrm {Gen}$ we get that $\vdash \forall (v)^{\mathscr {R}}_{\tau }((v=v)^{\mathscr {R}}_{\tau }\rightarrow \forall ^{\dagger }((v=v)^{\mathscr {R}}_{\tau }))$ , and so $\vdash \forall (v)^{\mathscr {R}}_{\tau }((v=v)^{\mathscr {R}}_{\tau })\rightarrow \forall (v)^{\mathscr {R}}_{\tau }\forall ^{\dagger }((v=v)^{\mathscr {R}}_{\tau })$ , by $\forall {\mathrm {1}}$ . Since $\vdash \forall (v)^{\mathscr {R}}_{\tau }((v=v)^{\mathscr {R}}_{\tau })$ , by $\forall {=}$ and $\mathtt {QL}$ -reasoning, we thus obtain the result that $\vdash \forall (v)^{\mathscr {R}}_{\tau }\forall ^{\dagger }((v=v)^{\mathscr {R}}_{\tau })$ . So, $\vdash \forall ^{\star }\forall (v)^{\mathscr {R}}_{\tau }\forall ^{\dagger }((v=v)^{\mathscr {R}}_{\tau })$ by repeated applications of $\mathrm {Gen}$ . Therefore, $\vdash \mathrm {FP}\rightarrow \overline {\forall }((v=v)^{\mathscr {R}}_{\tau })$ , by $\mathtt {PL}$ -reasoning.
Subcase (v): $\varphi $ is an instance of $\mathrm {C}{\forall }\mathrm {E}$ . Then, $\varphi :=\forall v\psi \rightarrow \psi ^{v}_{\alpha }$ . We have that $\vdash E[(\alpha )^{\mathscr {R}}_{\tau }]\rightarrow (\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow (\psi ^{v}_{\alpha })^{\mathscr {R}}_{\tau })$ . Now, $\alpha $ is either the constant ‘ $0$ ’, or else an individual variable u. Suppose first that it is the constant ‘ $0$ ’. Then, $\vdash E[(\alpha )^{\mathscr {R}}_{\tau }]$ , by Lemma 1. So, $\vdash \forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow (\psi ^{v}_{\alpha })^{\mathscr {R}}_{\tau }$ , by $\mathtt {PL}$ . That is, $\vdash (\varphi )^{\mathscr {R}}_{\tau }$ . Therefore, $\vdash \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {QL}$ -reasoning. Hence, $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {PL}$ -reasoning.
Suppose instead that $\alpha $ is the variable u. Then, $\vdash E[(u)^{\mathscr {R}}_{\tau }]\rightarrow (\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow {(\psi )^{\mathscr {R}}_{\tau }}^{(v)^{\mathscr {R}}_{\tau }}_{(u)^{\mathscr {R}}_{\tau }})$ , and $\overline {\forall }$ is a sequence of the form $\forall ^{\star }\forall (u)^{\mathscr {R}}_{\tau }\forall ^{\dagger }$ , where $\forall ^{\star }$ and $\forall ^{\dagger }$ are themselves sequences of universal quantifiers, and $\forall (u)^{\mathscr {R}}_{\tau }$ does not occur in $\forall ^{\dagger }$ . By repeated applications of $\mathrm {Gen}$ , $\forall {\mathrm {1}}$ and $\forall {\mathrm {2}}$ we get that $\vdash E[(u)^{\mathscr {R}}_{\tau }]\rightarrow \forall ^{\dagger }(\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow {(\psi )^{\mathscr {R}}_{\tau }}^{(v)^{\mathscr {R}}_{\tau }}_{(u)^{\mathscr {R}}_{\tau }})$ . A further application of $\mathrm {Gen}$ shows that $\vdash \forall (u)^{\mathscr {R}}_{\tau }(E[(u)^{\mathscr {R}}_{\tau }]\rightarrow \forall ^{\dagger }(\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow {(\psi )^{\mathscr {R}}_{\tau }}^{(v)^{\mathscr {R}}_{\tau }}_{(u)^{\mathscr {R}}_{\tau }}))$ , and we get that $\vdash \forall (u)^{\mathscr {R}}_{\tau }(E[(u)^{\mathscr {R}}_{\tau }])\rightarrow \forall (u)^{\mathscr {R}}_{\tau }\forall ^{\dagger }(\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow {(\psi )^{\mathscr {R}}_{\tau }}^{(v)^{\mathscr {R}}_{\tau }}_{(u)^{\mathscr {R}}_{\tau }})$ , by a further application of $\forall {\mathrm {1}}$ . So, $\vdash \forall (u)^{\mathscr {R}}_{\tau }\forall ^{\dagger }(\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow {(\psi )^{\mathscr {R}}_{\tau }}^{(v)^{\mathscr {R}}_{\tau }}_{(u)^{\mathscr {R}}_{\tau }})$ , by $\forall \mathrm {E}2$ . Finally, repeated applications of $\mathrm {Gen}$ yield that $\vdash \forall ^{\star }\forall (u)^{\mathscr {R}}_{\tau }\forall ^{\dagger }(\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }\rightarrow {(\psi )^{\mathscr {R}}_{\tau }}^{(v)^{\mathscr {R}}_{\tau }}_{(u)^{\mathscr {R}}_{\tau }})$ . That is, $\vdash \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ . So, $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ , by $\mathtt {PL}$ -reasoning.
Subcase (vi): $\varphi $ is an instance of $\mathrm {Comp}$ . Then, $\varphi :=\exists U\forall v^1\ldots \forall v^n(U(v^1,\ldots ,v^n)\leftrightarrow \psi )$ . By $\mathrm {Comp}_{\mathrm {C}}$ we get $\vdash E_{\lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }}\rightarrow E[\lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }]$ . Now, $\vdash E[\lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }]\rightarrow (E[(v^n)^{\mathscr {R}}_{\tau }]\rightarrow (\ldots \rightarrow (E[(v^1)^{\mathscr {R}}_{\tau }]\rightarrow ((\psi )^{\mathscr {R}}_{\tau }\leftrightarrow \lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau })((v^1)^{\mathscr {R}}_{\tau },\ldots ,(v^n)^{\mathscr {R}}_{\tau }))\ldots ))$ . So, repeated applications of $\mathrm {Gen}$ , $\forall {\mathrm {1}}$ , $\forall {\mathrm {2}}$ and ${\forall \mathrm {E}2}$ yield $\vdash E[\lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }]\rightarrow \forall (v^1)^{\mathscr {R}}_{\tau }\ldots \forall (v^n)^{\mathscr {R}}_{\tau } ((\psi )^{\mathscr {R}}_{\tau }\leftrightarrow \lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }) ((v^1)^{\mathscr {R}}_{\tau },\ldots ,(v^n)^{\mathscr {R}}_{\tau })$ . Hence, $\vdash E[\lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }]\hspace{-1pt}\rightarrow\hspace{-1pt} \exists (U)^{\mathscr {R}}_{\tau }\forall (v^1)^{\mathscr {R}}_{\tau }\ldots \forall (v^n)^{\mathscr {R}}_{\tau } ((\psi )^{\mathscr {R}}_{\tau } \leftrightarrow (U)^{\mathscr {R}}_{\tau }((v^1)^{\mathscr {R}}_{\tau },\ldots ,(v^n)^{\mathscr {R}}_{\tau }))$ , by $\mathtt {QL}$ -reasoning. So, $\vdash E_{\lambda (v^1)^{\mathscr {R}}_{\tau }\ldots (v^n)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }}\rightarrow \exists (U)^{\mathscr {R}}_{\tau }\forall (v^1)^{\mathscr {R}}_{\tau } \ldots \forall (v^n)^{\mathscr {R}}_{\tau } ((\psi )^{\mathscr {R}}_{\tau }\leftrightarrow (U)^{\mathscr {R}}_{\tau }((v^1)^{\mathscr {R}}_{\tau },\ldots ,(v^n)^{\mathscr {R}}_{\tau }))$ . But then, proceeding as in the proof of subcase (iv), we can derive, by repeated applications of $\mathrm {Gen}$ , $\forall {\mathrm {1}}$ , $\forall {\mathrm {2}}$ and ${\forall \mathrm {E}2}$ , that $\vdash \overline {\forall }\exists (U)^{\mathscr {R}}_{\tau }\forall (v^1)^{\mathscr {R}}_{\tau }\ldots \forall (v^n)^{\mathscr {R}}_{\tau } ((\psi )^{\mathscr {R}}_{\tau }\leftrightarrow (U)^{\mathscr {R}}_{\tau }((v^1)^{\mathscr {R}}_{\tau },\ldots ,(v^n)^{\mathscr {R}}_{\tau }))$ . That is, $\vdash \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ . So, $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ . This concludes the proof of the base case.
For the inductive case, there are two subcases two consider: (i) $\varphi $ is derived by an application of $\mathrm {Gen}$ ; (ii) $\varphi $ is derived by an application of $\mathrm {MP}$ .
Subcase (i): $\varphi $ is derived by an application of $\mathrm {Gen}$ . Then, $\varphi :=\forall v\psi $ . By the induction hypothesis, we have that $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\psi )^{\mathscr {R}}_{\tau }$ , for any universal closure $\overline {\forall }(\psi )^{\mathscr {R}}_{\tau }$ of $(\psi )^{\mathscr {R}}_{\tau }$ . But then, $\vdash \mathrm {FP}\rightarrow \overline {\forall }\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }$ , for any universal closure $\overline {\forall }\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }$ of $\forall (v)^{\mathscr {R}}_{\tau }(\psi )^{\mathscr {R}}_{\tau }$ . I.e., we have that $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ .
Subcase (ii): $\varphi $ is derived by an application of $\mathrm {MP}$ . Then, by the induction hypothesis, we have that $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\psi )^{\mathscr {R}}_{\tau }$ and $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\psi \rightarrow \varphi )^{\mathscr {R}}_{\tau }$ . Let $\overline {\forall }$ be the sequence $\forall v^1\ldots \forall v^n$ . Then, by repeated applications of $\forall {\mathrm {E}}$ , we get $\vdash \mathrm {FP}\rightarrow (E[v^n]\rightarrow (\ldots \rightarrow (E[v^1]\rightarrow (\psi )^{\mathscr {R}}_{\tau })\ldots ))$ and $\vdash \mathrm {FP}\rightarrow (E[v^n]\rightarrow (\ldots \rightarrow (E[v^1]\rightarrow (\psi \rightarrow \varphi )^{\mathscr {R}}_{\tau })\ldots ))$ . By $\mathtt {PL}$ -reasoning we get $\vdash \mathrm {FP}\rightarrow (E[v^n]\rightarrow (\ldots \rightarrow (E[v^1]\rightarrow (\varphi )^{\mathscr {R}}_{\tau })\ldots ))$ . But then, proceeding as in the proof of subcase (v) of the base case, we can derive, by repeated applications of $\mathrm {Gen}$ , $\forall {\mathrm {1}}$ , $\forall {\mathrm {2}}$ and ${\forall \mathrm {E}2}$ , $\vdash \mathrm {FP}\rightarrow \overline {\forall }(\varphi )^{\mathscr {R}}_{\tau }$ . This concludes the proof.
We thus obtain Theorem 8 as a straightforward corollary of Lemma 11:
Theorem 8 (Interpretability).
$\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ interprets $\mathtt {PA}_{\mathtt {2}}$ via the Russellian translation $(\cdot )^{\mathscr {R}}_{\tau }$ , for each type $\tau $ . More precisely, for every closed theorem $\varphi $ of $\mathtt {PA}_{\mathtt {2}}$ , $(\varphi )^{\mathscr {R}}_{\tau }$ is a theorem of $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ , for every type $\tau $ .
B Proof that $\mathrm {FP}$ implies its necessity
We begin by observing that the following result, stronger than Theorem 6, is also derivable by using nothing but $\mathtt {QL}$ -reasoning:
Lemma 12. $\vdash _{\mathtt {QL}}\forall y_{\langle \langle \langle \tau \rangle \rangle \rangle }((y(\textbf {0})\wedge Her[y])\rightarrow (\textbf {N}(x_{\langle \langle \tau \rangle \rangle })\rightarrow y(x_{\langle \langle \tau \rangle \rangle })))$ .
Owing to Lemma 12, it is easy to see that Lemma 5’s proof can easily be adapted to establish the following lemmas:
Lemma 13. $\vdash \textbf {N}(x)\rightarrow E[x]$ .
Lemma 14. $\vdash \textbf {N}(x)\rightarrow (\Box \textbf {N}(x)\wedge \Box E[x])$ .
The following result can then be established:
Lemma 15. .
C Necessity of arithmetic
The main aim of this section is to prove that the result $(\varphi )^{\mathcal {A}}_{\tau }$ of translating each closed formula $\varphi $ of $\mathrm {PA}_{2}$ to $\mathrm {PA}_{2}$ is, if true, necessarily so. That is, it is proven that $(\varphi )^{\mathcal {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathcal {A}}_{\tau }$ , for every closed formula $\varphi $ of $\mathrm {PA}_{2}$ .
Say that a relation is arithmetical just in case it necessarily exists and is essentially and rigidly about natural numbers. That is:
Definition 17 (Arithmetical relations).
Where $\alpha $ is a type $\langle \langle \langle \tau \rangle \rangle \rangle _n$ -singular term, and, for every i such that $1\leq i\leq n$ , $v^{i}$ is a type $\langle \langle \tau \rangle \rangle $ -singular variable which occurs nowhere in $\alpha $ :
Then, the proof that $(\varphi )^{\mathcal {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathcal {A}}_{\tau }$ , for every closed formula $\varphi $ of $\mathrm {PA}_{2}$ , relies on the following lemma: if all the parameters occurring in $(\varphi )^{\mathcal {A}}_{\tau }$ are natural numbers or arithmetical relations, then $(\varphi )^{\mathcal {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathcal {A}}_{\tau }$ . It furthermore relies on the result that, for every formula $\psi $ of $\mathrm {PA}_{2}$ , if all the parameters occurring in $(\forall V\psi )^{\mathcal {A}}_{\tau }$ are either natural numbers or arithmetical relations, then if every arithmetical relation V is such that $(\psi )^{\mathcal {A}}_{\tau }$ , then every relation V is such that $(\psi )^{\mathcal {A}}_{\tau }$ .
This last result relies on showing that whenever some relation V is such that $(\psi )^{\mathcal {A}}_{\tau }$ , there is a relation of a particular kind, defined in terms of V, which is arithmetical and such that $(\psi )^{\mathcal {A}}_{\tau }$ . I’ll now turn to the definition of this particular kind of arithmetical relation, which will be characterized partly in terms of plural resources.
I’ll start by defining a relation’s ‘pluralization’ and a plurality’s ‘relationization’. I’ll then show that if U is the relationization of the pluralization of some relation R, then U is an arithmetical relation.
Definition 18 (Pluralization).
For each type $\tau $ , type $\langle \langle \langle \tau \rangle \rangle \rangle _n$ -plural variable $vv$ , type $\langle \langle \langle \tau \rangle \rangle \rangle _n$ -singular variable V, and where u is a type $\langle \langle \langle \tau \rangle \rangle \rangle _n$ singular variable distinct from V:
So, the $yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n}$ are a pluralization of relation $x_{\langle \langle \langle \tau \rangle \rangle \rangle _n}$ —that is, $\mathrm {Pl}[yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n},x_{\langle \langle \langle \tau \rangle \rangle \rangle _n}]$ —just in case the $yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n}$ are relations each one of which holds of $w^1$ , $\ldots $ , $w^n$ just in case each $w^i$ is identical to some $z^i$ such that $z^i$ is a natural number and $z^1$ , $\ldots $ , $z^n$ stand in relation $x_{\langle \langle \langle \tau \rangle \rangle \rangle _n}$ .
Definition 19 (Relationization).
For each type $\tau $ and type $\langle \langle \langle \tau \rangle \rangle \rangle _n$ -plural variable $vv$ :
So, the relationization of the $yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n}$ , $\mathrm {Rel}[yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n}]$ , is the relation of being $w^{1}_{\langle \langle \tau \rangle \rangle }$ , $\ldots $ , $w^{n}_{\langle \langle \tau \rangle \rangle }$ such that $w^{1}_{\langle \langle \tau \rangle \rangle }$ , $\ldots $ , $w^{n}_{\langle \langle \tau \rangle \rangle }$ stand in some relation among the $yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n}$ .
Proving the result that a relation is arithmetical if it is the relationization of the pluralization of some relation requires establishing a number of auxiliary lemmas about pluralities of ‘ordered n-tuples’ of natural numbers. I’ll first introduce a type-theoretic characterization of ordered n-tuples’ of natural numbers, and of pluralities of these. I’ll then proceed to the proof of the auxiliary lemmas.
Definition 20 (Ordered n-tuples of natural numbers).
For each type $\tau $ , and where $\alpha $ is a type $\langle \langle \langle \tau \rangle \rangle \rangle _{n}$ -singular term, for each i such that $1\leq i\leq n$ , $v^i$ and $u^i$ are type $\langle \langle \tau \rangle \rangle $ -singular variables none of which occur in $\alpha $ , and such that $v^i$ and $u^j$ are distinct, for every i and j such that $1\leq i,j\leq n$ :
Then, pluralities of ordered n-tuples are defined as follows:
Definition 21 (Plurality of ordered n-tuples of natural numbers).
Where $vv$ is a type $\langle \langle \langle \tau \rangle \rangle \rangle _{n}$ -plural variable, and u is a type $\langle \langle \langle \tau \rangle \rangle \rangle _{n}$ -singular variable, for each type $\tau $ :
Lemma 16. $\vdash \mathrm {OrdN}[\alpha ]\rightarrow (\Box \mathrm {OrdN}[\alpha ]\wedge \Box E[\alpha ])$ .
Lemma 17. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}} (\mathrm {POrdN}[vv]\wedge E[vv])\rightarrow (\Box \mathrm {POrdN}[vv]\wedge \Box E[vv])$ .
Lemma 18. $\vdash \mathrm {OrdN}[\alpha ]\rightarrow \Box (\alpha (w^{1},\ldots ,w^{n})\rightarrow (\bigwedge _{1\leq i\leq n}(\textbf {N}(w^{i}))\wedge \Box \alpha (w^{1},\ldots ,w^{n})))$ .
Lemma 19. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}}(\mathrm {POrdN}[vv]\wedge E[vv])\rightarrow \Box \forall w^1\ldots \Box \forall w^n\Box (\mathrm {Rel}[yy](w^{1},\ldots ,w^{n}) \rightarrow (\bigwedge _{1\leq i\leq n}(\textbf {N}(v^i))\wedge \Box Rel[yy](w^{1},\ldots ,w^{n})))$ .
Lemma 20. $\vdash _{\mathtt {PQL}}\mathrm {Pl}[vv,V]\rightarrow \mathrm {POrdN}[vv]$ .
Lemma 21. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}} (\mathrm {Pl}[vv,V]\wedge E[vv])\rightarrow \Box \forall w^1\ldots \Box \forall w^n\Box (\mathrm {Rel}[vv](w^{1},\ldots ,w^{n})\rightarrow (\bigwedge _{1\leq i\leq n}(\textbf {N}(v^i))\wedge \Box \mathrm {Rel}[vv](w^{1},\ldots ,w^{n})))$ .
Lemma 22. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}} (\mathrm {Pl}[vv,V]\wedge E[vv])\rightarrow \Box E[\mathrm {Rel}[vv]]$ .
We are now in a position to offer a proof of Theorem 9, a crucial element in our case for the claim the truths of pure arithmetic are all metaphysically necessary (Section 5). The proof of this result makes use of some basic facts about arithmetical relations:
Lemma 23. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}}(\mathrm {Pl}[vv,V]\wedge E[vv])\rightarrow Ar[\mathrm {Rel}[vv]]$ .
Lemma 24. $\vdash _{\mathtt {KQL}}Ar[\Lambda ]$ .
Lemma 25. $\vdash _{\mathtt {S5PQL}}Ar[v_{\langle \langle \langle \tau \rangle \rangle \rangle _{n}}]\rightarrow \Box Ar[v_{\langle \langle \langle \tau \rangle \rangle \rangle _{n}}]$ .
Proof of Lemma 25.
A straightforward consequence, by $\mathtt {S5}$ -reasoning, of the definition of $Ar[\cdot ]$ .
Now, consider the following notational conventions. For each type $\tau $ and formula $\varphi $ of $\mathrm {MT}$ , and where $\alpha ^1$ , $\ldots $ , $\alpha ^n$ are all the type $\tau $ -singular parameters occurring in $\varphi $ , let
Similarly, where $\beta ^1$ , $\ldots $ , $\beta ^m$ are all the type $\langle \langle \tau \rangle \rangle \rangle _n$ -singular parameters occurring in $\varphi $ , for any n, let
Finally, for each type $\tau $ and formula $\varphi $ of $\mathrm {MT}$ , let $\overline {Ar}_{\tau }[\varphi ]:=\textbf {N}[\varphi ]\wedge Ar[\varphi ]$ . Then:
Lemma 26. For every formula $\varphi $ of $\mathrm {PA}_{2}$ in which the n-ary predicate variable V occurs free:
-
(A) $\vdash _{\mathtt {PQL}_{\mathtt {C}}}\textbf {N}[(\varphi )^{\mathscr {A}}_{\tau }]\rightarrow \forall yy_{\langle \langle \langle \tau \rangle \rangle \rangle _n}(\mathrm {Pl}[yy,V_{\langle \langle \langle \tau \rangle \rangle \rangle _n}] \rightarrow ((\varphi )^{\mathscr {A}}_{\tau }\leftrightarrow ((\varphi )^{\mathscr {A}}_{\tau })^{V_{\langle \langle \langle \tau \rangle \rangle \rangle _n}}_{\mathrm {Rel}[yy]}))$ .
-
(B) $\vdash _{\mathtt {QL}_{\mathtt {C}}}\textbf {N}[(\varphi )^{\mathscr {A}}_{\tau }]\rightarrow (\neg \exists z^{1}_{\langle \langle \tau \rangle \rangle }\ldots \exists z^{n}_{\langle \langle \tau \rangle \rangle }(\bigwedge _{1\leq i\leq n}(\textbf {N}(z^i))\wedge V(z^1,\ldots ,z^n))\rightarrow ((\varphi )^{\mathscr {A}}_{\tau }\leftrightarrow ((\varphi )^{\mathscr {A}}_{\tau })^{V_{\langle \langle \langle \tau \rangle \rangle \rangle _n}}_{\Lambda _{\langle \langle \langle \tau \rangle \rangle \rangle _n}}))).$
Proof of Lemma 26.
The proof is by induction on the complexity of $\varphi $ . We prove $(A)$ and $(B)$ in one go, considering the base and inductive cases for each one of them.
Base case: the only interesting subcase is when $\varphi :=U(\alpha ^1,\ldots ,\alpha ^n)$ and $V=U$ . Case $(A)$ :
Case $(B)$ :
Inductive case (i): $\varphi :=\neg \psi $ . Cases $(A)$ and $(B)$ follow straightforwardly from the inductive hypothesis.
Inductive case (ii): $\varphi :=\psi \wedge \chi $ . Case $(A)$ :
The proof of case $(B)$ (inductive case (ii)) proceeds in a similar fashion, and so is left to the reader.
Inductive case (iii): $\varphi :=\forall v\psi $ . Case $(A)$ :
The proof of $(B)$ (inductive case (iii)) proceeds in a similar fashion. The proof of inductive case (iv), $\varphi :=\forall V\psi $ (cases $(A)$ and $(B)$ ), is similar to that of inductive case (iii), and so is also left out. This concludes the proof.
Lemma 27. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}}\overline {Ar}[(\forall V\psi )^{\mathscr {A}}_{\tau }]\hspace{-1pt}\rightarrow\hspace{-1pt} (\forall V_{\langle \langle \langle \tau \rangle \rangle \rangle _n}(Ar(V)\rightarrow (\psi )^{\mathscr {A}}_{\tau })\rightarrow \forall V_{\langle \langle \langle \tau \rangle \rangle \rangle _n}((\psi ))^{\mathscr {A}}_{\tau }))$ , for every formula $\psi $ of $\mathrm {PA}_{2}$ .
The next lemma is the crux of the proof here given that pure arithmetic is necessary:
Lemma 28. For each type $\tau $ , $\overline {Ar}[(\varphi )^{\mathscr {A}}_{\tau }]\vdash _{\mathtt {S5PQL}_{\mathtt {C}}}(\varphi )^{\mathscr {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathscr {A}}_{\tau }$ , for every formula $\varphi $ of $\mathrm {PA}_{2}$ and every type $\tau $ .
Proof of Lemma 28.
The proof is by induction on the complexity of $\varphi $ .
Base case (i): $\varphi :=N(\alpha )$ . Then, $(\varphi )^{\mathscr {A}}_{\tau }=\textbf {N}(\alpha _{\langle \langle \tau \rangle \rangle })$ . So:
This establishes base case (i). Base case (ii): $\varphi :=S(\alpha ,\beta )$ . Then, $(\varphi )^{\mathscr {A}}_{\tau }=\textbf {S}(\alpha _{\langle \langle \tau \rangle \rangle },\beta _{\langle \langle \tau \rangle \rangle })$ . $\overline {Ar}_{\tau }[(\varphi )^{\mathscr {A}}_{\tau }]=\bigwedge _{\substack {\delta \in \{\alpha _{\langle \langle \tau \rangle \rangle },\beta _{\langle \langle \tau \rangle \rangle }\},\\\delta \neq \ulcorner 0_{\langle \langle \tau \rangle \rangle }\urcorner }}(\textbf {N}(\delta ))$ . So:
This establishes base case (ii). Base case (iii): $\varphi :=V(\alpha ^1,\ldots ,\alpha ^n)$ . Then, $(\varphi )^{\mathscr {A}}_{\tau }=V_{\langle \langle \langle \tau \rangle \rangle ,\ldots ,\langle \langle \tau \rangle \rangle \rangle }(\alpha ^1_{\langle \langle \tau \rangle \rangle },\ldots ,\alpha ^{n}_{\langle \langle \tau \rangle \rangle })$ , and $\overline {Ar}_{\tau }[(\varphi )^{\mathscr {A}}_{\tau }]=Ar(V_{\langle \langle \langle \tau \rangle \rangle ,\ldots ,\langle \langle \tau \rangle \rangle \rangle })\wedge \bigwedge _{\substack {\delta \in \{\alpha _{\langle \langle \tau \rangle \rangle },\beta _{\langle \langle \tau \rangle \rangle }\},\\\delta \neq \ulcorner 0_{\langle \langle \tau \rangle \rangle }\urcorner }}(\textbf {N}(\delta ))$ . So we have that:
This establishes base case (iii). Base case (iv): $\varphi :=\alpha =\beta $ . Then, $(\varphi )^{\mathscr {A}}_{\tau }=\bigwedge _{\substack {\delta \in \{\alpha _{\langle \langle \tau \rangle \rangle },\beta _{\langle \langle \tau \rangle \rangle }\},\\\delta \neq \ulcorner 0_{\langle \langle \tau \rangle \rangle }\urcorner }}(\textbf {N}(\delta ))$ . So:
This establishes base case (iv). Inductive case (i): $\varphi :=\neg \psi $ . Since $\overline {Ar}_{\tau }[(\psi )^{\mathscr {A}}_{\tau }]=\overline {Ar}_{\tau }[(\neg \psi )^{\mathscr {A}}_{\tau }]$ , we have that:
This establishes inductive case (i). Inductive case (ii): $\varphi :=\psi \wedge \chi $ . We have that:
This establishes inductive case (ii). Inductive case (iii): $\varphi :=\forall x\psi $ . We have that:
This establishes inductive case (iii). Inductive case (iv): $\varphi :=\forall V\psi $ . We have that:
This establishes inductive case (iv), and the lemma.
Theorem 9 is then an immediate corollary of Lemma 28:
Theorem 9. $\vdash _{\mathtt {S5PQL}_{\mathtt {C}}}(\varphi )^{\mathscr {A}}_{\tau }\rightarrow \Box (\varphi )^{\mathscr {A}}_{\tau }$ , for every closed formula $\varphi $ of $\mathrm {PA}_{2}$ and every type $\tau $ .
D Relationship between $\mathtt {S5PQL}_{\mathtt {SA,C}}$ and $\mathtt {S5PQL}_{\mathtt {SA,C}}$
In this section is shown that the theorems of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ are the same as those of the following system:
Definition 22 (System $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ).
$\mathtt {S5PQL}_{\mathtt {SA,C}}$ is that axiomatic system whose axioms consist of the result of prefixing any sequence (including the empty sequence) of metaphysical necessity operators and universal quantifiers, in any order, to $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s axioms, and whose only inference rule is $\mathrm {MP}$ .
The following lemma will be useful in what follows.
Lemma 29. If $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ , then $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\Box \varphi $ and $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\forall v\varphi $ .
Proof of Lemma 29.
The proof is by induction on the length of a $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}$ -derivation.
Base case: $\varphi $ is an axiom. Clearly, $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\Box \varphi $ and $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\forall v\varphi $ , as the result of prefixing a necessity operator or a universal quantifier to any axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ is itself an axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ .
Inductive case: suppose that $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ . Then, by the inductive hypothesis: (i) $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\Box \psi $ ; (ii) $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\forall v\psi $ ; (iii) $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\Box (\psi \rightarrow \varphi )$ ; and (iv) $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\forall v(\psi \rightarrow \varphi )$ . We have that $\Box (\psi \rightarrow \varphi )\rightarrow (\Box \psi \rightarrow \Box \varphi )$ is an axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ . So, by (i) and (iii), via two applications of $\mathrm {MP}$ , we get that $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\Box \varphi $ . Similarly, we have that $\forall v(\psi \rightarrow \varphi )\rightarrow (\forall v\psi \rightarrow \forall v\varphi )$ is an axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ . So, by (ii) and (iv), via two applications of $\mathrm {MP}$ , we get that $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\forall v\varphi $ . This concludes the proof.
The main result of this section is then the following:
Lemma 30. $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ if and only if $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ .
Proof of Lemma 30.
I’ll start by showing that if $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ , then $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ . The proof is by induction on the length of a $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}$ -derivation. The base case is trivial, as any axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ is an axiom of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ . For the inductive case, there are three subcases to consider: (i) $\varphi $ is proved via an application of $\mathrm {Gen}$ ; (ii) $\varphi $ is proved via an application of $\mathrm {Nec}$ ; and (iii) $\varphi $ is proved via an application of $\mathrm {MP}$ .
Subcase (i): $\varphi $ is proved via an application of $\mathrm {Gen}$ . Then $\varphi :=\forall v\psi $ . By the inductive hypothesis, $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\psi $ . But then, $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\forall v\psi $ , by Lemma 29.
Subcase (ii): $\varphi $ is proved via an application of $\mathrm {Nec}$ . Then $\varphi :=\Box \psi $ . By the inductive hypothesis, $\vdash _{\mathtt {S5QL}_{\mathtt {SA,C}}}\psi $ . But then, $\vdash _{\mathtt {S5QL}_{\mathtt {SA,C}}}\Box \psi $ , by Lemma 29.
Subcase (iii): $\varphi $ is proved via an application of $\mathrm {MP}$ . By the inductive hypothesis, $\vdash _{\mathtt {S5QL}_{\mathtt {SA,C}}}\psi $ and $\vdash _{\mathtt {S5QL}_{\mathtt {SA,C}}}\psi \rightarrow \varphi $ . But then, $\vdash _{\mathtt {S5QL}_{\mathtt {SA,C}}}\varphi $ , by $\mathrm {MP}$ .
So, $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ , then $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ . To see that if $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ , then $\vdash _{\mathtt {S5PQL}_{\mathtt {SA,C}}}\varphi $ , note that all axioms of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ are theorems of $\mathtt {S5PQL}_{\mathtt {SA,C}}$ , owing to $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s rules $\mathrm {Nec}$ and $\mathrm {Gen}$ . So, whatever is derivable from $\mathtt {S5PQL}_{\mathtt {SA,C}}$ ’s axioms via $\mathrm {MP}$ is derivable in $\mathtt {S5PQL}_{\mathtt {SA,C}}$ by $\mathrm {MP}$ , $\mathrm {Nec}$ and $\mathrm {Gen}$ . This concludes the proof.
E Relations between FP, Generation and InfAx
In this section I will begin by showing that generation implies finitary plenitude. In order to do so, it will be useful to appeal to some facts concerning the smaller than relation between natural numbers. This relation is defined as follows:
Definition 23 (Smaller than).
For all types $\tau $ , let
Then, the relevant results about the smaller than relation are the following:
Lemma 31. $\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }y_{\langle \langle \tau \rangle \rangle }(\textbf {S}(x,y)\rightarrow x<y).$
Lemma 32. $\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }\forall y\forall z((x<y\wedge y<z)\rightarrow x<z)$ .
Proof of Lemmas 31 and 32.
The two lemmas follow straightforwardly by $\mathtt {QL}_{\mathtt {C}}$ -reasoning.
Lemma 33. $\vdash _{\mathtt {QL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle }((\textbf {N}(x)\wedge \textbf {N}(y))\rightarrow x<y\vee x=y\vee y<x).$
Proof of Lemma 33.
Provable from Lemmas 1, 12, 31, and 32, and Theorem 6, by $\mathtt {QL}_{\mathtt {C}}$ -reasoning.
Now, consider the following definition:
Definition 24 (Gen).
For every type $\tau $ , type $\langle \langle \tau \rangle \rangle $ -singular term $\alpha $ and $\langle \tau \rangle $ -singular term $\beta $ ,
We will prove a couple of auxiliary lemmas involving .
Lemma 34. , for every type $\tau $ .
Lemma 35. $\vdash _{\mathtt {KQL}_{\mathtt {C}}}\forall x_{\langle \langle \tau \rangle \rangle }\forall u_{\langle \langle \tau \rangle \rangle }(x<u\rightarrow (\Diamond \exists w_{\langle \tau \rangle }(u(w))\rightarrow \Diamond \exists w_{\langle \tau \rangle }(x(w))))$ .
Finally comes the proof that Generation implies FP:
Lemma 36. .
Proof of Lemma 36.
It is shown that , as —i.e., $\exists x_{\langle e\rangle }\exists y_{\langle \langle e\rangle \rangle }((\textbf {N}(y)\wedge y(x))\wedge \forall w_{\langle \langle e\rangle \rangle }\forall z_{\langle \langle e\rangle \rangle }((\textbf {N}(w)\wedge \textbf {S}(w,z))\rightarrow \Box (w(x)\rightarrow \Diamond z(x))))$ —is just .
I’ll now turn to mapping the relationships between FP and InfAx. Consider the following definition:
Definition 25 (Infinity).
Let:
(a) $Z[x_{\langle \langle \tau \rangle \rangle }]:=\forall y_{\langle \tau \rangle }(\neg \exists z_{\tau }(y(z))\rightarrow x(y))$ ; (b) $A[x_{\langle \langle \tau \rangle \rangle }]:=\forall y_{\langle \tau \rangle }\forall w_{\langle \tau \rangle }\forall z_{\tau }((x(y)\wedge \forall u_{\tau }(w(u)\leftrightarrow (y(u)\vee u=z)))\rightarrow x(w))$ ; (c) $Inf[z_{\langle \tau \rangle },x_{\langle \langle \tau \rangle \rangle }]:= Z[x_{\langle \langle \tau \rangle \rangle }]\wedge A[x_{\langle \langle \tau \rangle \rangle }]\wedge \neg x(z)$ ; (d) $Inf[z_{\langle \tau \rangle }]:= \exists x_{\langle \langle \tau \rangle \rangle }(Z[x_{\langle \langle \tau \rangle \rangle }]\wedge A[x_{\langle \langle \tau \rangle \rangle }]\wedge \neg x(z))$ .
$Inf[z_{\langle \tau \rangle }]$ states that there are infinitely many instances of $z_{\langle \tau \rangle }$ . In what follows are proven some results involving the following theses, and FP:
Axiom of Infinity ( InfAx $_{\tau }$ :) There are infinitely many type $\tau $ -entities;
Numerical Actuality $_{\tau }$ ( NA $_{\tau }$ ): Every natural number has some instance;
Essential Instances $_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ ( EI $_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ ): Whatever possibly stands in a type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ -relation, stands in that relation whenever the relation exists
Identity as Coextensionality $_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ ( IC $_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ ): Coextensive type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ -relations are identical;
Lemma 37. $\vdash \forall y_{\langle \langle \tau \rangle \rangle }\forall x_{\langle \langle \tau \rangle \rangle }\forall z_{\langle \tau \rangle }((Inf[\lambda z_{\tau }(z=z),x]\wedge \textbf {N}(y)\wedge y(z))\rightarrow x(z)).$
Lemma 38. $\vdash \forall x_{\langle \langle \tau \rangle \rangle }\forall y_{\langle \langle \tau \rangle \rangle }\forall z_{\langle \tau \rangle }\forall u_{\langle \tau \rangle }((\textbf {N}(x)\wedge \textbf {S}(x,y)\hspace{-0.5pt}\wedge\hspace{-0.5pt} x(z)\wedge y(u))\hspace{-1pt}\rightarrow\hspace{-1pt} \neg \exists f_{\langle \langle \tau \rangle, \langle \tau \rangle \rangle } (Inj[f,u,z]))$ .
Lemma 39.
Proof of Lemma 39.
Provable, by $\mathtt {QL}_{\mathtt {C}}$ -reasoning, from Lemmas 1, 2, 4, 37, and 38 and Theorems 1, 2, and 6.
Lemma 40. .
Lemma 41. .
Lemma 42. .
F Model theory
A basic model theory for $\mathrm {PMT}$ , based on the work of Gallin [Reference Gallin28], Stalnaker [Reference Stalnaker and Sinnott-Armstrong71] and Fritz and Goodman [Reference Fritz and Goodman25], is here offered. I start by defining frames and inhabited structures (henceforth called ‘structures’).
Definition 26 (Frame, Inhabited structure).
A frame is a pair $\mathbf {F}=\langle \textbf {W},\textbf {U}\rangle $ where $\exists x\in \textbf {W}$ , $\mathbf {U}$ is a function with domain $\mathbf {W}$ such that $\bigcup _{w\in \textbf {W}}(\textbf {U}(w))\neq \emptyset $ , and: (i) $\mathbf {F}_{e}:=\bigcup _{w\in \textbf {W}}(\textbf {U}(w))$ ; and (ii) $\mathbf {F}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }:=(\wp (\unicode{x2A09} _{1\leq i\leq n}(\textbf {F}_{\tau ^i})))^{\textbf {W}}$ , for all $\tau ^1,\ldots ,\tau ^n\in \mathscr {T}$ and $n\in \mathbb {Z}^{+}$ . An inhabited structure is a triple $\mathbf {G}=\langle \textbf {W},\textbf {U},\textbf {D}\rangle $ where $\langle \textbf {W},\textbf {U}\rangle $ is a frame and $\mathbf {D}$ a function with domain $\mathbf {W}\times \mathscr {T}$ such that: (i) $\mathbf {D}_{w,e}=\textbf {U}(w)$ ; and (ii) $\mathbf {D}_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }\subseteq \textbf {F}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }$ , for all $\tau ^1,\ldots ,\tau ^n\in \mathscr {T}$ .
The set $\mathbf {W}$ represents the collection of all possible worlds, and the type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ -relations existing at a world w are represented by set-theoretic functions in $\mathbf {D}_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }$ . Since different functions in $\mathbf {D}_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }$ may have the same image at some world, this representation makes room for some distinct relations possibly sharing their extension. Pluralities of type $\tau $ -entities existing at a world w are represented by nonempty members of $\wp (\textbf {D}_{w,\tau })$ .Footnote 64 I’ll now turn to the standard definitions of variable-assignment, value, and satisfaction:
Definition 27 (Variable-assignments).
A variable-assignment $\sigma $ of a structure $\mathbf {G}$ is a function with domain the set of variables of $\mathrm {PMT}$ and which assigns to each singular type $\tau $ -variable v (plural type $\tau $ -variable $vv$ ) a value $\sigma (v)\in \bigcup _{w\in \textbf {W}}(\textbf {D}_{w,\tau })$ (a value $\sigma (vv)\in \bigcup _{w\in \textbf {W}}(\wp (\textbf {D}_{w,\tau })\setminus \{\emptyset \})$ ), for every $\tau \in \mathscr {T}$ . If $\sigma $ is a variable-assignment, v a singular type $\tau $ -variable (plural type $\tau $ -variable), and $o\in \bigcup _{w\in \textbf {W}}(\textbf {D}_{w,\tau })$ ( $o\in \bigcup _{w\in \textbf {W}}(\wp (\textbf {D}_{w,\tau })\setminus \{\emptyset \})$ ), $\sigma [o/v]$ is a variable-assignment just like $\sigma $ except (perhaps) that $\sigma [o/v](v)=o$ . $As_{\textbf {G}}$ is the collection of all variable-assignments of $\mathbf {G}$ .
Definition 28 (Value).
The value of an expression $\xi $ of $\mathrm {PMT}$ relative to $\sigma \in As_{\textbf {G}}$ and structure $\mathbf {G}$ , $\mathbf {V}^{\sigma }_{\textbf {G}}(\xi )$ , is defined as follows (where, if $\xi $ is an singular term or a formula, then $\mathbf {V}^{\sigma }_{\textbf {G}}(\xi )$ is a function with domain $\mathbf {W}$ ):
(a) $\mathbf {V}^{\sigma }_{\textbf {G}}(v)=\sigma (v)$ ; (b) $\mathbf {V}^{\sigma }_{\textbf {G}}(vv)=\sigma (vv)$ ; (c) $\mathbf {V}^{\sigma }_{\textbf {G}}(=_{\langle \rho ,\rho \rangle })(w)=\{\langle o,o\rangle :o\in \textbf {D}_{w,\rho }\}$ ; (d) $\mathbf {V}^{\sigma }_{\textbf {G}}(vv=uu)(w)=\{\langle \rangle :\textbf {V}^{\sigma }_{\textbf {G}} (vv)=\textbf {V}^{\sigma }_{\textbf {G}}(uu)\in \wp (\textbf {D}_{w,\delta })\setminus \{\emptyset \}\}$ ; (e) $\mathbf {V}^{\sigma }_{\textbf {G}}(\beta (\overline {\alpha }))(w)=\{\langle \rangle :\langle \textbf {V}^{\sigma }_{\textbf {G}}(\alpha ^1),\ldots ,\textbf {V}^{\sigma }_{\textbf {G}}(\alpha ^n) \rangle \in \textbf {V}^{\sigma }_{\textbf {G}}(\beta )(w)\}$ ; (f) $\mathbf {V}^{\sigma }_{\textbf {G}}(\neg \varphi )(w)=\{\langle \rangle :\langle \rangle \not \in \textbf {V}^{\sigma }_{\textbf {G}}(\varphi )(w)\}$ ; (g) $\mathbf {V}^{\sigma }_{\textbf {G}}(\varphi \wedge \psi )(w)=\textbf {V}^{\sigma }_{\textbf {G}}(\varphi )(w)\cap \textbf {V}^{\sigma }_{\textbf {G}}(\psi )(w)$ ; (h) $\mathbf {V}^{\sigma }_{\textbf {G}}(\Box \varphi )(w)=\bigcap _{u\in \textbf {W}}(\textbf {V}^{\sigma }_{\textbf {G}}(\varphi )(u))$ ; (i) $\mathbf {V}^{\sigma }_{\textbf {G}}(\alpha \prec vv)(w)=\{\langle \rangle :\textbf {V}^{\sigma }_{\textbf {G}}(\alpha )\in \textbf {V}^{\sigma }_{\textbf {G}}(vv)\;\&\;\textbf {V}^{\sigma }_{\textbf {G}}(vv)\in \wp (\textbf {D}_{w,\delta })\setminus \{\emptyset \}\}$ ; (j) $\mathbf {V}^{\sigma }_{\textbf {G}}(\forall v\varphi ) (w)=\bigcap _{o\in \textbf {D}_{w,\delta }}(\textbf {V}^{\sigma [o/v]}_{\textbf {G}}(\varphi )(w))$ ; (k) $\mathbf {V}^{\sigma }_{\textbf {G}}(\forall vv\varphi )(w)=\bigcap _{o\in \wp (\textbf {D}_{w,\delta })\setminus \{\emptyset \}}(\textbf {V}^{\sigma [o/vv]}_{\textbf {G}}(\varphi )(w))$ ; (l) $\mathbf {V}^{\sigma }_{\textbf {G}}(\lambda v^1\ldots v^n(\varphi ))(w)=\{\langle o^1,\ldots ,o^n\rangle \in \textbf {D}_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }: \langle \rangle \in \textbf {V}^{\sigma [o^1/v^1_{\tau ^1}]\ldots [o^n/v^n_{\tau ^n}]}_{\textbf {G}}(\varphi )(w)\}$ .
Definition 29 (Satisfaction).
Structure $\mathbf {G}$ , $w\in \textbf {W}$ and $\sigma \in As_{\textbf {G}}$ jointly satisfy formula $\varphi $ , $\mathbf {G},w,\sigma \vDash \varphi $ , if and only if $\langle \rangle \in \textbf {V}^{\sigma }_{\textbf {G}}(\varphi )(w)$ . $\mathbf {G}$ satisfies $\varphi $ , $\mathbf {G}\vDash \varphi $ , if and only if $\mathbf {G},w,\sigma \vDash \varphi $ , for all $\sigma \in As_{\textbf {G}}$ and $w\in \textbf {W}$ . A class of structures satisfies a collection of formulae if and only if every member of the class satisfies every formula in the collection.
The value of $\lambda v^1\ldots v^n(\varphi )$ in a structure $\mathbf {G}$ does not shift from world to world. For instance, $\mathbf {G},w,\sigma \vDash u=\lambda v^1\ldots v^n\varphi \rightarrow \Box (E[u]\rightarrow u=\lambda v^1\ldots v^n\varphi )$ : complex predicates satisfy an object language formulation of the claim that they are rigid designators. Indeed, the following lemma establishes that complex predicates do not constitute counterexamples to $\mathrm {Ind}$ in the context of our modal type theory:Footnote 65
Lemma 43. For every instance $\varphi $ of $\mathrm {Ind}$ and every inhabited structure $\mathbf {G}$ , $\mathbf {G}\vDash \varphi $ .
From upper’s proponents’ standpoint, not all structures represent ways modal reality might be. One requirement is that they be FP- and $\mathrm {SA}$ (seriously actualist)-structures:
Definition 30 (FP-structures).
A structure $\mathbf {G}=\langle \textbf {W},\textbf {U},\textbf {D}\rangle $ is an FP-structure if and only if, for every $n\in \textbf {N}$ , some $w\in \textbf {W}$ is such that $n\leq \textbf {D}_{w,e}$ .
Definition 31 ( $\mathrm {SA}$ -functions, $\mathrm {SA}$ -structures).
$\mathbf {A}_{\textbf {G},\langle \tau ^1,\ldots ,\tau ^n\rangle }=\{f\in \textbf {F}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }:\forall w\in \textbf {W}(f(w)\subseteq \unicode{x2A09} _{1\leq i\leq n}(\textbf {D}_{w,\tau ^i}))\}$ is the set of $\mathbf {G}$ ’s type $\langle \tau ^1,\ldots ,\tau ^n\rangle $ $\mathrm {SA}$ -functions, for all $\tau ^1,\ldots ,\tau ^n\in \mathscr {T}$ and structures $\mathbf {G}$ . A structure $\mathbf {G}=\langle \textbf {W},\textbf {U},\textbf {D}\rangle $ is an $\mathrm {SA}$ -structure if and only if, for all $\tau ^1,\ldots ,\tau ^n\in \mathscr {T}$ , and every $w\in \textbf {W}$ , $\mathbf {D}_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }\subseteq \textbf {A}_{\textbf {G},\langle \tau ^1,\ldots ,\tau ^n\rangle }$ .
$\mathrm {FP}$ & $\mathrm {SA}$ -structures satisfy all of $\mathtt {S5PQL}^{\mathtt {FP}}_{\mathtt {SA}}$ ’s theorems. Further, $-$ closed structures (in the sense of [Reference Fritz and Goodman25, definition 6]) constitute a natural class of structures which satisfy $\mathrm {Comp}_{\mathrm {C}}$ [Reference Fritz and Goodman25, sec. 5.1, proposition 8]. The notion can be glossed as follows. Say that an entity is distinguishable from another from a world’s standpoint just in case, roughly, there is some relation in the domain of that world such that the first entity falls under that relation at some world, but the other doesn’t, or vice versa. Then, a structure is closed just in case each world’s domain contains all and only the entities which, from that world’s standpoint, are distinguishable from all other entities. It is natural to think that, from the standpoint of upper’s proponents, the inhabited structures which represent genuine ways modal reality might be are all upper-structures, in the following sense:
Definition 32 (upper-structures).
To be an -structure just is to be an FP-, $\mathrm {SA}$ - and closed structure.
Now, consider the following structures, where, for each inhabited structure $\mathbf {S}$ and world w of $\mathbf {S}$ , $Fix(w,\textbf {S})$ is the set of structure automorphisms fixed to world w of structure $\mathbf {S}$ (this notion is precisely defined in [Reference Fritz and Goodman25, definition 15, p. 668], and so I’ll omit the details), $\mathscr {N}$ is the set of finite von Neumann ordinals, $n^{\mathscr {N}}$ is the nth von Neumann ordinal for each $n\in \mathbb {N}$ , and $1^{*}:=\{1^{\mathscr {N}}\}$ :
Definition 33 (Structures $\mathbf {G}^{\circ }$ , $\mathbf {G}^{\bullet }$ ).
$\mathbf {G}^{\circ }=\langle \textbf {W}^{\circ },\textbf {U}^{\circ },\textbf {D}^{\circ }\rangle $ , where:
(i) $\mathbf {W}^{\circ }=\mathscr {N}\cup \{1^{*}\}$ ; (ii) for all $w\in \textbf {W}^{\circ }$ , $\mathbf {D}^{\circ }_{w,e}=w$ and $\mathbf {D}^{\circ }_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }:=(\{\emptyset \})^{\textbf {W}^{\circ }}$ for all $\tau ^1,\ldots ,\tau ^n\in \mathscr {T}$ . $\mathbf {G}^{\bullet }:=\langle \textbf {W}^{\circ },\textbf {U}^{\circ },\textbf {D}^{\bullet }\rangle $ , where, for all $w\in \textbf {W}^{\circ }$ , $\mathbf {D}^{\bullet }_{w,e}:=\textbf {D}^{\circ }_{w,e}$ and $\mathbf {D}^{\bullet }_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }=\{f\in \textbf {F}_{\langle \tau ^1,\ldots ,\tau ^n\rangle }:\forall \pi (\pi \in Fix(w,\textbf {G}^{\circ })\Rightarrow \pi (f)=f)\text { and }f\in \textbf {A}_{\textbf {G}^{\bullet }},\langle \tau ^1,\ldots ,\tau ^n\rangle \}$ , for all $\tau ^1,\ldots ,\tau ^n\in \mathscr {T}$ .
$\mathbf {G}^{\bullet }$ ’s existence is a consequence of $\mathsf {ZF}$ . For each $n\in \mathbb {N}$ , $\mathbf {D}^{\bullet }_{(n+1)^{\mathcal {N}},e}=\textbf {D}^{\bullet }_{n^{\mathcal {N}},e}\cup \{n^{\mathcal {N}}\}$ , and $\mathbf {D}^{\bullet }_{n^{\mathcal {N}},e}=n$ . Just like $1^{\mathscr {N}}$ , world $1^{*}$ ’s type e-domain has exactly one element—respectively, $0^{\mathscr {N}}$ and $1^{\mathscr {N}}$ . Clearly, , and . Since $0^{\mathscr {N}}$ and $1^{\mathscr {N}}$ are indistinguishable (as individuals) from the standpoint of $0^{\mathcal {N}}$ , some relations defined in terms of them do not exist at $0^{\mathscr {N}}$ (even though they will exist at all other worlds, owing to $1^{\mathscr {N}}$ and $1^{*}$ being distinguishable from the standpoint of those other worlds). So, for every $\tau \in \mathscr {T}$ . Clearly, $\mathbf {G}^{\bullet }$ is an $\mathrm {FP}$ - and $\mathrm {SA}$ -structure. That $\mathbf {G}^{\bullet }$ is $-$ closed is a consequence of Fritz and Goodman’s Proposition 6 [Reference Fritz and Goodman25, sec. 4.3], since it is generated, in the sense of [Reference Fritz and Goodman25, sec. 4.2, definition 15], by structure $\mathbf {G}^{\circ }$ . Hence, $\mathbf {G}^{\bullet }$ is an upper-structure, and so:Footnote 66
Lemma 44. $\mathbf {G}^{\bullet }$ satisfies all theorems of $\mathtt {S5PQL}^{\mathtt {FP}}_{\mathtt {SA,C}}$ , for every type $\tau $ , and .
Finally, consider the following inhabited structure, whose existence is guaranteed by $\mathsf {ZF}$ :
Definition 34 (Structure $\mathbf {N}$ ).
$\mathbf {N}:=\langle \mathscr {N},\textbf {U},\textbf {D}\rangle $ , where for every $w\in \textbf {W}$ , $\mathbf {D}_{w,e}=w$ and $\mathbf {D}_{w,\langle \tau ^1,\ldots ,\tau ^n\rangle }=\textbf {A}_{\textbf {H},\langle \tau ^1,\ldots ,\tau ^n\rangle }$ .
To conclude we note the following result:
Lemma 45. $\mathbf {N}$ satisfies all theorems of $\mathtt {S5PQL}^{\mathtt {FP}}_{\mathtt {SA,N}}$ , , and .
Granting $\mathsf {ZF}$ ’s consistency, Lemmas 44 and 45 establish the joint consistency of with nontrivial modal type theories, and so also with the weak system $\mathtt {KQL}_{\mathtt {C}}^{\mathtt {FP}}$ . Given the track record, vis-à-vis their consistency, of systems purporting to contain arithmetic, these lemmas are not an insignificant achievement.
Acknowledgements
My greatest thanks go to José Mestre, with whom I started thinking and working on the topics in this paper. Talks based on the paper's material have been delivered at the CMAFcIO Mathematical Logic Seminar, LanCog's Seminar Series in Analytic Philosophy, MLAG's Seminar, MIT's LLaMM Seminar, MIT's Topics in Philosophical Logic course, IFILNOVA's Open Seminar, Arché's Logic & Metaphysics Seminar, the 14th Panhellenic Logic Symposium, and the University of Lisbon's Philosophy of Mathematics course. I would like to thank the audiences for their input. Special thanks also to Fernando Ferreira, David Ingram, Vann McGee, Agustín Rayo, Chris Scambler, Jack Spencer, Bob Stalnaker and Gabriel Uzquiano for very helpful discussions on the paper.
Funding
This work was funded by national funds through the FCT - Fundação para a Ciência e a Tecnologia, I.P., under the R&D Unit Center for the Philosophy of Sciences (CFCUL), strategic project with references FCT I.P. and DOIs: UIDB/00678/2020, DOI 10.54499/UIDB/00678/2020 | UIDP/00678/2020, DOI 10.54499/UIDP/00678/2020, and the FCT projects CEECIND/04761/2017, PHI-ASM—Philosophical Investigation of Applications of Science in Mathematics with reference PeX.2022.06784.PTDC, and OGSR—New Perspectives on the Objects and Grounds of Structural Rules with reference PeX.2022.03194.PTDC.