4. Arithmetic and first-order foundations

4.1. Algebraic terms

Introduction

The concept of algebraic term, namely a term with a purely algebraic language L and a set V of variable symbols (no predicate, logical symbols nor binders), was first intuitively introduced in 1.5. We shall now formalize it as a kind of systems in a set theoretical framework. For convenience let us work with only one type (the generalization to many types is easy), and start with a wider class of systems we shall call L-drafts.
The formalization of this concept has a few possible variants.

In a first approach, an L-draft is an L-equational system (D, b) such that (Gr b ∪ D) is injective (where b: V_D → D and Gr b ∩ D = ∅). This injectivity means that

D is injective ;
b is injective, allowing for the simplification V_D ⊂ D and b = Id_{V_D}.
Im b ∩ Im D = ∅, thus Im b = ∁_D Im D.

The set D means the set of occurrences of symbols, split between the set Im b of occurrences of variable symbols, and the set Im D of occurrences of symbols from L. But this definition for drafts would make each variable occur exactly once, thus cannot directly include algebraic terms, where a variable may have multiple occurrences or none, as particular cases. The solution is to first replace the use of b by that of its inverse b', function from a set of occurrences V_D ⊂ D to the set V of available variables, then remove from it the bijectivity requirement.
We may keep the injectivity requirement on b', meaning the uniqueness of occurrence of any variable, as any multiplicity of occurrences of one variable in a term can be replaced by the use of a single occurrence of it referenced multiple times inside the term (this replacement gives a synonymous "term" abstractly conceived, ignoring any care for visible notations).
But when multiple variables are available (∃^≥2: V), we need to allow for non-surjective b' (terms leaving some available variables unused), especially to allow as a draft any term made of a single variable symbol.
Hence the relevance of a formalization where V_D = D∩V and b' = Id_{V_D}.

Drafts

Finally, let us formalize an L-draft as an injective L-system (D, ^tGr ψ_D) where:

Dom ψ_D = Im D = O_D = D\V is the set of occurrences of symbols from L in D;
V_D = ∁_D O_D = D∩V is the set of used variables (or occurrences of variables) in D;
〈V_D〉_L = D (well-foundedness).

Let us denote ψ_D = σ ⊓ l, i.e. ∀x∈O_D, ψ_D(x) = (σ(x), l_x) ∈ ^LD where σ∈L^O_D and l_x∈D^n_σ^_(x).
Equivalent formulations of well-foundedness are

∀A⊂O_D, (∀x∈O_D, Im l_x ⊂ A∪V ⇒ x∈A) ⇒ A=O_D
∀A⊂D, (V_D⊂A ∧ D_*(^LA) ⊂A) ⇒ A = D
∀A⊂D, V_D⊂A≠D ⇒ ∃x∈O_D\A, Im l_x⊂A
∀A⊂O_D, A≠∅ ⇒ ∃x∈A, A∩ Im l_x = ∅

A ground draft is a draft with no variable, i.e. V_D = ∅. Thus, ψ_D: D → ^LD and Sub_LD = {D}.
Variables in a draft may be reinterpreted as constants: extending ψ_D by Id_{V_D} : V_D → V, forms a ground (L∪V)-draft.

Sub-drafts and terms

Well-foundedness makes uninteresting the range of stable subsets of drafts, to be replaced by another stability concept : a subset A⊂D will be called a sub-draft of D (or a co-stable subset of D) if, denoting O_A = A∩O_D and ψ_A = ψ_{D|O_A}, we have (Im ψ_A⊂ ^LA), i.e. ∀x∈O_A, Im l_x⊂A.
Any sub-draft A of a draft D is a draft, i.e. fits the above axioms with fixed V, where the well-foundedness of A is obviously deduced from that of D by its last formulation.

Like with stable subsets, any intersection of sub-drafts is a sub-draft. Moreover, any union of sub-drafts is also a sub-draft (unlike for sub-algebras with an operation with arity >1, which from arguments in different sub-algebras may give a result escaping their union).

The sub-draft co-generated by a subset of a draft, is the intersection of all sub-drafts that include it. A term is a draft co-generated by a single element that is its root. Each element x of a draft D defines a term T_x with root x, sub-draft of D co-generated by {x}.

Each draft D is ordered by x ≤ y ⇔ x∈T_y.
This is obviously a preorder. Its antisymmetry, that is the uniqueness of the root of any term, can be proven as follows (a more elegant proof from more general concepts is in the study of Galois connections):

O_D

V_D

T_y

O_D

l_z

T_z

O_D

T_z

Equivalent

Categories of drafts

As particular relational systems, classes of L-drafts form concrete categories. Between two L-drafts D,E,

f ∈ Mor_L(D,E) ⇔ (f[O_D]⊂O_E ∧ ψ_E ⚬ f_{|O_D}= ^Lf⚬ψ_D)

where the equality condition can be split as

σ_E⚬f_{|O_D} = σ_D
∀x∈O_D, l_f(x) = f⚬l_x.

There, coproducts are given by disjoint union, like with relational systems.

Another concrete category is that of drafts with variables-preserving morphisms, where V is fixed, and

Mor_L,V(D,E) = {f ∈ Mor_L(D,E) | f_{|V_D} = Id_{V_D}}.

This is roughly the category of ground (L∪V)-drafts (reinterpreting variables as constants, except that a multiply used variable may be figured in the latter as multiple elements, but only as a multiply targeted single element in the former).
The definition of coproducts is modified there as

^L,V∐_i∈I D_i = (⋃_i∈I V_{D_i}) ∪ ∐_i∈I O_{D_i}

with the structure defined like before. In the binary case this coproduct between drafts D,E will be written D ⊔_V E.
Products in these categories cannot be directly made of the products of underlying sets. The description of products in the variables-preserving case is left as an exercise once understood draft condensation (4.3).

Images of drafts

If D,E are L-drafts and f ∈ Mor_L(D,E) then

For any sub-draft F⊂E, f*(F) is co-stable.
If f[V_D] ⊂ V_E then Im f is co-stable, and ∀x∈D, f[T_x] = T_f(x).

Intepretations of drafts in algebras

An interpretation of an L-draft D in an L-algebra E, is a morphism f∈Mor_L(D,E). Equivalently,

f_{|O_D}= φ_E⚬^Lf⚬ψ_D
∀x∈O_D, f(x) = σ(x)_E(f⚬l_x)

Theorem. For any L-draft D and any L-algebra E,

∀u∈E^V, ∃!h∈Mor_L(D,E), h_{|V_D} = u_{|V_D}.

Proof. For simplicity, let V = V_D by restricting u.
Uniqueness was seen with equational systems, and will be used to prove existence :

_A∈

_|V

_|B

_f∈K

^Lh

In other words, every L-algebra E is a module for every L-draft.

This formally justifies the concept of operation symbol defined by a term and interpreted as an operation in every L-algebra, by the concepts explained in 3.11 : the role of V-ary operation symbol played by any element s of an L-draft D (interpreted in each L-algebra E by ∀u∈E^V, s_E(u) = h(s) for the unique h∈Mor_L(D,E) such that h_{|V_D} = u_{|V_D}), preserved by all L-morphisms, is that of the operation symbol defined by the L-term with root s co-generated by s in D.

4.2. Quotient systems

Quotients of relational systems

Given a relational language L and two L-systems E and F, a quotient from E to F is a surjective morphism f : E↠F such that ^Lf[E] = F. We also say that the system F is a quotient of E by ∼_f.
This condition defines F from E and f, as the smallest L-structure on F such that f ∈ Mor_L(E, F).

For any equivalence relation R on E, the quotient set E/R receives the L-structure E/R defined as ^LR⃗[E], making (R⃗) a quotient from E to E/R.
Then for any h : E → X,

R ⊂ ∼_h ⇒ (h ∈ Mor_L(E,X) ⇔ h/R ∈ Mor_L(E/R,X)).

Now if L is an algebraic language, the condition for an f : E↠F between L-systems to be a quotient, is written

{(^Lf(x),f(y)) | (x,y) ∈ E} = F.

It implies ∀B⊂F, B ∈ Sub_LF ⇔ f *(B) ∈ Sub_LE.
Any morphism from a serial system to a partial algebra is a quotient to its image, which is a stable algebra.

Quotients in concrete categories

The concept of quotient has an equivalent definition expressible for any concrete category C, similar to the concept of embedding with sides reversed. This can be explained by being the same concept as embedding in the opposite category, once the concreteness of the category is re-expressed as a link with the category of sets, independently of the details of the category of sets, which can thus be replaced by any other category, namely the opposite category to the category of sets.

So, a quotient from E to F in C is a surjective morphism f : E↠F such that, ∀_CX, equivalently

Mor(F,X) = {g∈X^F | g⚬f ∈ Mor(E,X)}
Mor(F,X) = {h/f | h ∈ Mor(E,X) ∧ ∼_f ⊂ ∼_h}
{h ∈ Mor(E,X) | ∼_f ⊂ ∼_h} = {g⚬f | g∈Mor(F,X)}

Any retraction is a quotient.

Congruences

Given an algebraic language L, a congruence on an L-system E is an equivalence relation R on E such that, equivalently,

R ∈ Sub_L(E²)
The quotient system E/R is a partial algebra.
There exists a morphism f : E → F where F is a partial algebra and ∼_f = R.

Proof of equivalence.

^Lf

s_E

^Lf

Thus, the quotient of any algebra or other serial system by a congruence, is an algebra. In this case, a congruence can be essentially described as a binary relation satisfying the axioms of equality.

Quotients of modules

Not all quotients of modules are modules. But for any algebraic language L, any L-system E, and any L-equational system (K, b), where b : V→K and AC_V holds, if E is a b-module (where V is structureless) then its quotient by any congruence is also a b-module.
Proof.

F^V

E^V

seen before

While this proof only needs f to be a surjective morphism, not precisely a quotient, in practice this will only be used with quotients. Indeed, f is a quotient for the smallest structure of F which makes f a morphism; greater structures preserve existence; to ensure uniqueness we need F to still be a partial algebra, but when E is left serial this implies that f is a quotient anyway.
Actually the AC_V hypothesis is not needed, as infinite equational systems will be seen equivalent to sets of finite ones, where finite choice applies.

Intersections of congruences

On any L-system E, any intersection of equivalence relations is an equivalence relation ; and any intersection of congruences is a congruence.
This can be seen as a case of intersection of stable subsets, as equivalence relations are the stable relations by some structure, and the same for congruences. But here is another viewpoint on this:

Let ∀i∈I, f_i ∈ Mor_L(E,F_i), P = ∏_i∈I F_i and g = ⊓_i∈I f_i ∈ Mor_L(E, P). Then

∼_g = ⋂_i∈I ∼_{f_i}.

If all F_i are partial algebras then P is also a partial algebra, which explains that ∼_g is a congruence.
Let G = Im g ⊂ P. Like any morphism, g will be a quotient to its image G if E is serial and P is a partial algebra, but not necessarily otherwise.
Let ∀i∈I, h_i = π_i|G ∈ Mor_L(G,F_i). Then f_i = h_i⚬g. As with any composite of morphisms, if f_i is a quotient then h_i is also a quotient.

Minimal congruences

The minimal congruence of any L-system E is the intersection of all its congruences. Let us write it ≃_E, or simply ≃ when E is given by context. It is equality if and only if E is a partial algebra.
Let us call condensate of E its quotient by its minimal congruence, and π_E the natural surjection to it:

⤓E = E/≃
π_E = ≃⃗_E : E ↠ ⤓E

(⤓E, π_E) is initial in the category of (X,f) where X is a partial L-algebra, f ∈ Mor_L(E,X), and

Mor((X,f),(Y,g)) = {h ∈ Mor_L(X,Y) | h ⚬ f = g}
Mor((⤓E, π_E),(Y,g)) = {g/π_E}

The congruence of E generated by any equivalence relation R is the preimage of ≃_E/R by R⃗.

In the category of partial algebras, the coproduct is given by the condensate of the disjoint union.

Condensates of injective systems

For any injective L-system (E, ^tGr ψ_E), its minimal congruence ≃_E is 〈𝛿_E〉_L, and ⤓E is injective.

Proof:

l_x

l_y

_σ

^LR

_σ

l_x

l_y

⃖

zRx

_σ

⃖

_σ

l_z

l_y

l_x

l_y

l_z

l_x

zRx

left Euclidean

The formula R = 𝛿_E ∪ F_*(^LR) is equivalent to

R⃗ ∈ Mor_L(E,℘(E)) ∧ ∀x∈E\Dom ψ_E, R⃗(x) = {x}
where φ_℘(E)(s,u) = {x∈O_D | σ(x)=s ∧ l_x∈∏u}

which determines R when E is a draft.
The formula (*) expresses R ⊂ 𝛿_E ∪ F_*(^LR), and is equivalent to the conjunction of

∀(x,y)∈R, x ∉ Dom ψ_E ⇒ x = y
∀(x,y)∈Dom ψ_E, xRy ⇒ (x ∼_σ y ∧ Im(l_x×l_y) ⊂ R).

If R is an equivalence relation, 2. says the structure of E/R is injective, while 1. means it has the same variables as E. Thus in a draft, the equivalence relations which fit (*) are those of variables-preserving morphisms to some other draft.
Therefore, the minimal congruence of a draft is its only congruence by which the quotient is a (functional) draft with the same variables ; it is the preimage of = by any variables-preserving morphism to a functional draft.

4.3. Term algebras

Synonymous terms

In any draft, x ≃ y is the condition for the terms with roots x and y to be synonymous, i.e. taking the same value f(x) = f(y) for any interpretation f of the draft in a partial algebra.

Terms in different drafts D, E with the same variables, can be so compared by interpreting ≃ in their coproduct D ⊔_V E. Equivalently, for any functional draft F,

∀f∈Mor_L,V(D,F), ∀g∈Mor_L,V(E,F), ∀x∈D, ∀y∈E, x ≃ y ⇔ f(x) = g(y).

This equivalence comes from !: Mor_L,V(D ⊔_V E, F) ⇌ Mor_L,V(D,F) × Mor_L,V(E,F). Then,

∀h∈Mor_L,V(D,E), ∀x,y∈D, h(x) ≃ x ∧ (x ≃ y ⇔ h(x) ≃ h(y))

because ≃⃗_E ⚬ h ∈ Mor_L,V(D, ).
Thus ∀x,y∈D, the meaning of x ≃ y is the same for any sub-drafts of D which x and y are respectively considered elements of, such as the terms T_x and T_y (using h∈Mor_L,V(T_x ⊔_V T_y, D)).

Term algebras

For any set given set V, a V-ary term L-algebra is an L-system equivalently described as:

An algebraic L-draft D where V_D = V;
An injective L-algebra D generated by V where V ∩ Im φ_D= ∅ (thus D\Im φ_D = V);
A final L-draft in the sense of variables-preserving morphisms Mor_L,V ;
An L-algebra with basis V.

As any draft, it is qualified as ground if V = ∅, i.e. (by definition 2.) a ground term L-algebra is an algebra both minimal and injective, and thus also bijective; 4. means it is an initial L-algebra.
Intuitively, a V-ary term L-algebra T represents the set of all V-ary operation symbols possibly defined by L-terms ; any V-ary s∈L has a copy s_T(Id_V) ∈ T.

Proof of equivalence. Already 1. ⇔ 2. ⇒ (3. ∧ 4.).
Proof of 3. ⇒ 1. : any final L-draft D is

Functional : ∃f∈Mor_L,V(⤓D, D), f ⚬ ≃⃗_D ∈ Mor_L,V(D, D) ∴ f ⚬ ≃⃗_D = Id_D ∴ Inj ≃⃗_D
Serial : Mor_L,V(D∪(^LD\Dom D), D) ≠∅
V_D = V because V is a draft.

Let us prove 4. ⇒ 2. after reduction to the ground case by replacing variables by constants (replacing L by L∪V).
Any initial L-algebra (E,φ_E) is :

minimal : ∃f∈Mor_L(E, Min_LE), f∈Mor_L(E,E) ∴ f = Id_E ∴ Min_LE = E
injective : (F = ^LE ∧ φ_F = ^Lφ_E) ⇒ φ_E ⚬ ^Lφ_E = φ_E ⚬ φ_F ⇔ φ_E∈Mor_L(F,E)
∴ ∃f∈Mor_L(E,F), φ_E⚬f = Id_E ∴ f⚬φ_E = φ_F⚬^Lf = ^L(φ_E⚬f) = Id_F.∎

Term algebras in injective algebras

For any injective L-algebra (E,φ_E), any V ⊂ E \ Im φ_E, generates a term algebra 〈V〉_L ⊂ E. In particular the existence of an injective algebra implies that of a ground term algebra.

For any f∈Mor_L(K,E) from a ground draft K to an injective system E, the system Im f is both injective and minimal, i.e. a ground draft. As with any morphism, if K is serial then Im f is serial; if E is functional then Im f is functional. Thus, if K is a ground term algebra and E is an injective partial algebra then Im f is a ground term algebra, and thus f is injective (by essential uniqueness of initial or final objects).

Precisely to distinguish statements and their respective ranges of validity,

If K is a serial minimal system and E is an injective partial algebra then Im f is a ground term algebra;
If K is a functional ground draft and E is injective then f is injective (because ∼_f ⊂ f*(≃_{Im f}) = ≃_K = 𝛿_K).

Some more complicated statements can be obtained in non-ground cases by applying the above to the language L∪V.

Particular languages

If L has no constant then ∅ is a ground term L-algebra.
If L only has constants, then ground term L-algebras are the copies of L.
If L only has symbols with arity 0 or 1 then every L-term is functional (this may not be obvious ; we shall not use it later).

Trajectories by transformation sets

For any set of transformations L ⊂ E^E, seen as a set of function symbols interpreted in E,

∀x∈E, 〈{x}〉_L = {f(x)|f∈〈L〉_{Id,⚬}}
∀X⊂E, 〈X〉_L = ⋃_{f∈〈L〉_{Id,⚬}} f[X] = ⋃_x∈X 〈{x}〉_L

Denoting A = 〈X〉_L, M = 〈L〉_{Id,⚬} and K = {f(x) | (f,x)∈M×X}, the claim is that A = K. As we shall formalize later, both A and K mean the set of all composites of any number of functions in L, applied to any x∈X.
Proof of A ⊂ K

Proof of K ⊂ A

E^E

_{f}

_{A}

_{Id,⚬}

E^E

_{A}

_{f}

Free monoids

A free monoid is a monoid with a basis in the category of monoids. Then its basis is generating (for the same reason as the above case of categories of algebras).

Any unary term L-algebra M (if it exists) is an egg in the category of L-algebras, thus a monoid acting on all L-algebras with Mor_L ⊂ Mor_M.

If L is a language of function symbols and M is a unary term L-algebra then L is a basis of M in the category of monoids.
Proof:

s_M

s_E

_{Id,•}

representation theorem

X^X

N^L

_{e,•}

X^X

_|L

_{Id,•}

_{Id,⚬}

Conversely, if L is a basis of a monoid M in the category of monoids, then M is a unary term L-algebra.
Proof:

_{Id,•}

4.4. Integers and recursion

The set ℕ

An arithmetic is any theory describing the system ℕ of natural numbers. There are diverse such theories, depending on the choice of a logical framework, then of a language and axioms. Somehow, the main framework for arithmetic is second-order logic, but this leads to other versions depending on how second-order logic itself comes down to other frameworks.
Let us start with the set theoretical arithmetic, most precise arithmetic expressed in the framework of a strong enough set theory, interpreting second-order quantifiers by the powerset symbol. This defines "the standard ℕ", or rather its isomorphism class relative to the given universe (it amounts to defining the class of "finite sets", which are measurable by their number of elements, as we shall define in 4.6).

Definition. The set ℕ of natural numbers is a unary term {S}-algebra where S is a function symbol called the successor ; also expressible as a ground term {0,S}-algebra where 0 (zero) is a constant symbol.

The existence of such a system is our first expression of the axiom of infinity. Let us formalize it by inserting arithmetic into set theory, in the form of 3 constant symbols ℕ, 0, S with the following axioms :

	0 ∈ ℕ ∧ S : ℕ → ℕ	: ℕ is a {0,S}-algebra
(H0)	∀n∈ℕ, Sn ≠ 0	: 0 ∉ Im S
(Inj)	∀n,p∈ℕ, Sn = Sp ⇒ n = p	: S is injective
(Ind)	∀A⊂ℕ, (0∈A ∧ ∀n∈A, Sn∈A) ⇒ A = ℕ	: 0 generates ℕ (induction axiom).

More constant symbols can be defined from there:
1 = S0
2 = S1 = SS0
and so on.

This completes our axioms of set theory up to the strength of Mc Lane set theory, which essentially forms (with the axiom of choice) the most commonly used foundational theory for mathematics. It will imply the existence of term algebras with any language.
This definition of ℕ may be criticized as circular, for involving concepts of algebra we initially introduced using ℕ as set of possible arities of symbols. But these concepts may be re-written up to this point without this use, by restriction to the use of symbols with small arities (0, 1, 2).

Recursively defined sequences

Each n∈ℕ represents a function symbol Sⁿ defined by a unary S-term and interpreted in each S-algebra (E,f) as a function fⁿ ∈ E^E.

A sequence of elements of a set E, is a family u : ℕ → E. It is called recursive when it is a morphism for a given S-algebra structure on E, i.e. defined by choices of a∈E and f∈E^E :

u ∈ Mor_{0,S}(ℕ, (E,(a,f))) ⇔ (∀n∈ℕ, u_n = fⁿ(a))
⇔ (u₀ = a ∧ ∀n∈ℕ, u_Sn = f(u_n)).

Intuitively, each fⁿ(a) abbreviates the term with n occurrences of the function symbol f, and variable a :

f⁰(a) = a
f¹(a) = f(a)
f²(a) = f(f(a))

The sequence (fⁿ) is itself recursively defined by f⁰ = Id_E ∧ ∀n∈ℕ, f^Sn = f ⚬ fⁿ.
In particular, f¹ = f and f² = f⚬f.
Generally for any f∈E^E, g∈E^X, the recursive sequence h : ℕ → E^X defined by h₀ = g ∧ ∀n∈ℕ, h_Sn = f⚬h_n, is h_n = fⁿ⚬g.

Addition

The basic properties of addition and multiplication in ℕ can be seen as obvious consequences of the role of natural numbers as measures ("cardinals") of finite sets, which will be defined in 4.6. But let us prove them independently here as an exercise.
Like any unary term algebra, ℕ forms a monoid (ℕ, 0, +), whose actions are the sequences (fⁿ) for any transformation f.
It is commutative as it is generated by the singleton {1} (which commutes with itself). Thus the side won't matter, but conventions basically present it as acting on the right, defining addition as n+p = S^p(n), or recursively as

n + 0 = n
∀p∈ℕ, n+S(p) = S(n+p)
n+1 = S(n+0) = Sn

The action axiom ∀f∈ E^E, ∀n,p∈ℕ, f^n+p = f^p⚬fⁿ is deducible like for any term algebra:
(fⁿ⁺⁰ = fⁿ ∧ ∀p∈ℕ, f^n+Sp = f^S(n+p) = f⚬f^n+p) ∴ ∀a∈E, (p ↦ f^n+p(a)) ∈ Mor(ℕ,(E, fⁿ(a),f)).
The associativity of + (coming as a property of monoids) will be used implicitly, omitting parenthesis.

Inversed recursion and integers

Let -ℕ = {-n | n∈ℕ} (where -n is read "minus n") a copy of ℕ, structured as a partial {0,S}-algebra where 0_-ℕ = -0 and S_-ℕ is the copy of S_ℕ^-1. As a relational system, it works like ℕ with S transposed. Thus, it has a unique {0,S}-morphism to any {0,S}-system E where 0_E is a singleton and S_E is both injective and surjective. In particular this holds for {0,S}-algebras (E,(a,f)) where f = S_E ∈ 𝔖_E, forming the negative recursion

(fⁿ(a))_n∈-ℕ ∈ Mor_{0,S}(-ℕ, (E,(a,f)))
∀n∈ℕ, f^-n = (f^-1)ⁿ

giving back the notation f^-1 for the inverse of f as a particular case.
Let us define the system of integers as ℤ = ℕ ∪ -ℕ assuming ℕ ⋂ -ℕ = {0} = {-0}, with the union of structures from ℕ and -ℕ. It is easy to see that S_ℤ ∈ 𝔖_ℤ, and that ℤ is a {0,S}-algebra and the coproduct of ℕ and -ℕ in the category of partial {0,S}-algebras.
Thus, (ℤ,0) is an egg in the category of {S}-algebras (E,f) where f ∈ 𝔖_E. This defines a monoid structure (0,+) on ℤ, extending the one on ℕ, and acting on each such (E,f) by the unique (fⁿ)_n∈ℤ ∈ Mor_{0,S}(ℤ, (E^E, Id_E, ⚬⃗(f)). It is also the unique morphism of monoid from ℤ to E^E sending 1 to f.
Moreover,

+_ℤ is commutative.
∀n∈ℕ, (-n)+n = 0 = n+(-n), thus ∀f ∈ 𝔖_E, {fⁿ | n∈ℤ} ⊂ 𝔖_E.

Proof of 1. : ℤ is generated by {-1, 1}, while (-1) + 1 = 1 + (-1) = 0 because f ∈ 𝔖_E ⇒ f^-1 ⚬ f = f ⚬ f^-1 = Id_E.

Proof of 2. : from ∀n∈ℕ, n+1 = 1+n comes

∀f∈E^E, f^Sn = f⚬fⁿ = fⁿ⚬f
∀f,g∈ E^E, g⚬f = Id_E ⇒ ∀n∈ℕ, gⁿ⚬fⁿ = Id_E
∀f∈𝔖_E, ∀n∈ℕ, fⁿ∈𝔖_E ∧ (fⁿ)^-1 = f^-n

Each result can also be deduced from the other:
1.⇒2. : by symmetry between ℕ and -ℕ, (-n)+n ∈ ℕ ⇔ n+(-n) ∈ -ℕ.
2.⇒1. from the commutativity of +_ℕ and a previous result.

Following the French terminology, the subsets ℕ and -ℕ of ℤ will be respectively called the sets of positive and negative integers.

Multiplication

The following definition and properties of multiplication in ℕ also hold in ℤ (where it is an extension of the one in ℕ).
Let us define it as n⋅p = (S^p)ⁿ(0) (this choice of sides fits common language, unlike the usual one from the literature, until the proof of commutativity removes the distinction). Recursively,

the latter coming as (S^x)^Sy(0) = S^x((S^x)^y(0)).
Then generally, ∀f∈E^E, f^n⋅p = (f^p)ⁿ. Proof :

_n∈ℕ

S^p

f^m

_m∈ℕ

S^p

E^E

f^p

^n⋅p

_n∈ℕ

E^E

f^p

^0⋅p

_n∈ℕ

^n⋅p

_n∈ℕ

f^p

ⁿ

_n∈ℕ

∀n∈ℕ, n⋅0 = 0 (easy to check by induction).
Right distributivity ∀x,y,z∈ℕ, (x+y)⋅z = x⋅z + y⋅z comes by induction on y, or as (S^z)^x+y = (S^z)^y⚬(S^z)^x.
Left distributivity ∀x,y,z∈ℕ, x⋅(y+z) = x⋅y + x⋅z comes by induction on x using commutativity of +.
In particular ∀x,y∈ℕ, x⋅Sy = (x⋅y)+x. As the transpose of ⋅ fits the recursive definition of ⋅, both are equal : multiplication is commutative.

An elegant approach to multiplication is to see (ℕ, 1, ⋅) as the monoid End_{{0, +}} ℕ. It will be developed later.

4.5. Presburger Arithmetic

First-order theories of arithmetic

After the formalization of ℕ in set theory, let us review theories of first-order arithmetic, which means first-order theories describing ℕ as their only type, and interpreting induction as a schema of axioms by second-order universal elimination (thus with A only ranging over classes of that theory). First-order theories with 2 types ℕ and ℘(ℕ), called second-order arithmetic, will be studied in Part 5.

Unlike the set theoretical framework, first-order logic cannot define recursive sequences for arithmetic from the mere successor function. In particular it cannot proceed the recursive definitions of addition and multiplication, as will be clear from the description of non-standard models (4.9). So, different choices of language give non-equivalent versions of first-order arithmetic, with non-equivalent versions of the schema of induction:

Bare arithmetic, with the mere symbols 0 and S, is very poor.
Presburger arithmetic, detailed below, is arithmetic with addition. It still cannot define multiplication.
Full arithmetic, named Z₁, admits symbols 0,S, + and ⋅ , and all axioms of their definitions. By tedious tricks it can define all recursive sequences, and generally the sequence defined by any condition of the form u_n+1 = f(u_n,n) expressed in its language.

Presburger arithmetic

Its usual presentations have primitive symbols 0, +, and either 1 or S which are definable from each other (1 = S0 and Sx = x+1) ; actually + would suffice to define all.
Among its equivalent formalizations, a very short one which superficially looks weakest, describes the set ℕ* of nonzero natural numbers, with symbols 1 (constant) and + (binary operation), and axioms :

∀x,y∈ℕ, x + (y+1) = (x+y*) + 1	(A1) : + is associative on 1
∀A⊂ℕ, (1∈A ∧ ∀x,y∈A, x+y∈A) ⇒ A=ℕ	(Min)
∀x,y∈ℕ*, x + y ≠ y	(F)

The axiom (Min), expressed for simplicity in set theory where it makes 1 a generator of (ℕ*,+), is then reduced to a first-order axiom schema by second-order universal elimination. It will be seen equivalent to the axiom of induction, both set theoretically and as axiom schemas.

(A1) is a particular case of the associativity axiom (As).

Conversely, (A1 ∧ Min) ⇒ (As) :

a,b

(As ∧ Min) ⇒ commutativity, as deduced from 1∈C({1}).

Now take ℕ = ℕ*∪{0} where 0∉ℕ*, to which + is extended as ∀n∈ℕ, 0+n = n+0 = n. This extension is still commutative and associative.
With S_ℕ defined as x↦ x+1, obviously (H0) holds. As x+0 = x ≠ 0,

(F) ⇔ (∀x∈ℕ*, ∀y∈ℕ, x+y ≠ y)

(Inj ∧ Ind ∧ A1) ⇒ (F) because ∀x∈ℕ*, {y∈ℕ | x+y ≠ y} ∈ Sub_{0,S}ℕ .
The proof of the converse (F ⇒ Inj) using other axioms, will come from the study of the order relation.

Equivalence of axiom schemas

Let (Ind₁) be the statement ∀A⊂ℕ*, (1∈A ∧ (∀x∈A, x+1 ∈A)) ⇒ A=ℕ*.
(Ind₁) ⇒ (Ind) and (Ind₁) ⇒ (Min) are obvious
(Ind) ⇒ (Ind₁) either by using A∪{0}, or {x∈ℕ|x+1∈A} once noted that (Ind) ⇒ Im S = ℕ*.
Proof of (A1 ∧ Min) ⇒ (Ind) in set theory (ignoring our previous definition of ℕ):

⃗

Proof of (As ∧ Min) ⇒ (Ind) directly convertible to first-order logic:

_{0,S}

So, all possible axiom pairs are equivalent: (A1 ∧ Min) ⇔ (As ∧ Min) ⇔ (As ∧ Ind) ⇔ (A1 ∧ Ind), and imply commutativity.

The order relation

In Presburger arithmetic, let us define binary relations ≤ and < as

x<y ⇔ ∃z∈ℕ*, y = x+z
x≤y ⇔ ∃z∈ℕ, y = x+z

(A1 ∧ Ind) implies

< is transitive
∀x,y∈ℕ, x≤y ⇔ (x<y ∨ x=y)
∀x∈ℕ*,∃y∈ℕ, x = y + 1 (ℕ is a surjective {0,S}-algebra)
∀x,y∈ℕ, x<y ⇔ x+1≤y
∀A⊂ℕ, A≠∅ ⇒ ∃x∈A, ∀y∈A, x≤y (meaning a schema of formulas)
∀x,y∈ℕ, x≤y ∨ y≤x
∀x,y,z∈ℕ, x<y ⇒ x+z < y+z

Proofs :

using (As), x < y < z ⇒ (∃n,p∈ℕ*, z = y+p = x+n+p) ⇒ x < z.
obvious from definitions;
by (Ind)
from 3.
x≤y ⇒ (x+1≤y ∨ x=y)
0∈{x∈ℕ |∀y∈A, x≤y}=B
∀x∈B, x+1∈B ∨ x∈A
A⋂B = ∅ ⇒ (B = ℕ ∴ A = A⋂B = ∅)
from 5. with A={x,y}. Or using 3.,
A = {x∈ℕ | ∀y∈ℕ, x<y ∨ x=y ∨ y<x} ⇒ (0∈A ∧ ∀x∈A, x+1 ∈ A)
y = x+n ⇒ y+z = x+z+n ∎

(A1 ∧ Ind ∧ F) implies

< is a strict total order, associated with the total order ≤
∀x,y,z∈ℕ, x<y ⇔ x+z < y+z
+ is cancellative, which gives (Inj) as a particular case.

Proof. (F) gives irreflexivity ; other conditions for a strict total order come from 1. and 6.
There must be one true formula among (x<y), (x = y), (y<x), which by 7. respectively imply (x+z<y+z), (x+z = y+z), (y+z<x+z). But only one of the latter can be true, thus each implication must be an equivalence. Cancellativity on one side extends to the other side by commutativity. ∎

The smallest element of any nonempty A ⊂ ℕ is denoted min A.

In the set theoretical ℕ, this order coincides with the order between terms with either language {S} or {0,S}. (The proof is easy).
Also, for each n ∈ℕ, the set <⃗(n) ⊂ ℕ can be seen as recursively defined by
¬(n<0) ∧ ∀x∈ℕ, n < Sx ⇔ (x = n ∨ n < x)
which can also be written
∀x∈ℕ, x ∈ ≤⃗(n) ⇔ (x = n ∨ ∃y∈ ≤⃗(n), Sy=x).
(Its existence in ℘(ℕ) can be deduced by induction on n, more simply than other recursions)

Parity

Presburger arithmetic can qualify a number n∈ℕ as even by ∃x∈ℕ, n = x+x; and odd by ∃x∈ℕ, n=x+x+1.
Obviously, if n is even then n+1 is odd; if n is odd then n+1 is even thanks to commutativity. Then it can be shown by (Ind) that any n must be either even or odd, and by also using (Inj) that it cannot be both and that this x is unique.

Taken as primitive with a relevant set of axioms, the order relation can form a variant of first-order arithmetic more expressive than bare arithmetic, but less than Presburger arithmetic, as it suffices to define S but cannot suffice to define + (nor parity) and cannot be defined from S in first-order logic.

Images of recursive sequences

In set theory with axiom of infinity, the image of a recursive sequence u = (fⁿ(a))_n∈ℕ by a transformation f∈X^X is a trajectory Y = 〈a〉_f = Im u of an action of (ℕ,0,+) on E.
As + is commutative, Y gets a monoid structure with identity a and generator f(a), that is a quotient of ℕ.

The injectivity of u (that would make it an isomorphism from ℕ to Y) is equivalent to the injectivity (H0) ∧ (Inj) of the {0,S}-algebra structure of Y.
If u is not injective, let y = min{z∈ℕ | ∃x<z, u_x = u_z}. Then the restriction of u to <⃖(y) is bijective to Y (its image is Y because it is {0,S}-stable). Then there is a unique x<y such that u_x = u_y, and this data of x and y determine the isomorphism class of Y.

A cycle of a transformation f∈X^X, is a non-free trajectory Y = 〈a〉_f for some a∈X, such that, equivalently (using the above defined x and y),

x = 0
Inj f_|Y
f_|Y ∈ 𝔖_Y
f^y(a) = a
a∈ Im f_|Y
(f_|Y)^y = Id_Y
∀b∈Y, Y = 〈b〉_f

Then this y, independent of the chosen generator a of Y, is called the period of Y.
If f is a permutation then every cycle of f is also a cycle of f^-1 with the same period.

4.6. Finiteness and countability

To review equivalent forms of the axiom of infinity, we need to work without assuming it.

Cardinals

A set A is said to be smaller (or of smaller size or of smaller cardinality) than a set B, if there exists an injection from A to B. This predicate is a preorder on the class of sets (as injections form a category on the class of sets).
By the Cantor-Bernstein theorem, the equivalence predicate it defines on the class of sets coincides with the predicate ≈ of equinumerosity between sets, defined by the existence of a bijection : A ≈ B ⇔ ∃f∈B^A, f : A ↔ B.
Equinumerous sets are said to "have the same cardinal". The cardinal |A| of any set A, intuitively meant as the equivalence class of A by ≈ in the class of all sets, can be formalized for the modest use we shall make of that concept, as A itself where any direct or indirect use of "equality" between cardinals (never mixed with other elements) actually refers to the use of ≈. This practice is a particular case of development of a theory (here, set theory), as will be detailed in 4.11.

The {0,S}-structure on powersets

Let us give to the class of sets the meta-structure with language Σ = {0,S} first introduced to define ℕ, defined by

0 = {∅}
S(A,B) ⇔ ∃x∈B, A = B\{x}.

By restriction, this gives a Σ-structure to the powerset ℘(E) of any set E.

The quotient Σ-system K_E = ℘(E)/≈ is functional, surjective and injective, and Dom S_{K_E} ∪ {|E|} = K_E.

S_{K_E}

elsewhere

^-1

∀A,B,B' ⊂E, ∀x∈B, (A = B\{x} ∧ f : B ↔ B' ) ⇒ f_|A : A ↔ B'\{f(x)}

transposition

If F⊂E then the natural injection from K_F to K_E is a Σ-embedding (like any injective morphism from a surjective system to an injective one). Let us abusively treat these natural injections as inclusions (K_F ⊂ K_E), which means to let these sets play the role of sets of cardinals. For any family of considered sets, seeing them as subsets of their union lets us proceed the study of their cardinals by this tool.

Finite cardinals

For any set E, the set Min_Σ K_E is called the set of finite cardinals from K_E.
This predicate of finiteness for cardinals in K_E is independent of E. Proof:

K_F

K_E

_Σ

K_F

_Σ

K_E

K_F

_Σ

K_E

K_F

_Σ

K_E

_Σ

K_E

K_F

_Σ

K_E

_Σ

K_F

Any infinite cardinal is greater than any finite one. Proof:

_Σ

K_E

S_{K_E}

_Σ

K_E

Axiom of infinity

In set theory with the powerset, the following statements are equivalent expressions of the axiom of infinity:

There exists an injective algebra whose language has at least a constant and a non-constant.
There exists a non-surjective injection of a set to itself; equivalently an injective {0,S}-algebra, or a set ℕ.
There exists an infinite set.
For any countable algebraic language there is a (countable) injective algebra
Any consistent first-order theory with a countable language has a (countable) model

4. ⇒ 2. ⇒ 1.; for 1. ⇒ 2. : from a non-constant operation s there, x↦s(k↦x) is an injective, non-surjective transformation.

We shall prove 2. ⇔ 3. below (finite cardinalities), and 2. ⇒ 4. ⇒ 5. in the section on the completeness theorem.

We may regard 5. ⇒ 4. as relative to the natural assumption that the concept of injective algebra forms a consistent theory. This theory is essentially the part of set theory which describes tuples. So, if a language L is included in a model of that simple part of set theory, then there exists an injective L-algebra. If such a model U is standard, this takes the form ^LU ⊂ U, making (U, Id_^LU) an injective L-algebra. ∎
The infinity of ℕ easily implies the uniqueness of the number of elements of a finite set, as its non-uniqueness would give that set a strict injection into itself, and thus an injection of ℕ into it.
2. ⇒ 3. will come from the infinity of ℕ.
For 3. ⇒ 2. we shall pick a different set, as the existence of an injective {0,S}-algebra structure on any infinite set would need the axiom of choice.

More properties of finite sets

Between finite cardinals, the order as cardinals coincides with the natural order in ℕ.

If E is finite, e∈ Min K_E, and K_E is a ground term with root e representing the number of elements of E
The set of finite subsets of E can be equivalently defined as Min_Σ ℘(E). A set is finite if it is a finite subset of itself (otherwise it is infinite).
For any sets E⊂F, (E is finite) ⇔ (E is a finite subset of F).

Equivalently, a set E is finite if it has a bijection to the graph of S in a ground Σ-term n (which can be seen as a unary {S}-term). This may as well be seen as a bijection to its domain {0,...,n-1} or to its image {1,...,n}.

Assuming the axiom of infinity, the finiteness of a set E is equivalent to the existence of a number n∈ℕ and a bijection from V_n = {x∈ℕ | x<n} to E; or a surjection from V_n to E, which is then bijective precisely when n takes its smallest value (the number of elements of E).

Without ℕ, either ℘(℘(E)) or (with more work) E^E can be used to express the finiteness of a set E, and to rebuild ℕ from E if E is infinite.

Any left cancellative element of a finite monoid is invertible. Thus any finite cancellative monoid is a group.
Proof: it acts as an injective transformation of a finite set, thus a permutation, by which the preimage of e is a right inverse, thus an inverse.

Countable sets

It is easy to check that any condensed ground Σ-draft is either ∅ or a ground Σ-term or a ground term Σ-algebra (only two other languages have this property, apart from those with no constant).
A set is called countable if, equivalently,

It has an injection to a condensed ground Σ-draft
It has a bijection with a condensed ground Σ-draft
A condensed ground Σ-draft has a surjection to it

1.⇒2. by restricting the order, once condensed ground {0,S}-drafts are described from their order structure; another method is given below.
2.⇒(1.∧ 3.) ; 3.⇒1. without AC, by picking the smallest element of each preimage.

So, countable sets are either finite or in bijection with ℕ. The latter, on which there exists a possible structure of ground term {0,S}-algebra, will be called countably infinite. However we still need to prove that they are indeed infinite.

Countability of ℕ×ℕ

y\x
0
1
2
3
4

0
0
1
4
9
16

1
2
3
5
10
17

2
6
7
8
11
18

3
12
13
14
15
19

A natural bijection between ℕ×ℕ and ℕ is defined by (x,y) ↦ (y < x ? x⋅x + y : y⋅y + x + y).

y\x	0	1	2	3	4
0	0	1	3	6	10
1	2	4	7	11
2	5	8	12
3	9	13

Another bijection b₂ : ℕ×ℕ → ℕ is defined as b₂(x,y) = T_x+y+y from the sequence (T_n) of triangular numbers defined by 2⋅T_n = n⋅(n+1) (see properties of parity) or equivalently by T₀=0 ∧ ∀n∈ℕ, T_n+1=T_n+n+1. This makes ℕ×ℕ a ground term (0,S)-algebra where

0=(0,0)
S(0,y)=(y+1,0)
S(x+1,y)=(x,y+1)

which recursively defines b₂^-1(z), where x+y is the only n such that T_n≤z<T_n+1.

Countability of finite sequences of integers

If there is a surjection f from ℕ to a set E then E is countable, as an inverse injection can be defined by associating to each x∈E its smallest preimage by f.

Fixing a bijection b₂ from ℕ² to ℕ, for any integer n>0 we can recursively define a bijection b_n:ℕⁿ→ℕ, for example:
b₁=Id_ℕ
∀n>0,∀u∈ℕⁿ, b_Sn(u) = b₂(b_n(u),u_n)
(which gives back b₂ as the particular case of b_n for n=2).
Also the set ℕ^(ℕ) = {u∈ ℕ^ℕ | ∃k∈ ℕ, ∀n>k, u_n=0} is countable. There are different ways to see it:

A sequence of bijections h_n : ℕ↔ E_n, gives a surjection ∐_n∈ℕ E_i ≈ ℕ×ℕ → U=⋃_n∈ℕ E_n so that taking E_n=ℕⁿ we find that U=ℕ^(ℕ) is countable.
With the decomposition in prime numbers
As the binary representation maps ℕ to the set 2^(ℕ) of finite subsets of ℕ, it also maps ℕ^(ℕ) to the set 2^(ℕ×ℕ) of finite subsets of ℕ×ℕ.

But this construction of ℕ^(ℕ)) and similarly simple candidate expressions of H as a countable set, assumes recursion, thus cannot be used to define recursion by the above method. To actually provide a definition of recursion that does not assume it, requires a different expression of an enumerated H. The representation of sequences which was used by Godel as a basis to define recursion, actually forms a different set.

4.7. The Completeness Theorem

Existence of countable term algebras

For any countable algebraic language L with at least a constant and a non-constant, finding a countable ground term L-algebra (necessarily infinite) amonts to defining a ground term L-algebra structure on ℕ, which is a bijection from ^Lℕ to ℕ.
Splitting L as C∪D where C is the set of constants, and D is the rest of symbols, we have ^Lℕ = C∪(^Dℕ). But C and D are countable (either finite or infinite), and ^Dℕ is a union over D of disjoint sets with explicitly definable bijections with ℕ (such as the b_{n_s} we saw). In any case, a bijection between C∪(^Dℕ) and ℕ can be found without problem.
In practice, such bijections from ^Lℕ to ℕ happen to be ground term L-algebra structures because ∀(s,x)∈^Dℕ, ∀i<n_s, x_i<s(x) which would be contradicted by the smallest element outside Min_L ℕ. ∎

Interpretation of first-order formulas

Trying to extend the formal construction of the interpretation of terms in algebras, to the case of formulas interpreted in systems, the difficulty is to cope with the interpretation of quantifiers (or generally binders, if we wish to still generalize).
A possibility, is to switch to a view over all variables as bound throughout the formula: from the concept of interpretation h_v∈E^T of a term T in an algebra E for every interpretation v of its set V of variables, the family (h_v)_v∈E^V is re-curried into a function from T to E^{E^V}. So, a first-order formula interpreted in a system E can be understood as a term interpreted in an algebra with maybe two base types : the sets Op_E and Rel_E of all operations and all relations in E; or split as two sequences of types, Op_E⁽ⁿ⁾ and Rel_E⁽ⁿ⁾. We might not need the full sets of these, but at least, an algebra of these (a subset stable by all needed logical operations).
We took unions over ℕ for the case we would need to see "all possible formulas" as terms interpreted in one same algebra.

The Completeness Theorem

Let us finally prove the Completeness theorem previously commented in 1.1, 1.9, 1.10, 1.B and 1.C.

Theorem. First-order logic has a proving system both sound and complete in the following equivalent senses

Any consistent first-order theory T with countable language has a countable model.
Any formula true in all countable models of such a theory is deducible from its axioms.

Sketch of proof of the first statement (implicitly suggesting a possible form of a proving system, but ignoring efficiency issues; the axiom of choice is not used and is anyway out of topic):

Reduce T to a single-type theory T₁ simulating the use of types by relevant unary predicate symbols and axioms, but without requiring all objects to belong to some type. Thus, the axiom (∃x, 1) can be added to T₁ without harm (it still allows types to be empty).
Then re-write axioms of T₁ in prenex form, that is chains of quantifiers applied to quantifier-free formulas (using equivalences from 2.5 and 2.6 simplified for a nonempty model).
Interpret the symbol = in axioms as an ordinary predicate symbol, with the axioms of equality [1.11].

Replace each occurrence of ∃ in an axiom by the use of an additional operator symbol K, for example

(∀x,x', ∃y, ∀z, A(x,x',y,z)) ⇢ (∀x,x',∀z, A(x,x',K(x,x'), z)).

Let M be the ground term algebra over the language of operator symbols enriched as just described. Reinterpreting all ∀ in axioms as the use of axiom schemas (one axiom for each replacement of variables by a tuple of elements of M), gives a propositional theory T₂ (its axioms are composed of logical connectives over Boolean variables), whose set of Boolean variables is ^PM where P is the set of predicate symbols in the theory plus the "equality" symbol.
Observe that T₂ is still consistent, as any contradiction in T₂ (= finite set of axioms not satisfied by any tuple of Boolean values of their variables) would provide a contradiction in T₁, as follows:

From all subterms occurring in used axioms of T₂, list without repetition all those whose root is a symbol S which was added to the language when converting some axiom ∀x,∃y,∀z,∃u, A(x,y,z,u) of T₁, into the axiom ∀x,z, A(x,K(x),z, S(x,z)) in T₂. Successively replace them all by variables in T₁: for terms S(t₀,t₁) and S(t₀,t₂) (where t₀, t₁, t₂ ∈ M), the reasoning in T₁ will say
"Let y such that ∀z,∃u, A(t₀,y,z,u);
let u₁ such that A(t₀,y,t₁,u₁);
let u₂ such that A(t₀,y,t₂,u₂);"
and use the contradiction in T₂ where y replaces K(t₀), u₁ replaces S(t₀,t₁) and u₂ replaces S(t₀,t₂).

Recursively by a chosen enumeration of ^PM as a countable set, add one by one to the axioms, each of these Boolean variables if it is consistent with previously accepted axioms, so that all get values without contradiction.
Then the quotient of M by the equivalence relation of "equality", forms a model (with the "true equality"). ∎

As this construction depends on an arbitrary order between formulas, different choices give different models, which may be non-isomorphic and even have different properties reflecting the undecidability of the theory's formulas.

Deduction of the second statement from the first :

T⊢ A
⇔ T ∪ {¬A} is inconsistent
⇔ T ∪ {¬A} has no countable model
⇔ A is true in all countable models of T. ∎

Skolem's Paradox

Comparing the results of the completeness theorem with Cantor's theorem gives this paradox :
If set theory is consistent then it has countable models, though it sees some sets there, such as ℘(ℕ), as uncountable.

A countable model U interprets the powerset "℘(E)" of an infinite "set" E∈U as the "set" P of all objects in U which play the role of subsets of E (but may differ from these when U is non-standard); as P is meta-countable, it cannot exhaust the uncountable meta-powerset of "all" subsets of E.
As P is meta-countable, it externally has a bijection with ℕ or U; but such bijections "do not exist" (have no representatives) as objects in U.

Rigorously speaking, when the powerset axiom says that the class of subsets of E is a set written ℘(E), it can only determine the interpretation of the functor ℘ relatively to (depending on) the universe where its open quantifiers are interpreted: «all subsets of E existing in this universe are in ℘(E)». In the other interpretation of what it means for a class to be a set, this would mean that the universe already contains "really all" subsets of E, forming the supposedly "true" set ℘(E) which will stay fixed in any further expansion of the universe. However, for the powerset of an infinite set, this wish cannot be expressed in first-order logic nor any other conceivable mathematical formalism: there is no way to talk about meta-sets that "cannot be found" in this universe but would «exist elsewhere» (in any mysterious bigger universe) to exclude them from there. (Only one aspect of this idea can be formalized as the axiom schema of specification).

4.8. More recursion tools

Rebuilding recursion

Let us work in a first-order theory just assumed to have 3 types ℕ, E and H⊂E^ℕ, the language and axioms of arithmetic for ℕ with the schema of induction over expressible subsets of ℕ, and the axiom that H provides "all finite sequences"

∀n∈ℕ, ∀u∈H, ∀y∈E, ∃v∈H, v_n=y ∧ ∀k<n, v_k = u_k

The following existential result has a simple proof by induction making no use of uniqueness:

∀R⊂ℕ×E², ∀n∈ℕ, (∀i<n, ∀x∈E, ∃y∈E, R(i,x,y)) ⇒ ∀a∈E, ∃u∈H, u₀=a ∧ ∀i<n, R(i,u_i,u_i+1).

Its simplicity only goes for this case, only generalizable to conditions of the form R(n,(u_k)_k<n,u_n), while the case of algebraic terms is also true but more difficult to prove.
As a particular case, comes the finite choice theorem written as

∀R⊂ℕ×E, ∀n∈ℕ, (∀i<n, ∃y∈E, R(i,y)) ⇒ ∃u∈H, ∀i<n, R(i,u_i).

With f∈E^ℕ×E the restriction of such u to numbers ≤ n is also unique by induction, from which the x∈E^ℕ such that x₀=a ∧ ∀n∈ℕ, x_n+1 = f(n,x_n), can be defined by its graph

{(n,u_n) | (n,u)∈ℕ×H, u₀=a ∧ ∀i<n, u_i+1 = f(i,u_i)}

As H contains any finite sequence expressible in the theory, this definition of recursion turns out to be "the recursion which the theory can express" independently of the particular choice of H.
The construction of ℕ^(ℕ) from the last section, and similarly simple candidate expressions of H as a countable set, assumes recursion, thus cannot be used to define recursion by the above method. To actually provide a definition of recursion that does not assume it, requires an independent expression of an enumerated H, which is harder but finally possible. Such a construction was achieved by Godel as part of his work to prove the incompleteness theorem.

Another proof of recursion

By seeing morphisms as subalgebras, we can write another construction of recursive sequences u∈Mor_(0,S)(ℕ,(E,a,f)), as follows.

Let M be the minimal subalgebra of ℕ×E_a,f, and let A={n∈ℕ | ∃!x∈E, (n,x)∈M}.
As M is a minimal (0,S)-algebra, M = {(0,a)}∪ Im S_M.
Substituting this into the definition of A we get

∀p∈ℕ, p∈A ⇔ (∃!y∈E, (p=0 ∧ y=a)∨∃(n,x)∈M, (p=Sn ∧ y=f(x))).

From 0 ∉ Im S we get 0∈A, and

∀n∈ℕ, Sn∈A ⇔ ∃!y∈E, ∃(n',x)∈M, (Sn=Sn' ∧ y=f(x)).

From the injectivity of S we get

∀n∈ℕ, Sn∈A ⇔ ∃!y∈E, ∃x∈E, (n,x)∈M∧ y=f(x).

Thus (∀n∈A, Sn∈A), so that A = ℕ, i.e. M is the graph of a function u ∈ E^ℕ. As M ∈ Sub(ℕ×E_a,f), we conclude u ∈ Mor_(0,S)(ℕ,E_a,f).

Injectivity lemma

If E is a surjective partial L-algebra and F is an injective L-system then ∀f ∈Mor_L(E,F),

A = {x∈E | ∀y∈E, f(x) = f(y) ⇒ x=y} ∈ Sub_LE.
B = {y∈F | !: f^•(y)} ∈ Sub_LF
Thus if more precisely E, F are algebras, {y∈F | ∃!: f^•(y)} ∈ Sub_LF.

While they are almost the same, let us write separate proofs.
The condition f ∈Mor_L(E,F) is read Im f ⊂ Dom ψ_F ∧ ^Lf_{|Dom φ_E} = ψ_F⚬f⚬φ_E.

∀x∈^LA⋂Dom φ_E, ∀y∈E, ∃z∈φ_E^•(y),
f(φ_E(x)) = f(y) ⇒ ^Lf(x) = ψ_F(f(φ_E(x))) = ψ_F(f(y)) = ψ_F(f(φ_E(z))) = ^Lf(z)
⇒ x = z ⇒ φ_E(x) = y.
∀y∈F_*(^LB), ψ_F(y) ∈ ^LB ∴ (!z∈Dom φ_E, ψ_F(y) = ^Lf(z) = ψ_F(f(φ_E(z))))
∴ !x∈Im φ_E=E, y = f(x).∎

A more general form of recursion

Some useful sequences need recursive definitions where the term defining u_Sn uses not only u_n but also n itself. Somehow it would work all the same, but trying to directly adapt to this case the proof we gave would require to define some special generalizations of previous concepts, and specify their resulting properties. To simplify, let us proceed another way, similar to the argument in Halmos's Naive Set Theory, but generalized.
For any algebraic language L, let us introduce a general concept of "recursive condition" for functions u : E → F, where, instead of a draft, E is first assumed to be an L-algebra (then a ground term algebra to conclude).
The version we saw was formalized by giving the term in the recursive definition as an L-algebra structure on F, φ_F: L⋆F → F, then expressing the request for u to satisfy this condition as u∈Mor(E,F), namely

∀(s,x)∈L⋆E, u(s_E(x)) = φ_F(s,u০x).

Let us generalize this as u(s_E(x)) = φ(s,x,u০x) which by the canonical bijection Dom φ ≡ ∐_s∈L E^n_s×F^n_s ≡ ∐_s∈L (E×F)^n_s = L⋆(E×F) can be written using h : L⋆(E×F) → F such that ∀(s,x,y)∈ Dom φ, φ(s,x,y) = h(s,x×y), as

u(s_E(x)) = h(s,x×(u০x)).

As ∀u∈F^E, x×(u০x) = (Id_E×u)০x, this becomes the second component of the formula Id_E×u ∈ Mor(E, E×F) when giving E×F the structure φ_E×F = (φ_E০π_L)×h.
The first component (φ_E০π_L) we give to φ_E×F, makes π∈ Mor(E×F, E) and makes tautological the first component of the formula Id_E×u ∈ Mor(E, E×F), namely
Id_E(s_E(x)) = φ_E(s,x) = (φ_E০π_L)(s,x×(u০x)).
It is then possible to conclude by re-using the previous result of existence of interpretations:

If E is a closed term L-algebra then ∃!f ∈ Mor(E, E×F), which is of the form Id_E×u because π০f ∈ Mor(E, E) ∴ π০f = Id_E.

But one can do without it, based on the following property of this L-algebra E×F:

∀u∈F^E, Id_E×u ∈ Mor_L(E, E×F) ⇔ Gr u ∈ Sub_L(E×F)

Indeed the defining formulas of both sides coincide. To see it otherwise,

⇒ is a case of image of an algebra by a morphism, Gr u = Im (Id_E×u).
For the converse, the inverse of the bijective morphism π_{|Gr u} ∈ Mor_L(Gr u, E) is a morphism Id_E×u ∈ Mor_L(E, Gr u) ⊂ Mor_L(E, E×F).

This reduces the issue to the search of subalgebras of E×F which are graphs of functions from E to F.
Now if E is a ground term L-algebra then M = Min_L(E×F) is one of them because π_|M∈ Mor_L(M, E) from a surjective algebra to a ground term algebra must be bijective.
Any other subalgebra of E×F must include M, thus to stay functional it must equal M. ∎

4.9. Non-standard models of Arithmetic

Standard and non-standard numbers

Since the induction axiom needed to determine the isomorphism class of ℕ depends on ℘(ℕ), the relativity of ℘ allows a diversity of non-isomorphic models of arithmetic respectively defined by the diverse models of any consistent first-order theory T which contains arithmetic. Many theories may be picked in the role of T, such as a set theory seen as a first-order theory, thus involving other notions than numbers (to define more sets of numbers than arithmetically defined classes). All we shall assume here is that it has a notion N of "natural number" with a constant 0, a function symbol S, and axioms making N a model of bare arithmetic (in any model of T), thus a bijective {0,S}-algebra.

A number in N is called standard if it belongs to the minimal subalgebra N₀ = Min_{0,S}N, isomorphic to ℕ by the unique {0,S}-morphism (embedding, n↦S_Nⁿ(0_N)) from ℕ to N. So, it represents an element of ℕ, which may be called a meta-number.
A model N of arithmetic is called standard if it is a minimal {0,S}-algebra, which can be equivalently written in 3 ways:

It satisfies the full induction axiom (in terms of powerset);
It is isomorphic to ℕ (also called the standard model of arithmetic);
All its elements are standard.

We shall abbreviate some works of first-order logic by set theoretical notations as follows :

Each meta-number n∈ℕ will be confused with the ground term "Sⁿ(0)" and its value as a standard number;
The quantifier "∀n∈ℕ" at the root of a formula, means to declare a schema of one formula per value of n in ℕ;
For each n∈ℕ, the quantifier ∀x<n (resp. ∃x<n) can equivalently be read using x as an object (∀x, x<n ⇒), or as a string of conjunctions (resp. disjunctions) of n copies of the sub-formula with each value of x as a meta-number lower than n.

Existence of non-standard models

The non-standardness of N is expressible by axioms on an additional structure, forming an extended theory T'. One way is to add

A constant symbol k with range N (which may need the axiom k∈N);
The schema of axioms n ≠ k for each n∈ℕ, making the number k non-standard.

If T is consistent then T' is also consistent (and cannot prove more), as any finite list of axioms of T' can be satisfied in any model of T (by interpreting k as a different number). Thus, T' also has a model, which is a model of T where N is non-standard.

This means that no consistent extension of arithmetic can ever force its model of arithmetic to be standard, i.e. require induction to apply to the full powerset of N, which it cannot ensure to exhaust by its classes. Candidate "standardness" predicates may be introduced but still cannot be forced to coincide with the true standardness (N₀), which may stay a meta-set beyond all classes.

Any model of arithmetic obtained as described in the proof of the completeness theorem from any consistent extension of arithmetic, will necessarily be non-standard, as it cannot even be elementarily equivalent to ℕ according to the truth undefinability theorem. Skolem's paradox still holds in two ways:

the meta-countable set P interpreting "℘(ℕ)", whose elements serve as subsets of N, still cannot exhaust ℘(N) which is also meta-uncountable (on the meta level, bijections between sets N and ℕ induce bijections between their powersets, which preserve uncountability);
the image of P projected to ℘(ℕ) by taking preimages by the embedding from ℕ to N, does not fill ℘(ℕ) either.

Non-standard models behave much like standard ones, as they satisfy all consequences of induction. Their precise range of diversity depends on the chosen theory of arithmetic.

Non-standard models of bare arithmetic

The only "addition" definable from S, is the meta-operation adding any n∈ℕ to any x∈N as x + n = Sⁿ(x), that is the meta-sequence of functions (Sⁿ)_n∈ℕ each defined in the theory by a unary S-term n. The consequences of the induction schema in bare arithmetic can be summed up as

The bijectivity of N as a {0,S}-algebra, which implies the bijectivity of S on the set of non-standard numbers
The schema of formulas ∀n∈ℕ*, ∀x∈N, x + n ≠ x which is a weak version of (F).

As the restriction of S to the meta-set N\N₀ of non-standard numbers of any model is a permutation, N\N₀ is a ℤ-set. The last formula obliges this ℤ-set to be free. Conversely, the disjoint union of ℕ with any free ℤ-set, forms a model of bare arithmetic.
Applying both ways the definition of the order from the partial meta-operation of addition, gives two ordering results with non-standard numbers:

Each ℤ-orbit of non-standard numbers has its own total order.
Each non-standard number is greater than each standard number, thus may be called «infinitely big».

Models of arithmetic with order, are formed as models of bare arithmetic with any choice of a total order on the partition of non-standard numbers in ℤ-orbits. Thus, bare arithmetic cannot suffice to define the order. But arithmetic with order cannot suffice to define addition either, as its non-standard models may either admit no or many corresponding interpretations of addition.

Non-standard models of Presburger Arithmetic

Such models satisfy all theorems of Presburger Arithmetic. In particular they have a well-defined total order, by which any non-standard number is greater than any standard number, and any non-empty class of numbers has a smallest element (which the meta-set of non-standard numbers hasn't).

Here are some more details only for the sake of illustration, which may be skipped.

There is also a meta-operation (sequence of functions) of multiplication of an x∈N by a standard number n∈ℕ : n⋅x = x+...+x (with n occurrences of x)
Beyond commutativity, associativity and the seen properties of the order, the last independent consequence of the axiom schema of induction constraining non-standard models, is the possibility of Euclidean division by any nonzero standard number (generalizing the results on parity) :

∀d∈ℕ*, ∀x∈N, ∃q∈N, ∃r<d, x = d⋅q + r

thanks to ∀d∈ℕ*, ∀q∈N, d⋅(q+1) = d⋅q+d which is a schema of theorems in Presburger arithmetic.
Moreover this (q,r) is unique; q = x:d∈N is called the quotient and r is called the rest of the division of an x∈N by a d∈ℕ*, so that

∀q∈N, q=x:d ⇔ d⋅q ≤ x< d⋅(q+1) ⇔ ∃ r<d, x = d⋅q + r.

The concept of model of Preburger arithmetic generated by a set (of non-standard numbers, since standard ones have no generating effect), is defined as that of subalgebra generated by a subset for the "algebra" (not exactly an algebra, but...) with language completed, like in the proof of the completeness theorem, by additional operation symbols reflecting all existence properties deduced from the axiom schema of induction:

The subtraction of a number by a lower number (or the absolute value of subtraction), which was implicit in the definition of the order and the proof that it is total;
The sequence of functions (x↦ x:d) of Euclidean division by each d∈ℕ*.

They can be conceived either in the abstract (by evaluating relations arbitrarily, like in the proof of the completeness theorem) or as a subset of a given model (interpreting expressions there).
In particular, in any model of Preburger arithmetic generated by a single element (non-standard k∈N), the set of non-standard elements is the set of all values of expressions of the form (a⋅k):d +b where a,d ∈ℕ* and b∈ℤ (the cases where a,d are relative primes suffice).
The predicate of divisibility of x∈N by a d∈ℕ*, is defined as the case when the rest cancels: d|x ⇔ (∃q∈N, x = d⋅q)

The possible shapes of these models with respect to k (classes of isomorphisms preserving k, described using it), are classified by the sequence (r_d)_d∈ℕ* of rests of the division of k by all standard numbers d. The possible sequences are those which satisfy not only r_d<d but also the compatibility formulas : ∀n,d ∈ℕ*, ∃h, r_d⋅n = d⋅h + r_d (where in fact h<n). The simplest one is where all r_d are 0, i.e. where k is divisible by every standard number (but the distinguishing property of this isomorphism class of models with 1 generator, "there exists a number divisible by every standard number", is inexpressible in Presburger arithmetic).

The non-standard models generated by 2 non-standard numbers k,k' can be split into the following classification depending on what may be intuitively described as the (standard) real number which k/k' is infinitely close to:

k/k' may be infinitely large or infinitely small;
It may be close to an irrational number
But the cases when it is close to a rational number a/b for standard numbers a, b, are reducible to other cases:

If the difference between b⋅k and a⋅k' is standard then the model is actually generated by only one element (any non-standard element is a generator).
If this difference is non-standard then, replacing one generator by this difference, reduces this model to the first case.

4.10. Developing theories : definitions

Given a theory T (with types, structures and axioms), a development of T is a extended theory T' = T∪(pack of extra components) which remains «equivalent» to T. This concept has 3 aspects:

In which sense is T' «equivalent» to T :
- For any extension T' of T, any model of T' defines by restriction a unique model of T ;
- Such a T' is qualified a development of T if conversely, any model of T has a «unique extension» as a model of T', in a sense we need to specify.
A list of developments schemes, where each is the data of a pair (X,Y) of theoretical components (a sub-theory X of a theory Y) so that any morphism from X to a theory T allows to add to T a copy of the rest of Y beyond its sub-theory X identified as its image in T. A list of developments schemes is sufficient when any legitimate development is producible by a succession of steps each described by one of these schemes.
On the formal side: how, for each such scheme, the use of extra components in works (expressions, proofs...) with T' can be translated into (usually longer) works with T, and thus be seen as abbreviations of them.

There are 3 levels of development with their respective schemes:

A proof only adds one ground formula to the axioms, as this formula is shown to be a theorem (deduced from some of the previous axioms).
A definition adds both a new structure symbol and a new axiom.
A construction adds altogether a new type, new structure symbol(s) and new axiom(s).

The use of a theorem as an available axiom for the proofs of the next theorems, abbreviates a repetition of its proof as a part of the next proofs (with possible substitutions of variables by terms to which the theorem is applied).
When inserting theories in the set theoretical framework, constructions become set theoretical definitions, which is why the distinction between definitions and constructions is not usually clarified in the literature.

The Galois connection (Mod,Tru)

For any language L, the truth relation between ground formulas and systems defines a Galois connection (Mod,Tru), where Mod gives the class of models of any set A of ground formulas taken as axioms, while Tru gives the set of true formulas which all given systems have in common. Replacing the class of all systems by the set of all L-structures on ℕ, leaves unchanged and thus gives definite sense to the closure Tru০Mod. All theorems of a theory belong to the closure Tru০Mod(A) of axioms, which is the condition for a formula to leave unchanged Mod A when it is added to A.

Schemes of definitions

We have 3 schemes of definitions:

A formula φ defines a new relation symbol R with arguments the free variables of φ, with the new axiom ∀(variables), R(variables)⇔φ(variables). The use of R can be replaced by the use of φ as a sub-formula of other formulas (replacing the free variables of φ by the terms used as arguments of R).
A term t defines an operation symbol S, with the axiom ∀(variables), S(variables)=t(variables). This S abbreviates, and can be replaced by, its defining term t.
Given a formula φ(variables, x), if we have (∀(variables),∃!x, φ(variables, x)) then it defines a new operation symbol S, with the new axiom
∀(variables), φ(variables, S(variables)). This scheme may be split in 2 steps : the definition of a relation symbol R by φ (by the first scheme); then a reduced version of the last scheme directly using R instead of a formula φ. This case was discussed with the uniqueness quantifiers (2.4).

Extending models by undefined structures

Thanks to the completeness theorem, if every model of T can be extended as a model of a T' = (T∪{R,A}) where R is a new symbol and A is a new (set of) axiom(s), then every ground formula of T provable in T', is also provable in T.
The converse may be wrong: a theory T may be extensible to a theory T' unable to prove more of its formulas, but to which not all its models are extensible. Here are some examples from non-standard models of arithmetic :

T is any arithmetic and T' expresses non-standardness by letting R be a constant (number), and A the axiom schema making it non-standard.
The other expression of non-standardness by a unary predicate symbol R which may mean the standardness predicate in a non-standard model, as A says the the induction axiom fails on R: R(0) ∧ (∀x, R(x) ⇒ R(S(x))) ∧ (∃x, ¬R(x))
T = bare arithmetic; T' = Presburger arithmetic
T = Presburger arithmetic; T' = full arithmetic (with addition and multiplication)

Definitions extend models

For any theory T with a model E, extensions T' of T by an additional axiom and/or structure symbol may have different effects:

If T' only has a new axiom A then E may stay or not as a model of T', depending whether A is true or false in E; it always does if A is provable from previous axioms;
If T' only has a new symbol then E may have several extensions as models of T' by giving this symbol arbitrary interpretations, except for special cases
Definitions precisely combine a new symbol R and a new axiom A, so that any model of T has a unique extension to an interpretation of R that satisfies A, forming a model of the extended theory (T∪{R,A}).

The uniqueness of the extension for every E, means that in the doubly extended theory (T∪{R,A,R₁,A₁}) where (R₁,A₁) is a copy of (R,A), one can prove

∀(variables), R⇔R₁

Definitions preserve sets of isomorphisms

For any isomorphism f between models E and F of T, with respective extensions as models E' and F' of a theory T' extending T by further structures and axioms, f may stay or not as an isomorphism between E' and F', i.e. preserve the interpretation of the new structure(s). It will be an isomorphism whenever the additional axiom(s) form a definition, determining the new structure(s) and thus making it invariant by isomorphisms. Thus, adding a defined structure to a theory preserves the sets of isomorphisms between its models.
In particular, any relation defined by an expression with language L⊂R without parameters is invariant by automorphisms, so is in its closure Inv(Aut_L E). This means that adding it to L preserves Aut_L E.
As we saw how the switch from models to isomorphisms provides similarities between development levels from proofs to definitions, we shall now see it also providing similarities from definitions to constructions, making sense to how constructions also essentially preserve theories.

4.11. Constructions

Diverse construction schemes let new types play the roles of sets defined in set theory by operations between sets named by previous types:

From a type E and a unary relation φ on E we can construct a type F in the role of the subset {x∈E| φ(x)}, with a function symbol j:F→E in the role of canonical injection, with axioms

∀_Fx,y, j(x) = j(y) ⇒ x=y meaning that j is injective
∀_Ex, φ(x)⇔∃_Fy, j(y)=x meaning that Im j = {x∈E| φ(x)}

From 2 types E,F a type G is constructed in the role of the sum E⊔F, by function symbols i:E→G , j:F→G and axioms

∀_Ex,y, i(x) = i(y) ⇒ x=y
∀_Fx,y, j(x) = j(y) ⇒ x=y
∀_Gx, (∃_Ey, i(y) = x)⇎(∃_Fy, j(y)=x)

A type K_n of n constants (for any integer n) can be constructed ex-nihilo, with n constant symbols k₁, ... k_n, the axiom (∀_Kx, x=k₁ ∨ ... ∨ x=k_n) and for every 0<i<j≤n the axiom k_i≠k_j.
From 2 types E,F, a type G is constructed in the role of the product E×F, by function symbols i:G→E , j:G→F, a binary operation symbol *:E×F→G and the axiom

∀_Ex, ∀_Fy, ∀_Gz, z = x*y ⇔ (x=i(z)∧y=j(z))
A variable of type G (or argument in a structure symbol), abbreviates a copy of the given list of variables (here with types (E,F))
A structure with values of type G, abbreviates a pack of 2 structures (to be used together as arguments of another structure, in the place of an argument with type G).

Each of the following schemes suffices to produce both others through the use of previous ones:

From a type E, a binary relation R on E and the theorem making R an equivalence relation (∀_Ex,y, xRy ⇔ (∀_Ez,xRz⇔yRz)), we can construct a type Q in the role of the quotient E/R with a function symbol q:E→Q, and axioms

∀_Ex,y, q(x)=q(y) ⇔ xRy
∀_Qx, ∃_Ey, q(y)=x

u,v

_Ex,y

xRy

u,v

equivalence relations and quotient

A type of unary relations: from 2 types E,F and a relation R between them, we construct the image Q of E in ℘(F) by the curried R, by 2 new structures

a function q:E→Q
a relation R' between Q and F,

∀_Qx,∃_Ey, q(y)=x
∀_Ex,∀_Fy, q(x)R' y ⇔xRy
∀_Qx,y, (∀_Fz, xR' z ⇔ yR' z) ⇒ x=y. Or equivalently,∀_Ex,y,(∀z∈F, xRz ⇔ yRz)⇒ q(x)=q(y)
The last 2 axioms are together equivalent to
∀_Ex,∀_Qy, (∀_Fz, yR' z ⇔xRz) ⇔ q(x) =y.

A type of structures K of any kind defined by a fixed expression with parameters, meaning all structures defined by this expression with all values of parameters. Its structures are:
- A quotient operation, from parameters to K
- An evaluator, interpreting K as a set of structures.

The last one can be obtained by composing constructions of products (to collapse bound variables as one, and the same for parameters), a definition and a type of unary relations or functions (either as particular relations, or by adapting the form of axioms, replacing ⇔ by = ).

A development scheme at each level looks like a component at the next level

Despite their differences of nature, the 3 levels of development have remarkable similarities:

A construction of a type of structures is given by a variable structure, that is a structure symbol with a selection of variables to play the role of parameters. Structures in the language are invariant, i.e. without parameter. This construction is worthless if the chosen symbol is without parameter (or behaving as such: independent of parameters) which would give singleton; or without other variable, as it would give the Boolean type.
Definitions look like axioms: a definition of a predicate is given by a formula with free variables; axioms are chosen from ground formulas. Any formula merely defining a Boolean constant, i.e. with value depending only on the model and not on variables, such as ground formulas or provable formulas (in the sense that any free variables are implicitly bound by ∀), do not play any effective role of structures of the described system (giving roles to objects). Predicates defined without using the language, are not structuring either.
As systems using axioms, proofs are similar to contradictions. A contradiction is a kind of proof with no theorem but 0 (False), which would be a very bad axiom. We might regard contradictions as a 4th kind of component of theories after types, symbols and axioms; not only the list of given contradictions should be empty for a theory to be of interest, but the set of possible contradictions should be empty as well.

How constructions preserve isomorphisms

From any theory T, consider any extension T' = T∪X∪R∪A where X is a type, R is a pack of structures and A is a pack of axioms. The existence of an extension of a given model of T as a model of T', is a matter of A being small enough for these X and R. Its uniquess would not make direct sense as any identical copy would also fit. But extensions may still be qualified as «essentially unique» (being all copies of each other) giving sense to how constructions preserve models, depending on the set of isomorphisms between them. For any models E' and F' of T' respective extensions of models E and F of T, and any isomorphism f : E → F,

An extension of f as an isomorphism from E' to F' will have to exist if A suffices to "determine" (make essentially unique) the extension of the model (interpretation of X and R) in the following sense: any two extensions are isomorphic (making unique the isomorphism class of extended models E'), by some isomorphism whose restriction to old types is the identity. Therefore in this case, any formulas in T' without free variable in the new type are provably equivalent to formulas expressed in T, may they be ground formulas (as they describe the extended model) or with some free variables in old types (as they may define structures on previous types, which the isomorphisms must preserve).
The uniqueness of the extension of f is a matter of T' having enough structures to prevent E' from having any more symmetry (nontrivial automorphism) than E. For example if R is empty, if the interpretations X and X' in E' and F' are equinumerous then any bijection between them extends f as an isomorphism between E' and F'; this is non-unique when X has at least 2 elements.
The extension T' of T is a construction of X in T, when it ensures both the existence and uniqueness of an extension of f to X as an isomorphism from E' to F'. This can be formally verified in the doubly extended theory T''= (T∪X∪R∪A∪X'∪R'∪A'), by defining without parameter, a function from X to X', and proving that it is a bijection extending Id_E as an isomorphism j between both models E' , E'' of T' obtained by restricting a model of T'' to copies (T∪X∪R∪A) and (T∪X'∪R'∪A') of T'.

Indeed in this case, any 2 isomorphisms g,h from E' to F' with the same restriction to E, will be equal because

The automorphism h⁻¹০g of E' extended by Id_X' forms an automorphism of the model (E'∪E'') of T''. As j is invariant, it stays equal to its image j০(h⁻¹০g)⁻¹ by this automorphism. Therefore h⁻¹০g = Id_E' , i.e. g=h.

What we can see here strictly preserved by constructions, is the class of isomorphisms as an abstract category. In particular, constructions leave unchanged the automorphism groups seen as abstract groups, providing more types on which they act.

4. Arithmetic and first-order foundations

4.1. Algebraic terms

Introduction

Drafts

Sub-drafts and terms

Categories of drafts

Images of drafts

Intepretations of drafts in algebras

4.2. Quotient systems

Quotients of relational systems

Quotients in concrete categories

Congruences

Quotients of modules

Intersections of congruences

Minimal congruences

Condensates of injective systems

4.3. Term algebras

Synonymous terms

Term algebras

Term algebras in injective algebras

Particular languages

Trajectories by transformation sets

Free monoids

4.4. Integers and recursion

The set ℕ

Recursively defined sequences

Addition

Inversed recursion and integers

Multiplication

4.5. Presburger Arithmetic

First-order theories of arithmetic

Presburger arithmetic

Equivalence of axiom schemas

The order relation

Parity

Images of recursive sequences

4.6. Finiteness and countability

Cardinals

The {0,S}-structure on powersets

Finite cardinals

Axiom of infinity

More properties of finite sets

Countable sets

Countability of ℕ×ℕ y\x 0 1 2 3 4 0 0 1 4 9 16 1 2 3 5 10 17 2 6 7 8 11 18 3 12 13 14 15 19

Countability of finite sequences of integers

4.7. The Completeness Theorem

Existence of countable term algebras

Interpretation of first-order formulas

The Completeness Theorem

Skolem's Paradox

4.8. More recursion tools

Rebuilding recursion

Another proof of recursion

Injectivity lemma

A more general form of recursion

4.9. Non-standard models of Arithmetic

Standard and non-standard numbers

Existence of non-standard models

Non-standard models of bare arithmetic

Non-standard models of Presburger Arithmetic

4.10. Developing theories : definitions

The Galois connection (Mod,Tru)

Schemes of definitions

Extending models by undefined structures

Definitions extend models

Definitions preserve sets of isomorphisms

4.11. Constructions

A development scheme at each level looks like a component at the next level

How constructions preserve isomorphisms

Countability of ℕ×ℕ

y\x
0
1
2
3
4

0
0
1
4
9
16

1
2
3
5
10
17

2
6
7
8
11
18

3
12
13
14
15
19