4.6. Finiteness

Equinumerosity and cardinals

Given any small category with a set O of objects, one may consider its objects "up to isomorphism", by taking the quotient of O by the relation ∼ of isomorphism. Precisely, an attribute may be attached to objects, by saying that two objects have the same [something] if and only if they are isomorphic. This is formalized as a function K from O to some other set, whose equivalence relation is ∼, while no other detail of this function matters. But, the use of this function may as well be seen as an abbreviation of, or simulated by, the use of objects themselves, where the formula "K(A) = K(B)" means to abbreviate AB (the existence of an isomorphism between A and B), and more sophisticated uses follow from there (for example, an inclusion XY ⊂ Im K can be reinterpreted with XO, YO as ∀AX, ∃BY, AB).
This may be generalized to any category beyond the small ones, as justified either in terms of that abbreviation/simulation, or by the context of use where the non-smallness of the category has no impact, since objects are only handled "some fixed set of them at a time", i.e. interpretable as the use of some small category, no matter which one precisely. Another possible justification would be to change the formalization of set theory to admit another primitive class of objects beyond sets and functions; this would be an extension or development of set theory, in the sense further discussed in 4.11).

In particular for the category of sets, the equivalence predicate ≈ of equinumerosity between sets, is defined by the existence of a bijection :

AB ⇔ ∃fBA, f : AB

Equinumerous sets are said to "have the same cardinal" : |A| = |B| ⇔ AB where |A| denotes the cardinal of a set A.

On the "class" of cardinals, structures of addition, successor, multiplication and exponentiation are naturally defined from the relevant operations between sets,

0 = |∅|
1 = |{x}| for any x
|A| + |B| = |AB|
S(a,b) ⇔ a+1 =b
|A| ⋅ |B| = |A×B|
|A||B| = |AB|

with their famous equality properties deducible from the existences of canonical bijections (2.8).
Cardinals are naturally ordered by : |A| ≤ |B| if there exists an injection from A to B. Indeed it is obviously a preorder (composites of injections are injections), while its antisymmetry is given by the Cantor-Bernstein theorem. For any sets A, B, C,

|A| + |B| = |C| ⇔ (∃DC, |B| = |D| ∧ |A| = |C\D|) ⇒ (∃DC, |A| + |D| = |C|) ⇔ |A| ≤ |C|

Also, |A| ≤ |℘(A)| because (Ax↦{x}) : A ↪ ℘(A), while |A| ≠ |℘(A)| by Cantor's theorem (2.7).

For any sets A, B, if |A| + 1 = |B| + 1 then |A| = |B|.

Proof (simpler than done elsewhere).

More generally, if |A| + 1 ≤ |B| + 1 then |A| ≤ |B|.
Indeed if |C| + |A| + 1 = |B| + 1 for some C, then also |C| + |A| = |B|, i.e. |A| ≤ |B|. ∎

Another easy result is

|A| ≤ |B| ⇔ (|A| = |B| ∨ |A| + 1 ≤ |B|)

Finiteness

Let us give to any powerset ℘(E), the {0,1,S,+}-structure defined by restriction of the natural meta-{0,1,S,+}-structure on the class of sets:

0 = {∅}
1℘(E) = 1E = {{x}|xE}
setA,B, ASB ⇔ ∃xB, A = B\{x}
setA,B,C, +((A,B), C) ⇔ (AB = ∅ ∧ C = AB)

It forms a partial {0,+}-algebra, embedded in the product monoid ℕE (if ℕ exists), namely identifiable to {h∈ℕE | Im h ⊂ {0,1}}.
Its component 0 may also be written as the constant 0 = ∅ when using the convention for algebras.
The cardinality function forms a {0,1,S,+}-morphism from ℘(E) to the class of cardinals.
If +((A,B), C) then

(A) ∈ MorS(℘(B),℘(C))
∅⌈SEBASEC ⇒ (∅⌈SEA ⇒ ∅⌈SEC)
1E{0,+} ⊂ 〈0〉S = Min{0,S} ℘(E) ⊂ Min{0,1,+}℘(E)

Tbe set 〈0〉S = Min{0,1,+}℘(E) = 〈1E{0,+} is the set of finite subsets of E.
So, a set FE is finite when F ∈ Min{0,1,+}℘(E) independently of the chosen E such that FE:

(+((A,B), C) ∧ CF) ⇒ (AFBF)
Min{0,1,+}℘(F) = ℘(F) ∩ Min{0,1,+}℘(E)

Finiteness only depends on cardinality, since any bijection between E and F defines an {0,1,+}-isomorphism between ℘(E) and ℘(F). For this reason, it will also qualify cardinals: a finite cardinal is a cardinal of finite sets.
If FE and E is finite then F is finite because

(F) ∈ Mor{0,+}(℘(E),℘(F))
(F)[1E] ⊂ 〈1F{0}
(F)[〈1E{0,+}] ⊂ 〈∩(F)[1E]〉{0,+} ⊂ 〈1F{0,+}

Any finite union of finite sets is finite.

Proof. In the binary case, the finiteness of A and B implies that of AB = A∪(B\A), where B\AB is finite.
The general case of a union U = ⋃xE Ax where E and each Ax are finite, follows by

{FE|⋃xF Ax ∈ Min{0,1,+}℘(U)} ∈ Sub{0,1,+}℘(E) ∎

Any term over a language where all symbols have finite arities, is finite.
The proof involves the stability of the set of roots of finite sub-terms.∎

Any finite product of finite sets is finite.

Proof. A product of two finite sets is a finite union of finite sets. The general case follows like for unions. ∎

In particular, The finite choice theorem (announced in 2.10) is deduced in the same way.

For any finite set A ⊂ Dom f,

  1. f[A] is finite,
  2. |f[A]| ≤ |A|.
Proof:
  1. f[A] = ⋃xA {f(x)}
  2. The finite choice theorem then gives the existence of a section of f|A.
A section of f|A can also be explicitly defined when A = Vn, by y ↦ min f(y).

ℕ and finite cardinals

Let us keep working independently of the axiom of infinity, and now bring tools to express equivalent forms of it.

A set E is called Dedekind-infinite if, equivalently

  1. |E|+1 ≤ |E|
  2. |ℕ| ≤ |E|
  3. |E|+1 = |E|
Proof 1.⇒2. : |E|+1 ≤ |E| means there exists f:EE and x ∈ ∁E Im f, which implies 〈{x}〉f ≈ ℕ (as a unary {f}-term algebra).
2.⇒3. : if ℕ ⊂ E then the {0,S}-structure of ℕ extended by IdE\ℕ forms a bijection |E|+1 = |E|.∎

Dedekind-infinity implies infinity, i.e. ℕ is infinite.
Proof: But, the converse depends on a version of the axiom of choice, as will be discussed in 5.5.

Any finite cardinal is comparable with any other cardinal, i.e. |E| ≤ |F| ∨ |F| ≤ |E| whenever E is finite.

Proof : for any fixed set F, the class of sets E such that |E| ≤ |F| ∨ |F| ≤ |E| is {0,S}-stable.∎

Such reasonings to deduce a property of any set E by stability in the class of sets (or of cardinals), are justified as implicit uses of stability in the system ℘(E).
Likewise, the class of cardinals ≤ |E| can be expressed as a quotient set KE = ℘(E)/≈, whose {0,1,S,+}-structure (restriction of the one among cardinals) coincides with the quotient of the one of ℘(E). The natural surjection plays the role of the cardinality function

| |E : ℘(E) ↠ KE.

Let ME = Min{0,S} KE the set of finite cardinals of subsets of E (thanks to the formula ∀BF, f(B) = MinLEB = MinLF from 4.2).

As {0,S}-systems, KE and ME are functional, surjective and injective, and

KE = Dom SKE ∪ {|E|E}
Dom SME = ME ∩ Dom SKE
ME ⊂ Dom SME ∪ {|E|E}
|E|E ∉ Dom SME

Proof of the last formula: Hence as noted in 4.3, Hence the equivalence between several expressions of the axiom of infinity: If it holds (in a given universe) then the range of finite cardinals is expressible as the set ℕ, which may be identified with the ground {0,S}-term algebra ME for any infinite E.
Otherwise, namely in finite set theory, finite cardinals may still be identified as "natural numbers" conceived as synonymity classes of {0,S}-terms, which form a model of arithmetic out of the given model of finite set theory, but with a caveat explained below. Either case needs some technical justifications by the following results.

For any sets E, F such that |E| ≤ |F|, there is a (unique) natural embedding jE,F ∈ Mor{0,1,+}(KE,KF) defined as "the identity on cardinals", i.e.

AE, ∀f:EF, jE,F(|A|E) = |f[A]|F ≤ |Im f|F

Conversely, if E is finite and h ∈ Mor{0,S}(KE,KF) then |E| ≤ |F| and h = jE,F.
Proof. Denoting G = EF,

{jE,G , jF,Gh} ⊂ Mor{0,S}(KE,KG)
jE,G = jF,Gh
|E|G = jF,G(h(|E|E)) ≤ |F|G

hence h = jE,F because jF,G is injective.∎

Similarly if E is infinite, any h ∈ Mor{0,S}(ME,KF) also preserves cardinals (jF,Gh = jE,G |ME).
But, the result does not extend to KE with an infinite E. In details, h ∈ Mor{0,S}(KE,KF) neither implies h ∈ Mor{≤}(KE,KF) nor injectivity of h, nor Im h = ≤⃖(h(|E|E)). Taking all these as additional hypotheses, to deduce |E| ≤ |F| and h = jE,F also involves the use of the well-ordering theorem which is a difficult consequence of (equivalent to) the axiom of choice, out of scope at this stage.

So, the above essentially describes the cardinality concept as forming the unique meta-{0,S}-isomorphism from the class of finite cardinals, to the class of natural numbers conceived as synonymity classes of ground {0,S}-terms. Its inverse is explicitly given by the concept, already introduced in 2.3, of the family of sets (Vn)nD defined for any fixed ground {0,S}-draft D with its strict order < (transitive closure of SD), as

nD, Vn = <(n)

It is an {0,S}-embedding from D to ℘(D) (in particular, the term with root n is embedded into ℘(Vn)). Therefore by uniqueness of interpretations,

nD, |Vn| ≃ n.

Applying this to D = ME, shows that any finite AE is equinumerous with V|A|E. So, any finite A is in bijection with Vn where n = |A| in ℕ under the axiom of infinity, or anyway in some ground functional {0,S}-draft D serving as a range of natural numbers.

Under the axiom of infinity, the set of all finite subsets of ℕ is

F = ⋃n∈ℕ ℘(Vn)

The proof is easy, using F ∈ Sub{0,1,+}℘(E).
On the other hand in finite set theory, ℕ is only indirectly handled as seen above without being a set. Through such encoding, the range of all its subsets available as sets, matches the above definition of F. This still fills the role of ℕ, since the induction axiom (Ind) from 4.4 is actually equivalent to the following restricted version (assuming some basic properties of the order):

AF, ¬(0∈A ∧ ∀nA, SnA))

(Another short equivalent form of (Ind) is ∀AF, ∃n∈ℕ\A, VnA.)

Yet, this expression of arithmetic in finite set theory does not ensure all axioms of first-order arithmetic. Indeed, the schema of induction (obtained from (Ind) by second-order universal elimination) is normally meant to include the cases of subsets defined by formulas with quantifiers in ℕ, but these become open quantifiers when ℕ is not admitted as a set. To recognize the subsets so defined as sets in set theory, is the point of the axiom schema of specification, which we only introduced in 1.A without using it yet.

Cancellativity in finite monoids

Any left cancellative element of a finite monoid (M, e, •) is invertible. Thus, any finite left cancellative monoid is a group.

Proof: if xM is left cancellative, the transformation •(x) is injective in a finite set, thus also surjective. Being both left cancellative and right invertible (a monic retraction), x is invertible.∎

Free commutative monoids

After the concept of free monoid introduced in 4.3 and 4.4, let us introduce the concept of free commutative monoid, defined as a commutative monoid with some basis X in the category of commutative monoids. This is another use of the general concept of free modules, differing from that of free monoids by expanding the axioms list (beyond the axioms of monoid), which may be formalized by adding an element in the structure of an equational system representing these axioms. Then, a free commutative monoid with basis X, is the quotient of the free monoid M = ⋃n∈ℕ Xn with basis X, by the congruence generated by, so to speak, the interpretation of the commutativity axiom in M.
This congruence coincides with the equivalence relation

R = {(u,u⚬σ) | uM ∧ σ∈𝔖l(u)}

where 𝔖n is the symmetric group of Vn.

Indeed, it is easy to check that R is a congruence containing commutativity, i.e. M/R is a commutative monoid.
Less trivial is to check the converse: that it has basis X in the category of commutative monoids. In other words, for any commutative monoid (E,e,•) and f∈Mor(M,E), let us prove by induction on n∈ℕ that

uXn, ∀σ∈𝔖n, f(u) = f(u⚬σ)

For n=0 it is trivial. Assume it for a given n.
Any uXn+1 can be written u = vx for some vXn and xX. Then

∀σ∈𝔖n+1, ∃a,bM, u⚬σ = axb ∧ (ab)Rv
f(u) = f(v)•f(x) = f(ab)•f(x) = f(a)•(f(b)•f(x)) = f(a)•(f(x)•f(b)) = f(u⚬σ) ∎

The quotient from term algebras to free monoids formed a simplification on the structure of {e,•}-terms: the axioms of associativity and identity respectively removed the parenthesis and occurrences of the identity element from terms, turning them into strings uXn. Then the quotient MM/R from free monoids to free commutative monoids, reduces this structure further : it removes the dependence on a choice of total order on the domain Vn of the string, reducing this domain to a structureless set of n elements, and seeing two functions from finite sets to X as expressing the same element of M if they match by a bijection between their domains. In other words, M/R as construction of a free commutative monoid, can be naturally replaced by M'/R' where Such uM' can be thought of as a "commutative term" that is a variant of the concept of term, over an implicit "language of commutative monoids" instead of an ordinary algebraic language, that needs no concrete language symbol to occur in these "commutative terms", but where V plays the role of set of occurrences of variable symbols from X. Then, the interpretation of a commutative term u : VX in a commutative monoid (N,0,+) matching a given v : XN, is the unique f∈ Mor{0,+}(℘(V),N) such that ∀iV, f({i}) = vu. Its end (root) value (previously called the long composite) is called the sum of the family vu, and written

∑(vu) = f(V)

But if the target commutative monoid is named and denoted multiplicatively as (N,1,⋅) then the result is called the product of the family vu, and written ∏(vu).

Another construction of free commutative monoid with basis X with no more stuff meant to be quotiented away, consists in the submonoid ℕ(X) of the product monoid ℕX, made of all h∈ℕX such that {xX | h(x) ≠ 0} is finite. Such h is said to have finite support.
The natural morphism (quotient) M' ↠ ℕ(X) is given by defining the image h∈ℕ(X) of uM' by

h = (Xx ↦ |u(x)|).

Its role by its morphisms to commutative monoids N, is expressible by the same operator (binder) ∑ but with domain N(X) instead of NV:

∑(vu) = ∑xX v(x)⋅h(x)
∏(vu) = ∏xX v(x)h(x)

More generally, any coproduct K = ∐iI Ei in the category of commutative monoids, is canonically isomorphic to the submonoid P = ∏(iI) Ei of ∏iI Ei made of families with finite support. Expressing each monoid by the formalism of its definition, this canonical isomorphism is naturally written (∑iI) ∈ Mor (P,K), whose inverse δ ∈ Mor (K,P) is (a generalization of) the Kronecker delta symbol, namely a two-indexed (i,kI) family of morphisms, defined using the conditional operator (2.4) as

iI, δik = (i=k ? IdEi : 0ik) ∈ Mor (Ei,Ek)

where 0ik is the constant morphism giving 0 everywhere (the 0 of the product monoid EkEi).

This can be seen as a construction of the coproduct of the Ei as a role given to P, by letting the ji take value in P by composition with δ

iI, ji = ⊓kI δik ∈ Mor (Ei,P)

while any morphism g ∈ Mor (P,N) to any commutative monoid N, can be written as a coproduct of a family of morphisms fi ∈ Mor (Ei,N), composed with ∑iI

xP, g(x) = ∑iI fi(xi)

Maybe the least trivial (still easy) step here, is to check that ∑iI is indeed a {+}-morphism.

Numbers of injections

For any finite sets X, Y, the set Inj(X,Y) of injections from X to Y is finite (subset of the finite set YX), and its cardinal only depends on those of X and Y. Let us denote

P(|Y|,|X|) = |Inj(X,Y)|

Then obviously ∀n∈ℕ,

P(n,0) = 1
P(n,1) = n
k∈ℕ, n<kP(n,k) = 0

Now let AX, B = X\A and πA : Inj(X,Y) → Inj(A,Y) defined by ff|A. Then

g∈ Inj(A,Y), πA(g) ≈ Inj(B, Y\Im g)

by the bijection ff|B. Hence with |A|=i=|Im g|, |B|= k and |Y|= n+i,

n,k,i ∈ℕ, P(n+i, k+i) = P(n,k)⋅P(n+i, i)

In particular,

P(n+1, k+1) = P(n,k)⋅(n+1)
P(n+i, i+1) = nP(n+i, i)

The last one can also be written

n,k ∈ℕ, P(n,k+1) = (nk)⋅P(n,k)

by extension to n<kP(n,k+1)= 0 = P(n,k).
Either formula, together with P(n,0) = 1 and the 0 value for n<k, forms a recursive definition of P(n,k). Either way, this implies coincidence with the definition of the falling factorial (among other possible notations and names) on a larger domain

n∈ℤ,∀k∈ℕ, P(n,k) = ∏i<k (ni)

where i<k abbreviates iVk.

The above formulas keep validity for negative n except that P(n,k) ≠ 0. Namely, it is conveniently expressed there by the rising factorial

n∈ℤ,∀k∈ℕ, P+(n,k) = ∏i<k (n+i) = P(n+k−1,k)
P(-n,k) = P+(n,k)⋅(-1)k

The factorial comes as a particular case :

n∈ℕ, n! = |𝔖n| = P(n,n) = P+(1,n) >0

The falling and rising factorials can be expressed from it : ∀n,k ∈ℕ,

n!⋅P+(n+1, k) = n!⋅P(n+k,k) = (n+k)!

Numbers of combinations

For any set X and any k∈ℕ, a k-combination of X is a subset AX such that |A| = k. The number of k-combinations of Vn (and thus of any set of n elements) is denoted C(n,k). This easily implies by Inj(Vk,Vn)∋f ↦ Im f, the following formula which we extend as a definition of C on a larger domain:

n∈ℤ,∀k∈ℕ, k!⋅C(n,k) = P(n,k)

Indeed P(n,k) is still a multiple of k! for negative n, as it repeats its values from positive n for the same k. Namely, let us similarly define ∀n∈ℤ,∀k∈ℕ,

C+(n,k) = C(n+k−1,k)
C(-n,k) = C+(n,k)⋅(-1)k

including C+(0,0) = C(0,0) = 1 from the latter formula.

As 0! = 1! = 1 we generally have C(n,0) = C+(n,0) = 1 and C(n,1) = n.
As A ↦ ∁X A defines a bijective correspondence between k-combinations and n-combinations of Vn+k,

n,k∈ℕ, C(n+k,k) = C(n+k,n) = C+(n+1,k) = C+(k+1,n)

The first equality is more usually written knC(n,k) = C(n,nk).

Given n=|X| and k,l∈ℕ, the number of possible data of disjoint A,BX with |A| = k and |B| = l, can be expressed in 3 ways according to which of A, B or AB is chosen first. This gives the following double equality (its validity for all n∈ℤ follows either by relating cases of negative n to those of positive n, or by deducing the general formula from numerical definitions after multiplication by k!⋅l!) :

n∈ℤ,∀k,l∈ℕ, C(nk,l)⋅C(n,k) = C(nl,k)⋅C(n,l) = C(k+l,l)⋅C(n,k+l)

In particular when l = 1

n∈ℤ,∀k∈ℕ, (nk)⋅C(n,k) = nC(n−1,k) = (k+1)⋅C(n,k+1)

From there follows

n∈ℤ,∀k∈ℕ*, C(n,k) = C(n−1,k−1) + C(n−1,k)

Proof : kC(n−1,k) = (nk)⋅C(n−1,k−1) = k⋅(C(n,k) − C(n−1,k−1)) ∎
This proof has 2 variants, each failing to cover some cases due to multiplication by zero:

nC(n−1,k) = (nk)⋅C(n,k) = n⋅(C(n,k) − C(n−1,k−1))
(nk)⋅C(n−1,k) = nC(n−1,k) − kC(n−1,k) = (nk)⋅(C(n,k) − C(n−1,k−1))

The form of these versions together suggest an idea of possible failure on n=k=0 for an extension of C(n,k) to negative k. Two natural extensions may be considered. The one giving 0 for all negative k, preserves C(n,k) = C(n−1,k−1) + C(n−1,k) but breaks C(n,k) = C(n,nk). Another, preserves the latter but breaks the former. We shall not use either.
There is of course the simple proof for n,k∈ℕ*, using that a k-combination of Vn is either a k-combination of Vn−1, or ({n−1} ∪ a k−1-combination of Vn−1).

The number of strictly monotone g:VkVn is C(n,k).
Proof : g bijectively matches the k-combination Im g of Vn. ∎

The number of monotone f:VkVn is C+(n,k).
Proof : f bijectively matches strictly monotone g:VkVn+k−1 by g(x) = f(x) + x. ∎

Monotone f:VkVn+1, monotone g:VnVk+1, and k-combinations A of Vn+k, are so bijectively related by

x<k, ∀y<n, f(x) ≤ yx < g(y) ⇔ x < |AVx+y+1|

The number of h:Vn → ℕ such that ∑h = k is C+(n,k).
Proof : h bijectively matches monotone f:VkVn by ∀x<n, h(x) = |f(x)|. ∎

The number of h:Vn → ℕ such that ∑hk is C+(n+1,k).
Proof : they are bijectively h = h'|Vn for h':Vn+1 → ℕ such that ∑h' = k. ∎

The number of h:Vk → ℕ such that ∑h < n is C+(n,k).
Proof : h bijectively matches monotone f:VkVn by ∀x<k, f(x) = ∑ix h(i). ∎

Related wikipedia pages:


Back to homepage : Set theory and foundations of mathematics
1. First foundations of mathematics
2. Set theory
3. Algebra 1
4. Arithmetic and first-order foundations
4.1. Algebraic terms
4.2. Quotient systems
4.3. Term algebras
4.4. Integers and recursion
4.5. Presburger Arithmetic
4.6. Finiteness
4.7. Countability and completeness
4.8. More recursion tools
4.9. Non-standard models of Arithmetic
4.10. Developing theories : definitions
4.11. Constructions
4.A. The Berry paradox
5. Second-order foundations
6. Foundations of Geometry