Infinity and the axiom of choice

5.5. The Undecidability of AC

In 1938, Gödel proved the relative consistency of AC by showing that any universe includes one where AC is true (with the same ∈ but a different powerset). It is defined as the class L of "constructible" sets, obtained by successively adding sets definable from previously defined ones, in such a way that it forms a universe. There, each nonempty set is only made of "constructed" objects, thus has a "first constructed element" that can be used as a choice. Any exception to AC in the external universe, in the form of a family of sets (A_i) with empty product, is absent from L because for some i, the whole set A_i is disjoint from L (it has no constructed element).

In 1966, Paul Cohen proved the relative unprovability of AC. The proof is even harder, so we cannot sketch it here. Let us only explain how, from a constructible universe, it is possible to produce different ones. Since Cantor's theorem holds in L while each construction step of "adding all definable sets" to a given countable part of universe can only add countably many sets, an uncountable infinity of steps is needed to fill uncountable sets; therefore, sub-universes with a non-standard countable model of this "uncountable" hierarchy of construction times, will interpret "constructibility" differently, regarding as "unconstructible" some objects whose time of construction is not represented in this model.

Well-ordering and the axiom of choice

We shall show later that the axiom of choice implies the existence of a well-ordering on all sets. This is a bit difficult to prove but the converse is simple:

If all sets can be well-ordered then the axiom of choice can be deduced in the following way. If ∀x∈E, ∃y∈F, xRy, then with a well-ordering on F we can define f from E to F by f(x) = the smallest y such that xRy.

Thus without the axiom of choice we cannot assume that ZF-ordinals which are cardinals represent all cardinals.

Axiom of dependent choice (DC)

The axiom of dependent choice is expressible by the following equivalent formulas over every nonempty set E

∀R⊂ℕ×E², (∀n∈ℕ, ∀x∈E, ∃y∈E, R(n,x,y)) ⇒ ∀a∈E, ∃u∈E^ℕ, u₀=a ∧ ∀n∈ℕ, R(n,u_n,u_n+1).
∀R⊂E², Dom R = E ⇒ ∀a∈E, ∃u ∈ E^ℕ, u₀=a ∧ ∀n∈ℕ, u_n R u_n+1.
∀R⊂E², Dom R = E ⇒ ∃u ∈ E^ℕ, ∀n∈ℕ, u_n R u_n+1.
Any relation not well-founded has a descending sequence (while the converse was already seen)

3. ⇒ 1. can be proven in the above context with H, by taking the set defined like the above graph, {(n,u_n) | (n,u)∈ℕ×H, u₀=a ∧ ∀i<n, R(i,u_i,u_i+1)} with the relation between (n,x) and (p,y) defined as p=n+1 ∧ R(n,x,y)
Being not well founded, means ∃A⊂X, A≠∅ ∧ Im(R∩(A×A))=A.
3. ⇒ 4. as any descending sequence of R∩(A×A) in A is a descending sequence of R in E
4. ⇒ 3. with A=E∎
DC implies AC_ℕ (as clear on 1.), with no converse.
DC as expressed by 2. can be deduced from AC_E that narrows R to a function (∃f∈ E^E, ∀x∈E, xRf(x)) which gives such a u by recursion.

Sometimes DC needs to be expressed in the generalized form ∀n∈ℕ, (u_k)_k<n R u_n. It can be obtained from the above version by replacing E by the set of finite sequences in E. This substitution would affect the first deduction of DC from AC: the change of Dom R no more lets AC_E suffice as a justification. Still it keeps the second justification, from the choice function on Im ⃗R which remains a set of nonempty subsets of E.
DC is equivalent to a form of the Lowenheim Skolem theorem, stating that any system with countable language has a countable elementary subsystem. To deduce this statement from DC, the elementary countable subsystem comes as the union of a sequence of countable subsystems, each chosen to satisfy the truth of formulas with parameters in the previous one. The converse is quite easy. Reference : paper by Asaf Karagila - message and paper by Christian Espindola.
In practice, DC, or even AC_ℕ, may resolve most practical mathematical questions which depend on AC. The full AC only becomes important for higher, more abstract questions of set theory beyond ordinary purposes. It is then usually accepted as true for the questions that depend on it, as it intuitively feels more true and usually leads to rather uniform and effective consequences. Any statement of its falsity would beg for details about which kind of exceptions may it have, so as to reduce the margin of undecidability which occurs in its absence. But as (unlike the powerset) the axiom of choice is unnecessary for the core (vital) constructions at the foundation of mathematics, we shall generally do without it in this work.

Injections of ℕ into infinite sets

The existence of an injection from ℕ to E (equivalent to that of a non-surjective injection from E to E) implies that E is infinite. But the converse involves AC either way:

DC gives the existence of a sequence such that ∀n∈ℕ, u_n ∉ {u_k}_k<n (by a choice function on the set of complements of all finite subsets of E);
AC_ℕ suffices: if there exists a sequence t where each t_n is an injection from V_n+1 to E, then there exists an injective sequence u such that
u_n = t_n(min{i|∀k<n, t_n(i) ≠ u_k}).

Counter-examples to the axiom of choice

AC is valid for any family of sets (A_i) having a choice tool over "this kind of" sets A_i (that is a fixed formula which from A_i can pick one of its elements), for every i except possibly a finite set of them (for which the finite choice theorem applies). This limits the possible counterexamples to AC to the rest of cases, that is families of sets where an infinity of them are without choice tool. The simplest sets with no choice tool, are mainly

Sets of several pure elements (these do not exist in ZF where all elements are sets)
Infinite subsets of ℘(ℕ) (precisely, infinite sets of unconstructible subsets of ℕ).
Subsets of ℘( ℘(ℕ)), more precisely sets of such infinite subsets of ℘(ℕ).

Indeed, in a finite set of subsets of ℕ, you may choose "the smallest one" for a total order roughly defined as the order between real numbers between 0 and 1, seeing subsets of ℕ as binary expressions of these numbers; but an infinite set of them needs not contain any smallest one for this order: if an infinity of them contain 0 while an infinity don't, you can select those which don't, then continue choosing between those which contain 1 and those which don't, but after an infinity of such exclusion steps you may end up with nothing at all.
Exceptions to AC_ℕ may appear:

By the existence of an infinite set into which ℕ has no injection (a universe with such an infinite set of pure elements is easy to make; I am not sure of this possibility in a universe of ZF, which has no pure element).
in the form of a formula F(x,y) with variables x∈ℕ and y∈ ℘(ℕ) such that ∀x∈ℕ, ∃y∈ ℘(ℕ), F(x,y), where the formula F has at least 3 quantifiers (according to this reference, at least for second-order arithmetic, I don't know otherwise)
℘(ℕ) may be a countable union of countable sets ! while AC_ℕ implies that any countable union of countable sets is countable, which ℘(ℕ) cannot be.

AC vs measurability

Here is an important example of negation of AC while DC is accepted. The Lebesgue measure of the sets of reals is a way to giving each set its "length" (a real number or infinity) with 2 properties: countable additivity (the measure of a countable union of pairwise disjoint sets is the sum of their measures) and translation invariance. Specialists found the relative consistency of set theory with DC and the statement that all sets of reals are Lebesgue measurable. But this statement contradicts AC for the following reason. AC gives the existence of a choice function of the partition of ℘(ℕ) defined by the equivalence relation of finiteness of difference (A,B ↦ (A∆B is finite)), which can be seen in [0,1] as the equivalence relation of differing by a dyadic rational number. (This use of AC is illustrated by this video in French).
But the image of this function, that is a set made of one representative per equivalence class, cannot be Lebesgue measurable (a candidate measure can neither be zero nor non-zero to fit both properties of Lebesgue measurability).
Actually, it is a very similar use of AC to this one that is used in the construction of the Banach Tarski paradox, where the non-mesurability of sets is just more spectacular.

Set theory and foundations of mathematics
1. First foundations of mathematics
2. Set theory (continued)
3. Algebra 1
4. Arithmetic and first-order foundations
5. Second-order foundations

5.1. Second-order structures and invariants
5.2. Second-order logic
5.3. Well-foundedness
5.4. Ordinals and cardinals
5.5. Undecidability of the axiom of choice
5.6. Second-order arithmetic
5.7. The Incompleteness Theorem

More philosophical notes :

Gödelian arguments against mechanism : what was wrong and how to do instead
Philosophical proof of consistency of the Zermelo-Fraenkel axiomatic system