Proof 1 The idea of the proof is essentially due to Zermelo and is to prove the following weak form of
Zorn's lemma, from the
axiom of choice. (Zorn's lemma itself also follows from this weak form.) The maximal principle follows from the above since the set of all chains in P satisfies the above conditions. By the axiom of choice, we have a function f : \mathfrak{P}(P) - \{ \emptyset \} \to P such that f(S) \in S for the power set \mathfrak{P}(P) of P. For each C \in F, let C^* be the set of all x \in P - C such that C \cup \{ x \} is in F. If C^* = \emptyset, then let \widetilde{C} = C. Otherwise, let :\widetilde{C} = C \cup \{ f(C^*) \}. Note C is a maximal element if and only if \widetilde{C} = C. Thus, we are done if we can find a C such that \widetilde{C} = C. Fix a C_0 in F. We call a subset T \subset F a
tower (over C_0) if • C_0 is in T. • The union of each totally ordered subset T' \subset T is in T, where "totally ordered" is with respect to set inclusion. • For each C in T, \widetilde{C} is in T. There exists at least one tower; indeed, F itself is a tower. Let T_0 be the intersection of all towers, which is again a tower. Now, we shall show T_0 is totally ordered. We say a set C is
comparable in T_0 if for each A in T_0, either A \subset C or C \subset A. Let \Gamma be the set of all sets in T_0 that are comparable in T_0. We claim \Gamma is a tower. The conditions 1. and 2. are straightforward to check. For 3., let C in \Gamma be given and then let U be the set of all A in T_0 such that either A \subset C or \widetilde{C} \subset A. We claim U is a tower. The conditions 1. and 2. are again straightforward to check. For 3., let A be in U. If A \subset C, then since C is comparable in T_0, either \widetilde{A} \subset C or C \subset \widetilde{A} . In the first case, \widetilde{A} is in U. In the second case, we have A \subset C \subset \widetilde{A}, which implies either A = C or C = \widetilde{A}. (This is the moment we needed to collapse a set to an element by the axiom of choice to define \widetilde{A}.) Either way, we have \widetilde{A} is in U. Similarly, if C \subset A, we see \widetilde{A} is in U. Hence, U is a tower. Now, since U \subset T_0 and T_0 is the intersection of all towers, U = T_0, which implies \widetilde{C} is comparable in T_0; i.e., is in \Gamma. This completes the proof of the claim that \Gamma is a tower. Finally, since \Gamma is a tower contained in T_0, we have T_0 = \Gamma, which means T_0 is totally ordered. Let C be the union of T_0. By 2., C is in T_0 and then by 3., \widetilde C is in T_0. Since C is the union of T_0, \widetilde C \subset C and thus \widetilde C = C. \square The existence of a C in F such that \widetilde{C} = C is also immediately given by the
Bourbaki–Witt theorem (cf. ), which says that F \to F, \, C \mapsto \widetilde{C} has a fixed point. On the other hand, the above proof actually establishes the following as a special case, which is somewhat of independent interest. Indeed, if such g exists, we define \widetilde{C} = C \cup \{ g(C) \} and the above C gives a contradiction as C = \widetilde{C}. This lemma in turns immediately implies the Bourbaki–Witt theorem; see Bourbaki–Witt theorem#Proof 3. For an alternative proof (of Bourbaki–Witt), see also the proof of Theorem 3.1. (Bourbaki-Witt) at https://ncatlab.org/nlab/show/Zorn's+lemma What we call a tower here is the same as an s-inductive set there. That proof is exactly the same as the proof of Zorn's lemma in Lang's
Algebra.
Proof 2 The
Bourbaki–Witt theorem states :Let X be a nonempty poset in which each chain has a least upper bound (i,e.,
supremum). Then each function f : X \to X such that f(x) \ge x for every x in X has a fixed point. The above theorem itself does not rely on the
axiom of choice. However, together with the axiom of choice, it can be used to prove the Hausdorff maximal principle as follows. Take X to be the set of all chains in a poset P, which itself is a poset with respect to set inclusion. It is nonempty since it includes the empty set. Also, the union of a chain \mathcal{C} \subset X is a least upper bound: it is clearly an upper bound and if U is an upper bound of \mathcal{C}, then \cup \mathcal{C} \subset U. Now, for each chain C in P, let C^* be the set of all strict upper bounds of C. Let :g : \mathfrak{P}(P) - \{ \emptyset \} \to P be a
choice function whose existence is ensured by the axiom of choice and then define f : X \to X by :f(C)\mathrel{\mathop:}=\begin{cases}C, &\text{if}\ C\ \text{is maximal}\\ C\cup \{g(C^*)\}, &\text{if}\ C\ \text{is not maximal}\end{cases} By the Bourbaki-Witt theorem, there exists an element C in X such that f(C) = C and this C is a maximal chain in P. \square
Proof from the well-ordering theorem Let P' be the set of all chains in P. By the
well-ordering theorem, we find a well-ordering \preceq on P. We shall construct the function :f : P \to P' recursively with respect to \preceq as follows. For an element x in P, suppose we are given an arbitrary function :g : \{ y \in P \mid y \prec x \} \to P'. Then let :f(x) = (\cup \operatorname{im}(g)) \cup \{ x \} if the set on the right is totally ordered; i.e., an element of P' and f(x) = \emptyset otherwise. If g itself is also required to satisfy the above recursive condition, then the
transfinite recursion theorem ensures this defines the function f uniquely (in a nutshell, if x is a \preceq-least element, the domain of g above is the empty set and thus f(x) is uniquely determined and in general, the recursion condition ensures the unique g is used to define f(x).) We note • The image of f is totally ordered, with respect to set inclusion. • If D is a chain containing \cup \{ f(y) \mid y \prec x, \, x \in D \}, then x \in f(x) for each x \in D. Indeed, (1) holds since if y \prec x, then :f(x) = (\cup \{ f(y) \mid y \prec x \}) \cup \{ x \} which contains f(y) as a subset. (2) holds since :f(\{ y \mid y \prec x \}) \cup \{ x \} \subset D is a chain as it is a subset of a chain. Finally, by (1), the union C of the image of f is a chain and it is maximal by (2), since if D \supset C is another chain, then x \in f(x)\subset C for each x \in D. \square ==Notes==