This article was translated from Chinese by ChatGPT and may contain some errors.
This article is not a strict academic paper and may contain some errors and omissions. Please forgive me.
❖Introduction
Kurt Gödel’s several famous theorems form the cornerstone of modern logic. Among them, Gödel’s (first and second) Incompleteness Theorems are widely acclaimed.
For a long time, many people, including the author, have found the proof of Gödel’s Incompleteness Theorem to be very cumbersome, with very complex technical details, so it is quite difficult to learn. At the ∞-type Café Summer School in 2023, teacher Ocua gave a very elegant and concise comprehensive lecture on the proof of Gödel’s Incompleteness Theorem. This lecture made me realize that Gödel’s Incompleteness Theorem is actually closely related to computability, or recursion theory.
In this article, I will combine my understanding of Gödel’s Incompleteness Theorem to introduce a simplest (perhaps not strict) proof of Gödel’s Incompleteness Theorem, trying to make ordinary programmers understand it after learning simple first-order logic.
❖First-Order Logic
First-order logic (First order logic, FOL) is the logical system discussed in this article. First-order logic has syntax and semantics. If the reader has not learned first-order logic in school, then you can understand it this way:
- First-order logic is a language, similar to our common programming languages, it has a set of formalized syntax (Syntax)
- The
sentences
of first-order logic can be seen as a syntax tree (Abstract Syntax Tree), or an algebraic data type (ADT) - This syntax tree has two layers, one is
proposition
(Statement), and the other isexpression
(Expression) - The expression layer includes variables, functions (calls), etc., , , are all expressions
- The proposition layer is composed of expressions, and expressions are combined with equal signs or relations to form propositions, such as ,
- Propositions can be connected with logical connectors to express specific meanings, such as is also a proposition. Such logical connectors also have , ,
- The free variables in the proposition (free variable) are bound with quantifiers to get new propositions. For example,
- The
- This language of first-order logic needs to specify its semantics, which is implemented through the model (Model). The related content is more complicated and will not be discussed for the time being.
- First-order logic has many proof systems, such as Hilbert System, these proof systems are independent of semantics. Simply put, a proof system can give a proposition like 1 through
calculation
, where represents a set of propositions, that is, a bunch of propositions, and represents the proposition to be proved. - When we use first-order logic, we use first-order theory (Theory). A first-order theory consists of a symbol table and a set of axioms . The symbol table specifies all the symbols that may appear in the proposition, including constants, function symbols, and relation symbols; the axiom set specifies the self-evident propositions in this theory, they are the starting point of the proof system2. For example, the famous Peano arithmetic number can be used as a first-order theory, its symbol table has:
- Constant symbol
- Function symbol
- Function symbol
- Function symbol (representing successor)
- Relation symbol
Our introduction to first-order logic ends here. First-order logic has very rich mathematical properties, which are the main discussion objects of mathematical logic courses. If the reader feels uncomfortable during the reading process, you might as well find a reliable mathematical logic book, such as [1, 2], etc., to learn some related knowledge.
❖Gödel’s Incompleteness Theorem
Completeness
is a property of a first-order theory. A first-order theory is complete, which means:
For any proposition constructed from the symbol set of this theory, either or
In other words, any proposition in this theory can either be proved or refuted.
Notice that this is a purely syntactic concept, and it seems to require specifying a particular proof system. However, various first-order logic proof systems, if not specially constructed, are equivalent in proof power. And because of the existence of Gödel’s completeness theorem, if and only if , so this is actually also a semantic concept. In this way, we generally do not need to specify a proof system. In the formalization of Gödel’s incompleteness theorem [3], such specification may be necessary.
For a first-order theory, we naturally hope that it is complete. For example, in Peano arithmetic, the following intuitive propositions can be proved:
And correspondingly, propositions that are not intuitively valid can be refuted.
If the first-order theory is simple enough, incompleteness is easy to achieve. For example, consider a first-order theory with a symbol table of and axioms of , it is obvious that:
Intuitively, this is because the axiom set of the theory is too small. Adding an axiom (often called expansion
, extension) to can make a complete theory.
For any theory , if can neither be proved nor refuted in this theory (called is independent of ), then adding or to the axioms of can eliminate this incompleteness.
This seems to reveal a beautiful picture: we can build all mathematics on some complete first-order theory, in this ultimate theory, any proposition can either be proved or refuted. In this way, the vague and chaotic mathematical language can be completely replaced by clear and neat first-order logic…
Disappointingly, in 1931, Gödel proved in his published paper Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme I
[13] that as long as a formal system includes elementary arithmetic3, there must be propositions that can neither be proved nor disproved. Note that adding axioms is of no use, because after adding axioms, the antecedent of the incompleteness theorem still holds, and the consequent can still be derived. 4
This is equivalent to sentencing the beautiful picture we imagined above to death. Of course, we are programmers, not mathematicians, what does this have to do with us?
❖Decidability
After Gödel published his paper in 1931, Church and Turing respectively proposed another major issue about first-order logic–the negative proof of decidability in 1936-1937.
As mentioned earlier, the propositions of first-order logic can be seen as a syntax tree (or string), and the first-order theory is a symbol table and a set of axioms. Under certain conditions (such as the recursive axiomatization to be discussed below), these can all be seen as data in the program.
Obviously we will ask: Is there a program that determines whether a proposition can be proved in ?
The answer to this question is negative: there is no such program. We will leave the proof method to the next section. This fact is a bit counter-intuitive, because the proof in an axiom system, theoretically speaking, can be enumerated.
How to enumerate? Consider Peano arithmetic and the Hilbert system. In the axioms of Peano arithmetic, the induction axiom is infinite, but it has a fixed pattern:
What needs to be enumerated here are and . goes without saying, is all predicates, its syntax is also determined, and can be generated by fixed rules. However, writing a gen
function directly is quite complicated. We can borrow Gödel’s method here.
First of all, the symbol set of Peano arithmetic is infinite, but we can divide it into two areas:
- Ten logical symbols (after simplification)
- Infinite variable symbols (denoted as , , , …)
By encoding the symbols of these two areas in order, we can give encoding and decoding functions. The encoding function maps symbols to natural numbers, and the decoding function maps natural numbers to symbols.
1# put '∀' to 1 , instead of 0
2LOGIC_SYM = [ 'dummy', '∀', '0', 'S', '+', '×', '(', ')', '¬', '→', '=' ]
3
4def decode(n):
5 if n <= len(LOGIC_SYM):
6 return LOGIC_SYM[n]
7 else:
8 # append '$' to variable head
9 return f"$v_{n - len(LOGIC_SYM)}"
10
11def encode(sym):
12 if sym[0] == '$':
13 # is variable, start with '$'
14 index = sym[3:]
15 return int(index) + len(LOGIC_SYM)
16 elif sym in LOGIC_SYM and sym != 'dummy':
17 return LOGIN_SYM.index(sym)
18 else:
19 # error
20 return -1
Similarly, a proposition (that is, a string, a list of characters) can also be encoded and decoded. The technique used by Gödel is as follows:
Here, represents the encode(aᵢ)
we just defined. And represents the th prime number
1from sympy import nextprime, Pow, factorint
2
3def encode_str(stmt):
4 pᵢ = 2
5 res = 1
6 for aᵢ in stmt:
7 res *= Pow(pᵢ, encode(aᵢ))
8 pᵢ = nextprime(pᵢ)
9 return res
10
11def decode_str(n):
12 """
13 n must be of form `Πᵢ pᵢ^{aᵢ}, aᵢ ≠ 0`, or this function is undefined
14 """
15 res = []
16 factors = factorint(n)
17 # factors is like { pᵢ : aᵢ }
18 for pᵢ, aᵢ in sorted(factors.items()):
19 res.append(decode(aᵢ))
20 return res
The range of decode_str
includes all strings. In this way, the traversal of strings can be implemented by the traversal of natural numbers.
A string may not be a valid proposition. Therefore, a function is needed to judge whether a string is a valid proposition. As long as we have this function, we can enumerate all propositions:
1def is_valid_stmt(stmt):
2 # this function is hard to write
3 pass
4
5def gen_all_stmts():
6 n = 0
7 while True:
8 stmt = decode_str(n)
9 if is_valid_stmt(stmt):
10 # do something here
11 n += 1
The function is_valid_stmt
is similar to the parser of a programming language. Although it is difficult to write, it can definitely be written.
Similarly, we can enumerate all proofs (that is, a list of propositions), and then construct a function to judge whether a proposition can be proved:
1def is_valid_proof(stmt_l):
2 pass
3
4def encode_proof(proof):
5 # like encode_str()
6 pass
7
8def decode_proof(n):
9 pass
10
11def can_prove(ψ):
12 n = 0
13 while True:
14 stmt_l = decode_proof(n)
15 if is_valid_proof(stmt_l):
16 stmt_last = stmt_l[-1]
17 if stmt_last == ψ:
18 return True
19 n += 1
For the program can_prove
, the following observations are made:
- If the last one in the list is the to be proved, then obviously, is established.
- Our enumeration traverses all proofs, so if , that is, there is a proof that meets the requirements, then that proof will definitely be found.
- If , this algorithm will not stop at all, it will keep looping.
This is very similar to the situation of the halting problem, which is called semi-decidability
in computability theory. Semi-decidability is the core concept used in our proof below, and it is worth discussing carefully.
A set5 is semi-decidable, if and only if there exists a program , such that for all , .
The key to semi-decidability is that it does not discuss whether the program halts when . Although if , that is, when the program outputs , according to if and only if, . But if the input , whether halts is not mentioned in the definition. Therefore, a decidable set must be a semi-decidable set, but a semi-decidable set is not necessarily a decidable set.
The halting problem is semi-decidable, not decidable
The proof is simple, the program that exists for semi-decidability is constructed as follows:
1def p(prog, x):
2 exec(prog, x)
3 return True
If prog(x)
halts, then the interpreter exec(prog, x)
also halts.
Semi-decidability has several equivalent definitions:
- Recursively enumerable (r.e.). A set is recursively enumerable, if and only if or there exists a recursive total function , such that , that is, is the range of .
- The partial characteristic function of is partially recursive.
- is the range of some partial recursive function.
What is a partial function
? Simply put, it is a function that is not defined on some inputs. A program that does not halt on some inputs, its image
, that is, is a partial function.
These equivalences need to be proven, you can refer to [1, 5].
The construction of is_valid_stmt
and is_valid_proof
requires that there exists an algorithm to determine whether a string is in the axiom set. This puts requirements on the axiom set. In fact, in the conditions of Gödel’s incompleteness theorem, there is a requirement that the axiom set needs to be a recursive set, which is equivalent to being decidable, and it adds the restriction that it must halt on any input to the requirement of semi-decidability.
❖Proof of Gödel’s Incompleteness Theorem
We already know that, at least for Peano arithmetic , the problem of is semi-decidable. Based on this result and a little programming trick, we can immediately get Gödel’s incompleteness theorem.
First, assume that is complete, that is, for any , or . Let the semi-decider of be , a decider of can be constructed. The construction is as follows:
1from multiprocessing.pool import ThreadPool
2
3def can_prove_input(ψ):
4 can_prove(ψ)
5 return ψ
6
7def p_total(ψ):
8 pool = ThreadPool(processes=2)
9 result = pool.apply_async(can_prove_input, (ψ, ¬ψ))
10 which = result.get()
11 if which == ψ:
12 return True
13 else:
14 return False
Unexpectedly, the key to this program is concurrency! Another name for concurrency is non-deterministic computation
. In fact, I just want to simulate non-deterministic computation with this method. (That is, simultaneously
calculate can_prove_input(ψ)
and can_prove_input(¬ψ)
)
Analyze carefully, consider completeness, and discuss it in different cases:
- If , then
can_prove_input(ψ)
will halt,result.get()
returnsψ
, so the entire function returnsTrue
. Note that at this timecan_prove_input(¬ψ)
will not halt, soresult.get()
will definitely returnψ
. - If , then
can_prove_input(¬ψ)
will halt,result.get()
returns¬ψ
, the entire function returnsFalse
. According to the consistency of , at this time.
Therefore, is a decider of . In this way, is decidable, which contradicts the undecidable result. Therefore, the assumption is not established, is not complete.
Readers may not be convinced: Isn’t this a trick? Where does Gödel know about multithreading? In fact, non-deterministic Turing machines and deterministic Turing machines are equivalent, and multithreading can also be simulated by single-threading. However, we can still give a more normal
.
1def can_prove_n(ψ, n):
2 n0 = 0
3 while n0 <= n:
4 stmt_l = decode_proof(n0)
5 if is_valid_proof(stmt_l):
6 stmt_last = stmt_l[-1]
7 if stmt_last == ψ:
8 return True
9 n += 1
10 return False
11
12def p_total(ψ):
13 n = 0
14 while True:
15 if can_prove_n(ψ, n):
16 return True
17 elif can_prove_n(¬ψ, n):
18 return False
19
20 n += 1
Here, we have changed the original can_prove(ψ)
to can_prove_n(ψ, n)
, which is whether can be proved within steps. In this way, can_prove_n(ψ, n)
will definitely halt, so it can be directly called in p_total
for sequential calculation.
❖Proof of Undecidability
From the above discussion, we can see:
- Although semi-decidability requires a relatively complex program, it is overall intuitive
- With semi-decidability and undecidability, we can directly obtain Gödel’s incompleteness theorem
The key to the problem is how to obtain the undecidability of . This article will not strictly provide a proof, but will simply introduce the idea of the proof.
❖Using Church’s Results
To prove undecidability, we must introduce a relatively complex concept–representability. Gödel himself does not seem to have directly used this concept [4], but used an equivalent form of it.
Church first proved in his 1936 paper [11] that there exists an undecidable problem (although some scholars are not very satisfied with his proof), this problem is:
Given , does hold?
Here, refers to closed lambda terms
, refers to the β-equivalence relation, roughly speaking, there is a path between and composed of β-reduction and β-abstraction. His article further pointed out that any ω-consistent system cannot construct a decider. [11, 12]
If we want to follow Church’s proof, then, we first need to arithmetize . That is to say, we need to find a relation on integers such that if and only if , where represents encoding. We obviously need to have some good representation, because we want to find a sentence in such that for any
The contains , which refers to the symbol string corresponding to the natural number in the meta-language in , that is,
is a determined, sentence with two free variables. In this way:
- Assuming is decidable, for any ,
- Input , $\psi = \mathsf{\neg P(\sharp a, \sharp b)} $ in the decider ,
- Return
Obviously, this program has decided the problem that Church proved to be undecidable. In fact, if is representable in , then the decision problem of can be reduced to the decidability of .
However, I personally think it is a troublesome job to find .
❖Using Kleene’s results
In Gödel’s 1931 paper, he proved that all primitive recursive functions are representable. Regarding the representability of functions, textbooks [1] and materials [9] have different opinions. Material [9] equates the representability of functions with the representability of relations. We temporarily adopt the latter’s statement6, a function is representable if and only if there exists a , such that
Kleene proved the normal form theorem:
There exists a three-element primitive recursive function and a one-element primitive recursive function , such that any partial recursive function can be represented as:
This theorem seems quite puzzling. In fact, it describes a system similar to an interpreter:
1def U(x):
2 # some calculation
3 pass
4
5def C(e, y, z):
6 # some calculation
7 pass
8
9def μ(f):
10 n = 0
11 while f(n) != 0:
12 n += 1
13 return n
14
15def f(n):
16 return n * 2
17
18def f_another(n):
19 # note: e may not be 10, we just use a fixed value here
20 return U(μ(lambda z: C(10, n, n)))
21
22# ∀ n, f(n) == f_anthor(n)
Here, can be roughly understood as the result after the program (recursive function) is encoded. Through this theorem, Kleene used the diagonal method to prove the following theorem:
Not every partial recursive function is potentially recursive
The so-called potentially recursive
means that for a partial recursive function , there exists a total function such that where is defined. Intuitively, this theorem is similar to not all semi-decidable sets are decidable
.
Now suppose there is a decider for . According to Gödel’s proof, there is a formula that represents . Then, a new super interpreter can be constructed that can find the corresponding total function for all partial recursive functions . Consider any with value , construct as follows:
1def f(n):
2 if p_t(∀ z. ¬C(e₁, n, z, 0)):
3 # g(n) can never halt
4 # return a random value, doesn't matter
5 return 0
6 else:
7 # g(n) will halt
8 # return g(n)
9 return U(μ(lambda z: C(e₁, n, z)))
Kleene’s theorem guarantees that the only way does not halt is . Therefore, since itself is a primitive recursive function that can be represented in , once is decidable, whether halts is decidable for any .
❖Comparison with Gödel’s original proof
Gödel’s original proof [1, 13] is quite complex and can be summarized as:
- Prove that all primitive recursive functions are representable.
- Arithmetize the syntax of first-order logic and elementary arithmetic with primitive recursive functions. We solve this by writing
can_prove_n(ψ, n)
; Gödel defines elementary recursive functions and corresponding elementary recursive relations. He wrote 45 recursive relations and finally defined provability (the famous Bew). - Since Bew is a primitive recursive relation, it is representable. In this way, a formula can be found to represent it.
- Gödel proved the fixed-point theorem: for any unary formula , a sentence can be found such that .
- Therefore, there exists such that . Note that this sentence constructs a proposition similar to the liar paradox.
- Using ω-completeness, whether assuming or , a contradiction will be obtained. In this way, and are independent.
As can be seen, our proof did not avoid 1. and 2., which are the most complex steps. However, semi-decidability is relatively simple to understand, and if it is brought up at the moment, the proof will be very short. 1. and 2. are actually introduced to prove semi-decidability.
❖Further Reading
- If you want to learn Gödel’s original proof, please read [1] or Gödel’s original paper [13]. Professor Yang Ruizhi’s book is very well written.
- The idea of this proof seems to originate from Kleene, you can refer to his work [10] and these notes [9].
- This article [4] also introduces the incompleteness theorem with this idea.
- Professor ocau’s Agda formalization is worth referring to, but his lecture videos are not public (for unknown reasons). His idea may come from this Coq formalization work [6], you can read this paper as a substitute.
- A complete Coq formalization of Gödel’s incompleteness theorem [3] (the above paper is comprehensive and has a big difference from what is discussed here)
- There are two books about Gödel’s incompleteness theorem [2, 8] that are quite good, you can choose to read some chapters.
- For knowledge related to recursive functions, you can refer to [1, 10] and Professor Song’s [7]. Similarly, concepts related to computability can refer to [5]
❖References
- [1] Mathematical Logic: Proof and Its Limits
- [2] An Introduction to Gödel’s Theorems
- [3] Essential Incompleteness of Arithmetic Verified by Coq
- [4] Gödel’s Incompleteness Phenomenon—Computationally
- [5] Computability and Complexity: From a Programming Perspective.
- [6] Gödel’s Theorem Without Tears: Essential Incompleteness in Synthetic Computability
- [7] Guide to Computational Models
- [8] Incompleteness and Computability: An Open Introduction to Gödel’s Theorems
- [9] Kleene’s proof of Gödel’s Theorem
- [10] Introduction to Metamathematics
- [11] An Unsolvable Problem of Elementary Number Theory
- [12] A. Church on Computability
- [13] On Formally Undecidable Propositions of Principia Mathematica and Related Systems I