Is There Something Wrong with Proving Gödel’s Incompleteness Using Multithreading?

WARNING

This article was translated from Chinese by ChatGPT and may contain some errors.

WARNING

This article is not a strict academic paper and may contain some errors and omissions. Please forgive me.

Introduction

Kurt Gödels several famous theorems form the cornerstone of modern logic. Among them, Gödels (first and second) Incompleteness Theorems are widely acclaimed.

For a long time, many people, including the author, have found the proof of Gödels Incompleteness Theorem to be very cumbersome, with very complex technical details, so it is quite difficult to learn. At the ∞-type Café Summer School in 2023, teacher Ocua gave a very elegant and concise comprehensive lecture on the proof of Gödels Incompleteness Theorem. This lecture made me realize that Gödels Incompleteness Theorem is actually closely related to computability, or recursion theory.

In this article, I will combine my understanding of Gödels Incompleteness Theorem to introduce a simplest (perhaps not strict) proof of Gödels Incompleteness Theorem, trying to make ordinary programmers understand it after learning simple first-order logic.

First-Order Logic

First-order logic (First order logic, FOL) is the logical system discussed in this article. First-order logic has syntax and semantics. If the reader has not learned first-order logic in school, then you can understand it this way:

  1. First-order logic is a language, similar to our common programming languages, it has a set of formalized syntax (Syntax)
    • The sentences of first-order logic can be seen as a syntax tree (Abstract Syntax Tree), or an algebraic data type (ADT)
    • This syntax tree has two layers, one is proposition (Statement), and the other is expression (Expression)
    • The expression layer includes variables, functions (calls), etc., xx, f(x)f(x), g(x,y)g(x, y) are all expressions
    • The proposition layer is composed of expressions, and expressions are combined with equal signs or relations to form propositions, such as f(x1)=f(x2)f(x_1) = f(x_2), P(x1,f(x2))P(x_1, f(x_2))
    • Propositions can be connected with logical connectors to express specific meanings, such as f(x1)=f(x2)P(x1,f(x2))f(x_1) = f(x_2) \land P(x_1, f(x_2)) is also a proposition. Such logical connectors also have \rightarrow, \lor, ¬\neg
    • The free variables in the proposition (free variable) are bound with quantifiers to get new propositions. For example, x1x2.f(x1)=f(x2)\forall x_1 \exists x_2. f(x_1) = f(x_2)
  2. This language of first-order logic needs to specify its semantics, which is implemented through the model (Model). The related content is more complicated and will not be discussed for the time being.
  3. First-order logic has many proof systems, such as Hilbert System, these proof systems are independent of semantics. Simply put, a proof system can give a proposition like Γψ\Gamma \vdash \psi 1 through calculation, where Γ\Gamma represents a set of propositions, that is, a bunch of propositions, and ψ\psi represents the proposition to be proved.
  4. When we use first-order logic, we use first-order theory (Theory). A first-order theory consists of a symbol table SS and a set of axioms TT. The symbol table specifies all the symbols that may appear in the proposition, including constants, function symbols, and relation symbols; the axiom set specifies the self-evident propositions in this theory, they are the starting point of the proof system2. For example, the famous Peano arithmetic number can be used as a first-order theory, its symbol table has:
    • Constant symbol 00
    • Function symbol ++
    • Function symbol ×\times
    • Function symbol SS (representing successor)
    • Relation symbol \le

Our introduction to first-order logic ends here. First-order logic has very rich mathematical properties, which are the main discussion objects of mathematical logic courses. If the reader feels uncomfortable during the reading process, you might as well find a reliable mathematical logic book, such as [1, 2], etc., to learn some related knowledge.

Gödels Incompleteness Theorem

Completeness is a property of a first-order theory. A first-order theory TT is complete, which means:

For any proposition ψ\psi constructed from the symbol set of this theory, either TψT \vdash \psi or T¬ψT \vdash \neg \psi

In other words, any proposition in this theory can either be proved or refuted.

Notice that this is a purely syntactic concept, and it seems to require specifying a particular proof system. However, various first-order logic proof systems, if not specially constructed, are equivalent in proof power. And because of the existence of Gödels completeness theorem, TψT \vDash \psi if and only if TψT \vdash \psi, so this is actually also a semantic concept. In this way, we generally do not need to specify a proof system. In the formalization of Gödels incompleteness theorem [3], such specification may be necessary.

For a first-order theory, we naturally hope that it is complete. For example, in Peano arithmetic, the following intuitive propositions can be proved:

  • n.nS(n)\forall n. n \le S(n)
  • n1,n2.n1+n2=n2+n1\forall n_1, n_2 . n_1 + n_2 = n_2 + n_1

And correspondingly, propositions that are not intuitively valid can be refuted.

If the first-order theory is simple enough, incompleteness is easy to achieve. For example, consider a first-order theory AA with a symbol table of {c,e}\{ c, e \} and axioms of \emptyset, it is obvious that:

  • A⊬c=eA \not\vdash c = e
  • A⊬ceA \not\vdash c \neq e

Intuitively, this is because the axiom set of the theory is too small. Adding an axiom (often called expansion, extension) c=ec = e to AA can make AA a complete theory.

For any theory TT, if ψ\psi can neither be proved nor refuted in this theory (called ψ\psi is independent of TT), then adding ψ\psi or ¬ψ\neg \psi to the axioms of TT can eliminate this incompleteness.

This seems to reveal a beautiful picture: we can build all mathematics on some complete first-order theory, in this ultimate theory, any proposition can either be proved or refuted. In this way, the vague and chaotic mathematical language can be completely replaced by clear and neat first-order logic

Disappointingly, in 1931, Gödel proved in his published paper Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme I [13] that as long as a formal system TT includes elementary arithmetic3, there must be propositions that can neither be proved nor disproved. Note that adding axioms is of no use, because after adding axioms, the antecedent of the incompleteness theorem still holds, and the consequent can still be derived. 4

This is equivalent to sentencing the beautiful picture we imagined above to death. Of course, we are programmers, not mathematicians, what does this have to do with us?

Decidability

After Gödel published his paper in 1931, Church and Turing respectively proposed another major issue about first-order logicthe negative proof of decidability in 1936-1937.

As mentioned earlier, the propositions of first-order logic can be seen as a syntax tree (or string), and the first-order theory is a symbol table and a set of axioms. Under certain conditions (such as the recursive axiomatization to be discussed below), these can all be seen as data in the program.

Obviously we will ask: Is there a program p\texttt{p} that determines whether a proposition ψ\psi can be proved in TT?

The answer to this question is negative: there is no such program. We will leave the proof method to the next section. This fact is a bit counter-intuitive, because the proof in an axiom system, theoretically speaking, can be enumerated.

How to enumerate? Consider Peano arithmetic and the Hilbert system. In the axioms of Peano arithmetic, the induction axiom is infinite, but it has a fixed pattern:

yk.((φ(0,yk) (φ(x,yk)φ(S(x),yk)))x.φ(x,yk)) \begin{aligned} \forall \vec{y_k}. (& (\varphi(0, \vec{y_k})\ \land \\ & (\varphi(x, \vec{y_k}) \rightarrow \varphi(S(x), \vec{y_k}))) \\ & \rightarrow \forall x. \varphi(x, \vec{y_k})) \end{aligned}

What needs to be enumerated here are kk and φ\varphi. kk goes without saying, φ\varphi is all predicates, its syntax is also determined, and can be generated by fixed rules. However, writing a gen function directly is quite complicated. We can borrow Gödels method here.

First of all, the symbol set of Peano arithmetic is infinite, but we can divide it into two areas:

  • Ten logical symbols (after simplification)
  • Infinite variable symbols (denoted as v0v_0, v1v_1, v2v_2, …)

By encoding the symbols of these two areas in order, we can give encoding and decoding functions. The encoding function maps symbols to natural numbers, and the decoding function maps natural numbers to symbols.

1# put '∀' to 1 , instead of 0
2LOGIC_SYM = [ 'dummy', '∀', '0', 'S', '+', '×', '(', ')', '¬', '→', '=' ]
3
4def decode(n):
5    if n <= len(LOGIC_SYM):
6        return LOGIC_SYM[n]
7    else:
8        # append '$' to variable head
9        return f"$v_{n - len(LOGIC_SYM)}"
10
11def encode(sym):
12    if sym[0] == '$':
13        # is variable, start with '$'
14        index = sym[3:]
15        return int(index) + len(LOGIC_SYM)
16    elif sym in LOGIC_SYM and sym != 'dummy':
17        return LOGIN_SYM.index(sym)
18    else:
19        # error
20        return -1

Similarly, a proposition (that is, a string, a list of characters) can also be encoded and decoded. The technique used by Gödel is as follows:

[a1,a2,a3,...,an]=p1a1×p2a2××pnan \sharp [ a_1, a_2, a_3, ..., a_n ] = p_1^{\sharp a_1} \times p_2^{\sharp a_2} \times \cdots \times p_n^{\sharp a_n}

Here, ai\sharp a_i represents the encode(aᵢ) we just defined. And pip_i represents the iith prime number

1from sympy import nextprime, Pow, factorint
2
3def encode_str(stmt):
4    pᵢ = 2
5    res = 1
6    for aᵢ in stmt:
7        res *= Pow(pᵢ, encode(aᵢ))
8        pᵢ = nextprime(pᵢ)
9    return res
10
11def decode_str(n):
12    """
13    n must be of form `Πᵢ pᵢ^{aᵢ}, aᵢ ≠ 0`, or this function is undefined
14    """
15    res = []
16    factors = factorint(n)
17    # factors is like { pᵢ : aᵢ }
18    for pᵢ, aᵢ in sorted(factors.items()):
19        res.append(decode(aᵢ))
20    return res

The range of decode_str includes all strings. In this way, the traversal of strings can be implemented by the traversal of natural numbers.

A string may not be a valid proposition. Therefore, a function is needed to judge whether a string is a valid proposition. As long as we have this function, we can enumerate all propositions:

1def is_valid_stmt(stmt):
2    # this function is hard to write
3    pass
4
5def gen_all_stmts():
6    n = 0
7    while True:
8        stmt = decode_str(n)
9        if is_valid_stmt(stmt):
10            # do something here
11        n += 1

The function is_valid_stmt is similar to the parser of a programming language. Although it is difficult to write, it can definitely be written.

Similarly, we can enumerate all proofs (that is, a list of propositions), and then construct a function to judge whether a proposition can be proved:

1def is_valid_proof(stmt_l):
2    pass
3
4def encode_proof(proof):
5    # like encode_str()
6    pass
7
8def decode_proof(n):
9    pass
10
11def can_prove(ψ):
12    n = 0
13    while True:
14        stmt_l = decode_proof(n)
15        if is_valid_proof(stmt_l):
16            stmt_last = stmt_l[-1]
17            if stmt_last == ψ:
18                return True
19        n += 1

For the program can_prove, the following observations are made:

  • If the last one in the list is the ψ\psi to be proved, then obviously, TψT \vdash \psi is established.
  • Our enumeration traverses all proofs, so if TψT \vdash \psi, that is, there is a proof that meets the requirements, then that proof will definitely be found.
  • If T⊬ψT \not\vdash \psi, this algorithm will not stop at all, it will keep looping.

This is very similar to the situation of the halting problem, which is called semi-decidability in computability theory. Semi-decidability is the core concept used in our proof below, and it is worth discussing carefully.

A set5 SS is semi-decidable, if and only if there exists a program p\texttt{p}, such that for all eSe \in S, p(e)=True\texttt{p}(e) = \texttt{True}.

The key to semi-decidability is that it does not discuss whether the program halts when e∉Se \not\in S. Although if p(e)=True\texttt{p}(e) = \texttt{True}, that is, when the program outputs True\texttt{True}, according to if and only if, eSe \in S. But if the input e1∉Se_1 \not\in S, whether p(e)\texttt{p}(e) halts is not mentioned in the definition. Therefore, a decidable set must be a semi-decidable set, but a semi-decidable set is not necessarily a decidable set.

The halting problem is semi-decidable, not decidable

The proof is simple, the program that exists for semi-decidability is constructed as follows:

1def p(prog, x):
2    exec(prog, x)
3    return True

If prog(x) halts, then the interpreter exec(prog, x) also halts.

Semi-decidability has several equivalent definitions:

  1. Recursively enumerable (r.e.). A set SS is recursively enumerable, if and only if S=S = \emptyset or there exists a recursive total function ff, such that S={y  x.f(x)=y}S = \{ y\ |\ \exists x. f(x) = y \}, that is, SS is the range of ff.
  2. The partial characteristic function of SS is partially recursive.
  3. SS is the range of some partial recursive function.

What is a partial function? Simply put, it is a function that is not defined on some inputs. A program that does not halt on some inputs, its image, that is, {(x,y)  xA,p(x)=y}\{ (x, y)\ |\ x \in A, \texttt{p}(x) = y \} is a partial function.

These equivalences need to be proven, you can refer to [1, 5].

The construction of is_valid_stmt and is_valid_proof requires that there exists an algorithm to determine whether a string is in the axiom set. This puts requirements on the axiom set. In fact, in the conditions of Gödels incompleteness theorem, there is a requirement that the axiom set needs to be a recursive set, which is equivalent to being decidable, and it adds the restriction that it must halt on any input to the requirement of semi-decidability.

Proof of Gödels Incompleteness Theorem

We already know that, at least for Peano arithmetic PA\mathsf{PA}, the problem of PAψ\mathsf{PA} \vdash \psi is semi-decidable. Based on this result and a little programming trick, we can immediately get Gödels incompleteness theorem.

First, assume that PA\mathsf{PA} is complete, that is, for any ψ\psi, PAψ\mathsf{PA} \vdash \psi or PA¬ψ\mathsf{PA} \vdash \neg \psi. Let the semi-decider of PA\mathsf{PA} be p\texttt{p}, a decider p_total\texttt{p\_total} of PA\mathsf{PA} can be constructed. The construction is as follows:

1from multiprocessing.pool import ThreadPool
2
3def can_prove_input(ψ):
4    can_prove(ψ)
5    return ψ
6
7def p_total(ψ):
8    pool = ThreadPool(processes=2)
9    result = pool.apply_async(can_prove_input, (ψ, ¬ψ))
10    which = result.get()
11    if which == ψ:
12        return True
13    else:
14        return False

Unexpectedly, the key to this program is concurrency! Another name for concurrency is non-deterministic computation. In fact, I just want to simulate non-deterministic computation with this method. (That is, simultaneously calculate can_prove_input(ψ) and can_prove_input(¬ψ))

Analyze p_total\texttt{p\_total} carefully, consider completeness, and discuss it in different cases:

  • If PAψ\mathsf{PA} \vdash \psi, then can_prove_input(ψ) will halt, result.get() returns ψ, so the entire function returns True. Note that at this time can_prove_input(¬ψ) will not halt, so result.get() will definitely return ψ.
  • If PA¬ψ\mathsf{PA} \vdash \neg\psi, then can_prove_input(¬ψ) will halt, result.get() returns ¬ψ, the entire function returns False. According to the consistency of PA\mathsf{PA}, PA⊬ψ\mathsf{PA} \not\vdash \psi at this time.

Therefore, p_total\texttt{p\_total} is a decider of PAψ\mathsf{PA} \vdash \psi. In this way, PA\mathsf{PA} is decidable, which contradicts the undecidable result. Therefore, the assumption is not established, PA\mathsf{PA} is not complete.

Readers may not be convinced: Isnt this a trick? Where does Gödel know about multithreading? In fact, non-deterministic Turing machines and deterministic Turing machines are equivalent, and multithreading can also be simulated by single-threading. However, we can still give a more normal p_total\texttt{p\_total}.

1def can_prove_n(ψ, n):
2    n0 = 0
3    while n0 <= n:
4        stmt_l = decode_proof(n0)
5        if is_valid_proof(stmt_l):
6            stmt_last = stmt_l[-1]
7            if stmt_last == ψ:
8                return True
9        n += 1
10    return False
11
12def p_total(ψ):
13    n = 0
14    while True:
15        if can_prove_n(ψ, n):
16            return True
17        elif can_prove_n(¬ψ, n):
18            return False
19        
20        n += 1

Here, we have changed the original can_prove(ψ) to can_prove_n(ψ, n), which is whether ψ\psi can be proved within nn steps. In this way, can_prove_n(ψ, n) will definitely halt, so it can be directly called in p_total for sequential calculation.

Proof of Undecidability

From the above discussion, we can see:

  • Although semi-decidability requires a relatively complex program, it is overall intuitive
  • With semi-decidability and undecidability, we can directly obtain Gödels incompleteness theorem

The key to the problem is how to obtain the undecidability of PA\mathsf{PA}. This article will not strictly provide a proof, but will simply introduce the idea of the proof.

Using Churchs Results

To prove undecidability, we must introduce a relatively complex conceptrepresentability. Gödel himself does not seem to have directly used this concept [4], but used an equivalent form of it.

Church first proved in his 1936 paper [11] that there exists an undecidable problem (although some scholars are not very satisfied with his proof), this problem is:

Given a,bΛa, b \in \Lambda^{\circ}, does aβba \approx_{\beta} b hold?

Here, Λ\Lambda^{\circ} refers to closed lambda terms, β\approx_{\beta} refers to the β-equivalence relation, roughly speaking, there is a path between aa and bb composed of β-reduction and β-abstraction. His article further pointed out that any ω-consistent system cannot construct a decider. [11, 12]

If we want to follow Churchs proof, then, we first need to arithmetize β\approx_{\beta}. That is to say, we need to find a relation PP on integers such that P(M,N)P(\sharp M, \sharp N) if and only if MβNM \approx_{\beta} N, where \sharp represents encoding. We obviously need PP to have some good representation, because we want to find a sentence P(x,y)\mathsf{P(x, y)} in PA\mathsf{PA} such that for any n1,n2n_1, n_2

P(n1,n2)PAP(n1,n2)¬P(n1,n2)PA¬P(n1,n2) \begin{aligned} P(n_1, n_2) &\rightarrow \mathsf{PA \vdash P(n_1, n_2)} \\ \neg P(n_1, n_2)& \rightarrow \mathsf{PA \vdash \neg P(n_1, n_2)} \end{aligned}

The P(n1,n2)\mathsf{P(n_1, n_2)} contains n1\mathsf{n_1}, which refers to the symbol string corresponding to the natural number n1n_1 in the meta-language in PA\mathsf{PA}, that is,

SSSSn1times 0 \underbrace{S S S \cdots S}_{n_1 times}\ 0

P(x,y)\mathsf{P(x, y)} is a determined, PA\mathsf{PA} sentence with two free variables. In this way:

  • Assuming PA\mathsf{PA} is decidable, for any a,bΛa, b \in \Lambda^{\circ},
  • Input φ=P(a,b)\varphi = \mathsf{P(\sharp a, \sharp b)}, $\psi = \mathsf{\neg P(\sharp a, \sharp b)} $ in the decider p\texttt{p},
  • Return p(φ)\texttt{p}(\varphi)

Obviously, this program has decided the problem that Church proved to be undecidable. In fact, if PP is representable in TT, then the decision problem of PP can be reduced to the decidability of TT.

However, I personally think it is a troublesome job to find P\mathsf{P}.

Using Kleenes results

In Gödels 1931 paper, he proved that all primitive recursive functions are representable. Regarding the representability of functions, textbooks [1] and materials [9] have different opinions. Material [9] equates the representability of functions with the representability of relations. We temporarily adopt the latters statement6, a function ff is representable if and only if there exists a P\mathsf{P}, such that

f(m)=nTP(m,n)f(m)nT¬P(m,n) \begin{aligned} f(\vec{m}) = n \rightarrow T \vdash \mathsf{P(\vec{m}, n)} \\ f(\vec{m}) \neq n \rightarrow T \vdash \mathsf{\neg P(\vec{m}, n)} \end{aligned}

Kleene proved the normal form theorem:

There exists a three-element primitive recursive function C(x,y,z)C(x, y, z) and a one-element primitive recursive function U(x)U(x), such that any partial recursive function ff can be represented as:
f(n)=U(μz[C(e,n,z)]) f(n) = U(\mu z[C(e, n, z)])

This theorem seems quite puzzling. In fact, it describes a system similar to an interpreter:

1def U(x):
2    # some calculation 
3    pass
4
5def C(e, y, z):
6    # some calculation
7    pass
8
9def μ(f):
10    n = 0
11    while f(n) != 0:
12        n += 1
13    return n
14
15def f(n):
16    return n * 2
17
18def f_another(n):
19    # note: e may not be 10, we just use a fixed value here
20    return U(μ(lambda z: C(10, n, n)))
21
22# ∀ n, f(n) == f_anthor(n)

Here, ee can be roughly understood as the result after the program (recursive function) is encoded. Through this theorem, Kleene used the diagonal method to prove the following theorem:

Not every partial recursive function is potentially recursive

The so-called potentially recursive means that for a partial recursive function gg, there exists a total function ff such that f(n)=g(n)f(n) = g(n) where gg is defined. Intuitively, this theorem is similar to not all semi-decidable sets are decidable.

Now suppose there is a decider p_t\texttt{p\_t} for TT. According to Gödels proof, there is a formula C(e,n,z,t)\mathsf{C(e, n, z, t)} that represents C(e,n,z)C(e, n, z). Then, a new super interpreter can be constructed that can find the corresponding total function ff for all partial recursive functions gg. Consider any gg with ee value e1e_1, construct ff as follows:

1def f(n):
2    if p_t(∀ z. ¬C(e₁, n, z, 0)):
3        # g(n) can never halt
4        # return a random value, doesn't matter
5        return 0
6    else:
7        # g(n) will halt
8        # return g(n)
9        return U(μ(lambda z: C(e₁, n, z)))

Kleenes theorem guarantees that the only way gg does not halt is z.C(e1,n,z)0\forall z. C(e_1, n, z) \neq 0. Therefore, since CC itself is a primitive recursive function that can be represented in TT, once TT is decidable, whether g(n)g(n) halts is decidable for any nn.

Comparison with Gödels original proof

Gödels original proof [1, 13] is quite complex and can be summarized as:

  1. Prove that all primitive recursive functions are representable.
  2. Arithmetize the syntax of first-order logic and elementary arithmetic with primitive recursive functions. We solve this by writing can_prove_n(ψ, n); Gödel defines elementary recursive functions and corresponding elementary recursive relations. He wrote 45 recursive relations and finally defined provability (the famous Bew).
  3. Since Bew is a primitive recursive relation, it is representable. In this way, a formula P(ψ)\mathsf{P(\sharp\psi)} can be found to represent it.
  4. Gödel proved the fixed-point theorem: for any unary formula ψ(x)\psi(x), a sentence σ\sigma can be found such that Tσψ(σ)T \vdash \sigma \leftrightarrow \psi(\mathsf{\sharp\sigma}).
  5. Therefore, there exists σ\sigma such that Tσ¬P(σ)T \vdash \sigma \leftrightarrow \neg \mathsf{P(\sharp \sigma)}. Note that this sentence constructs a proposition similar to the liar paradox.
  6. Using ω-completeness, whether assuming TσT \vdash \sigma or T¬σT \vdash \neg \sigma, a contradiction will be obtained. In this way, σ\sigma and TT are independent.

As can be seen, our proof did not avoid 1. and 2., which are the most complex steps. However, semi-decidability is relatively simple to understand, and if it is brought up at the moment, the proof will be very short. 1. and 2. are actually introduced to prove semi-decidability.

Further Reading

  • If you want to learn Gödels original proof, please read [1] or Gödels original paper [13]. Professor Yang Ruizhis book is very well written.
  • The idea of this proof seems to originate from Kleene, you can refer to his work [10] and these notes [9].
  • This article [4] also introduces the incompleteness theorem with this idea.
  • Professor ocaus Agda formalization is worth referring to, but his lecture videos are not public (for unknown reasons). His idea may come from this Coq formalization work [6], you can read this paper as a substitute.
  • A complete Coq formalization of Gödels incompleteness theorem [3] (the above paper is comprehensive and has a big difference from what is discussed here)
  • There are two books about Gödels incompleteness theorem [2, 8] that are quite good, you can choose to read some chapters.
  • For knowledge related to recursive functions, you can refer to [1, 10] and Professor Songs [7]. Similarly, concepts related to computability can refer to [5]

References