Universal programs#
We discuss universal programs: these are programs that take a program as an inputs and (act as if they) run the input program.
#@title
!python -m pip install -U setuptools
!python -m pip install -U git+https://github.com/lmoss/onesharp.git@main
from onesharp.interpreter.interpreter import *
Definition
A word \(u\) is universal if
for all words \(x\).
This means that if running \(x\) on all empty registers eventually gives some output in R1 (and all other registers are empty), then that same output would be the result of running \(u\) with \(p\) in R1 and all other registers empty. In the other direction, if running \(x\) with all empty registers gives some output, say \(y\), in R1 (and all registers empty at the end), then we would get the same output \(y\) in R1 when we run \(u\) with \(x\) in R1 and all other registers empty.
Let’s take a look at such a program and try it out.
u = universal
Examples
\([\![u]\!](\one\hash) \simeq \one\) because \([\![\one\hash]\!](\ )\simeq \one\).
\([\![u]\!](\one)\!\uparrow\) because \(\one\) is not a program.
More examples
You might try to figure out what the next two should be and then confirm your guesses by running them.
\([\![u]\!](\one\hash\one\hash\hash)\)
\([\![u]\!](\one\hash \one\hash\hash \one\hash \one\hash \hash \one\hash \hash+ u)\)
Let’s write a universal program#
The next goal is for you to write a universal program
yourself, either alone or with others.
The next sections guide you, using a series of exercises.
We hasten to make two remarks: first of all, it would be more instructive for you to forget about the rest of this lesson entirely and to just write your own universal program. But most readers would probably appreciate the hints and will find enough of a challenge in the construction of a long working program.
Second, the sections to come do not guide you to write a program that is exactly like \(u\) above. The program \(u\) above was designed to be as short as possible. In fact, we leave you with a challenge: write a universal program with the smallest number of instructions. The one above has 300.
Simplification#
We are going to simplify the requirements in order to make it easier to write a universal program. So we’ll outline how to write a program that is close to a universal program, but not quite a universal program.
We’ll write a program \(u\) that acts like a universal program, but only for programs which use only R1, R2, and R3. In more detail:
Let \(p\) be any
program of 1#
which only uses R1, R2, and R3.
Let \(x\) be any word on our alphabet {1#}
,
including possibly the empty word. Then the following are equivalent:
(A’) If \(p\) is started with all empty registers, then eventually we halt with \(x\) in R1.
(B’) If \(u\) is started with \(p\) in R1 and with the empty word in all other registers, then eventually we halt with \(x\) in R1.
Once again, the work of this section sketches a construction of
such a program \(u\). You will need to do much of the work yourself
in actually getting the program from the sketch. And even when we
have \(u\), it will not be what we really want, since
the resulting program \(u\) will only behave right on programs which use the first
three registers.
Later on, we’ll come
back and drop this simplification to get a program which is truly
universal.
At this point, we are ready to sketch a method for you to write a universal program subject to this simplification.
We are going to use pseudo-registers as follows:
R1: the input program p |
R2: an instruction number 1n |
R3: the nth instruction of p |
R4: the contents of the real R1 |
R5: the contents of the real R2 |
R6: the contents of the real R3 |
That is, we are going to simulate the computation of a register machine on a program p by using six registers. Our universal program u mimics what a register machine would do. The only difference is that one step of a real register machine would be done by *many* steps of our universal program.
The registers above hold what is sometimes called a snapshot
of a register machine run.
This means that they hold all of the relevant information about a register
machine computation at one particular point in time: they tell what program is
running, which instruction would be run next and what number instruction it is,
and also the contents of all of the registers that the program is using.
Now a real register machine goes from snapshot to snapshot in one step,
but the universal machine that we are building will take many steps to
go from one snapshot to the next.
Our universal program is composed of a few sub-programs. We shall illustrate the ideas behind several sub-programs by making a large chart. This chart shows just one example of how a universal program could work, when we take p to be the program
Let's parse this program and number the instructions:
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
1## | 1##### | 111111111### | 11111### | 11# | 11## | 11## | 111111#### | 11# |
10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 |
11## | 111111111#### | 11##### | 111111### | 111### | 1## | 1111#### | 1# | 111111#### |
What we are going to show is tables of what the various
registers show after different parts of a run of a universal program
u on this program 1# + write.
We hasten to add that the universal program exhibited above
does not conform to the tables. (This is because
registers work differently in it. Also, the program
above is designed to work even without our simplifying
assumption.)
Here are the tables; we'll have some words below about them.
R1: the input program p
p
p
p
p
p
p
p
p
p
p
p
p
p
R2: an instruction number 1n
1
1
11
11
11111
11111
16
16
17
17
18
18
R3: the nth instruction of p
1##
1#####
11#
11##
11##
16 ####
R4: the contents of R1
#
#
R5: the contents of R2
1
1
1#
1#
1##
1##
R6: the contents of R3
continuing
R1: the input program p
p
p
p
p
p
p
p
p
p
p
p
p
R2: an instr number 1n
11
11
111
111
112
112
114
114
117
117
118
118
R3: the nth instruction of p
1#####
19 ###
11#####
111###
1#
16 ####
R4: the contents of R1
1
1
R5: the contents of R2
1##
1##
1##
1##
1##
1##
1##
1##
##
##
##
##
R6: the contents of R3
and going further in the move2,1 part
at the end of write
R1: the input program p
p
p
p
p
p
p
p
p
p
p
p
p
R2: an instruction number 1n
112
112
115
115
116
116
112
112
115
115
116
116
R3: the nth instruction of p
11#####
1##
14####
11#####
1##
14####
R4: the contents of R1
1
1
1
1
1#
1#
1#
1#
1#
1#
1##
1##
R5: the contents of R2
##
##
#
#
#
#
#
#
R6: the contents of R3
and concluding
R1: the input program p
p
p
p
p
p
1##
R2: an instruction number 1n
112
112
113
113
119
R3: the nth instruction of p
11#####
16 ###
R4: the contents of R1
1##
1##
1##
1##
1##
R5: the contents of R2
R6: the contents of R3
As you can see, there are four different colors of the columns.
orange: starting out by pre-processing the input (if desired) and putting 1 in R2 |
beige: put into R3 the nth instruction of the program in R1, where 1n is in R2 |
green: execute the action in R3, but using R4 instead of R1, R5 instead of R2, and R6 instead of R3 |
yellow: finishing up by moving and clearing the relevant registers |
The colors indicate the sub-programs of u.
The columns themselves indicate the register contents when one
of the sub-programs begins.
The first color shown is orange.
This is the the easiest to explain. The orange sub-program could be 11#.
This indicates that at the
very beginning of the execution of
the input program, the instruction to look at first is instruction 1. But if you decide to pre-process the input program (see below), then the orange sup-program would do this as well.
The beige columns alternate with the green columns.
The purpose of the beige program is to take the input program
(in R1) and the next instruction number (R2), and to
look up the appropriate
instruction and put it into R3. But we need to be sure that the contents
of R1 and R2 are not destroyed at the end. The way to understand
the beige columns is that they tell the contents of the various registers
at the beginning of the operation of the "beige
sub-program". You will need to write that sub-program.
It is a fairly straightforward modification of a program which
takes an input program q and a number n and gives the nth instruction of
q. But you need to be
sure that the output goes in R3, and that the input is preserved.
The main action of the steps is done in the green columns;
more precisely,
the columns shown are the beginnings of the
"green" sub-program, and then the results of that sub-program
are the beige columns which follow.
The exact action depends on what kind of instruction is in R3 in
the given green column. For example, if R3 has `≈, then
in the next beige column we add a # to R5 (not R2: it is R5
that models what is in R2 of the real machine when we run
program). We also must add 1 to R2 in simulating the execution
of 11#, since after 11#, we go to the very next instruction.
This is how the green columns work when we execute an instruction
11#. The other cases are left for you to think about.
Note also that each time the green sub-program runs, R3 is emptied
at the end.
The last column color is the yellow one at the very end.
Before the yellow one starts up, the beige program
has determined (in some sense) that
our program $p$ has 18 instructions and we have
come to a point where we ask for instruction 19. Thus,
the input program has halted. We therefore want to
take the contents of R1 as run by the input program
(this is the word in R4), and move this to R1.
Also, we would like to clear out the contents of
whichever registers are not empty. We do this so that
the program we are writing will itself come to a good halt
on its input.
It might seem silly to have R1 unchanged throughout.
But this isn't really what is happening at all.
In the course of the beige subprograms, parts of R1 are copied
to other registers, and the copied back. So at the
end of the beige sub-programs, we have the original
program p back in R1.
Similarly, it might seem weird to have R6 completely unused in running a universal program on p. This is due to the fact that our program p only uses R1 and R2. If we worked with an example program that used R3, then R6 would not have remained empty in the tables.
Coding unboundedly many registers into one#
The simplified form of the universal program that we just saw could be modified to work on programs with any fixed finite number of registers. But we want one universal programs that works on all programs, without a pre-established bound. For this, we need a new idea.
We now call upon a technique introduced in a separate notebook in this course, the technique of two-by-two encoding. That notebook shows how we can add an end of register marker to R1, and the work with a simple encoding of symbols by pairs. We took the end of register marker to be ##
, and we encoded 1
by 11
and #
by 1#
.
All of that work was for R1 alone. But we could do the same work for all of the other registers.
Following this, the idea is to simply run all of the encoded registers into one.
For an example of what we mean, suppose that we were dealing with the following snapshot:
R1: 11# | R2: | R3: ## | R4: | R5: | R6: 1 | R7: # | R8: ##1 |
(We understand also that all of the other registers are empty.)
Encoding things as we have explained allows us to code the eight registers, as in the table below:
Contents | R1: 11# | R2: | R3: ## | R4: | R5: | R6: 1 | R7: # | R8: ##1 |
Encoded as | 11111### | ## | 1#1### | ## | ## | 11## | 1# ## | 1#1#11## |
One important point is that the empty registers still get an end-of-register marker ##
.
Therefore we would encode the contents of these registers by the single word
11 11 1# ## ## 1# 1# ## ## ## 11 ## 1# ## 1# 1# 11 ##
Without spaces, this is
11111#####1#1#######11##1###1#1#11##
The point is that with some additional work, we can use this single word as an encoded form of all the registers in our program.

Group project#
Click here for a suggestion on how groups could work together to write a universal program.
Every instructor and every class will be different, here is one plan that has worked for me.
Make groups of 4 or 5 people. If someone prefer to work alone, then it would be ok to do so, but I think it would be better to work with others.
Here is what each group needs to do:
- Make a plan on how to divide up the work of writing the different sub-programs. The parts are not all equally hard. The beige program could be the hardest, especially if you follow my comment below on it.
- You will need to meet, or at least to email each other, so that you can be sure that the different sub-programs do what they are supposed to.
- You also will need to have a plan on how to assemble everything together. The various sub-programs will need to "talk to each other" via goto statments. One good way is to use the Sanity parser. And one way that probably does not work well would be to simply put two flowcharts together.
- The group overall will need to turn in a program along with an explanation of the various parts. You should say clearly what the parts do, and also give some examples of how they run. If you have made flowcharts, it would be good to hand those in as well.
- You should test your overall program. It would be good to run your program on 1#, on self, and on 1#1## + u.
- At various points, you will be tempted to worry about what would happen if the input program isn't tidy. (For example, what should your program do if the input program is just 1111#### ?) Strictly speaking, the universal program should itself diverge on this input. But for our purposes, it's fine to ignore this point. You need not worry about this at all, only about the case when the input program is tidy.
- Above, we worked with the simplifying assumption that the input program only uses R1, R2, and R3. You should get a program u that woks on all input programs.
- It is important to write well. Your goal should be to write something that could be understood by someone else.

Universal programs of more than one argument#
Here is a more general result based on what we have seen up to now.
Theorem 5
For every natural number \(n\) there is a program \(u_n\) such that for all words \(p\), \(x_1\), \(\ldots\), \(x_n\),
What we have seen is an explicit program \(u_0\).
The proof of the general fact is an exercise using \(u_0\) and the \(s^m_n\) Theorem.
Exercise 49
What happens if we run the universal program \(u\) on the self-writing program \(\self\)?
Incidentally, running \(u\) on \(\self\) takes 22,376,608 steps, whereas running \(\self\) on empty registers takes 3,322.
Exercise 50
Show that the sets \(K\) and \(K_0\) are computably enumerable, where
Exercise 51
Write a program trade
which looks like it trades places with its input in R1. (Of course, a program cannot literally trade places with its input. But here is what we mean: running trade with p in R1 and all other registers empty does the same thing as running p with trade in R1 and all other registers empty.)
[Hint: You will need a universal program, and also an application of the Recursion Theorem.]
Exercise 52
Write a program self1
which writes itself to R1 but only uses R1 itself (no other registers).
This exercise requires a trick, since there is no program like diag
which only uses one register.
[Hint: you will need the technique of coding several registers into R1.]
An extra property of the universal program#
For use in several places, we need a technical remark about the universal prorgram \(u\) which we can arrange. That is, the remark below is not a consequence of the specification of \(u\) that we started with. It is not even a consequence of the outline of \(u\) which we provided. But if you write \(u\) the way we suggest, then the property below is either true, or one can arrange for it to hold.
Attention
We may take \(u\) to have the following property: when run on a word \(x\) in R1, the word in R1 is not empty at any point other than at the very end.
Only one more register than the inputs#
The work in this section can be used to prove the following result.
Proposition 4
For every \(n\) and every program \(p\), there is a program \(q\) such that
For all words \(x_1\), \(\ldots\), \(x_n\),
\(q\) uses R1, \(\ldots\), R\(n\), R\((n+1)\), and no other registers
\(q\) may be obtained computably from \(p\) and \(n\).
Taking \(n = 0\), it shows that when running a register machine program \(p\) on no inputs, we may assume that the program uses only one register. In particular, we have an undecidability result:
Proposition 5
Let \(A\) be the set of register machine programs which are tidy and which only use R1.
Let \(K^1\) be \(\{ p \in A : [\![p]\!](\ )\!\!\downarrow\}\)
Then \(K_0 \leq_m K^1\).
We are going to prove that \(K_0\) is undecdiable in the section on the halting problem. It follows from the last result that the halting problem for register machine programs which are tidy and which only use R1 is undecidable.
We are going to put the proposition just above to good use when we prove the undecidability of Post’s correspondence problem and of tiling.