Universality There is a
universal constructive martingale
d. This martingale is universal in the sense that, given any constructive martingale
d, if
d succeeds on a sequence, then
d succeeds on that sequence as well. Thus,
d succeeds on every sequence in RANDc (but, since
d is constructive, it succeeds on no sequence in RAND). (Schnorr 1971) There is a constructive null cover of RANDc. This means that all effective tests for randomness (that is, constructive null covers) are, in a sense, subsumed by this
universal test for randomness, since any sequence that passes this single test for randomness will pass all tests for randomness. (Martin-Löf 1966) Intuitively, this universal test for randomness says "If the sequence has increasingly long prefixes that can be increasingly well-compressed on this universal Turing machine", then it is not random." -- see next section.
Construction sketch: Enumerate the effective null covers as ((U_{m, n})_n)_m. The enumeration is also effective (enumerated by a modified universal Turing machine). Now we have a universal effective null cover by diagonalization: (\cup_{n}U_{n, n+k+1})_k.
Passing randomness tests If a sequence fails an algorithmic randomness test, then it is algorithmically compressible. Conversely, if it is algorithmically compressible, then it fails an algorithmic randomness test.
Construction sketch: Suppose the sequence fails a randomness test, then it can be compressed by lexicographically enumerating all sequences that fails the test, then code for the location of the sequence in the list of all such sequences. This is called "enumerative source encoding". Conversely, if the sequence is compressible, then by the
pigeonhole principle, only a vanishingly small fraction of sequences are like that, so we can
define a new test for randomness by "has a compression by this universal Turing machine". Incidentally, this is the
universal test for randomness. For example, consider a binary sequence sampled IID from the
Bernoulli distribution. After taking a large number N of samples, we should have about M \approx pN ones. We can code for this sequence as "Generate all binary sequences with length N, and M ones. Of those, the i-th sequence in lexicographic order.". By
Stirling approximation, \log_2 \binom{N}{pN} \approx N H(p) where H is the
binary entropy function. Thus, the number of bits in this description is 2(1 + \epsilon) \log_2 N + (1 + \epsilon) N H(p) + O(1) The first term is for prefix-coding the numbers N and M. The second term is for prefix-coding the number i. (Use
Elias omega coding.) The third term is for prefix-coding the rest of the description. When N is large, this description has just \sim H(p) N bits, and so it is compressible, with compression ratio \sim H(p) . In particular, the compression ratio is exactly one (incompressible) only when p = 1/2. (Example 14.2.8 )
Impossibility of a gambling system Consider a casino offering fair odds at a roulette table. The roulette table generates a sequence of random numbers. If this sequence is algorithmically random, then there is no lower semi-computable strategy to win, which in turn implies that there is no computable strategy to win. That is, for any gambling algorithm, the long-term log-payoff is zero (neither positive nor negative). Conversely, if this sequence is not algorithmically random, then there is a lower semi-computable strategy to win.
Examples •
Chaitin's halting probability Ω is an example of a random sequence. • Every random sequence is not computable. • Every random sequence is
normal, satisfies the
law of large numbers, and satisfies all Turing-computable properties satisfied by an IID stream of uniformly random numbers. (Theorem 14.5.2 )
Relation to the arithmetic hierarchy • RANDc (the
complement of RAND) is a
measure 0 subset of the set of all infinite sequences. This is implied by the fact that each constructive null cover covers a measure 0 set, there are only
countably many constructive null covers, and a countable union of measure 0 sets has measure 0. This implies that RAND is a measure 1 subset of the set of all infinite sequences. • The class RAND is a \Sigma^0_2 subset of Cantor space, where \Sigma^0_2 refers to the second level of the
arithmetical hierarchy. This is because a sequence
S is in RAND if and only if there is some open set in the universal effective null cover that does not contain
S; this property can be seen to be definable by a \Sigma^0_2 formula. • There is a random sequence which is \Delta^0_2, that is, computable relative to an oracle for the Halting problem. (Schnorr 1971) Chaitin's
Ω is an example of such a sequence. • No random sequence is
decidable,
computably enumerable, or
co-computably-enumerable. Since these correspond to the \Delta_1^0, \Sigma_1^0, and \Pi_1^0 levels of the
arithmetical hierarchy, this means that \Delta_2^0 is the lowest level in the arithmetical hierarchy where random sequences can be found. • Every sequence is
Turing reducible to some random sequence. (Kučera 1985/1989,
Gács 1986). Thus there are random sequences of arbitrarily high
Turing degree. == Relative randomness ==