The mathematics behind quantum computing

The first of two feature columns on this subject.

Quantum computing

Quantum computing may be just around the corner or it may be, for all practical purposes, permanently out of reach: the physics needed for a useful quantum computer has not yet been discovered, and may in fact not exist.

A quantum computer, real or potential, is essentially different from an adding machine. Whereas the dials in Pascal's A.D. 1645 brass calculator always line up to read out exactly one 6-digit number, the set of qubits in a quantum register exist in a superposition of states: when the register is interrogated one of these states is read out, with a definite probability, and the remaining information is lost. Because the register can simultaneously be "in" a huge number of states, a huge number of calculations may be carried out simultaneously. But the inputs to a quantum computer must be organized to take advantage of superposition, and the calculating process must force the probabilistic output to give useful information. This is the problem of programming a quantum computer, and in certain important and interesting cases it has been solved. One of the quantum computing algorithms, a factorization algorithm due to Peter Shor, will be the focus of these two columns.

The factorization problem

Communication security today is almost universally ensured by the use of RSA encryption. This method relies on the inaccessibility of large prime factors of a large composite number. The problem is an artificial one: the encrypter takes two (or more) large primes and multiplies them. The decrypter tries to work backwards from the product to the factors. It is hard work. The largest number factored so far ( "RSA-640") had 193 decimal digits and took "approximately 30 2.2GHz-Opteron-CPU years," over five months of calendar time. At that rate a 1024-bit number, the size currently recommended by a commercial cryptology site, would take on the order of 10¹⁴⁵ years ("bit" is short for "binary digit;" each additional bit contributes a factor of 2 to the size of the calculation). The site adds: "For more security or if you are paranoid, use 2048 or even 4096 bits."

A quantum computer of suitable size could factor these large numbers in a much shorter time. For a 1024-bit number, Shor's Algorithm requires on the order of 1024³, about one billion, operations. I do not have any information on how quickly quantum operations can be executed, but if each one took one second our factorization would last 34 years. If a quantum computer could run at the speed of today's electronic computers (100 million instructions per second and up) then factorization of the 1024-bit number would be a matter of seconds.

A note on "suitable size." To run Shor's Algorithm on a 1024-bit number requires two quantum registers, one of 2048 qubits and one of 1024. These qubits all have to be "in coherence," so that the totality of their states behaves as a single, entangled state. (More about entanglement later). It was reported as an extraordinary technical feat when IBM scientists in 2001 constructed a coherent quantum register with 7 qubits and used it with Shor's Algorithm to factor 15.

This column and next month's will present a description of Shor's Factorization Algorithm in terms appropriate for a general mathematical audience. This month will cover the number-theoretical underpinning along with the Discrete and Fast Fourier Transforms. These convert factorization into a frequency-detection problem which is structured so that it can be adapted for quantum computation. The problem itself has a definite answer but takes exponential time to get there. Next month's column will address the more specifically quantum-computational aspects of the algorithm, including superposition and entanglement, leading up to Shor's ingenious quantum jiu-jitsu, which forces the quantum read-out, with high probability, to give a useful answer. That probability is high enough, and the running time on a suitable machine would be short enough, for the calculation to be repeated until the unknown factors are produced.

Number Theory and Fourier Analysis

The light from a star can be split into a spectrum, where the characteristic frequencies of its elements can be detected. Analogously, a composite number N can be made to generate a spectrum, from which its factors can be calculated.

Choose a number a relatively prime to N, and make the list of integer powers of a modulo N: a, a², a³, ... . If a and N are relatively prime, it follows from a theorem of Euler that this list will eventually include the number 1. (Euler's Theorem says specifically that if φ(N) is the number of positive integers less that N which are coprime to N then a^φ(N) is congruent to 1 modulo N). Suppose this happens for an even power of a, say a^2b = 1 mod N, i.e. a^2b – 1 = 0 mod N. This means that (a^b + 1) (a^b – 1) is a multiple of N; if it is one times N we have our factors; otherwise the Euclidean Algorithm will speedily find a common factor of, say, a^b + 1 and N.

These examples are trivially simple, but illustrate the phenomenon:

Take N = 85 and a = 19. The powers of 19 mod 85 are 19, 21, 59, 16, 49, 81, 9, 1, 19, 21, ...; in particular 19⁸ = 1 mod 85. We deduce that 19⁴ + 1 = 17 and 19⁴ – 1 = 15 both have common factors with 85. In fact the first is a factor and the second has 5 as common divisor with 85.
Take N = 85 and a = 33. The powers of 33 mod 85 are 33, 69, 67, 1, 33, 69, 67, 1, ...; in particular 33⁴ = 1 mod 85. We deduce that 33² + 1 = 70 and 33² – 1 = 68 both have common factors with 85. In fact the first yields 5 and the second 17.
Note that φ(85) = 68, so 68 would always work; but this number cannot be calculated a priori: you have to know the prime factorization 85 = 17 x 5, and use the rule φ(pq) = (p–1)(q–1) for p and q prime.

How do we get our hands on b? Just examining the terms of the sequence and waiting for 1 to show up takes too long, because the length of the list is commensurate with the number N we want to factor: it increases exponentially with the number, say n, of bits used to write N. But think of the stream of numbers a mod N, a² mod N, a³ mod N, ... as the light emitted by N. If we can find the frequency, or equivalently the period, with which this sequence repeats itself, we can use the equivalence a^j = a^k <=> a^j–k = 1 (modulo N), and find a factorization of N as above. We will see that on a quantum computer, this computation requires a number of steps increasing only as a polynomial in n. That is the key to quantum factorization.

Discrete Fourier Transforms

Fourier transforms detect periodicity; we will find b using a quantum adaptation of the Fast Fourier Transform, a speeded-up version of the Discrete Fourier Transform, which itself arises from adapting to sequences the Fourier Series designed to work with continuous functions.

The (complex) Fourier coefficients of a real-valued function f(x) defined on [0,2A] are

Notes:

Writing e^{i nπx/A} as cos nπx/A + i sin nπx/A and c_n = a_n + i b_n gives the usual sine and cosine coefficients, except that c₀ is twice the usual a₀.
The factor 1/ allows the indentity , which will be important in our later quantum computations.

Suppose now we think of a sequence f = f₀, f₁, ... , f_2A–1 (A now an integer), as the set of values at x = 0, x = 1, x = 2, ... , x = 2A–1 of a function f(x) defined on the interval [0,2A], and that we define a new c_n by replacing the Fourier integral with a left hand sum with 2A equal subdivisions of length 1. This sum only involves the elements of f:

. Notes:

e^{i (2A+k)mπ/A} = e^{i 2mπ} e^{i kmπ/A} = e^{i kmπ/A}, so coefficients indexed 2A and higher give no extra information.
The sequence c of coefficients c₀, ... , c_2A–1 is the Discrete Fourier Transform of the sequence f.
Essentially the same calculation retrieves f from c: (note the minus sign in the exponents).

Transform of a periodic sequence

Suppose that f has period p, i.e. f_m+p = f_m for every value of m; the simplest case to analyze is when p is a divisor of 2A, say p = 2A/k, k an integer. In that case, if n is not a multiple of k (green boxes in the examples below) the periodic reappearances of a sequence item f_m are multiplied by coefficients which cycle through a complete set of roots of 1, and thereby add up to zero. The explicit formula is:

So the only chance for a non-zero Fourier coefficient is when n is a multiple of k, (and then, up to a factor, c₀, c_k, c_2k, ... , c_(p–1)k are the 0th, 1st, 2nd, ... , (p–1)-st Fourier coefficients of the sequence f restricted to a single period).

Fig. 1. Discrete Fourier Transform analysis of the sequence given by the first 16 powers of 19 (modulo 85), a sequence with period 8 and frequency 16/8 = 2. When n is a multiple of 2 (e.g. orange boxes) every re-occurrence of a sequence item gets the same coefficient, and c_n has a chance of being nonzero. Otherwise (e.g. green boxes) it is zero. The complex numbers in the matrix are graphically represented as unit vectors. Each c_n is the weighted vector sum of the entries in the column above it, weights coming from the f_k column on the right, with an overall factor of 1/ = 1/4. The c_n have been uniformly scaled to fit in the picture.

Fig. 2. Discrete Fourier Transform analysis of the sequence given by the first 16 powers of 33 (modulo 85). This sequence has period 4 and frequency 16/4=4. The non-zero c_n only occur for n a multiple of 4. Graphic conventions are the same as in Fig. 1.

Why powers of 2?

I have chosen for illustration a number (85) which generates power residue sequences with periods which are powers of 2, and I have chosen to analyze a length (16) of sequence which is also a power of 2. The second choice is not immaterial: the Fast Fourier Transform we will use, and its quantum adaptation, both require it. The first choice is only for covenience in generating a small example. Shor shows that to get a reliable read on the power residue period in general, in the problem of factoring N, one must analyze sequences of a length 2ⁿ between N and N². So to analyze a number like 85 realistically, we would have to work with sequences of length 128.

The Fast Fourier Transform

The Fast Fourier Transform which adapts directly to quantum computation is the Radix-2 Cooley-Tukey algorithm, invented in 1965 (reinvented, actually, since it later turned out to have been known to Gauss). To control the width of this presentation, I am going to work with the order-8 Discrete Fourier Transform, rather than order-16 as above. Taking ω = e^iπ/4= 2^–1/2(1 + i) as our primitive 8th root of 1, the 0-7th powers of ω are

1, e^iπ/4, e^iπ/2, e^3iπ/4, e^iπ, e^5iπ/4, e^3iπ/2, e^7iπ/4,

1, ω, i, iω, –1, –ω, –i, –iω,

the numbers represented graphically above as

, , , , , , , .

The transform c of an arbitrary length-8 sequence f = f₀, f₁, ..., f₇, given by

, would be produced from the array


1	1	1	1	1	1	1	1	f₀
1	ω	i	iω	–1	–ω	–i	–iω	f₁
1	i	–1	–i	1	i	–1	–i	f₂
1	iω	–i	ω	–1	–iω	i	–ω	f₃
1	–1	1	–1	1	–1	1	–1	f₄
1	–ω	i	–iω	–1	ω	–i	iω	f₅
1	–i	–1	i	1	–i	1	i	f₆
1	–iω	–i	–ω	–1	iω	i	ω	f₇
c₀	c₁	c₂	c₃	c₄	c₅	c₆	c₇

by setting the last row to be f₀ times the first plus ... plus f₇ times the eighth, the whole sum divided by . This calculation would require eight multiplications and seven additions for each item in c, or 120 operations in all. Working with a sequence of length 2ⁿ, the number of operations would be 2ⁿ (2ⁿ⁺¹ – 1).

Our Fast Fourier Transform leads in n steps from the elements f₀, ... f_2ⁿ–1 of the input sequence to the coefficients c₀, ... c_2ⁿ–1 of its Discrete Fourier Transform. We illustrate the process here for n = 3. Each column represents a successive step of the calculation.

Line
0
1
2
3
4
5
6
7

Start
f₇
f₆
f₅
f₄
f₃
f₂
f₁
f₀

Step 1
f₃+f₇
f₂+f₆
f₁+f₅
f₀+f₄
f₃–f₇
f₂–f₆
f₁–f₅
f₀–f₄

Step 2
f₁+f₅ + f₃+f₇
f₀+f₄ + f₂+f₆
f₁+f₅ – f₃–f₇
f₀+f₄ – f₂–f₆
f₁–f₅ + i(f₃–f₇)
f₀–f₄ + i(f₂–f₆)
f₁–f₅ – i(f₃–f₇)
f₀–f₄ – i(f₂–f₆)

Step 3
[f₀+f₄ + f₂+f₆] + [f₁+f₅ + f₃+f₇]
[f₀+f₄ + f₂+f₆] – [f₁+f₅ + f₃+f₇]
[f₀+f₄ – f₂–f₆] + i[f₁+f₅ – f₃–f₇]
[f₀+f₄ – f₂–f₆] – i[f₁+f₅ – f₃–f₇]
[f₀–f₄ + i(f₂–f₆)] + ω[f₁–f₅ + i(f₃–f₇)]
[f₀–f₄ + i(f₂–f₆)] – ω[f₁–f₅ + i(f₃–f₇)]
[f₀–f₄ – i(f₂–f₆)] + iω[f₁–f₅ – i(f₃–f₇)]
[f₀–f₄ – i(f₂–f₆)] – i ω[f₁–f₅ – i(f₃–f₇)]

c₀
=

c₄
=

c₂
=

c₆
=

c₁
=

c₅
=

c₃
=

c₇

Notes:

In this presentation of the algorithm the elements of the sequence f are ordered backwards and the elements of the transformed sequence c appear in "bit reversed" order: c_n appears on line k where (in binary) k is n written backwards; e.g. c₆ (110) appears on line 3 (011).
Working with a sequence of length 2ⁿ the computation table has 2ⁿ rows; in each row are n columns requiring computation; each entry is calculated as a linear combination of two entries from the previous column, and so requires two multiplications and a sum; altogether 3n operations per row for a total of 3n2ⁿ operations. Contrasting this with the 2ⁿ (2ⁿ⁺¹ – 1) operations to implement the Discrete Fourier Transform shows that the Fast Fourier Transform is fast indeed.

Why does this work?

The most transparent explanation I know uses properties of polynomial long division. Here I will specalize to the example at hand.

For a sequence f of length 8, the expression for the nth Fourier coefficient can be read as the value at e^inπ/4 of the polynomial f(x) = f₀ + f₁ x + ... + f₇ x⁷: .
If a polynomial p(x) is divided by the monic, degree-1 polynomial x–a, the remainder is exactly the number p(a).
So c_n is (1/) times the remainder when f(x) is divided by (x– e^inπ/4). The shift in focus from evaluation of a polynomial to calculation of remainders is the key to the economies of the Fast Fourier Transform.
The remainder when f(x) is divided by x⁴– 1 is (f₃ + f₇)x³ + (f₂ + f₆)x² + (f₁ + f₅)x + (f₀ + f₄), and its remainder upon division by x⁴+ 1 is (f₃ – f₇)x³ + (f₂ – f₆)x² + (f₁ – f₅)x + (f₀ – f₄). And in general, if a_2n–1x^2n–1 + ... + a₀ is divided by xⁿ – c, the remainder is a_n–1x^n–1 + ... + a₀ + c (a_2n–1x^n–1 + ... + a_n). This is what makes calculation of remainders especially simple when the overall degree is a power of 2.
Since x⁴ – 1 = (x² – 1)(x² + 1) = (x – 1)(x + 1)(x – i)(x + i), it follows that f(x) and p(x) = (f₃ + f₇)x³ + (f₂ + f₆)x² + (f₁ + f₅)x + (f₀ + f₄) have the same remainder when divided by any one of the factors (x – 1), (x + 1), (x – i) or (x + i).
- Because if f(x) = (x⁴ – 1) q(x) + p(x) and for example p(x) = (x – i)q'(x) + a, then f(x) = [(x – 1)(x + 1) (x – i)(x + i)]q(x) + (x – i)q'(x) + a also has remainder a when divided by (x – i).
Similarly, since x⁴ + 1 = (x² – i)(x² + i) = (x – ω)(x + ω) (x – iω)(x + iω), the degree-3 polynomial (f₃ – f₇)x³ + (f₂ – f₆)x² + (f₁ – f₅)x + (f₀ – f₄) can be used, via its remainders, to calculate f(ω), etc.
Returning to the Fast Fourier Transform calculation
we can understand Step 1 as transforming the degree-7 polynomial f into two degree-3 polynomials (red boxes) each of which can produce, via its remainders, half of the c_n. Then Step 2 transforms each of the degree-3 polynomials into two degree-1 polynomials (green boxes), each of which can produce a quarter of the c_n, and Step 3 transforms each of the degree-1 polynomials into two degree-0 polynomials, which are their own remainders.

Continued in next month's Feature Column.

References

For the Fast Fourier Transform, I have followed Henry Laufer's text Discrete Mathematics and Applied Modern Algebra, Prindle, Weber & Schmidt, Boston 1984.

The Wikipedia entry on Fast Fourier Transform gives an alternative explanation; it also provides references to Cooley and Tukey's 1965 paper as well as to the spot in Gauss' Nachlass where he develops the technique, both items in its extensive bibliography on the ancient and recent history of the algorithm.

References on the "Quantum Fourier Transform" and on Shor's Algorithm:

D. Coppersmith, An Approximate Fourier Transform Useful in Quantum Factoring, IBM Research Report 07/12/94, available as arXiv:quant-ph/0201067.

A. Ekert and R. Jozsa, Quantum computation and Shor's factoring algorithm, Reviews of Modern Physics 68 (1996) 733-753

Peter W. Shor, Algorithms for Quantum Computation: In: Proceedings, 35th Annual Symposium on Foundations of Computer Science, Santa Fe, NM, November 20--22, 1994, IEEE Computer Society Press, pp. 124--134. An expanded version is available, under the title
Polynomial-Time Algorithms for Prime Factorization and Discrete Logarithms on a Quantum Computer, as arXiv:quant-ph/9508027.