Construction of hard-core predicates

Next: A recent pseudo-random number Up: Further results on pseudo-random Previous: Hard-Core Predicates and Pseudo-Random Contents

Construction of hard-core predicates

It seems much harder to find examples of hard-core predicates than of one-way functions. The latter can be obtained from discrete logarithms, subset-sum problems, and elsewhere. It is difficult to prove that a property cannot be guessed with probability much larger than . This makes the following result¹⁷ interesting.

Theorem 33 Let be a one-way function which maps sequences of bits of length to sequences of length . For each $S\subset\{1,\dots n\}$ and -bit define to be true if the number of 1's in positions in specified by is even. There is no efficient algorithm which can guess given and with probability much greater than .

This result does not rule out the possibility that, for some specific

, it may be possible to guess

in an efficient way. However, it would not be possible to do something like guess

with probability

for $10\%$ of all possible

, and probability

for all other

. [this would imply we would be correct about

overall with probability

]

To prove this, we shall consider the following scenario: We are trying to determine an unknown $x\in \{0,1\}^{n}$ . We have an oracle which is supposed to tell us . The oracle may lie, but must tell the truth more than half the time. Theorem 34 says we can efficiently enumerate a not-too-large set which probably includes .

Theorem 34 Suppose that for at least $1/2+\epsilon$ of all $S\subset\{1,\dots n\}$ . We can enumerate $U\subset \{0,1\}^{n}$ in time polynomial in $\epsilon^{-1}$ such that $x\in U$ with probability close to 1.

To deduce Theorem 33 from Theorem 34, suppose we had an efficient algorithm for guessing

. We obtain

, which cannot be too large given the time bound. If we are given the value of

, we can evaluate

for all $x\in U$ to identify the correct

with high probability, which would mean

is not a one-way function.

The construction of proceeds in stages. At stage ( $1\le k\le n$ ), we enumerate $U_k\subset\{0,1\}^{k}$ which includes (with probability close to 1) the first digits of .

We will consider fixed for the rest of this section. Define $L=\{1,\ldots k\}$ and $R=\{k+1,\ldots n\}$ . If $\alpha$ and $\beta$ are true or false, $\alpha==\beta$ is defined to be 1 if $\alpha$ and $\beta$ are both true or both false, 0 otherwise. If is defined for all $S\subset L$ , we will use $\begin{array}[t]{c}{\rm Avg}\ [-2pt] S\end{array}h(S)$ to represent the average value of , in other words, $2^{-k}\sum_Sh(S)$ , with similar definitions for other collections of . The hypothesis of Theorem 34 can be written as

$\begin{displaymath}\begin{array}[t]{c}{\rm Avg}\ [-2pt] S\subset\{1,\ldots n\}\end{array}\left(A(S)==B(S,x)\right)\ge1/2+\epsilon\end{displaymath}$

Let $v\in \{0,1\}^{k}$ . If there is a $w\in \{0,1\}^{n-k}$ with $x=v\circ w$ [i.e.,

is the first

bits of

], then

$\begin{displaymath}1/2+\epsilon\le\begin{array}[t]{c}{\rm Avg}\ [-2pt] D\subset... ...L\end{array}\Bigl( A(C\cup D)==B(C\cup D,v\circ w)\Bigr)\right)\end{displaymath}$

$\begin{displaymath}{\rm Define\quad }T(v,D)=\begin{array}[t]{c}{\rm Avg}\ [-2pt] C \subset L\end{array}\Bigl(A(C\cup D)==B(C,v)\Bigr)\end{displaymath}$

Recall that

is true if and only if the sum of the bits of

corresponding to

is even. This implies

$\begin{displaymath}\begin{array}[t]{c}{\rm Avg}\ [-2pt] C\end{array}\Bigl(A(C\c... ...1-T(v,D)\quad{\rm if }B(D,w){\rm is false}\end{array}\right.\end{displaymath}$

Thus, if $x=v\circ w$ , then

$\begin{displaymath} \begin{array}[t]{c}{\rm Avg}\ [-2pt] D\subset R\end{array}\Bigl(\vert T(v,D)-1/2\vert\Bigr)\ge\epsilon \end{displaymath}$

(5)

To test whether a given

satisfies (5) requires looking at all possible

and

, which would take too much time. However, as in section 7.3, we can take not-too-large random samples from all possible

and

, with a high probability of correctly deciding whether

satisfies (5). Let

be the number of different

used in the sampling.

At stage , we have obtained $U_{k-1}$ which includes the first bits of with high probability. To create , we take each member of $U_{k-1}$ , add 0 and 1 to it, and identify using sampling which of the resulting -bit strings satisfy (5).

This process would take too much time if the number of strings doubled at each step. To complete the proof, we use Lemma 35 to show that, for every , the number of with $\vert T(v,D)-1/2\vert\ge\epsilon$ is not too large, specifically there are at most $(2\epsilon)^{-2}$ such . In order to to be included in , must satisfy $\vert T(v,D)-1/2\vert\ge\epsilon$ for at least one of the sets used in the sample, which gives a bound of $N(2\epsilon)^{-2}$ on $\vert U_k\vert$ .

Lemma 35 For any , $\sum_v\bigl(T(v,D)-1/2\bigr)^2=1/4$ , where the sum is over all $v\in \{0,1\}^{k}$ .

Proof: is a constant and may be ignored. We will argue by induction on . Define $L'=\{1,\ldots k-1\}$ . The function may be represented by with

$\begin{displaymath}A_1(C\cup D)=A(C\cup D)\quad A_2(C\cup D)=A(C\cup\{k\} \cup D)\quad{\rm for all }C\subset L'.\end{displaymath}$

$\begin{eqnarray*} {\rm Define\quad}T_i(v,D)&=&\begin{array}[t]{c}{\rm Avg}\ [-2... ...{1}{2}\Bigl(T_1(v,D)+(1-T_2(v,D)\Bigr){\rm\quad if\quad}v_k=1\\ \end{eqnarray*}$

We divide $\{0,1\}^{k}$ according to whether

or 1 to obtain

$\begin{eqnarray*} \sum_v\bigl(T(v,D)-1/2\bigr)^2&=&\sum_{v_k=0}\left(\frac{1}{2}... ...{1}{4}+\frac{1}{4}\right] \quad\mbox{\vrule height 8pt width 4pt}\end{eqnarray*}$

Footnotes

... result ¹⁷: O. Goldreich and L. Levin, ``A Hard-Core Predicate for Every One-Way Function,'' Symposium on Theory of Computation (1989)

Next: A recent pseudo-random number Up: Further results on pseudo-random Previous: Hard-Core Predicates and Pseudo-Random Contents

Translated from LaTeX by Scott Sutherland
2002-12-14