next up previous
Next: Phase Transition Up: Unstructured Problems Previous: Unstructured Problems

Theory

For this class of problems, the phase transition behavior is illustrated in Fig. 3. Specifically, this shows the cost to solve the problem with a simple chronological backtrack search. The cost is given in terms of the number of search nodes considered until a solution is found. The minimum cost, for a search that proceeds directly to a solution with no backtrack is tex2html_wrap2970 . The parameter distinguishing underconstrained from overconstrained problems is the ratio tex2html_wrap_inline2856 of the number of nogoods m at level 2 given by the constraints to the number of items N.

  figure888
Figure 3:  The solid curves show the classical backtrack search cost for randomly generated problems with a prespecified solution as a function of beta=m/N for N=10 (gray) and 20 (black) and L=N/2. Here m is the number of nogoods selected at level 2 of the search lattice. The cost is the average number of backtrack steps, starting from the empty set, required to find the first solution to the problem, averaged over 1000 problems. The error bars indicate the standard deviation of this estimate of the average value, and in most cases are smaller than the size of the plotted points. For comparison, the dashed curves show the probability for having a solution in randomly generated problems, ranging from 1 at the left to 0 at the right.

Even for these relatively small problems, a peak in the average search cost is evident. Moreover, this peak is near the transition region where random problemsgif change from mostly soluble to mostly insoluble. A simple, but approximate, theoretical value for the location of the transition is given by the point where the expected number of solutions is equal to one [50, 53]. Applying this to the class of problems considered here is straightforward. Specifically, there are tex2html_wrap2692 complete sets for the problem, as given by Eq. 1. A particular set s of size L will be good, i.e., a solution, only if none of the nogoods selected for the problem are a subset of s. Hence the probability it will be a solution is given by

equation902

because there are tex2html_wrap2972 sets of size 2 from which to choose the m nogoods specified directly by the constraints. The average number of solutions is then just tex2html_wrap2973 . If we set tex2html_wrap2974 and tex2html_wrap2693 , for large N this becomes

equation920

where tex2html_wrap2976 . The predicted transition pointgif is then given by

equation931

which is tex2html_wrap2977 for the case considered here (i.e., tex2html_wrap2336 ). This closely matches the location of the peak in the search cost for problems with prespecified solution, as shown in Fig. 3, but is about 20% larger than the location of the step in solubilitygif. Furthermore, the theory predicts there is a regime of polynomial average cost for sufficiently few constraints [28]. This is determined by the condition that the expected number of goods at each level in the lattice is monotonically increasing. Repeating the above argument for smaller levels in the lattice, we find that this condition holds up to

equation940

which is tex2html_wrap2979 for tex2html_wrap2336 .

While these estimates are only approximate, they do indicate that the class of random soluble problems defined here behaves qualitatively and quantitatively the same with respect to the transition behavior as a variety of other, perhaps more realistic, problem classes. This close correspondence with the theory (derived for the limit of large problems), suggests that we are observing the correct transition behavior even with these relatively small problems. Moreover the above approximate theoretical argument suggests that the average cost of general classical search methods scales exponentially with the size of the problem over the full range of tex2html_wrap2981 . Thus this provides a good test case for the average behavior of the quantum algorithm. As a final observation, it is important to obtain a sufficient number of samples, especially near the transition region. This is because there is considerable variation in problems near the transition, specifically a highly skewed distribution in the number of solutions. In this region, most problems have few solutions but a few have extremely many: enough in fact to give a substantial contribution to the average number of solutions even though such problems are quite rare.


next up previous
Next: Phase Transition Up: Unstructured Problems Previous: Unstructured Problems

Tad Hogg
Wed Mar 20 16:29:17 PST 1996