Misplaced Pages

Shanks's square forms factorization

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (March 2015) (Learn how and when to remove this message)

Shanks' square forms factorization is a method for integer factorization devised by Daniel Shanks as an improvement on Fermat's factorization method.

The success of Fermat's method depends on finding integers x {\displaystyle x} and y {\displaystyle y} such that x 2 y 2 = N {\displaystyle x^{2}-y^{2}=N} , where N {\displaystyle N} is the integer to be factored. An improvement (noticed by Kraitchik) is to look for integers x {\displaystyle x} and y {\displaystyle y} such that x 2 y 2 ( mod N ) {\displaystyle x^{2}\equiv y^{2}{\pmod {N}}} . Finding a suitable pair ( x , y ) {\displaystyle (x,y)} does not guarantee a factorization of N {\displaystyle N} , but it implies that N {\displaystyle N} is a factor of x 2 y 2 = ( x y ) ( x + y ) {\displaystyle x^{2}-y^{2}=(x-y)(x+y)} , and there is a good chance that the prime divisors of N {\displaystyle N} are distributed between these two factors, so that calculation of the greatest common divisor of N {\displaystyle N} and x y {\displaystyle x-y} will give a non-trivial factor of N {\displaystyle N} .

A practical algorithm for finding pairs ( x , y ) {\displaystyle (x,y)} which satisfy x 2 y 2 ( mod N ) {\displaystyle x^{2}\equiv y^{2}{\pmod {N}}} was developed by Shanks, who named it Square Forms Factorization or SQUFOF. The algorithm can be expressed in terms of continued fractions or in terms of quadratic forms. Although there are now much more efficient factorization methods available, SQUFOF has the advantage that it is small enough to be implemented on a programmable calculator. Shanks programmed it on an HP-65, made in 1974, which has storage for only nine digit numbers and allows only 100 steps/keystrokes of programming. There are versions of the algorithm that use little memory and versions that store a list of values that run more quickly.

In 1858, the Czech mathematician Václav Šimerka used a method similar to SQUFOF to factor ( 10 17 1 ) / 9 {\displaystyle (10^{17}-1)/9} = {\displaystyle =} 11111111111111111 {\displaystyle 11111111111111111} = {\displaystyle =} 2071723 5363222357 {\displaystyle 2071723\cdot 5363222357} .

Algorithm

Note This version of the algorithm works on some examples but often gets stuck in a loop.

This version does not use a list.

Input: N {\displaystyle N} , the integer to be factored, which must be neither a prime number nor a perfect square, and a small positive integer, k {\displaystyle k} .

Output: a non-trivial factor of N {\displaystyle N} .

The algorithm:

Initialize i = 0 , P 0 = k N , Q 1 = 1 , Q 0 = k N P 0 2 . {\displaystyle i=0,P_{0}=\lfloor {\sqrt {kN}}\rfloor ,Q_{-1}=1,Q_{0}=kN-P_{0}^{2}.}

Repeat

i = i + 1 , b i = P 0 + P i 1 Q i 1 , P i = b i Q i 1 P i 1 , Q i = Q i 2 + b i ( P i 1 P i ) {\displaystyle i=i+1,b_{i}=\left\lfloor {\frac {P_{0}+P_{i-1}}{Q_{i-1}}}\right\rfloor ,P_{i}=b_{i}Q_{i-1}-P_{i-1},Q_{i}=Q_{i-2}+b_{i}(P_{i-1}-P_{i})}

until Q i {\displaystyle Q_{i}} is a perfect square at some odd value of i {\displaystyle i} .

Start the second phase (reverse cycle).

Initialize b 0 = P 0 P i Q i {\displaystyle b_{0}=\left\lfloor {\frac {P_{0}-P_{i}}{\sqrt {Q_{i}}}}\right\rfloor } , Q 1 = Q i {\displaystyle Q_{-1}={\sqrt {Q_{i}}}} , and P 0 = b 0 Q i + P i {\displaystyle P_{0}=b_{0}{\sqrt {Q_{i}}}+P_{i}} , where P 0 , P i {\displaystyle P_{0},P_{i}} , and Q i {\displaystyle Q_{i}} are from the previous phase. The b 0 {\displaystyle b_{0}} used in the calculation of P 0 {\displaystyle P_{0}} is the recently calculated value of b 0 {\displaystyle b_{0}} .

Set i = 0 {\displaystyle i=0} and Q 0 = k N P 0 2 Q 1 {\displaystyle Q_{0}={\frac {kN-P_{0}^{2}}{Q_{-1}}}} , where P 0 {\displaystyle P_{0}} is the recently calculated value of P 0 {\displaystyle P_{0}} .

Repeat

i = i + 1 , b i = P 0 + P i 1 Q i 1 , P i = b i Q i 1 P i 1 , Q i = Q i 2 + b i ( P i 1 P i ) {\displaystyle i=i+1,b_{i}=\left\lfloor {\frac {P_{0}+P_{i-1}}{Q_{i-1}}}\right\rfloor ,P_{i}=b_{i}Q_{i-1}-P_{i-1},Q_{i}=Q_{i-2}+b_{i}(P_{i-1}-P_{i})}

until P i = P i 1 . {\displaystyle P_{i}=P_{i-1}.}

Then if f = gcd ( N , P i ) {\displaystyle f=\gcd(N,P_{i})} is not equal to 1 {\displaystyle 1} and not equal to N {\displaystyle N} , then f {\displaystyle f} is a non-trivial factor of N {\displaystyle N} . Otherwise try another value of k {\displaystyle k} .

Shanks' method has time complexity O ( N 4 ) {\displaystyle O({\sqrt{N}})} .

Stephen S. McMath wrote a more detailed discussion of the mathematics of Shanks' method, together with a proof of its correctness.

Example

Let N = 11111 {\displaystyle N=11111}

Q 1 = 1 {\displaystyle Q_{-1}=1}

Cycle forward
i {\displaystyle i} b i {\displaystyle b_{i}} P i {\displaystyle P_{i}} Q i {\displaystyle Q_{i}}
0 {\displaystyle 0} {\displaystyle } 105 {\displaystyle 105} 86 {\displaystyle 86}
1 {\displaystyle 1} 2 {\displaystyle 2} 67 {\displaystyle 67} 77 {\displaystyle 77}
2 {\displaystyle 2} 2 {\displaystyle 2} 87 {\displaystyle 87} 46 {\displaystyle 46}
3 {\displaystyle 3} 4 {\displaystyle 4} 97 {\displaystyle 97} 37 {\displaystyle 37}
4 {\displaystyle 4} 5 {\displaystyle 5} 88 {\displaystyle 88} 91 {\displaystyle 91}
5 {\displaystyle 5} 2 {\displaystyle 2} 94 {\displaystyle 94} 25 {\displaystyle 25}

Here Q 5 = 25 {\displaystyle Q_{5}=25} is a perfect square, so the first phase ends.

For the second phase, set Q 1 = 25 = 5 {\displaystyle Q_{-1}={\sqrt {25}}=5} . Then:

Reverse cycle
i {\displaystyle i} b i {\displaystyle b_{i}} P i {\displaystyle P_{i}} Q i {\displaystyle Q_{i}}
0 {\displaystyle 0} 2 {\displaystyle 2} 104 {\displaystyle 104} 59 {\displaystyle 59}
1 {\displaystyle 1} 3 {\displaystyle 3} 73 {\displaystyle 73} 98 {\displaystyle 98}
2 {\displaystyle 2} 1 {\displaystyle 1} 25 {\displaystyle 25} 107 {\displaystyle 107}
3 {\displaystyle 3} 1 {\displaystyle 1} 82 {\displaystyle 82} 41 {\displaystyle 41}
4 {\displaystyle 4} 4 {\displaystyle 4} 82 {\displaystyle 82} {\displaystyle }

Here P 3 = P 4 = 82 {\displaystyle P_{3}=P_{4}=82} , so the second phase ends. Now calculate g c d ( 11111 , 82 ) = 41 {\displaystyle gcd(11111,82)=41} , which is a factor of 11111 {\displaystyle 11111} .

Thus, N = 11111 = 41 271 {\displaystyle N=11111=41\cdot 271} .

Example implementation

Below is an example of C function for performing SQUFOF factorization on unsigned integer not larger than 64 bits, without overflow of the transient operations.

#include <inttypes.h>
#define nelems(x) (sizeof(x) / sizeof((x)))
const int multiplier = {1, 3, 5, 7, 11, 3*5, 3*7, 3*11, 5*7, 5*11, 7*11, 3*5*7, 3*5*11, 3*7*11, 5*7*11, 3*5*7*11};
uint64_t SQUFOF( uint64_t N )
{
    uint64_t D, Po, P, Pprev, Q, Qprev, q, b, r, s;
    uint32_t L, B, i;
    s = (uint64_t)(sqrtl(N)+0.5);
    if (s*s == N) return s;
    for (int k = 0; k < nelems(multiplier) && N <= UINT64_MAX/multiplier; k++) {
        D = multiplier*N;
        Po = Pprev = P = sqrtl(D);
        Qprev = 1;
        Q = D - Po*Po;
        L = 2 * sqrtl( 2*s );
        B = 3 * L;
        for (i = 2 ; i < B ; i++) {
            b = (uint64_t)((Po + P)/Q);
            P = b*Q - P;
            q = Q;
            Q = Qprev + b*(Pprev - P);
            r = (uint64_t)(sqrtl(Q)+0.5);
            if (!(i & 1) && r*r == Q) break;
            Qprev = q;
            Pprev = P;
        };
        if (i >= B) continue;
        b = (uint64_t)((Po - P)/r);
        Pprev = P = b*r + P;
        Qprev = r;
        Q = (D - Pprev*Pprev)/Qprev;
        i = 0;
        do {
            b = (uint64_t)((Po + P)/Q);
            Pprev = P;
            P = b*Q - P;
            q = Q;
            Q = Qprev + b*(Pprev - P);
            Qprev = q;
            i++;
        } while (P != Pprev);
        r = gcd(N, Qprev);
        if (r != 1 && r != N) return r;
    }
    return 0;
}

References

  1. Lemmermeyer, F. (2013). "Václav Šimerka: quadratic forms and factorization". LMS Journal of Computation and Mathematics. 16: 118–129. doi:10.1112/S1461157013000065.
  2. (Riesel 1994:189)
  3. "Daniel Shanks' Square Forms Factorization". 2004. CiteSeerX 10.1.1.107.9984.

External links

Number-theoretic algorithms
Primality tests
Prime-generating
Integer factorization
Multiplication
Euclidean division
Discrete logarithm
Greatest common divisor
Modular square root
Other algorithms
  • Italics indicate that algorithm is for numbers of special forms
Category: