Misplaced Pages

Chakravala method

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Chakravala) Cyclic algorithm to solve indeterminate quadratic equations

The chakravala method (Sanskrit: चक्रवाल विधि) is a cyclic algorithm to solve indeterminate quadratic equations, including Pell's equation. It is commonly attributed to Bhāskara II, (c. 1114 – 1185 CE) although some attribute it to Jayadeva (c. 950 ~ 1000 CE). Jayadeva pointed out that Brahmagupta's approach to solving equations of this type could be generalized, and he then described this general method, which was later refined by Bhāskara II in his Bijaganita treatise. He called it the Chakravala method: chakra meaning "wheel" in Sanskrit, a reference to the cyclic nature of the algorithm. C.-O. Selenius held that no European performances at the time of Bhāskara, nor much later, exceeded its marvellous height of mathematical complexity.

This method is also known as the cyclic method and contains traces of mathematical induction.

History

Chakra in Sanskrit means cycle. As per popular legend, Chakravala indicates a mythical range of mountains which orbits around the Earth like a wall and not limited by light and darkness.

Brahmagupta in 628 CE studied indeterminate quadratic equations, including Pell's equation

x 2 = N y 2 + 1 , {\displaystyle \,x^{2}=Ny^{2}+1,}

for minimum integers x and y. Brahmagupta could solve it for several N, but not all.

Jayadeva and Bhaskara offered the first complete solution to the equation, using the chakravala method to find for x 2 = 61 y 2 + 1 , {\displaystyle \,x^{2}=61y^{2}+1,} the solution

x = 1766319049 , y = 226153980. {\displaystyle \,x=1766319049,y=226153980.}

This case was notorious for its difficulty, and was first solved in Europe by Brouncker in 1657–58 in response to a challenge by Fermat, using continued fractions. A method for the general problem was first completely described rigorously by Lagrange in 1766. Lagrange's method, however, requires the calculation of 21 successive convergents of the simple continued fraction for the square root of 61, while the chakravala method is much simpler. Selenius, in his assessment of the chakravala method, states

"The method represents a best approximation algorithm of minimal length that, owing to several minimization properties, with minimal effort and avoiding large numbers automatically produces the best solutions to the equation. The chakravala method anticipated the European methods by more than a thousand years. But no European performances in the whole field of algebra at a time much later than Bhaskara's, nay nearly equal up to our times, equalled the marvellous complexity and ingenuity of chakravala."

Hermann Hankel calls the chakravala method

"the finest thing achieved in the theory of numbers before Lagrange."

The method

From Brahmagupta's identity, we observe that for given N,

( x 1 x 2 + N y 1 y 2 ) 2 N ( x 1 y 2 + x 2 y 1 ) 2 = ( x 1 2 N y 1 2 ) ( x 2 2 N y 2 2 ) {\displaystyle (x_{1}x_{2}+Ny_{1}y_{2})^{2}-N(x_{1}y_{2}+x_{2}y_{1})^{2}=(x_{1}^{2}-Ny_{1}^{2})(x_{2}^{2}-Ny_{2}^{2})}

For the equation x 2 N y 2 = k {\displaystyle x^{2}-Ny^{2}=k} , this allows the "composition" (samāsa) of two solution triples ( x 1 , y 1 , k 1 ) {\displaystyle (x_{1},y_{1},k_{1})} and ( x 2 , y 2 , k 2 ) {\displaystyle (x_{2},y_{2},k_{2})} into a new triple

( x 1 x 2 + N y 1 y 2 , x 1 y 2 + x 2 y 1 , k 1 k 2 ) . {\displaystyle (x_{1}x_{2}+Ny_{1}y_{2}\,,\,x_{1}y_{2}+x_{2}y_{1}\,,\,k_{1}k_{2}).}

In the general method, the main idea is that any triple ( a , b , k ) {\displaystyle (a,b,k)} (that is, one which satisfies a 2 N b 2 = k {\displaystyle a^{2}-Nb^{2}=k} ) can be composed with the trivial triple ( m , 1 , m 2 N ) {\displaystyle (m,1,m^{2}-N)} to get the new triple ( a m + N b , a + b m , k ( m 2 N ) ) {\displaystyle (am+Nb,a+bm,k(m^{2}-N))} for any m. Assuming we started with a triple for which gcd ( a , b ) = 1 {\displaystyle \gcd(a,b)=1} , this can be scaled down by k (this is Bhaskara's lemma):

a 2 N b 2 = k ( a m + N b k ) 2 N ( a + b m k ) 2 = m 2 N k {\displaystyle a^{2}-Nb^{2}=k\Rightarrow \left({\frac {am+Nb}{k}}\right)^{2}-N\left({\frac {a+bm}{k}}\right)^{2}={\frac {m^{2}-N}{k}}}

Since the signs inside the squares do not matter, the following substitutions are possible:

a a m + N b | k | , b a + b m | k | , k m 2 N k {\displaystyle a\leftarrow {\frac {am+Nb}{|k|}},b\leftarrow {\frac {a+bm}{|k|}},k\leftarrow {\frac {m^{2}-N}{k}}}

When a positive integer m is chosen so that (a + bm)/k is an integer, so are the other two numbers in the triple. Among such m, the method chooses one that minimizes the absolute value of m − N and hence that of (m − N)/k. Then the substitution relations are applied for m equal to the chosen value. This results in a new triple (a, b, k). The process is repeated until a triple with k = 1 {\displaystyle k=1} is found. This method always terminates with a solution (proved by Lagrange in 1768). Optionally, we can stop when k is ±1, ±2, or ±4, as Brahmagupta's approach gives a solution for those cases.

Brahmagupta's composition method

In AD 628, Brahmagupta discovered a general way to find x {\displaystyle x} and y {\displaystyle y} of x 2 = N y 2 + 1 , {\displaystyle x^{2}=Ny^{2}+1,} when given a 2 = N b 2 + k {\displaystyle a^{2}=Nb^{2}+k} , when k is ±1, ±2, or ±4.

k = ±1

Using Brahmagupta's identity to compose the triple ( a , b , k ) {\displaystyle (a,b,k)} with itself:

( a 2 + N b 2 ) 2 N ( 2 a b ) 2 = k 2 {\displaystyle (a^{2}+Nb^{2})^{2}-N(2ab)^{2}=k^{2}} {\displaystyle \Rightarrow } ( 2 a 2 k ) 2 N ( 2 a b ) 2 = k 2 {\displaystyle (2a^{2}-k)^{2}-N(2ab)^{2}=k^{2}}

The new triple can be expressed as ( 2 a 2 k , 2 a b , k 2 ) {\displaystyle (2a^{2}-k,2ab,k^{2})} .

Substituting k = 1 {\displaystyle k=-1} gives a solution:

x = 2 a 2 + 1 , y = 2 a b {\displaystyle x=2a^{2}+1,y=2ab}

For k = 1 {\displaystyle k=1} , the original ( a , b ) {\displaystyle (a,b)} was already a solution. Substituting k = 1 {\displaystyle k=1} yields a second:

x = 2 a 2 1 , y = 2 a b {\displaystyle x=2a^{2}-1,y=2ab}

k = ±2

Again using the equation, ( 2 a 2 k ) 2 N ( 2 a b ) 2 = k 2 {\displaystyle (2a^{2}-k)^{2}-N(2ab)^{2}=k^{2}} {\displaystyle \Rightarrow } ( 2 a 2 k k ) 2 N ( 2 a b k ) 2 = 1 {\displaystyle \left({\frac {2a^{2}-k}{k}}\right)^{2}-N\left({\frac {2ab}{k}}\right)^{2}=1}

Substituting k = 2 {\displaystyle k=2} ,

x = a 2 1 , y = a b {\displaystyle x=a^{2}-1,y=ab}

Substituting k = 2 {\displaystyle k=-2} ,

x = a 2 + 1 , y = a b {\displaystyle x=a^{2}+1,y=ab}

k = 4

Substituting k = 4 {\displaystyle k=4} into the equation ( 2 a 2 4 4 ) 2 N ( 2 a b 4 ) 2 = 1 {\displaystyle ({\frac {2a^{2}-4}{4}})^{2}-N({\frac {2ab}{4}})^{2}=1} creates the triple ( a 2 2 2 , a b 2 , 1 ) {\displaystyle ({\frac {a^{2}-2}{2}},{\frac {ab}{2}},1)} .

Which is a solution if a {\displaystyle a} is even:

x = a 2 2 2 , y = a b 2 {\displaystyle x={\frac {a^{2}-2}{2}},y={\frac {ab}{2}}}

If a is odd, start with the equations ( a 2 ) 2 N ( b 2 ) 2 = 1 {\displaystyle ({\frac {a}{2}})^{2}-N({\frac {b}{2}})^{2}=1} and ( 2 a 2 4 4 ) 2 N ( 2 a b 4 ) 2 = 1 {\displaystyle ({\frac {2a^{2}-4}{4}})^{2}-N({\frac {2ab}{4}})^{2}=1} .

Leading to the triples ( a 2 , b 2 , 1 ) {\displaystyle ({\frac {a}{2}},{\frac {b}{2}},1)} and ( a 2 2 2 , a b 2 , 1 ) {\displaystyle ({\frac {a^{2}-2}{2}},{\frac {ab}{2}},1)} . Composing the triples gives ( a 2 ( a 2 3 ) ) 2 N ( b 2 ( a 2 1 ) ) 2 = 1 {\displaystyle ({\frac {a}{2}}(a^{2}-3))^{2}-N({\frac {b}{2}}(a^{2}-1))^{2}=1}

When a {\displaystyle a} is odd,

x = a 2 ( a 2 3 ) , y = b 2 ( a 2 1 ) {\displaystyle x={\frac {a}{2}}(a^{2}-3),y={\frac {b}{2}}(a^{2}-1)}

k = -4

When k = 4 {\displaystyle k=-4} , then ( a 2 ) 2 N ( b 2 ) 2 = 1 {\displaystyle ({\frac {a}{2}})^{2}-N({\frac {b}{2}})^{2}=-1} . Composing with itself yields ( a 2 + N b 2 4 ) 2 N ( a b 2 ) 2 = 1 {\displaystyle ({\frac {a^{2}+Nb^{2}}{4}})^{2}-N({\frac {ab}{2}})^{2}=1} {\displaystyle \Rightarrow } ( a 2 + 2 2 ) 2 N ( a b 2 ) 2 = 1 {\displaystyle ({\frac {a^{2}+2}{2}})^{2}-N({\frac {ab}{2}})^{2}=1} .

Again composing itself yields ( ( a 2 + 2 ) 2 + N a 2 b 2 ) 4 ) 2 N ( a b ( a 2 + 2 ) 2 ) 2 = 1 {\displaystyle ({\frac {(a^{2}+2)^{2}+Na^{2}b^{2})}{4}})^{2}-N({\frac {ab(a^{2}+2)}{2}})^{2}=1} {\displaystyle \Rightarrow } ( a 4 + 4 a 2 + 2 2 ) 2 N ( a b ( a 2 + 2 ) 2 ) 2 = 1 {\displaystyle ({\frac {a^{4}+4a^{2}+2}{2}})^{2}-N({\frac {ab(a^{2}+2)}{2}})^{2}=1}

Finally, from the earlier equations, compose the triples ( a 2 + 2 2 , a b 2 , 1 ) {\displaystyle ({\frac {a^{2}+2}{2}},{\frac {ab}{2}},1)} and ( a 4 + 4 a 2 + 2 2 , a b ( a 2 + 2 ) 2 , 1 ) {\displaystyle ({\frac {a^{4}+4a^{2}+2}{2}},{\frac {ab(a^{2}+2)}{2}},1)} , to get

( ( a 2 + 2 ) ( a 4 + 4 a 2 + 2 ) + N a 2 b 2 ( a 2 + 2 ) 4 ) 2 N ( a b ( a 4 + 4 a 2 + 3 ) 2 ) 2 = 1 {\displaystyle ({\frac {(a^{2}+2)(a^{4}+4a^{2}+2)+Na^{2}b^{2}(a^{2}+2)}{4}})^{2}-N({\frac {ab(a^{4}+4a^{2}+3)}{2}})^{2}=1} {\displaystyle \Rightarrow } ( ( a 2 + 2 ) ( a 4 + 4 a 2 + 1 ) 2 ) 2 N ( a b ( a 2 + 3 ) ( a 2 + 1 ) 2 ) 2 = 1 {\displaystyle ({\frac {(a^{2}+2)(a^{4}+4a^{2}+1)}{2}})^{2}-N({\frac {ab(a^{2}+3)(a^{2}+1)}{2}})^{2}=1} {\displaystyle \Rightarrow } ( ( a 2 + 2 ) [ ( a 2 + 1 ) ( a 2 + 3 ) 2 ) ] 2 ) 2 N ( a b ( a 2 + 3 ) ( a 2 + 1 ) 2 ) 2 = 1 {\displaystyle ({\frac {(a^{2}+2)}{2}})^{2}-N({\frac {ab(a^{2}+3)(a^{2}+1)}{2}})^{2}=1} .

This give us the solutions

x = ( a 2 + 2 ) [ ( a 2 + 1 ) ( a 2 + 3 ) 2 ) ] 2 y = a b ( a 2 + 3 ) ( a 2 + 1 ) 2 {\displaystyle x={\frac {(a^{2}+2)}{2}}y={\frac {ab(a^{2}+3)(a^{2}+1)}{2}}}

(Note, k = 4 {\displaystyle k=-4} is useful to find a solution to Pell's Equation, but it is not always the smallest integer pair. e.g. 36 2 52 5 2 = 4 {\displaystyle 36^{2}-52*5^{2}=-4} . The equation will give you x = 1093436498 , y = 151632270 {\displaystyle x=1093436498,y=151632270} , which when put into Pell's Equation yields 1195601955878350801 1195601955878350800 = 1 {\displaystyle 1195601955878350801-1195601955878350800=1} , which works, but so does x = 649 , y = 90 {\displaystyle x=649,y=90} for N = 52 {\displaystyle N=52} .

Examples

n = 61

The n = 61 case (determining an integer solution satisfying a 2 61 b 2 = 1 {\displaystyle a^{2}-61b^{2}=1} ), issued as a challenge by Fermat many centuries later, was given by Bhaskara as an example.

We start with a solution a 2 61 b 2 = k {\displaystyle a^{2}-61b^{2}=k} for any k found by any means. In this case we can let b be 1, thus, since 8 2 61 1 2 = 3 {\displaystyle 8^{2}-61\cdot 1^{2}=3} , we have the triple ( a , b , k ) = ( 8 , 1 , 3 ) {\displaystyle (a,b,k)=(8,1,3)} . Composing it with ( m , 1 , m 2 61 ) {\displaystyle (m,1,m^{2}-61)} gives the triple ( 8 m + 61 , 8 + m , 3 ( m 2 61 ) ) {\displaystyle (8m+61,8+m,3(m^{2}-61))} , which is scaled down (or Bhaskara's lemma is directly used) to get:

( 8 m + 61 3 , 8 + m 3 , m 2 61 3 ) . {\displaystyle \left({\frac {8m+61}{3}},{\frac {8+m}{3}},{\frac {m^{2}-61}{3}}\right).}

For 3 to divide 8 + m {\displaystyle 8+m} and | m 2 61 | {\displaystyle |m^{2}-61|} to be minimal, we choose m = 7 {\displaystyle m=7} , so that we have the triple ( 39 , 5 , 4 ) {\displaystyle (39,5,-4)} . Now that k is −4, we can use Brahmagupta's idea: it can be scaled down to the rational solution ( 39 / 2 , 5 / 2 , 1 ) {\displaystyle (39/2,5/2,-1)\,} , which composed with itself three times, with m = 7 , 11 , 9 {\displaystyle m={7,11,9}} respectively, when k becomes square and scaling can be applied, this gives ( 1523 / 2 , 195 / 2 , 1 ) {\displaystyle (1523/2,195/2,1)\,} . Finally, such procedure can be repeated until the solution is found (requiring 9 additional self-compositions and 4 additional square-scalings): ( 1766319049 , 226153980 , 1 ) {\displaystyle (1766319049,\,226153980,\,1)} . This is the minimal integer solution.

n = 67

Suppose we are to solve x 2 67 y 2 = 1 {\displaystyle x^{2}-67y^{2}=1} for x and y.

We start with a solution a 2 67 b 2 = k {\displaystyle a^{2}-67b^{2}=k} for any k found by any means; in this case we can let b be 1, thus producing 8 2 67 1 2 = 3 {\displaystyle 8^{2}-67\cdot 1^{2}=-3} . At each step, we find an m > 0 such that k divides a + bm, and |m − 67| is minimal. We then update a, b, and k to a m + N b | k | , a + b m | k | {\displaystyle {\frac {am+Nb}{|k|}},{\frac {a+bm}{|k|}}} and m 2 N k {\displaystyle {\frac {m^{2}-N}{k}}} respectively.

First iteration

We have ( a , b , k ) = ( 8 , 1 , 3 ) {\displaystyle (a,b,k)=(8,1,-3)} . We want a positive integer m such that k divides a + bm, i.e. 3 divides 8 + m, and |m − 67| is minimal. The first condition implies that m is of the form 3t + 1 (i.e. 1, 4, 7, 10,… etc.), and among such m, the minimal value is attained for m = 7. Replacing (abk) with ( a m + N b | k | , a + b m | k | , m 2 N k ) {\displaystyle \left({\frac {am+Nb}{|k|}},{\frac {a+bm}{|k|}},{\frac {m^{2}-N}{k}}\right)} , we get the new values a = ( 8 7 + 67 1 ) / 3 = 41 , b = ( 8 + 1 7 ) / 3 = 5 , k = ( 7 2 67 ) / ( 3 ) = 6 {\displaystyle a=(8\cdot 7+67\cdot 1)/3=41,b=(8+1\cdot 7)/3=5,k=(7^{2}-67)/(-3)=6} . That is, we have the new solution:

41 2 67 ( 5 ) 2 = 6. {\displaystyle 41^{2}-67\cdot (5)^{2}=6.}

At this point, one round of the cyclic algorithm is complete.

Second iteration

We now repeat the process. We have ( a , b , k ) = ( 41 , 5 , 6 ) {\displaystyle (a,b,k)=(41,5,6)} . We want an m > 0 such that k divides a + bm, i.e. 6 divides 41 + 5m, and |m − 67| is minimal. The first condition implies that m is of the form 6t + 5 (i.e. 5, 11, 17,… etc.), and among such m, |m − 67| is minimal for m = 5. This leads to the new solution a = (41⋅5 + 67⋅5)/6, etc.:

90 2 67 11 2 = 7. {\displaystyle 90^{2}-67\cdot 11^{2}=-7.}
Third iteration

For 7 to divide 90 + 11m, we must have m = 2 + 7t (i.e. 2, 9, 16,… etc.) and among such m, we pick m = 9.

221 2 67 27 2 = 2. {\displaystyle 221^{2}-67\cdot 27^{2}=-2.}
Final solution

At this point, we could continue with the cyclic method (and it would end, after seven iterations), but since the right-hand side is among ±1, ±2, ±4, we can also use Brahmagupta's observation directly. Composing the triple (221, 27, −2) with itself, we get

( 221 2 + 67 27 2 2 ) 2 67 ( 221 27 ) 2 = 1 , {\displaystyle \left({\frac {221^{2}+67\cdot 27^{2}}{2}}\right)^{2}-67\cdot (221\cdot 27)^{2}=1,}

that is, we have the integer solution:

48842 2 67 5967 2 = 1. {\displaystyle 48842^{2}-67\cdot 5967^{2}=1.}

This equation approximates 67 {\displaystyle {\sqrt {67}}} as 48842 5967 {\displaystyle {\frac {48842}{5967}}} to within a margin of about 2 × 10 9 {\displaystyle 2\times 10^{-9}} .

Notes

  1. ^ Hoiberg & Ramchandani – Students' Britannica India: Bhaskaracharya II, page 200
  2. Kumar, page 23
  3. Plofker, page 474
  4. ^ Goonatilake, page 127 – 128
  5. Cajori (1918), p. 197

    "The process of reasoning called "Mathematical Induction" has had several independent origins. It has been traced back to the Swiss Jakob (James) Bernoulli, the Frenchman B. Pascal and P. Fermat, and the Italian F. Maurolycus. By reading a little between the lines one can find traces of mathematical induction still earlier, in the writings of the Hindus and the Greeks, as, for instance, in the "cyclic method" of Bhaskara, and in Euclid's proof that the number of primes is infinite."

  6. Gopal, Madan (1990). K.S. Gautam (ed.). India through the ages. Publication Division, Ministry of Information and Broadcasting, Government of India. p. 79.
  7. O'Connor, John J.; Robertson, Edmund F., "Pell's equation", MacTutor History of Mathematics Archive, University of St Andrews
  8. Kaye (1919), p. 337.
  9. ^ John Stillwell (2002), Mathematics and its history (2 ed.), Springer, pp. 72–76, ISBN 978-0-387-95336-6
  10. "Pell's equation". Maths History. Retrieved 2021-06-14.
  11. Datta and Singh (1962). History of Hindu Mathematics : A Source Book Parts I and II. Asia Publishing House. pp. 157–160. ISBN 8180903907.
  12. The example in this section is given (with notation Q n {\displaystyle Q_{n}} for k, P n {\displaystyle P_{n}} for m, etc.) in: Michael J. Jacobson; Hugh C. Williams (2009), Solving the Pell equation, Springer, p. 31, ISBN 978-0-387-84922-5

References

External links

Number-theoretic algorithms
Primality tests
Prime-generating
Integer factorization
Multiplication
Euclidean division
Discrete logarithm
Greatest common divisor
Modular square root
Other algorithms
  • Italics indicate that algorithm is for numbers of special forms
Categories: