4.4 Applications to the theory of numbers

Let it be required to solve

by integral x and y.

Obviously, Equation (1) cannot be solved, if there is a common factor of a and b other than ± 1, whence assume that a and b are relatively prime. Since a : b can be represented by an even continued fraction (cf. 4.2.1 Theorem 1).

Hence one obtains the integral solutions

Solve by integral x and y Pell's Equation*

* Solved by Rahmagupta (born 598 A.D.) and independently by Fermat (1857). The name has no historical justification, but is commonly used. (L.E. Dickson, History of the theory of Numbers, Vol. II; B.B. Datta and A.N. Singh, History of Hindu Mathematics, Part II.

It follows from 4.3.5 that ); on applying the corollary, it is found that

Hence, if n is even (x, y) = (Pkn, Qkn) and if odd, (x,y) = (P2kn,Q2kn) are solutions for every positive integral k.

Example:

This method yields the solutions

4.5* Continued fractions with elements f(x)

*This section may be omitted during a first reading.

The general method of continued fractions, the elements of which have been given in 4.1, can be applied to different fields K. In 4.2, 4.3, 4.4, the field of real numbers has been chosen for K and the expansion of a positive real number into a. continued fraction with integral elements studied. Thus, the positive real numbers formed the system A and the Integral numbers the domain S, referred to in the general theory. The expansion of positive numbers into continued fractions has yielded the opportunity for an approximation of real numbers by rational numbers. There arises now the question, whether in a similar manner power series in terms of an indeterminate x-1 can be expanded into continued fractions with polynomials in x as their elements and whether by this method the power series can be approximated by quotients of polynomials. Approximation always means that for a certain entity, which is to be approximated, an infinite sequence of approximating entities can be constructed such that the differences between the approximated and the approximating entities can be made arbitrarily small. Thus, an approximation implies that these differences are measurable and a sequence approximating an entity may be made non-approximating when a different method of measurement of the differences is adopted. In the theory to be given here, the power series are measured by their degrees - integral numbers; the measurement does not imply that attention has been given to convergence. The theory is therefore so general that nothing must be assumed regarding the field of the coefficients of the power series. On the other hand, this generality makes it necessary to define afresh sums, products, etc., of power series. This definition must be done in such a manner that it agrees with the definitions for the corresponding operations on polynomials for the case when only a finite number of coefficients differ from zero.

4.5.1 The field B: Let F be an arbitrary field and x an indeterminate not included in F. The elements of F will be denoted by a, b, c, d with or without subscripts, the elements of the ring F[x] by

The integral domain F[x] will be chosen as S. In order to get a new system A, create the new elements denoted by Greek letters

in the following manner:

This purely formal definition means that there corresponds to every sequence of coefficients from F with fixed decreasing integral subscripts

;

one of the new elements and this element will not be changed if one places before an a finite set of null coefficients. The addition and multiplication of Elements (2) will now be defined in such a manner that the elements (2) for which ak = 0, form for k < 0 a sub-ring isomorphic to F[x].

Definition: Let

where

then

where

This definition is obviously independent of null coefficients which have been placed ahead, the commutative, associative and distributive laws apply, and subtraction is defined by

Hence, for the null elements, every coefficient is 0. If for the coefficients of y(x) one has the conditions bk = 0 for k ¹ 0, then in "]

The elements (2) form a ring B and those elements, for which ak>0 = 0, form a sub-ring for which addition and multiplication have been defined in the same manner as for polynomials. Hence, there is an isormorphism I by which this sub-ring becomes isomorphic to F[x]. Let f(x) ¹ 0, then f(x) has at least one coefficient ¹ 0 ; let n be the highest index of the non-vanishing coefficients, then

and n is said to be the degree of f(x). From (4) follows directly:

The degree of a product is equal to the sum of the degrees of the factors.
The degree of a sum of elements of different degrees is equal to the maximum degree of the summands.

This remark shows that a product of two elements ¹ 0 cannot be equal to 0. Hence the ring B is an integral domain. Now, identify the elements of F[x] with the corresponding elements (2). For polynomials ¹ 0 of F[x], the degree tallies with that of B, but the zero element, which is a polynomial of degree -1 in F[x], has no degree when considered as an element of B.

The elements

for k ¹ 0 are identified with b0 and

Let

then

Every field with B contains the quotient field Q of F[x]. The elements of B, which are quotients of elements of F[x], form a ring which is isomorphic to a sub-ring of Q. The elements of this ring will therefore be identified with the corresponding elements of Q. Thus, yk(x) is identified with x-k and the finite sum with the symbolic sum where ak = 0 for k < -m.

Using this notation, one can extend the algorithm of division of polynomials to the elements of B.

Let f(x) and y(x) be of degree n and m, respectively, where n ³ m, and an and and bm their highest coefficients,

Repetition of this procedure yields

and eventually one obtains an enumerable set of elements ck of F for k £ n - m, so that

and

is of degree np+1 < np.

Let

then wp(x) is of degree np+1 - m, whence y(x)wp(x) is of degree np+1,

of degree < n for every n, whence this difference is 0. Hence f(x)=y(x)c(x). As y(x) was assumed to be an arbitrary element ¹ 0 of B, one has the

Theorem: The set B of the elements (2) is a field containing the quotient field Q of F[x].

4.5.2 Expansion of the elements of B into continued fractions: The general theory of continued fractions which has been explained in 4.1 can be applied to the expansion of elements of the type f (x), if the field B is taken for K, the domain F[x] for S and the system of the elements with non-negative degree taken for the system A. It is readily proved that those elements satisfy the conditions required for elements of a system A (cf. 4.1.1). Naturally, an element has a positive degree if and only if its inverse has a negative degree.

Now,

Hence either

where degree (y(x) > 0.

The. representation (1) of f (x) is uniquely determined, whence there exists an expansion of f (x) into a continued fraction. The first element s1 = f(x) may have any degree ³ 0 or it may be the zero element, whereas the second element s2 (if any) can be assumed to have a positive degree; the same applies to s2,s3,···. If this is assumed regarding the degrees, the expansion is uniquely determined , whence follows the

Theorem: The elements 4.5.1 (2) can be represented in one and only one manner by a continued fraction (s1, s2, ···), where si is a polynomial in x, the degree of which is > 0 for i > 1.

The degree of a polynomial s has just the properties of the function N(s) in 4.1.2. From that section and the preceding theorem follows the

Corollary: Finite continued fractions represent the elements of Q and every element of Q is represented by a finite continued fraction.

4.5.3 Approximation by rational functions: The elements of B can be approximated by finite continued fractions, the elements of which are polynomials. The following theory of approximation has great analogy with the approximation of real numbers by finite continued fractions with integral elements explained in 4.2; the same formulae of the general theory (cf. 4.1) will be applied. Whereas the real numbers have been approximated in such a way that the absolute value of the error was made smaller than any positive number, the elements of B will he approximated with an error the degree of which diverges to -¥. If the field F of the coefficients consists of numbers, the functions of x, which are the elements of B, are numerically approximated by the continued fractions for absolute values of x which are sufficiently high. Thus, the method can be used for numerical approximation of the asymptotic behaviour of those functions.

Let an element a1 of B be expanded into a continued fraction (s1, s2, ···), which satisfies the conditions of the preceding theorem, and

Hence d1, d2, ··· are positive integers. Apply the notation of 4.1; then, since ai=si+1,

degree ai = di . (2)

Moreover, since ai =a1:a i+1, it follows that

whence

By the general formulae

whence, for i = 2, 3, ···, Qi has a positive degree. By mathematical induction, it follows that these degrees increase steadily with i so that

Now,

whence this formula with (3) and (4) yields

The right hand side of (5) can also be expressed by - degree (Qk,Qk-1), whence it is equal to degree This show that, when a1 is approximated by its k-th convergent, the degree of the error is the same as the degree of the difference between the k-th and the (k + 1)-th convergent. It is now easy to prove a theorem analogous to that for the approximation of real numbers in 4.2.2

Theorem:

Proof: Set

As b is the sum of two elements of B, where the-first term has a higher degree than the second term, its degree is the same as the degree of the first term, whence follows from (5)

On the other hand, bg(x) Qk is a polynomial in x and differs from zero, whence

substituting the value of the degree b from (6), one obtains

Exercise:

Represent this function by a continued fraction and approximate it by rational functions.

4.5.4 Continued fractions with polynomial elements: In order to develop a theory of expansion of the elements of B in continued fractions analogous to the corresponding theory for real numbers, it must be shown that every infinite continued fraction of the type considered here is indeed the expansion of an element of B. This objective is assisted by the

Lemma: Let a = (s1 ···), a ' = (s'1 ···) be finite or infinite continued fractions, let m be the lowest subscript for which sm ¹ s'm or for which sm, but not s'm exists and (s1, ··· , sm) = A, (s'1, ··· , s'm) = A', then

Proof: Wthout loss of generality, assume that degree sm = r ³ degree s'm=r'. The ordinary notation will be used for the convergents of a, while those of a ' will be distinguished by an apostrophe. Then

Now,

Two different cases must be considered:

1. Let r = r'; as degree (s'm - sm) ³ 0,

2. Let r > r'; then degree (s'm - sm) = r, and therefore

Hence, in each case,

Now, by (1),

and

Hence,

degree (A')a - a') = degree [(A - A') - (A - a) + (A' - a')] = degree (A - A'),

since the degree of the first one of the three summands is larger than the degrees of the other two.

Theorem: Let s1, s2, ··· be an infinite set of polynomials of F[x] and for i > 1, degree s > 0, then there exists a continued fraction (s1, s2, ···).

Proof: The set s1, s2, ··· determines uniquely the values P1, ··· , Q1, ··· and PN:QN = (s1, ··· , sN). Let 1< n < N ; by the preceding lemma,

degree (PN:QN-Pn:Qn)=degree(1/Pn+1:Qn+1-Pn:Qn)=degree (1/QnQn-1)=-kn,

where kn increases to infinity with n.

The coefficients bk are independent of N. As kn increases with the subscript n, one gets an infinite set bm, ··· , , ···, which determines

Finally, it has to be proved that f(x) = (s1, s2, ···). Let f(x) = (s'1, s'2, ···) and m be the smallest subscript for which sm ¹ s'm, then, by the lemma, for every n>m

since both sides are equal to the degree ( (s'1, ··· ,s'm) - (s1, ··· ,sm)). However,

is of degree - kn which decreases infinitely with n. Thus, there is no subscript like m, whence follows the theorem.

4.6 Continued fractions with rational elements: Let S be the field of rational numbers, then every finite continued fraction

represents a rational number, but an infinite continued fraction determines a number by approximation, if and only if

converges. I the sum is convergent,

determines a real number equal to (2). Hence a necessary condition for the convergence of (3) is

If each of the numbers Qn are either > or < 0, or have alternating signs, the sum (2) is an alternating sum; hence, in these cases, the continued fraction converges if |Qn Qn-1| increases steadily to infinity.

4.6.1 Convergence of continued fractions:

For the continued fractions considered above, certain criteria of convergence will now be established. In these investigations, every sequence (series, continued fractions) which does not converge to a real number will be said to be divergent.

Theorem 1: If S|sn| is convergent, 4.6 (3) diverges.

Proof:. At first, it will be shown by mathematical induction that

Since Q1 = 1, Q2 = s2, the formula holds for n < 3. If (1) is true for n < m, it follows from

that

If Ssj converges, the infinite product P(1 +|sj|) converges to a positive number Q, and |Qn| < Q holds for every value of n, whence 4.6 (4) above does apply and the continued fraction diverges.

Let sj > 0 for j > 1. By mathematical induction, it follows from Q1 = 1, Q2 = s2 > 0, Qn = snQn-1 + Qn-2 that each number Qi > 0.

4.6 (2) is therefore an alternating series the absolute values of the elements of which decrease steadily. This series converges therefore if and only if 4.6 (4) is satisfied. These considerations lead to

Theorem 2: Let si > 0 for i > 1, then the continued fraction 4.6 (3) converges if and only if Ssi diverges.

Proof: If converges, the continued fraction diverges, as has been proved by the preceding theorem.

Let Ssi diverge, then Ssi ® ¥. Since Qi > 0, Q1 = 1, Q2n+1=s2n+1Q2n+Q2n+1, one finds by mathematical induction that Q2n+1³1. Hence

Q2n=s2nQ2n-1+Q2n-2³s2n+1+Q2n-2

and, since Q2 = s2n, it follows by mathematical induction that

If the sum (2) diverges with steadily increasing n,

whence 4.6 (4) is satisfied. However, if (2) converges, then Ssi+1 diverges, whence Q2n+1 ³ s2(s2 + ··· + s2n+1) diverges with increasing n. Thus 4.6 (4) applies in every case and this condition implies the convergence of the continued fraction in the case under consideration, whence follows the theorem.

4.6.2 Test of irrationality: The preceding investigations can be used to prove the irrationality of certain numbers.

Let s1 = 0, s ³ 1. As Ssi ® ¥, the continued fraction converges by the preceding theorem. The limit lies between the convergents with odd and even subscripts, i.e., between P1 : Q1 = 0 and P2 : Q2 = 1: s2. It will now be shown that this value is irrational

Let a1 = ( s1, s2, ··· ) be rational, say a1 = a2/a1, where a1, a2 are integral, then a1 > a2 and a2/a1 = - 1/(s2 + a2), whence a2 = a1/a2 - s2 = a2/a1, where a3 = a1 - s2a2 is integral. Since a2 = (0, s2, ···), one has 0 < a2 < 1, whence a2> a3 > 0. In the same manner, one finds

This procedure must end after a finite number of steps, as a sequence a1>a2>a3>a4> ··· of integral positive numbers cannot be continued indefinitely. Since the continued fraction has been assumed to be infinite, the assumption of a1 to be rational leads to a contradiction. Hence a1 is irrational.

Example: If

then

and by the identities

by mathematical induction,

whence

and the continued fraction is irrational. Its value is

 

i.e., the number e is irrational.

last next