Page:EB1911 - Volume 17.djvu/938

This page has been proofread, but needs to be validated.

MAXIMA AND MINIMA

919

the action of gravity, proposed in 1696, gave rise to a new kind of maximum and minimum problem in which we have to find a curve and not points on a given curve. From these problems arose the “Calculus of Variations.” (See Variations, Calculus of.)

The only general methods of attacking problems on maxima and minima are those of the differential calculus or, in geometrical problems, what is practically Fermat’s method. Some problems may be solved by algebra; thus if y = ƒ(x) ÷ φ(x), where ƒ(x) and φ(x) are polynomials in x, the limits to the values of yφ may be found from the consideration that the equation yφ(x) − ƒ(x) = 0 must have real roots. This is a useful method in the case in which φ(x) and ƒ(x) are quadratics, but scarcely ever in any other case. The problem of finding the maximum product of n positive quantities whose sum is given may also be found, algebraically, thus. If a and b are any two real unequal quantities whatever 1/2(a + b)}² > ab, so that we can increase the product leaving the sum unaltered by replacing any two terms by half their sum, and so long as any two of the quantities are unequal we can increase the product. Now, the quantities being all positive, the product cannot be increased without limit and must somewhere attain a maximum, and no other form of the product than that in which they are all equal can be the maximum, so that the product is a maximum when they are all equal. Its minimum value is obviously zero. If the restriction that all the quantities shall be positive is removed, the product can be made equal to any quantity, positive or negative. So other theorems of algebra, which are stated as theorems on inequalities, may be regarded as algebraic solutions of problems on maxima and minima.

For purely geometrical questions the only general method available is practically that employed by Fermat. If a quantity depends on the position of some point P on a curve, and if its value is equal at two neighbouring points P and P′, then at some position between P and P′ it attains a maximum or minimum, and this position may be found by making P and P′ approach each other indefinitely. Take for instance the problem of Regiomontanus “to find a point on a given straight line which subtends a maximum angle at two given points A and B.” Let P and P′ be two near points on the given straight line such that the angles APB and AP′B are equal. Then ABPP′ lie on a circle. By making P and P′ approach each other we see that for a maximum or minimum value of the angle APB, P is a point in which a circle drawn through AB touches the given straight line. There are two such points, and unless the given straight line is at right angles to AB the two angles obtained are not the same. It is easily seen that both angles are maxima, one for points on the given straight line on one side of its intersection with AB, the other for points on the other side. For further examples of this method together with most other geometrical problems on maxima and minima of any interest or importance the reader may consult such a book as J. W. Russell’s A Sequel to Elementary Geometry (Oxford, 1907).

The method of the differential calculus is theoretically very simple. Let u be a function of several variables x₁, x₂, x₃ . . . x_n, supposed for the present independent; if u is a maximum or minimum for the set of values x₁, x₂, x₃, . . . x_n, and u becomes u + δu, when x₁, x₂, x₃ . . . x_n receive small increments δx₁, δx₂, . . . δx_n; then δu must have the same sign for all possible values of δx₁, δ₂ . . . δx_n.

Now

δu = Σ	δu	δx₁ + 1/2 { Σ	δ²u	δx₁² + 2Σ	δ²u	δx₁δx₂ . . . } + . . . .
	δx₁		δx₁²		δx₁δx₂

The sign of this expression in general is that of Σ(δu/δx₁)δx₁, which cannot be one-signed when x₁, x₂, . . . x_n can take all possible values, for a set of increments δx₁, δx₂ . . . δx_n, will give an opposite sign to the set −δx₁, −δx₂, . . . −δx_n. Hence Σ(δu/δx₁)δx₁ must vanish for all sets of increments δx₁, . . . δx_n, and since these are independent, we must have δu/δx₁ = 0, δu/δx₂ = 0, . . . δu/δx_n = 0. A value of u given by a set of solutions of these equations is called a “critical value” of u. The value of δu now becomes

1/2 { Σ	δ²u	δx₁² + 2 Σ	δ²u	δx₁δx₂ + . . . };
	δx₁²		δx₁δx₂

for u to be a maximum or minimum this must have always the same sign. For the case of a single variable x, corresponding to a value of x given by the equation du/dx = 0, u is a maximum or minimum as d ²u/dx² is negative or positive. If d ²u/dx² vanishes, then there is no maximum or minimum unless d ²u/dx² vanishes, and there is a maximum or minimum according as d ⁴u/dx⁴ is negative or positive. Generally, if the first differential coefficient which does not vanish is even, there is a maximum or minimum according as this is negative or positive. If it is odd, there is no maximum or minimum.

In the case of several variables, the quadratic

Σ	δ²u	δx₁² + 2 Σ	δ²u	δx₁δx₂ + . . .
	δx₁²		δx₁δx₂

must be one-signed. The condition for this is that the series of discriminants

a₁₁ ,	a₁₁ a₁₂	,	a₁₁ a₁₂ a₁₃	, . . .
	a₂₁ a₂₂		a₂₁ a₂₂ a₂₃
			a₃₁ a₃₂ a₃₃

where a_pq denotes δ²u/δa_pδa_q should be all positive, if the quadratic is always positive, and alternately negative and positive, if the quadratic is always negative. If the first condition is satisfied the critical value is a minimum, if the second it is a maximum. For the case of two variables the conditions are

δ²u	·	δ²u	> (	δ²u	)2
δx₁²		δx₂²		δx₁δx₂

for a maximum or minimum at all and δ²u/δx₁² and δ²u/δx₂² both negative for a maximum, and both positive for a minimum. It is important to notice that by the quadratic being one-signed is meant that it cannot be made to vanish except when δx₁, δx₂, . . . δx_n all vanish. If, in the case of two variables,

δ²u	·	δ²u	= (	δ²u	)2
δx₁²		δx₂²		δx₁δx₂

then the quadratic is one-signed unless it vanishes, but the value of u is not necessarily a maximum or minimum, and the terms of the third and possibly fourth order must be taken account of.

Take for instance the function u = x² − xy² + y². Here the values x = 0, y = 0 satisfy the equations δu/δx = 0, δu/δy = 0, so that zero is a critical value of u, but it is neither a maximum nor a minimum although the terms of the second order are (δx)², and are never negative. Here δu = δx² − δxδy² + δy², and by putting δx = 0 or an infinitesimal of the same order as δy², we can make the sign of δu depend on that of δy², and so be positive or negative as we please. On the other hand, if we take the function u = x² − xy² + y⁴, x = 0, y = 0 make zero a critical value of u, and here δu = δx² − δxδy² + δy⁴, which is always positive, because we can write it as the sum of two squares, viz. (δx − 1/2δy²)² + 3/4δy⁴; so that in this case zero is a minimum value of u.

A critical value usually gives a maximum or minimum in the case of a function of one variable, and often in the case of several independent variables, but all maxima and minima, particularly absolutely greatest and least values, are not necessarily critical values. If, for example, x is restricted to lie between the values a and b and φ′(x) = 0 has no roots in this interval, it follows that φ′(x) is one-signed as x increases from a to b, so that φ(x) is increasing or diminishing all the time, and the greatest and least values of φ(x) are φ(a) and φ(b), though neither of them is a critical value. Consider the following example: A person in a boat a miles from the nearest point of the beach wishes to reach as quickly as possible a point b miles from that point along the shore. The ratio of his rate of walking to his rate of rowing is cosec α. Where should he land?

Here let AB be the direction of the beach, A the nearest point to the boat O, and B the point he wishes to reach. Clearly he must land, if at all, between A and B. Suppose he lands at P. Let the angle AOP be θ, so that OP = a secθ, and PB = b − a tan θ. If his rate of rowing is V miles an hour his time will be a sec θ/V + (b − a tan θ) sin α/V hours. Call this T. Then to the first power of δθ, δT = (a/V) sec²θ (sin θ − sin α)δθ, so that if AOB > α, δT and δθ have opposite signs from θ = 0 to θ = α, and the same signs from θ = α to θ = AOB. So that when AOB is > α, T decreases from θ = 0 to θ = α, and then increases, so that he should land at a point distant a tan α from A, unless a tan α > b. When this is the case, δT and δθ have opposite signs throughout the whole range of θ, so that T decreases as θ increases, and he should row direct to B. In the first case the minimum value of T is also a critical value; in the second case it is not.

The greatest and least values of the bending moments of loaded rods are often at the extremities of the divisions of the rods and not at points given by critical values.

In the case of a function of several variables, X₁, x₂, . . . x_n, not independent but connected by m functional relations u₁ = 0, u₂ = 0, . . ., u_m = 0, we might proceed to eliminate m of the variables; but Lagrange’s “Method of undetermined Multipliers” is more elegant and generally more useful.

We have δu₁ = 0, δu₂ = 0, . . ., δu_m = 0. Consider instead of δu, what is the same thing, viz., δu + λ₁δu₁ + λ₂δu₂ + . . . + λ_m δu_m, where λ₁, λ₂, . . . λ_m, are arbitrary multipliers. The terms of the first order in this expression are