Home Laboratory Computer Lab Theses

3.4 Piecewise π’ž1 extremal functions

In all the problems examined so far, the functions defining the class for optimization were required to be continuously differentiable that is xβˆˆπ’ž1[t1,t2]nx . Yet, it is natural to wonder whether cornered trajectories, i.e., trajectories represented by piecewise continuously differentiable functions, might not yield improved results. Besides improvement, it is also natural to wonder whether those problems of the calculus of variations which do not have extremals in the class of π’ž1 functions actually have extremals in the larger class of piecewise π’ž1 functions.

Definition 3.10: Piecewiseπ’ž1 functions

A real-valued function x^βˆˆπ’ž[a,b] is said to be piecewise π’ž1 , denoted x^βˆˆπ’ž^1[a,b] , if there is a finite (irreducible) partition a=c0< c1<β‹―<cN+1=b such that x^ may be regarded as a function in π’ž1[ck,ck+1] for each k=0,1,…⁑,N. When present, the interior points c1,…⁑,cN are called corner points of x^ .

PIC

Figure 3.8:: Illustration of a piecewise continuously differentiable function x^βˆˆπ’ž^1[a,b] (thick red line), and its derivative x^Λ™ (dash-dotted red line); without corners, x^ may resemble the continuously differentiable function xβˆˆπ’ž1[a,b] (dashed blue curve).

Some remarks are in order. Observe first that, when there are no corners, then x^βˆˆπ’ž1[a,b] . Further, for any x^βˆˆπ’ž^1[a,b],x^Λ™ is defined and continuous on [a,b] except at its corner points c1,…⁑,cN where it has distinct limiting values x^Λ™(ckΒ±) ; such discontinuities are said to be simple, and x^Λ™ is said to be piecewise continuous on [a,b] denoted x^Λ™βˆˆπ’ž^[a,b] . Figure 3.8 illustrates the effect of the discontinuities of x^Λ™ in producing corners on the graph of x^ . Without these corners, x^ might resemble the π’ž1 function x whose graph is presented for comparison. In particular, each piecewise π’ž1 function is ”almost” π’ž1 , in the sense that it is only necessary to round out the corners to produce the graph of a π’ž1 function. These considerations are formalized by the following Lemma:

Lemma 3.4: Smoothing of Piecewiseπ’ž1 Functions

Let x^βˆˆπ’ž^1[a,b] . Then, for each Ξ΄>0, there exists xβˆˆπ’ž1[a,b] such that x≑x^ except in a neighborhood ℬδ(ck) of each corner point of x^ . Moreover, βˆ₯xβˆ’x^βˆ₯βˆžβ‰€Γ‚Ξ΄ where Γ‚ is a constant determined by x^ .

Likewise, we shall consider the class π’ž^1[a,b]nx of nx -dimensional vector valued analogue of π’ž^1[a,b], consisting of those functions x^βˆˆπ’ž^1[a,b]nx with components x^jβˆˆπ’ž^1[a,b],j=1,…⁑,nx. The corners of such x^ are by definition those of any one of its components x^j. Note that the above lemma can be applied to each component of a given x^, and shows that x^ can be approximated by a xβˆˆπ’ž1[a,b]nx which agrees with it except in prescribed neighborhoods of its corner points. Both real valued and real vector valued classes of piecewise π’ž1 functions form linear spaces of which the subsets of π’ž1 functions are subspaces. Indeed, it is obvious that the constant multiple of one of these functions is another of the same kind, and the sum of two such functions exhibits the piecewise continuous differentiability with respect to a suitable partition of the underlying interval [a,b] . Since π’ž^1[a,b]βŠ‚π’ž[a,b] , we have:

βˆ₯xβˆ₯∞:=max⁑a≀t≀b|x(t)|

defines a norm on π’ž^1[a,b]. Moreover,

βˆ₯xβˆ₯1,∞:=max⁑a≀t≀b|x(t)|+sup⁑tβˆˆβ‹ƒk=0N(ck,ck+1)|αΊ‹(t)|

can be shown to be another norm on π’ž^1[a,b], with a=c0<c1<β‹―<cN<cN+1= b being a suitable partition for x^ . (The space of vector valued piecewise π’ž1 functions π’ž^1[a,b]nx can also be endowed with the norms βˆ₯β‹…βˆ₯∞ and βˆ₯β‹…βˆ₯1,∞). By analogy to the linear space of π’ž1 functions, the maximum norms βˆ₯β‹…βˆ₯∞ and βˆ₯β‹…βˆ₯1,∞ are referred to as the strong norm and the weak norm, respectively; the functions which are locally extremal with respect to the former [latter] norm are said to be strong [weak] extremal functions.

3.4.1 The Weierstrass-Erdmann Corner Conditions

A natural question that arises when considering the class of π’ž^1 functions is whether a (local) extremal point for a functional in the class of π’ž1 functions also gives a (local) extremal point for this functional in the larger class of π’ž^1 functions. We state the following Theorem:

Theorem 3.8:π’ž^1 Extremals vs.π’ž1 Extremals

If x⋆ gives a [local] extremal point for the functional:

F[x]:=∫t1t2f(t,x(t),xΛ™(t))dt

on π’Ÿ:={xβˆˆπ’ž1[t1,t2]nx:x(t1)=x1,x(t2)=x2} with
fβˆˆπ’ž([t1,t2]×ℝ2nx) then x⋆ also gives a [local] extremal point for F
on π’Ÿ^:={x^βˆˆπ’ž^1[t1,t2]nx:x^(t1)=x1,x^(t2)=x2} with respect to the same βˆ₯β‹…βˆ₯∞ or βˆ₯β‹…βˆ₯1,∞ norm.

On the other hand, a functional F may not have π’ž1 extremals, but be extremized by a π’ž^1 function. We shall first seek for weak (local) extremals x^β‹†βˆˆπ’ž^1[t1,t2]nx, i.e., extremal trajectories with respect to some weak neighborhood ℬδ1,∞(x^⋆) .

  • Observe that x^βˆˆβ„¬Ξ΄1,∞(x^⋆) if and only if x^=x^⋆+Ξ±ΞΎ^ for ΞΎ^βˆˆπ’ž^1[t1,t2]nx and a sufficiently small Ξ± . In characterizing (weak) local extremals for the functional
    F[x^]:=∫t1t2f(t,x^(t),x^Λ™(t))dt

    on π’Ÿ^:={x^βˆˆπ’ž^1[t1,t2]nx:x^(t1)=x1,x^(t2)=x2}, where f and its partials fx,fxΛ™ are continuous on [t1,t2]×ℝ2nx, one can therefore duplicate the analysis of the previous section. This is done by splitting the integral into a finite sum of integrals with continuously differentiable integrands, then differentiating each under the integral sign. Overall, it can be shown that a (weak, local) extremal x^β‹†βˆˆπ’ž^1[t1,t2]nx must be stationary in intervals excluding corner points, i.e. the Euler equation is satisfied on [t1,t2] except at corner points c1,…⁑,cN of x^⋆ .

  • Likewise, both Legendre second-order necessary conditions and convexity sufficient conditions can be shown to hold on intervals excluding corners points of a π’ž^1 extremal.
  • Finally, transversality conditions corresponding to the various free end-point problems remain the same. To see this most easily, e.g., in the case where freedom is permitted only at the right end-point, suppose that x^β‹†βˆˆπ’ž^1[t1,t2]nx gives a local extremal for π’Ÿ^, and let cN be the right-most corner point of x^⋆ . Then, restricting comparison to those competing x^ having their right-most corner point at cN and satisfying x^(cN)=x^⋆(cN) , it is seen that the corresponding directions (ΞΎ^,Ο„) must utilize the end-point freedom exactly as for π’ž1 functions. Thus, resulting in identical boundary conditions.

Besides necessary conditions of optimality on intervals excluding corner points c1,…⁑,cN of a local extremal x^β‹†βˆˆπ’ž^1[t1,t2]nx, the discontinuities of x^˙⋆ which are permitted at each ck are restricted. These are the so-called first Weierstrass-Erdmann corner conditions.

Theorem 3.9: First Weierstrass-Erdmann Corner Condition

Let x^⋆(t) be a (weak) local extremal of the problem to minimize the functional

F[x^]:=∫t1t2f(t,x^(t),x^Λ™(t))dt

on π’Ÿ^:={x^βˆˆπ’ž^1[t1,t2]nx:x^(t1)=x1,x^(t2)=x2}, where f and its partials fx,fxΛ™ are continuous on [t1,t2]×ℝ2nx . Then, at every (possible) corner point c∈[t1,t2] of x^⋆, we have:

βˆ‚f(c,x⋆(c),x˙⋆(cβˆ’)βˆ‚xΛ™βŠ€β‘=βˆ‚f(c,x⋆(c),x˙⋆(c+)βˆ‚xΛ™βŠ€β‘

where x^˙⋆(cβˆ’) and x^˙⋆(c+) denote the left and right time derivative of x^⋆ at c respectively.

Proof. Integrating both sides of Euler equation between t and t1 for each component i=1,…⁑,nx gives:

∫t1tddtβˆ‚fβˆ‚αΊ‹idt=∫t1tβˆ‚fβˆ‚xdtβ†’βˆ‚fβˆ‚αΊ‹i=∫t1tβˆ‚fβˆ‚xidt+Ci,

therefore the function g(t):=βˆ‚f(t,x^⋆(t),x^˙⋆(t)βˆ‚αΊ‹i is continuous at each t∈(t1,t2) even though x^Λ™(t) may be discontinuous at that point. That is g(cβˆ’)=g(c+) . Moreover, βˆ‚fβˆ‚αΊ‹i being continuous in its 1+2nx arguments, x^(t) being continuous at c , and x^Λ™(t) having finite limits x^Λ™(cΒ±) at c we get:

βˆ‚f(c,x^(c),x^Λ™(cβˆ’))βˆ‚αΊ‹i=βˆ‚f(c,x^(c),x^Λ™(c+))βˆ‚αΊ‹i

for each i=1,…⁑,nx β–‘

Remark 3.16

The first Weierstrass-Erdmann condition of Theorem 3.9 shows that the discontinuities of x^Λ™ which are permitted at corner points of a local extremal trajectory x^β‹†βˆˆπ’ž^1[t1,t2]nx are those which preserve the continuity of fxΛ™ . Likewise, it can be shown that the continuity of the Hamiltonian H:=fβˆ’βˆ‚fβˆ‚xΛ™xΛ™ must be preserved at corner points of x^⋆ that is:

H(c,x^(c),x^Λ™(cβˆ’))=H(c,x^(c),x^Λ™(c+))

which yields the so-called second Weierstrass-Erdmann corner condition.

Example 3.12. Consider the problem to minimize the functional:

F[x]=βˆ«βˆ’11x2(t)(1βˆ’αΊ‹(t))2dts.t.xβˆˆπ’Ÿ,

whereπ’Ÿ:={xβˆˆπ’ž1[βˆ’1,1]:x(βˆ’1)=0,x(1)=1} . The lagrangianf=x2(t)(1βˆ’αΊ‹(t)2) is independent of the independent variablet , we thus have the constancy of the Hamiltonian on a stationary trajectory. Thus:

H=fβˆ’βˆ‚fβˆ‚αΊ‹αΊ‹=x2(t)(1βˆ’αΊ‹(t))2βˆ’[2x(t)2(αΊ‹(t)βˆ’1)]αΊ‹(t)=cβˆ€β‘t∈[βˆ’1,1]

for some constantc . Upon semplification, we get:

x(t)2(1βˆ’αΊ‹(t)2)=cβˆ€β‘t∈[βˆ’1,1],

in order to solve this differential equation we make the substitutionu(t):=x(t)2 and thenceuΛ™(t):=2x(t)αΊ‹(t) , we get the somewhat simpler equationuΛ™(t)2=4(u(t)βˆ’c) that can be solved by separation of variable and has general solution:

u(t):=(t+k)2+c

wherek is a constant of integration. In turn, substituing backx2(t)=u(t) we conclude that a stationary pointxΒ― must be of the form:

xΒ―(t)2=(t+k)2+c.

In particular, the boundary conditionsx(βˆ’1)=0 andx(1)=1 produce constantsc=βˆ’(34)2 andk=14 . However, the resulting stationary function:

xΒ―(t)=(t+14)2βˆ’(34)2=(t+1)(tβˆ’12)

is defined only fortβ‰₯12 ortβ‰€βˆ’1 . Thus, there is no stationary function for the Lagrangianf inπ’Ÿ . Next, we turn to the problem of minimizingF in the larger setπ’Ÿ^:={x^βˆˆπ’ž^1[βˆ’1,1]:x^(βˆ’1)=0,x^(1)=1 . Suppose thatx^⋆ is a local minimizer forF onπ’Ÿ^ . Then, by the Weierstrass-Erdmann condition, we must have:

βˆ‚f(c,x(c),αΊ‹(cβˆ’)βˆ‚αΊ‹=βˆ‚f(c,x(c),αΊ‹(c+)βˆ‚αΊ‹
βˆ’2x^⋆(c)[1βˆ’x^˙⋆(cβˆ’)]=βˆ’2x^⋆(c)[1βˆ’x^˙⋆(c+)]

which gives:

x^⋆(c)[x^˙⋆(c+)βˆ’x^˙⋆(cβˆ’)]=0,

by definition of corner points, we must havex^Λ™(c+)β‰ x^Λ™(cβˆ’) , hence corner points are only allowed at thosec∈(βˆ’1,1) such thatx^⋆(c)=0 . Observe that sincef=x2(t)(1βˆ’αΊ‹(t))2>0βˆ€β‘xβ‰ 0,αΊ‹β‰ 1 and thus the functional is bounded below by0 . This means that if we are able to find axΒ―βˆˆπ’Ÿ^ function that satisfies the necessary conditions and achieveF[xΒ―]=0 thenxΒ―=x⋆ is the global optimum of the problem. From the boundary condition we have thatxΒ―(βˆ’1)=0 andxΒ―(1)=1 and from the Weierstrass-Erdmann condition we have that we can only have corner points at time instants such thatxΒ―(c)=0 . We can then construct a functionxΒ― that is0 fromβˆ’1 to0 where it has a corner point and grows linearly with constant derivative equal to1 up to1 so thatxΒ―(1)=1 . More precisely:

x¯⋆(t)={0,βˆ’1≀t≀0t,0<t≀1

such a function achieves the global minimumF[xΒ―]=0 and satisfies the necessary conditions (i.e. it has a corner point at0 wherexΒ―(0)=0 ) and thence it is the unique global minimum point forF onπ’Ÿ^ . The global optimum is plotted in Figure 3.9.

pict

Figure 3.9:: Globally minimizing trajectory, note the corner point c=0
.

Corollary 3.1: Absence of corner points

Consider the problem to minimize the functional:

F[x^]:=∫t1t2f(t,x^(t),x^Λ™(t))dt

on π’Ÿ^:={x^βˆˆπ’ž^1[t1,t2]nx:x^(t1)=x1,x^(t2)=x2} . If βˆ‚fβˆ‚xΛ™(t,y,z) is a strictly monotone function of zβˆˆβ„nx (or, equivalently, f(t,y,z) is a convex function in z on ℝnx ), for each (t,y)∈[t1,t2]Γ—Rnx , then an extremal solution x^⋆(t) cannot have corner points.

Proof. By the first Weierstrass-Erdmann corner condition, at a corner point holds:

βˆ‚f(c,x⋆(c),x˙⋆(cβˆ’)βˆ‚xΛ™βŠ€β‘=βˆ‚f(c,x⋆(c),x˙⋆(c+)βˆ‚xΛ™βŠ€β‘.

Let’s define vector-valued function k(z):=βˆ‚f(c,x⋆(c),z)βˆ‚xΛ™βŠ€β‘ . By hypothesis k is strictly monotone in z and thus it cannot assume twice the same value. The Weierstrass-Erdmann condition can be written in terms of k as:

k(x^Λ™(cβˆ’))=k(x^Λ™(c+))β†’k(z1)=k(z2)

but by definition of corner point z1β‰ z2 thus contradicting the strictly monotonicity of k . β–‘

Example 3.13 (Minimum Path Problems). Consider again the problem in Example 3.8that is to minimize the distance between two fixed points, namelyA=(x1,y1) andB=(x2,y2) in the(x,y) -plane. We have shown that extremal trajectories for this problem correspond to straight lines. But could we have extremal trajectories with corner points? The answer is no, the lagrangian isf=1+αΊ‹2 andβˆ‚fβˆ‚αΊ‹=αΊ‹1+αΊ‹2 is a strictly monotone function inαΊ‹ hence we can apply Corollary 3.1and conclude that extremal trajectories for this problem cannot have corner points.

3.4.2 Weierstrass’ Necessary Conditions: Strong Minima

The GΓ’teaux derivatives of a functional are obtained by comparing its value at a point x with those at points x+Ξ±ΞΎ in a weak norm neighborhood. In contrast to these (weak) variations, we now consider a new type of (strong) variations whose smallness does not imply that of their derivatives. In the scalar case 6 , we consider variations Ε΄βˆˆπ’ž^1[t1,t2] defined as:

Ε΄(t):={v(tβˆ’Ο„+Ξ΄)ifΟ„βˆ’Ξ΄β‰€t≀τv(βˆ’Ξ΄(tβˆ’Ο„)+Ο„)ifτ≀t<Ο„+Ξ΄0otherwise

Where Ο„βˆˆ(t1,t2) and v and Ξ΄ are positive real coefficients such that Ο„βˆ’Ξ΄>t1 and Ο„+Ξ΄<t2 .

PIC

Figure 3.10:: Strong variation Ε΄(t) and its time derivative Ε΄Λ™(t)

Note that strong variations of this kind depend on three parameters v,Ξ΄,Ο„ . Informally speaking Ο„ is the point at which the perturbation is centered while Ξ΄ determines the extension in time of the perturbation. Note that the conditions Ο„βˆ’Ξ΄>t1 and Ο„+Ξ΄<t2 constrain the variation to lie within the open interval (t1,t2) . The parameter v modulates the magnitude of the variation and of its derivative. We are now ready to state a set of necessary conditions for a strong local minimum, whose proof is based on the foregoing class of variations.

Theorem 3.10: Weierstrass’ Necessary Condition

Consider the problem to minimize the functional:

F[x^]:=∫t1t2f(t,x^(t),x^Λ™(t))dt

on π’Ÿ^:={x^βˆˆπ’ž^1[t1,t2]nx:x^(t1)=x1,x^(t2)=x2}. Suppose x^⋆(t),t1≀t≀t2, gives a strong (local) minimum for F on π’Ÿ^ . Then,

β„°(t,x^⋆,x^˙⋆,v):=f(t,x^⋆,x^˙⋆+v)βˆ’f(t,x^⋆,x^˙⋆)βˆ’βˆ‚f(t,x^⋆,x^˙⋆)βˆ‚xΛ™vβ‰₯0

at every t∈[t1,t2] and for each vβˆˆβ„nx . ( β„° is referred to as the excess function of Weierstrass ).

Proof. For the sake of clarity, we shall present and prove this condition for scalar functions x^βˆˆπ’ž^[t1,t2] only. Let x^Ξ΄(t):=x^⋆(t)+Ε΄(t) . Note that both Ε΄ and x^⋆ being π’ž^1 functions, so is x^Ξ΄ . These smoothness conditions are sufficient to calculate F[x^Ξ΄] , as well as its derivative with respect to Ξ΄ at Ξ΄=0 . Note that x^Ξ΄ and x^⋆ differ only in the interval [Ο„βˆ’Ξ΄,Ο„+Ξ΄] . Hence, by the definition of Ε΄ , we have:

F[x^Ξ΄]βˆ’F[x^⋆]=βˆ«Ο„βˆ’Ξ΄Ο„+Ξ΄(f(t,x^Ξ΄(t),x^Λ™Ξ΄(t))βˆ’f(t,x^⋆(t),x^˙⋆(t)))dt=βˆ«Ο„Ο„+Ξ΄f(t,x^Ξ΄(t),x^Λ™Ξ΄(t))dt+βˆ«Ο„βˆ’Ξ΄Ο„f(t,x^Ξ΄(t),x^Λ™Ξ΄(t))dt+βˆ’βˆ«Ο„βˆ’Ξ΄Ο„+Ξ΄f(t,x^⋆(t),x^˙⋆(t))dt,

note that we have splitted the integral in the points of discontinuities of the derivative of Ε΄(t) . The differential quotient is:

F[x^Ξ΄]βˆ’F[x^⋆]Ξ΄=1Ξ΄βˆ«Ο„βˆ’Ξ΄Ο„(f(t,x^⋆(t)+v(tβˆ’Ο„+Ξ΄),x^˙⋆(t)+v)βˆ’f(t,x^⋆(t),x^˙⋆(t)))dt+1Ξ΄{βˆ«Ο„Ο„+Ξ΄f(t,x^⋆(t)+v(βˆ’Ξ΄(tβˆ’Ο„)+Ο„),x^˙⋆(t)βˆ’vΞ΄)dt+βˆ’βˆ«Ο„Ο„+Ξ΄f(t,x^⋆(t),x^˙⋆(t))dt}=I1Ξ΄+I2Ξ΄.
I10=lim⁑δ→0I1Ξ΄=lim⁑δ→01Ξ΄{βˆ’βˆ«Ο„Ο„βˆ’Ξ΄(f(t,x^⋆(t)+v(tβˆ’Ο„+Ξ΄),x^˙⋆(t)+v)dt+βˆ’βˆ«Ο„Ο„βˆ’Ξ΄f(t,x^⋆(t),x^˙⋆(t)))dt}==f(Ο„,x^⋆(Ο„),x^˙⋆(Ο„)+v)βˆ’f(Ο„,x^⋆(Ο„),x^˙⋆(Ο„)),

where we have applied Theorem 3.6. In order to analyse the second term we define g(t):=v(βˆ’(tβˆ’Ο„)+Ξ΄) , its time-derivative is Δ‘(t)=βˆ’v and thus we have:

I20=lim⁑δ→0I2Ξ΄=lim⁑δ→01Ξ΄{βˆ«Ο„Ο„+Ξ΄f(t,x^⋆(t)+Ξ΄g(t),x^˙⋆(t)+Ξ΄Δ‘)dt+βˆ’βˆ«Ο„Ο„+Ξ΄f(t,x^⋆(t),x^˙⋆(t))dt},

using the first-order Taylor series expansion in Ξ΄ under the integral sign we have:

f(t,x^⋆(t)+Ξ΄g(t),x^˙⋆(t)+Ξ΄Δ‘)==f[x^⋆]+βˆ‚f[x^⋆]βˆ‚xg(t)Ξ΄+βˆ‚f[x^⋆]βˆ‚αΊ‹Δ‘(t)Ξ΄+o(Ξ΄)

where the arguments have been compressed for notational simplicity. Therefore we have:

lim⁑δ→0I2Ξ΄=lim⁑δ→01Ξ΄βˆ«Ο„Ο„+Ξ΄(βˆ‚f[x^⋆]βˆ‚xg(t)+βˆ‚f[x^⋆]βˆ‚αΊ‹Δ‘(t)+o(Ξ΄))dt,

upon integration by parts of the term involving Δ‘(t) we obtain:

lim⁑δ→0I2Ξ΄=lim⁑δ→0{1Ξ΄βˆ«Ο„Ο„+Ξ΄(βˆ‚f[x^⋆]βˆ‚xβˆ’ddtβˆ‚f[x^⋆]βˆ‚αΊ‹)g(t)dt++1Ξ΄(βˆ‚f[x^⋆]βˆ‚αΊ‹g)|ττ+Ξ΄+o(Ξ΄)Ξ΄},

note that the term βˆ‚f[x^⋆]βˆ‚xβˆ’ddtβˆ‚f[x^⋆]βˆ‚αΊ‹=0 since Euler equations are necessary condition for optimality. Then by definition of small o we have lim⁑δ→0o(Ξ΄)Ξ΄=0 , finally the last term reduces to:

1Ξ΄(βˆ‚f[x^⋆]βˆ‚αΊ‹g)|ττ+Ξ΄=βˆ’βˆ‚f[x^⋆(Ο„)]βˆ‚αΊ‹v,

since g(Ο„)=0 and g(Ο„+Ξ΄)=βˆ’vΞ΄ . Since x^⋆ is a local strong minimizer we have F[x^Ξ΄]β‰₯F[x^]⋆ , for all sufficiently small Ξ΄ , and thence also in the limit

0≀lim⁑δ→0F[x^Ξ΄]βˆ’F[x^⋆]Ξ΄=I10+I20,

and thus:

f(Ο„,x^⋆(Ο„),x^˙⋆(Ο„)+v)βˆ’f(Ο„,x^⋆(Ο„),x^˙⋆(Ο„))βˆ’βˆ‚f(Ο„,x^⋆(Ο„),x^˙⋆(Ο„)βˆ‚αΊ‹vβ‰₯0

for every Ο„βˆˆβ„ and vβˆˆβ„ . β–‘

Example 3.14 (Minimum path problem II). Consider again the problem in Example 3.8that is to minimize the distance between two fixed points, namelyA=(x1,y1) andB=(x2,y2) in the(x,y) -plane. We have shown that extremal trajectories for this problem correspond to straight lines joiningA andB , that is:

y⋆(x)=C1x+C2

whereC1=y2βˆ’y1x2βˆ’x1 andC2=y1 . We now ask the question whethery⋆(x) is a strong minimum for that problem? The Weierstress excess function is:

β„°(x,y⋆,ẏ⋆,v)=1+(C1+v)2βˆ’1+C12βˆ’C1v1+(C1)2,

note that in this simple case it can be regarded as a function ofC1 andv only. The excess function is plotted as a function ofv for different values ofC1 in Figure 3.11. It is easily checked that it is always nonnegative thereforey⋆ is also a strong local minimum for the minimum path problem. In fact, we note that the functiong(z)=1+z2 is convex 7 . The Weierstrass condition is equivalent to:

g(v+C1)βˆ’g(C1)βˆ’dgdz|z=C1vβ‰₯0

That holds true for everyC1,vβˆˆβ„ being precisely the first-order characterization of convexity ofg .

pict

Figure 3.11:: Weierstress excess function for the minimum path problem as a function of v for different coefficients C1 .

The following corollary indicates that the Weierstrass condition is also useful to detect strong (local) minima in the class of π’ž1 functions.

Corollary 3.2: Weierstrass’ Necessary Condition

Consider the problem to minimize the functional:

min⁑x(t)F[x]=∫t1t2f(t,x(t),xΛ™(t))dts.t.xβˆˆπ’Ÿ

where the functional space is defined as π’Ÿ={xβˆˆπ’ž1[t1,t2]such thatx(t1)=x1,x(t2)=x2} and f:ℝ×ℝnx×ℝnx→ℝ a continuously differentiable function. Suppose that xβ‹†βˆˆπ’Ÿ gives a (local) minimum for F on π’Ÿ .
Then :

β„°(t,x^⋆,x^˙⋆,v):=f(t,x^⋆,x^˙⋆+v)βˆ’f(t,x^⋆,x^˙⋆)βˆ’βˆ‚f(t,x^⋆,x^˙⋆)βˆ‚xΛ™vβ‰₯0

at every t∈[t1,t2] and for each vβˆˆβ„nx .

Proof. By Theorem 3.8 we have that x⋆ is a local strong minimizer on π’Ÿ^ as well, therefore Theorem 3.10 holds. β–‘

Remark 3.17: Weierstrass’ Condition and Convexity

It is readily seen that the Weierstrass condition of Theorem 3.10 is satisfied automatically when the Lagrangian function f(t,y,z) is partially convex (and continuously differentiable) in zβˆˆβ„nx , for each (t,y)∈[t1,t2]×ℝnx .

Remark 3.18: Weierstrass’ Condition and Pontryagin’s Maximum Principle

Interestingly enough, the Weierstrass’ condition can be rewritten as:

f(t,x⋆(t),x˙⋆(t)+v)βˆ’βˆ‚f(t,x⋆(t),x˙⋆(t)+v)βˆ‚xΛ™(x˙⋆+v)β‰₯f(t,x⋆(t),x˙⋆(t))βˆ’βˆ‚f(t,x⋆(t),x˙⋆(t))βˆ‚xΛ™x˙⋆

which given the definition of Hamiltonian gives:

H(t,x⋆(t),x˙⋆(t)+v)≀H(t,x⋆(t),x˙⋆(t))

for each t∈[t1,t2] and for each vβˆˆβ„nx . This necessary condition prefigures Pontryagin’s Maximum Principle in optimal control theory.