Home Laboratory Computer Lab Theses

3.2 Class of functions and optimality criteria

Having defined an objective functional F [ x ] and the set of admissible trajectories x ( t ) ∈ π’Ÿ βŠ‚ 𝒳 , one must then decide about the class of functions with respect to which the optimization shall be performed. The traditional choice in the calculus of variations is to consider the class of continuously differentiable functions, e.g., π’ž 1 [ t 1 , t 2 ] . Yet, as shall be seen later on, an optimal solution may not exist in this class. In response to this, a more general class of functions shall be considered, such as the class π’ž ^ 1 [ t 1 , t 2 ] of piecewise continuously differentiable functions.

At this point, we need to define what is meant by a minimum (or a maximum) of F [ x ] on π’Ÿ . Similar to finite-dimensional optimization problems we shall say that F assumes its minimum value at x ⋆ provided that

F [ x ⋆ ] ≀ F [ x ] βˆ€ x ∈ π’Ÿ .

This assignment is global in nature and may be made without consideration of a norm (or, more generally, a distance). Yet, the specification of a norm permits an analogous description of the local behavior of F [ x ] at a point x ⋆ ∈ π’Ÿ . In particular, x βˆ— is said to be a local minimum for F [ x ] in π’Ÿ , relative to the norm β€– β‹… β€– , if

βˆƒ Ξ΄ > 0  such that  F [ x ⋆ ] ≀ F [ x ] , βˆ€ x ∈ ℬ Ξ΄ ( x ⋆ ) ∩ π’Ÿ

with ℬ Ξ΄ ( x ⋆ ) : = { x ∈ 𝒳 : β€– x βˆ’ x ⋆ β€– < Ξ΄ } . Unlike finite-dimensional linear spaces, different norms are not necessarily equivalent in infinite-dimensional linear spaces, in the sense that x ⋆ may be a local minimum with respect to one norm but not with respect to another.

Having chosen the class of functions of interest as π’ž 1 [ t 1 , t 2 ] , several norms can be used. Maybe the most natural choice for a norm on π’ž 1 [ t 1 , t 2 ] is

β€– x β€– 1 , ∞ : = max a ≀ t ≀ b | x ( t ) | + max a ≀ t ≀ b | αΊ‹ ( t ) |

since π’ž 1 [ t 1 , t 2 ] endowed with β€– β‹… β€– 1 , ∞ is a Banach space. Another choice is to endow π’ž 1 [ t 1 , t 2 ] with the maximum norm of continuous functions:

β€– x β€– ∞ : = max a ≀ t ≀ b | x ( t ) | .

The maximum norms, β€– β‹… β€– ∞ and β€– β‹… β€– 1 , ∞ are called the strong norm and the weak norm, respectively. Similarly, we shall endow π’ž 1 [ t 1 , t 2 ] n x with the strong norm β€– β‹… β€– ∞ and the weak norm β€– β‹… β€– 1 , ∞ :

β€– x β€– ∞ : = max a ≀ t ≀ b β€– x ( t ) β€– β€– x β€– 1 , ∞ : = max a ≀ t ≀ b β€– x ( t ) β€– + max a ≀ t ≀ b βˆ₯ x Λ™ ( t ) βˆ₯

where β€– x ( t ) β€– stands for any norm in ℝ n x . The strong and weak norms lead to the following definitions for a local minimum:

Definition 3.3: Strong Local Minimum, Weak Local Minimum

x ⋆ ∈ π’Ÿ is said to be a strong local minimum for F [ x ] in π’Ÿ if

βˆƒ Ξ΄ > 0  such that  F [ x ⋆ ] ≀ F [ x ] , βˆ€ x ∈ ℬ Ξ΄ ∞ ( x ⋆ ) ∩ π’Ÿ .

Likewise, x ⋆ ∈ π’Ÿ is said to be a weak local minimum for F [ x ] in π’Ÿ if

βˆƒ Ξ΄ > 0  such that  F [ x ⋆ ] ≀ F [ x ] , βˆ€ x ∈ ℬ Ξ΄ 1 , ∞ ( x ⋆ ) ∩ π’Ÿ .

Remark 3.2

Note that for strong (local) minima, we completely disregard the values of the derivatives of the comparison elements x ∈ π’Ÿ . That is, a neighborhood associated to the topology induced by β€– β‹… β€– ∞ has many more curves than in the topology induced by β€– β‹… β€– 1 , ∞ . In other words, a weak minimum may not necessarily be a strong minimum. Consider the three curves y 0 , y 1 , y 2 illustrated in Figure 3.3. y 0 and y 2 are close in the strong sense that is according to β€– β‹… β€– ∞ but not in a weak sense. While y 0 and y 1 are close both according to β€– β‹… β€– ∞ and β€– β‹… β€– 1 , ∞ . a a Note that actually to take the β€– β‹… β€– ∞ of y 2 we should make it π’ž 1 hence smoothing the corners

PIC

Figure 3.3:: Closeness in weak and strong sense

These important considerations are illustrated in the following example.

Example 3.4 (A Weak Minimum that is Not a Strong One). Consider the problem P to minimize the functional

F [ x ] : = ∫ 0 1 αΊ‹ 2 ( 1 βˆ’ αΊ‹ 2 ) d t

on π’Ÿ : = { x ∈ π’ž ^ 1 [ 0 , 1 ] : x ( 0 ) = x ( 1 ) = 0 } . We first show that the function x Β― ( t ) = 0 , 0 ≀ t ≀ 1 , is a weak local minimum for P in the topology induced by β€– β‹… β€– 1 , ∞ . Consider the open ball of radius 1 centered at x Β― , i.e. ℬ 1 1 , ∞ ( x Β― ) . For every x ∈ ℬ 1 1 , ∞ ( x Β― ) , we have

αΊ‹ ( t ) ≀ 1 , βˆ€ t ∈ [ 0 , 1 ]

hence F [ x ] β‰₯ 0 . This proves that x Β― is a local minimum for P since F [ x Β― ] = 0 . In the topology induced by βˆ₯ β‹… βˆ₯ ∞ , on the other hand, the admissible trajectories x ∈ ℬ Ξ΄ ∞ ( x Β― ) are allowed to take arbitrarily large values αΊ‹ ( t ) , 0 ≀ t ≀ 1 . Consider the sequence of functions defined by:

x k ( t ) : = { 1 k + 2 t βˆ’ 1  if  1 2 βˆ’ 1 2 k ≀ t ≀ 1 2 1 k βˆ’ 2 t + 1  if  1 2 ≀ t ≀ 1 2 + 1 2 k 0  otherwise 

clearly, x k ∈ π’ž ^ 1 [ 0 , 1 ] and x k ( 0 ) = x k ( 1 ) = 0 for each k β‰₯ 1 , i.e., x k ∈ π’Ÿ . Moreover

βˆ₯ x k βˆ₯ = max 0 ≀ t ≀ 1 | x k ( t ) | = 1 k

meaning that for every Ξ΄ > 0 , there is a k β‰₯ 1 such that x k ∈ ℬ Ξ΄ ∞ ( x Β― ) . Finally

F [ x k ] = ∫ 0 1 αΊ‹ k ( t ) 2 ( 1 βˆ’ αΊ‹ k ( t ) 2 ) d t = [ βˆ’ 1 2 t ] 1 2 βˆ’ 1 2 k 1 2 1 1 = βˆ’ 1 2 k < 0

for each k β‰₯ 1 . Therefore, the trajectory x Β― cannot be a strong local minimum for P.

3.2.1 Existence of an Optimal Solution

Prior to deriving conditions that must be satisfied for a function to be a minimum (or a maximum) of a problem of the calculus of variations, one must ask the question whether such solutions actually exist for that problem. In the case of optimization in finite-dimensional Euclidean spaces, it has been shown that a continuous function on a nonempty compact set assumes its minimum (and its maximum) value in that set (see Weierstrass Theorem 2.1). It is possible to extend this result to continuous functionals on a nonempty compact subset of a normed function space. But as attractive as this solution to the problem of establishing the existence of maxima and minima may appear, it is of little help because most of the sets of interest are too ”large” to be compact.

The principal reason is that most of the sets of interest are not bounded with respect to the norms of interest. As just an example, the set
π’Ÿ : = { x ∈ π’ž 1 [ a , b ] : x ( a ) = x a , x ( b ) = x b } is clearly not compact with respect to the strong norm βˆ₯ β‹… βˆ₯ ∞ as well as the weak norm βˆ₯ β‹… βˆ₯ 1 , ∞ , for we can construct a sequence of curves in π’Ÿ (e.g., parabolic functions satisfying the boundary conditions) which have maximum values as large as desired. That is, the problem of minimizing, say, F [ x ] = βˆ’ βˆ₯ x βˆ₯ or F [ x ] = ∫ a b x ( t ) d t , on π’Ÿ , does not have a solution.

A problem of the calculus of variations that does not have a minimum is addressed in the following:

Example 3.5 ( A Problem with No Minimum ). Consider the problem P to minimize the functional:

F [ x ] : = ∫ 0 1 x ( t ) 2 + αΊ‹ ( t ) 2 d t  on  π’Ÿ : = { x ∈ π’ž 1 [ 0 , 1 ] : x ( 0 ) = 0 , x ( 1 ) = 1 } .

Observe first that for any admissible trajectory x ( t ) joining the two end-points, we have

F [ x ] = ∫ 0 1 x 2 + αΊ‹ 2 d t > ∫ 0 1 | αΊ‹ | d t β‰₯ ∫ 0 1 αΊ‹ d t = x ( 1 ) βˆ’ x ( 0 ) = 1

That is,

F [ x ] > 1 , βˆ€ x ∈ π’Ÿ ,  and  inf { F [ x ] : x ∈ π’Ÿ } β‰₯ 1 .

Now, consider the sequence of functions { x k } in π’ž 1 [ 0 , 1 ] defined by x k ( t ) : = t k . Then

F [ x k ] = ∫ 0 1 t 2 k + k 2 t 2 k βˆ’ 2 d t = ∫ 0 1 t k βˆ’ 1 t 2 + k 2 d t ≀ ∫ 0 1 t k βˆ’ 1 ( t + k ) d t = 1 + 1 k + 1

so we have F [ x k ] β†’ k β†’ ∞ 1 . Overall, we have thus shown that inf F [ x ] = 1 . But since F [ x ] > 1 for any x in π’Ÿ we know with certainty that F has no global minimizer on π’Ÿ .