A variety of different definitions of optimality are used in different contexts. It is important to understand fully each definition and the context within which it is appropriately used.
Definition 2.8: Infimum/Supremum
The infimum of , denoted as , provided it exists, is the greatest lower bound for , i.e. a number satisfying:
Similarly, the supremum of , denoted as , provided it exists, is the least upper bound for , i.e. a number satisfying:
The first question one may ask concerns the existence of infima and suprema in . In particular, one cannot prove that in every set bounded from above has a supremum and every set bounded from below has an infimum. This is an axiom, known as the axiom of completeness:
Axiom 2.1: of Completeness
If a nonempty subset of real numbers has an upper bound, then it has a least upper bound. If a nonempty subset of real numbers has a lower bound, it has a greatest lower bound.
Remark 2.6
It is important to note that the real number (respectively ), with a nonempty set in bounded from below (respectively from above), although it exist, need not be an element of .
Notation 2.1
Let be the image of the feasible set of an optimization problem under the objective function . Then, the notation
refers to the number . Likewise, the notation
refers to .
Clearly, the numbers and may not be attained by the value at any . This is illustrated in an example below.
Remark 2.7
By convention, the infimum of an empty set is while the supremum of an empty set is . That is, if the values are allowed, then infima and suprema always exist.
Consider the standard problem formulation
where denotes the feasible set. Any is said to be a feasible point. Conversely, any is said to be an infeasible point.
Definition 2.9: (Global) Minimum, Strict (Global) Minimum
A point is said to be a (global) a minimum of on if
(2.1) |
i.e. a minimum is a feasible point whose objective function value is less than or equal to the objective function value of all other feasible points. It is said to be a strict (global) minimum of on if
A (global) maximum is defined by reversing the inequality in the above definition:
Definition 2.10: (Global) Maximum, Strict (Global) Maximum
A point is said to be a (global) maximum of on if
(2.2) |
It is said to be a strict (global) maximum of on if
Remark 2.8
The important distinction between minimum (respectively maximum) and infimum (respectively supremum) is that the value must be attained at one or more points , whereas the value does not necessarily have to be attained at any points . Yet, if a minimum (respectively maximum) exists, then its optimal value will equal the infimum (respectively supremum).
Remark 2.9
Note also that if a minimum exists, it is not necessarily unique. That is, there may be multiple or even an infinite number of feasible points that satisfy the inequality 2.1 and are thus minima.
Notation 2.2
Since there is in general a set of points that are minima, the notation
is introduced to denote the set of minima. This is a (possibly empty) set a a The notation is also used by some authors. In this case, should be understood as a function returning a point that minimizes on . in .
Remark 2.10
A minimum is often referred to as an optimal solution, a global optimal solution, or simply a solution of the optimization problem. The real number is known as the (global) optimal value or optimal solution value.
Notation 2.3
Regardless of the number of minima, there is always a unique real number that is the optimal value (if it exists). The notation is used to refer to this real value.
Unless the objective function and the feasible set possess special properties (e.g. convexity), it is usually very hard to devise algorithms that are capable of locating or estimating a global minimum or a global maximum with certainty. This motivates the definition of local minima and maxima, which, by the nature of their definition in terms of local information, are much more convenient to locate with an algorithm.
Definition 2.11: Local Minimum, Strict Local Minimum
A point is said to be a local minimum of on if
A point is said to be a strict local minimum of on if
The qualifier ’local’ originates from the requirement that be a minimum only for those feasible points in a neighbourhood around the local minimum.
Remark 2.11
Trivially, the property of being a global minimum implies that is also a local minimum because a global minimum is local minimum with set arbitrarily large.
Definition 2.12: Local Maximum, Strict Local Maximum
A point is said to be a local maximum of on if
A point is said to be a strict local maximum of on if
The qualifier ’local’ originates from the requirement that be a maximum only for those feasible points in a neighbourhood around the local maximum.
Remark 2.12
Trivially, the property of being a global maximum implies that is also a local maximum because a global maximum is local maximum with set arbitrarily large.
Remark 2.13
It is important to note that the concept of a global minimum or a global maximum of a function on a set is defined without the notion of a distance (or a norm in the case of a vector space). In contrast, the definition of a local minimum or a local maximum requires that a distance be specified on the set of interest. In , norms are equivalent, and it is readily shown that local minima (respectively maxima) in ( , ) match local minima (respectively maxima) in (( , ), for any two arbitrary norms and in (e.g. the Euclidean norm and the infinite norm ). Yet, this nice property does not hold in linear functional spaces, as those encountered in problems of the calculus of variations and optimal control.
Figure ?? illustrates the various definitions of minima and maxima. Point is the unique global maximum; the objective value at this point is also the supremum. Points , , and are strict local minima because there exists a neighbourhood around each of these point for which , , or is the unique minimum (on the intersection of this neighbourhood with the feasible set ). Likewise, point is a strict local maximum. Point is the unique global minimum; the objective value at this point is also the infimum. Finally, point is simultaneously a local minimum and a local maximum because there are neighbourhoods for which the objective function remains constant over the entire neighbourhood; it is neither a strict local minimum, nor a strict local maximum.
Example 2.4. Consider the function
The point is a local minimum for
with value . The optimal value is and .
A crucial question when it comes to optimizing a function on a given set, is whether a minimizer or a maximizer exist for that function in that set. Strictly, a minimum or maximum should only be referred to when it is known to exist.
Figure ?? illustrates three instances where a minimum does not exist. In figure ??a, the infimum of over is given by , but since is not closed and, in particular, , a minimum does not exist. In figure ??b, the infimum of over is given by the limit of as approaches from the left, i.e. . However, because is discontinuous at , a minimizing solution does not exist. Finally, figure ??c illustrates a situation within which is unbounded over the unbounded set .
We now formally state and prove the result that if is nonempty, closed, and bounded, and if is continuous on , then, unlike the various situations of figure ??, a minimum exists.
Theorem 2.1: Weierstrass’ Theorem
Let be a nonempty compact set and let be continuous on . Then, the problem attains its minimum, that is, there exists a minimizing solution to this problem.
Proof. Since is continuous on and is both closed and bounded, is bounded below on . Consequently, since , there exists a greatest lower bound (see previous axiom). Now, let , and consider the set for . By the definition of an infimum, for each , and so we may construct a sequence of points by selecting a point for each . Since is bounded, there exists a convergent subsequence indexed by the set . Let denote its limit. By the closedness of , we have and by the continuity of on , since , we have . Hence, we have shown that there exist a solution such that , i.e. is a minimizing solution. □
Remark 2.14
The hypotheses of theorem 2.1 can be justified as follows:
Example 2.5. Theorem 2.1 establishes that a minimum (and a maximum) of
exists, since is a nonempty compact set and is a continuous function on . On the other hand, minima can still exist even though the set is not compact or the function is not continuous, for theorem 2.1 only provides a sufficient condition. This is the case for the problem
which has a minimum at .
Example 2.6. Consider the NLP problem
The objective function being continuous and the feasible region being nonempty, closed and bounded, the existence of a minimum to this problem directly follows from theorem 2.1 .