Misplaced Pages

Pseudoconvex function

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
This article is about the notion in convex analysis. For the notion in several complex variables, see pseudoconvex domain.

In convex analysis and the calculus of variations, both branches of mathematics, a pseudoconvex function is a function that behaves like a convex function with respect to finding its local minima, but need not actually be convex. Informally, a differentiable function is pseudoconvex if it is increasing in any direction where it has a positive directional derivative. The property must hold in all of the function domain, and not only for nearby points.

Formal definition

Consider a differentiable function f : X R n R {\displaystyle f:X\subseteq \mathbb {R} ^{n}\rightarrow \mathbb {R} } , defined on a (nonempty) convex open set X {\displaystyle X} of the finite-dimensional Euclidean space R n {\displaystyle \mathbb {R} ^{n}} . This function is said to be pseudoconvex if the following property holds:

for all x , y X : f ( x ) ( y x ) 0 f ( y ) f ( x ) {\displaystyle x,y\in X:\quad \nabla f(x)\cdot (y-x)\geq 0\Rightarrow f(y)\geq f(x)} .

Equivalently:

for all x , y X : f ( y ) < f ( x ) f ( x ) ( y x ) < 0 {\displaystyle x,y\in X:\quad f(y)<f(x)\Rightarrow \nabla f(x)\cdot (y-x)<0} .

Here f {\displaystyle \nabla f} is the gradient of f {\displaystyle f} , defined by: f = ( f x 1 , , f x n ) . {\displaystyle \nabla f=\left({\frac {\partial f}{\partial x_{1}}},\dots ,{\frac {\partial f}{\partial x_{n}}}\right).}

Note that the definition may also be stated in terms of the directional derivative of f {\displaystyle f} , in the direction given by the vector v = y x {\displaystyle v=y-x} . This is because, as f {\displaystyle f} is differentiable, this directional derivative is given by:

f v ( x ) = f ( x ) v = f ( x ) ( y x ) . {\displaystyle {\frac {\partial f}{\partial v}}(x)=\nabla f(x)\cdot v=\nabla f(x)\cdot (y-x).}

Properties

Relation to other types of "convexity"

Every convex function is pseudoconvex, but the converse is not true. For example, the function f ( x ) = x + x 3 {\displaystyle f(x)=x+x^{3}} is pseudoconvex but not convex. Similarly, any pseudoconvex function is quasiconvex; but the converse is not true, since the function f ( x ) = x 3 {\displaystyle f(x)=x^{3}} is quasiconvex but not pseudoconvex. This can be summarized schematically as:

convex {\displaystyle \Rightarrow } pseudoconvex {\displaystyle \Rightarrow } quasiconvex
Functions x^3 (quasiconvex but not pseudoconvex) and x^3 + x (pseudoconvex and thus quasiconvex). None of them is convex.
Functions x^3 (quasiconvex but not pseudoconvex) and x^3 + x (pseudoconvex and thus quasiconvex). None of them is convex.

To see that f ( x ) = x 3 {\displaystyle f(x)=x^{3}} is not pseudoconvex, consider its derivative at x = 0 {\displaystyle x=0} : f ( 0 ) = 0 {\displaystyle f^{\prime }(0)=0} . Then, if f ( x ) = x 3 {\displaystyle f(x)=x^{3}} was pseudoconvex, we should have:

f ( 0 ) ( y 0 ) = 0 0 f ( y ) f ( 0 ) , y R . {\displaystyle f^{\prime }(0)(y-0)=0\geq 0\Rightarrow f(y)\geq f(0),\quad \forall \,y\in \mathbb {R} .}

In particular it should be true for y = 1 {\displaystyle y=-1} . But it is not, as: f ( 1 ) = ( 1 ) 3 = 1 < f ( 0 ) = 0 {\displaystyle f(-1)=(-1)^{3}=-1<f(0)=0} .

Sufficient optimality condition

For any differentiable function, we have the Fermat's theorem necessary condition of optimality, which states that: if f {\displaystyle f} has a local minimum at x {\displaystyle x^{*}} in an open domain, then x {\displaystyle x^{*}} must be a stationary point of f {\displaystyle f} (that is: f ( x ) = 0 {\displaystyle \nabla f(x^{*})=0} ).

Pseudoconvexity is of great interest in the area of optimization, because the converse is also true for any pseudoconvex function. That is: if x {\displaystyle x^{*}} is a stationary point of a pseudoconvex function f {\displaystyle f} , then f {\displaystyle f} has a global minimum at x {\displaystyle x^{*}} . Note also that the result guarantees a global minimum (not only local).

This last result is also true for a convex function, but it is not true for a quasiconvex function. Consider for example the quasiconvex function:

f ( x ) = e x x 2 + 1 + 1 e x {\displaystyle f(x)={\frac {e^{x}}{x^{2}+1}}+{\frac {1}{e^{x}}}} .

This function is not pseudoconvex, but it is quasiconvex. Also, the point x = 0 {\displaystyle x=0} is a critical point of f {\displaystyle f} , as f ( 0 ) = 0 {\displaystyle f^{\prime }(0)=0} . However, f {\displaystyle f} does not have a global minimum at x = 0 {\displaystyle x=0} (not even a local minimum).

Example of a quasiconvex function with a critical point that is not a minimum.
Example of a quasiconvex function that is not pseudoconvex. The function has a critical point at x = 0 {\displaystyle x=0} , but this is not a minimum.

Finally, note that a pseudoconvex function may not have any critical point. Take for example the pseudoconvex function: f ( x ) = x 3 + x {\displaystyle f(x)=x^{3}+x} , whose derivative is always positive: f ( x ) = 3 x 2 + 1 > 0 , x R {\displaystyle f^{\prime }(x)=3x^{2}+1>0,\,\forall \,x\in \mathbb {R} } .

Examples

An example of a function that is pseudoconvex, but not convex, is: f ( x ) = x 2 x 2 + k , k > 0. {\displaystyle f(x)={\frac {x^{2}}{x^{2}+k}},\,k>0.} The figure shows this function for the case where k = 0.2 {\displaystyle k=0.2} . This example may be generalized to two variables as:

f ( x ) = x 2 + y 2 x 2 + y 2 + k , k > 0. {\displaystyle f(x)={\frac {x^{2}+y^{2}}{x^{2}+y^{2}+k}},\,k>0.}
Pseudoconvex function that is not convex: x^2 / (x^2+0.2)
Pseudoconvex function that is not convex.

The previous example may be modified to obtain a function that is not convex, nor pseudoconvex, but is quasiconvex:

f ( x ) = | x | p | x | p + k , k > 0 , p ( 0 , 1 ) . {\displaystyle f(x)={\frac {|x|^{p}}{|x|^{p}+k}},\,k>0,\,p\in (0,1).}

The figure shows this function for the case where k = 0.5 , p = 0.6 {\displaystyle k=0.5,p=0.6} . As can be seen, this function is not convex because of the concavity, and it is not pseudoconvex because it is not differentiable at x = 0 {\displaystyle x=0} .

Quasiconvex function that is not convex, nor pseudoconvex:
Quasiconvex function that is not convex, nor pseudoconvex.

Generalization to nondifferentiable functions

The notion of pseudoconvexity can be generalized to nondifferentiable functions as follows. Given any function f : X R {\displaystyle f:X\rightarrow \mathbb {R} } , we can define the upper Dini derivative of f {\displaystyle f} by:

f + ( x , u ) = lim sup h 0 + f ( x + h u ) f ( x ) h ; {\displaystyle f^{+}(x,u)=\limsup _{h\to 0^{+}}{\frac {f(x+hu)-f(x)}{h}};}

where u is any unit vector. The function is said to be pseudoconvex if it is increasing in any direction where the upper Dini derivative is positive. More precisely, this is characterized in terms of the subdifferential f {\displaystyle \partial f} as follows:

For all x , y X {\displaystyle x,y\in X} : if x f ( x ) {\displaystyle x^{*}\in \partial f(x)} is such that x , y x 0 {\displaystyle \langle x^{*},y-x\rangle \geq 0} , then f ( x ) f ( z ) {\displaystyle f(x)\leq f(z)} , for all z [ x , y ] {\displaystyle z\in } ;

where [ x , y ] {\displaystyle } denotes the line segment adjoining x and y.

Related notions

A pseudoconcave function is a function whose negative is pseudoconvex. A pseudolinear function is a function that is both pseudoconvex and pseudoconcave. For example, linear–fractional programs have pseudolinear objective functions and linear–inequality constraints. These properties allow fractional-linear problems to be solved by a variant of the simplex algorithm (of George B. Dantzig).

Given a vector-valued function η {\displaystyle \eta } , there is a more general notion of η {\displaystyle \eta } -pseudoconvexity and η {\displaystyle \eta } -pseudolinearity; wherein classical pseudoconvexity and pseudolinearity pertain to the case when η ( x , y ) = y x {\displaystyle \eta (x,y)=y-x} .

See also

Notes

  1. Mangasarian 1965
  2. Mangasarian 1965
  3. Floudas & Pardalos 2001
  4. Rapcsak 1991
  5. Chapter five: Craven, B. D. (1988). Fractional programming. Sigma Series in Applied Mathematics. Vol. 4. Berlin: Heldermann Verlag. p. 145. ISBN 3-88538-404-3. MR 0949209.
  6. Kruk, Serge; Wolkowicz, Henry (1999). "Pseudolinear programming". SIAM Review. 41 (4): 795–805. Bibcode:1999SIAMR..41..795K. doi:10.1137/S0036144598335259. JSTOR 2653207. MR 1723002.
  7. Mathis, Frank H.; Mathis, Lenora Jane (1995). "A nonlinear programming algorithm for hospital management". SIAM Review. 37 (2): 230–234. doi:10.1137/1037046. JSTOR 2132826. MR 1343214. S2CID 120626738.
  8. Ansari, Qamrul Hasan; Lalitha, C. S.; Mehta, Monika (2013). Generalized Convexity, Nonsmooth Variational Inequalities, and Nonsmooth Optimization. CRC Press. p. 107. ISBN 9781439868218. Retrieved 15 July 2019.
  9. Mishra, Shashi K.; Giorgi, Giorgio (2008). Invexity and Optimization. Springer Science & Business Media. p. 39. ISBN 9783540785613. Retrieved 15 July 2019.

References

Convex analysis and variational analysis
Basic concepts
Topics (list)
Maps
Main results (list)
Sets
Series
Duality
Applications and related
Categories: