Exponential transformation: Difference between revisions

From Cornell University Computational Optimization Open Textbook - Optimization Wiki
Jump to navigation Jump to search
(Add Hessian matrix (half))
No edit summary
Line 3: Line 3:
== Introduction ==
== Introduction ==


Exponential transformations are simple algebraic transformations of monomial functions through a variable substitution with an exponential variable. In computational optimization, exponential transformations are used for convexification of geometric programming constraints (posynomial) nonconvex optimization problems. Exponential transformation creates a convex function without changing the decision space of the problem. <ref name =":0"> D. Li and M. P. Biswal, [https://doi.org/10.1023/A:1021708412776 "Exponential Transformation in Convexifying a Noninferior Frontier and Exponential Generating Method]," ''Journal of Optimization Theory and Applications'', vol. 99, pp. 183–199, 1998.</ref> The transformation can then be verified using the Hessian positive definite test to confirm that it is now a convex function. Through the use of exponential transformations, the overall time to solve a Nonlinear Programming (NLP) or a Mixed Integer Nonlinear Programming (MINLP) problem is reduced and simplifies the solution space enough to utilize normal NLP/MINLP solvers.  
Exponential transformations are simple algebraic transformations of monomial functions through a variable substitution with an exponential variable. In computational optimization, exponential transformations are used for convexification of geometric programming constraints nonconvex optimization problems. The constraints for geometric programs are posynomial functions which are characterized by being positive polynomials.  


== Theory & Methodology ==
Exponential transformation creates a convex function without changing the decision space of the problem. <ref name =":0"> D. Li and M. P. Biswal, [https://doi.org/10.1023/A:1021708412776 "Exponential Transformation in Convexifying a Noninferior Frontier and Exponential Generating Method]," ''Journal of Optimization Theory and Applications'', vol. 99, pp. 183–199, 1998.</ref> This is done through a simple substitution of continuous variables with a natural exponent and simplifying binary variables by removing the exponents due to the binary nature of the variable. The transformation can then be verified using the Hessian positive definite test to confirm that it is now a convex function.


Exponential transformation can be applied to geometric programs. A geometric program is a mathematical optimization problem. Geometric Programs take the form of:
Using exponential transformations, the overall time to solve a Nonlinear Programming (NLP) or a Mixed Integer Nonlinear Programming (MINLP) problem is reduced and simplifies the solution space enough to utilize conventional NLP/MINLP solvers. This can be seen used in various real world optimization problem applications to simplify the solution space as real world applications have extensive quantities of constraints and variables.


<math>\begin{align}
== Theory & Methodology ==
\min & \quad f_0(x) \\
Exponential transformation is an algebraic transformation
s.t. & \quad f_i(x) \leq 1 \quad  i = 1,...,m  \\
& \quad g_i(x) = 1  \quad i = 1,...,p
\end{align}</math>


convexification of a non-inferior frontier - based on having a differentiable objective function<ref name=":0"></ref>
In non linear programming an exponential transformation are used for geometric programs which are composed of posynomial functions in the optimization function and constraints.


this would apply to any polynomial. A posynomial is defined as define polynomials:  
A posynomial function which can also be referred to as a posynomial is defined as a positive polynomial function. <ref name =":1"> S. Boyd, S. J. Kim, and L. Vandenberghe ''et al.'', [https://doi.org/10.1007/s11081-007-9001-7, "A tutorial on geometric programming]," ''Optimization and Engineering'', vol. 8, article 67, 2007.</ref>


Exponential transformation begins with a posynomial (Positive and Polynomial) noncovex function of the form <ref name =":1"> S. Boyd, S. J. Kim, and L. Vandenberghe ''et al.'', [https://doi.org/10.1007/s11081-007-9001-7, "A tutorial on geometric programming]," ''Optimization and Engineering'', vol. 8, article 67, 2007.</ref> :
Exponential transformation begins with a posynomial (Positive and Polynomial) noncovex function of the form <ref name =":1"> S. Boyd, S. J. Kim, and L. Vandenberghe ''et al.'', [https://doi.org/10.1007/s11081-007-9001-7, "A tutorial on geometric programming]," ''Optimization and Engineering'', vol. 8, article 67, 2007.</ref> :
Line 30: Line 27:


<math> f(u) = \sum_{k=1}^N c_k{{e}^{{u_1}{a_{1k}}}{e}^{{u_2}{a_{2k}}}....{e}^{{u_n}{a_{nk}}}} </math>
<math> f(u) = \sum_{k=1}^N c_k{{e}^{{u_1}{a_{1k}}}{e}^{{u_2}{a_{2k}}}....{e}^{{u_n}{a_{nk}}}} </math>
Exponential transformation can be applied to geometric programs. A geometric program is a mathematical optimization problem where the objective function is a posynomial which is minimized. For Geometric programs in standard form the objective must be a posynomial and the constraints must be posynomials <math> \leg 1 </math> or monomials equal to 1.
Geometric Programs in standard form is represented by:
<math>\begin{align}
\min & \quad f_0(x) \\
s.t. & \quad f_i(x) \leq 1  i = 1,....,m  \\
& \quad g_i(x) = 1  i = 1,....,p
\end{align}</math>


===Exponential Transformation in Computational Optimization===
===Exponential Transformation in Computational Optimization===
Exponential transformation can be used for convexification of any Geometric MINLP that meets the criteria of equation 1. This is done by turning the problem into a nonlinear convex optimization problem through exponential transformation. Using the exponential substitution detailed above, all continuous variables in the function are transformed while binary variables are not transformed.  
Exponential transformation can be used for convexification of any Geometric MINLP that meets the criteria for a geometric program. In a geometric program the approach to solving efficiently is taken by transforming the optimization problem into a convex nonlinear problem. This is done by turning the problem into a nonlinear convex optimization problem through exponential transformation. Using the exponential substitution detailed above, all continuous variables in the function are transformed while binary variables are not transformed. Through this transformation the constraints are also convex.
 
In a special case binary exponential transformation can also be applied where binary variables are linearized since their possible allocation is 0 or 1. Binary exponential transformation can be done by using the following replacement <math> y^n is substituted by y </math> REFERENCE TEXTBOOK


Additionally, as presented in Theorem 1 and accompanying proof in "Global optimization of signomial geometric programming using linear relaxation" by P. Shen, K. Zhang, given that a function is being minimized, it shows that after transformation all points on the transformed function are feasible in the original function and all objective values in the transformed function are the same or less than the original function. <ref name =":1"></ref> Also presented by Li and Biswal, the bounds of the problem are not altered through exponential transformation. <ref name=":0"></ref>
Additionally, as presented in Theorem 1 and accompanying proof in "Global optimization of signomial geometric programming using linear relaxation" by P. Shen, K. Zhang, given that a function is being minimized, it shows that after transformation all points on the transformed function are feasible in the original function and all objective values in the transformed function are the same or less than the original function. <ref name =":1"></ref> this creates a convex under estimator approach to the problem.
 
Also as presented by Li and Biswal, the bounds of the problem are not altered through exponential transformation. <ref name=":0"></ref>


== Numerical Example ==
== Numerical Example ==
Line 45: Line 56:
Substituting <math> x_1 = e^{u_1}, x_2 = e^{u_2} </math>
Substituting <math> x_1 = e^{u_1}, x_2 = e^{u_2} </math>


<math>{e^{{3}{u_1}}}*{e^{{-4}{u_2}}} + {e^{{2}{u_1}}} + {e^{{\frac{2}{3}}{u_2}}} \leq 4 </math>
<math>{e^{{3}{u_1}}}*{e^{{-4}{u_2}}} + {e^{{2}{u_1}}} + {{e^{u_2}}^{\frac{2}{3}}} \leq 4 </math>


Simplifying by exponent properties
Simplifying by exponent properties
Line 56: Line 67:


<math>\begin{align}
<math>\begin{align}
\min & \quad Z = 5{x_1^2}{x_2^8} + 2{x_1} + {x_2^3} + 5{y_1} + 2 {y_2^2} \\
\min & \quad Z = 5{x_1^2}{x_2^8} + 2{x_1} + \frac{x_2^3} + 5{y_1} + 2 {y_2^2} \\
s.t. & \quad {x_1} \leq 7{x_2^{0.2}} \\
s.t. & \quad {x_1} \leq 7{x_2^{0.2}} \\
& \quad 2{x_1^3} - y_1^2 \leq 1 \\
& \quad 2{x_1^3} - y_1^2 \leq 1 \\
Line 64: Line 75:
\end{align} </math>  
\end{align} </math>  


Using the exponential transformation to continuous variables <math>x_1, x_2 </math>  by substituting <math> x_1 = e^{u_1}</math> and <math> x_2 = e^{u_2} </math> described the problem becomes the following:
Using the exponential transformation to continuous variables <math>x_1, x_2, x_3 </math>  by substituting <math> x_1 = e^{u_1}</math> and <math> x_2 = e^{u_2} </math> described the problem becomes the following:


<math>\begin{align}
<math>\begin{align}
\min & \quad Z = 5{e^{2{u_1}}}{e^{8{u_2}}} + 2{e^{u_1}}{e^{2u_2}} + {e^{{3}{u_2}}} + 5{y_1} + 2 {y_2^2} \\
\min & \quad Z = 5{e^{2{u_1}}}{e^{8{u_2}}} + 2{e^{u_1}}{e^{2u_2}} + \frac{e^{{u_2}{3}}} + 5{y_1} + 2 {y_2^2} \\
s.t. & \quad {e^{u_1}} \leq 7{e^{0.2{u_2}}} \\
s.t. & \quad {e^{u_1}} \leq 7{e^{0.2{u_2}}} \\
& \quad 2{e^{3{u_1}}} - y_1 \leq 1 \\
& \quad 2{e^{3{u_1}}} - y_1^2 \leq 1 \\
& \quad e^{u_1} \geq 0 \\
& \quad e^{u_1} \geq 0 \\
& \quad e^{u_2} \leq 4 \\
& \quad e^{u_2} \leq 4 \\
Line 78: Line 89:


<math>\begin{align}
<math>\begin{align}
\min & \quad Z = 5{e^{2{u_1}}}{e^{8{u_2}}} + 2{e^{u_1}}{e^{2u_2}} + {e^{{3}{u_2}}} + 5{y_1} + 2 {y_2^2}  \\
\min & \quad Z = 5{e^{2{u_1}}}{e^{8{u_2}}} + 2{e^{u_1}}{e^{2u_2}} + \frac{e^{{u_2}{3}}} + 5{y_1} + 2 {y_2^2}  \\
s.t. & \quad u_1 \leq \ln 7 + 0.2{u_2} \\
s.t. & \quad u_1 \leq \ln 7 + 0.2{u_2} \\
& \quad 2{e^{3{u_1}}} - y_1 \leq 1 \\
& \quad 2{e^{3{u_1}}} - y_1^2 \leq 1 \\
& \quad {u_2} \leq \ln 4 \\
& \quad {u_2} \leq \ln 4 \\
& \quad y_1 = 0,1 \quad y_2 = 0,1
& \quad y_1 = 0,1 \quad y_2 = 0,1
Line 87: Line 98:
Where <math> u_1 </math> is unbounded due to logarithmic of 0 being indefinite.
Where <math> u_1 </math> is unbounded due to logarithmic of 0 being indefinite.


The transformed objective function can be shown to be convex through the positive-definite test of the Hessian. For the example above, the Hessian is as follows <ref> M. Chiang, [https://www.princeton.edu/~chiangm/gp.pdf "Geometric Programming for Communication Systems]," 2005. </ref>:
Additionally simplifying further for binary variables by substituting <math> {y_1}^2 with {y_1}  and {y_2}^2 with {y_2} </math> since <math> {y_2} </math> is either 0 or 1 and any exponents on the variable will not change the solution space:
 
<math>\begin{align}
\min & \quad Z = 5{e^{2{u_1}}}{e^{8{u_2}}} + 2{e^{u_1}}{e^{2u_2}} + \frac{e^{{u_2}{3}}} + 5{y_1} + 2 {y_2}  \\
s.t. & \quad u_1 \leq \ln 7 + 0.2{u_2} \\
& \quad 2{e^{3{u_1}}} - y_1 \leq 1 \\
& \quad {u_2} \leq \ln 4 \\
& \quad y_1 = 0,1 \quad y_2 = 0,1
\end{align}</math>


In order to prove the convexity of the transformed functions, the positive definite test of Hessian is used as defined in ''Optimization of Chemical Processes'' <ref> T. F. Edgar, D. M. Himmelblau, and L. S. Lasdon, ''Optimization of Chemical Processes'', McGraw-Hill, 2001.</ref>. This tests the Hessian defined as:
The transformed objective function can be shown to be convex through the positive-definite test of the Hessian. In order to prove the convexity of the transformed functions, the positive definite test of Hessian is used as defined in ''Optimization of Chemical Processes'' <ref> T. F. Edgar, D. M. Himmelblau, and L. S. Lasdon, ''Optimization of Chemical Processes'', McGraw-Hill, 2001.</ref>. This tests the Hessian defined as:


<math>  H(x) = H = \nabla^2f(x)</math>
<math>  H(x) = H = \nabla^2f(x)</math>
Line 103: Line 122:
for all <math> x \neq 0 </math>
for all <math> x \neq 0 </math>


for functions the Hessian is defined as:
For the example above, the Hessian is as follows <ref> M. Chiang, [https://www.princeton.edu/~chiangm/gp.pdf "Geometric Programming for Communication Systems]," 2005. </ref>:


<math>  
<math>  
Line 130: Line 149:
==Applications==
==Applications==


Currently, various applications of exponential transformation can be seen in published journal articles and industry practices. Due to the closeness with logarithmic transformation, usually a combination of the approaches are used in practical solutions.   
Currently, various applications of exponential transformation can be seen in published journal articles and industry practices. Many of these applications use exponential transformation to convexify their problem space. Due to the closeness with logarithmic transformation, usually a combination of the approaches is used in practical solutions.   


Global optimization of heat exchanger network synthesis problems with and without the isothermal mixing assumption:
=== Mechanical Engineering Applications ===
*As seen in eq(34) and (35) of the work by Björk and Westerlund, they employ an exponential transformation to convexify their optimization problem to employ a global optimization approach. <ref>K. J. Björk and T. Westerlund, [https://www.sciencedirect.com/science/article/abs/pii/S0098135402001291?casa_token=G7OVOrBKagoAAAAA:1hUCHkGascVlawR3OfBpolNXlFqPSBUhWL6MkVAhn-ofKVfF-CbhVK6ZfSCKQ7i6mRQ9MTaqf9Q, "Global optimization of heat exchanger network synthesis problems with and without the isothermal mixing assumption]," ''Computers & Chemical Engineering'', vol. 26, issue 11, pp. 1581-1593, 2002.</ref>
In the paper “Global optimization of heat exchanger network synthesis problems with and without the isothermal mixing assumption” a global optimization approach is explored for the synthesis of heat exchanger networks. As seen in eq(34) and (35) of the work by Björk and Westerlund, they employ an exponential transformation to convexify their optimization problem to employ a global optimization approach. <ref>K. J. Björk and T. Westerlund, [https://doi.org/10.1016/S0098-1354(02)00129-1, "Global optimization of heat exchanger network synthesis problems with and without the isothermal mixing assumption]," ''Computers & Chemical Engineering'', vol. 26, issue 11, pp. 1581-1593, 2002.</ref>




Electrical Engineering Application: <ref>C. Chu and D. F. Wong, [http://home.eng.iastate.edu/~cnchu/pubs/j08.pdf, "VLSI Circuit Performance Optimization by Geometric Programming]," ''Annals of Operations Research'', vol. 105, pp. 37-60, 2001.</ref>  
=== Electrical Engineering Application: ===
Applications for VLSI circuit performance optimization. In this application a special geometric program defined as unary geometric programs is presented. The unary geometric program is a posynomial as defined in the [[#Theory & Methodology|Theory & Methodology]] section. The unary geometric program is derived through a greedy algorithm which implements a logarithmic transformation within lemma 5. <ref> http://home.eng.iastate.edu/~cnchu/pubs/j08.pdf </ref> While this is not a specific exponential transformation example logarithmic transformations are within the same family and can also be used to convexify geometric programs.




Quadratic Geometric Programming Economics: <ref>T. R. Jefferson and C. H. Scott, [https://link.springer.com/content/pdf/10.1007/BF02591746.pdf, "Quadratic geometric programming with application to machining economics]," ''Mathematical Programming'', vol. 31, pp. 137-152, 1985.</ref>  
=== Machining Economics: ===
Applications in economics can be seen through geometric programming approaches. Examples and applications include to analyze the life of cutting tools in machining. In this approach they use exponential transformations to convexify the problem.
<ref> https://link.springer.com/content/pdf/10.1007/BF02591746.pdf </ref>


==Conclusion==
==Conclusion==

Revision as of 22:54, 13 December 2021

Author: Daphne Duvivier, Daniela Gil, Jacqueline Jackson, Sinclaire Mills, Vanessa Nobre (SYSEN 5800, Fall 2021)

Introduction

Exponential transformations are simple algebraic transformations of monomial functions through a variable substitution with an exponential variable. In computational optimization, exponential transformations are used for convexification of geometric programming constraints nonconvex optimization problems. The constraints for geometric programs are posynomial functions which are characterized by being positive polynomials.

Exponential transformation creates a convex function without changing the decision space of the problem. [1] This is done through a simple substitution of continuous variables with a natural exponent and simplifying binary variables by removing the exponents due to the binary nature of the variable. The transformation can then be verified using the Hessian positive definite test to confirm that it is now a convex function.

Using exponential transformations, the overall time to solve a Nonlinear Programming (NLP) or a Mixed Integer Nonlinear Programming (MINLP) problem is reduced and simplifies the solution space enough to utilize conventional NLP/MINLP solvers. This can be seen used in various real world optimization problem applications to simplify the solution space as real world applications have extensive quantities of constraints and variables.

Theory & Methodology

Exponential transformation is an algebraic transformation

In non linear programming an exponential transformation are used for geometric programs which are composed of posynomial functions in the optimization function and constraints.

A posynomial function which can also be referred to as a posynomial is defined as a positive polynomial function. [2]

Exponential transformation begins with a posynomial (Positive and Polynomial) noncovex function of the form [2] :

where and

A transformation of is applied [3]

The transformed function is presented as:

Exponential transformation can be applied to geometric programs. A geometric program is a mathematical optimization problem where the objective function is a posynomial which is minimized. For Geometric programs in standard form the objective must be a posynomial and the constraints must be posynomials Failed to parse (unknown function "\leg"): {\displaystyle \leg 1 } or monomials equal to 1.

Geometric Programs in standard form is represented by:

Exponential Transformation in Computational Optimization

Exponential transformation can be used for convexification of any Geometric MINLP that meets the criteria for a geometric program. In a geometric program the approach to solving efficiently is taken by transforming the optimization problem into a convex nonlinear problem. This is done by turning the problem into a nonlinear convex optimization problem through exponential transformation. Using the exponential substitution detailed above, all continuous variables in the function are transformed while binary variables are not transformed. Through this transformation the constraints are also convex.

In a special case binary exponential transformation can also be applied where binary variables are linearized since their possible allocation is 0 or 1. Binary exponential transformation can be done by using the following replacement REFERENCE TEXTBOOK

Additionally, as presented in Theorem 1 and accompanying proof in "Global optimization of signomial geometric programming using linear relaxation" by P. Shen, K. Zhang, given that a function is being minimized, it shows that after transformation all points on the transformed function are feasible in the original function and all objective values in the transformed function are the same or less than the original function. [2] this creates a convex under estimator approach to the problem.

Also as presented by Li and Biswal, the bounds of the problem are not altered through exponential transformation. [1]

Numerical Example

Reformulating to exponents

Substituting

Simplifying by exponent properties

Example of Convexification Application in MINLP

The following MINLP problem can take a convexification approach using exponential transformation:

Using the exponential transformation to continuous variables by substituting and described the problem becomes the following:

With additional logarithmic simplification through properties of natural logarithm:

Where is unbounded due to logarithmic of 0 being indefinite.

Additionally simplifying further for binary variables by substituting since is either 0 or 1 and any exponents on the variable will not change the solution space:

The transformed objective function can be shown to be convex through the positive-definite test of the Hessian. In order to prove the convexity of the transformed functions, the positive definite test of Hessian is used as defined in Optimization of Chemical Processes [4]. This tests the Hessian defined as:

to test that

where

for all

For the example above, the Hessian is as follows [5]:

In the example above, the Hessian is defined as:

Therefore, H(x) is positive-definite and strictly convex.

Applications

Currently, various applications of exponential transformation can be seen in published journal articles and industry practices. Many of these applications use exponential transformation to convexify their problem space. Due to the closeness with logarithmic transformation, usually a combination of the approaches is used in practical solutions.

Mechanical Engineering Applications

In the paper “Global optimization of heat exchanger network synthesis problems with and without the isothermal mixing assumption” a global optimization approach is explored for the synthesis of heat exchanger networks. As seen in eq(34) and (35) of the work by Björk and Westerlund, they employ an exponential transformation to convexify their optimization problem to employ a global optimization approach. [6]


Electrical Engineering Application:

Applications for VLSI circuit performance optimization. In this application a special geometric program defined as unary geometric programs is presented. The unary geometric program is a posynomial as defined in the Theory & Methodology section. The unary geometric program is derived through a greedy algorithm which implements a logarithmic transformation within lemma 5. [7] While this is not a specific exponential transformation example logarithmic transformations are within the same family and can also be used to convexify geometric programs.


Machining Economics:

Applications in economics can be seen through geometric programming approaches. Examples and applications include to analyze the life of cutting tools in machining. In this approach they use exponential transformations to convexify the problem. [8]

Conclusion

Exponential transformation is a useful method to convexify Geometric MINLP and obtain a global solution to the problem. Exponential transformation does not alter the bounds of the problem and allows for a convex objective function and constraints given that the conditions described within Theory & Methodology are satisfied. Geometric Programming transformation can be further explored through logarithmic transformation to address convexification.

References

  1. 1.0 1.1 D. Li and M. P. Biswal, "Exponential Transformation in Convexifying a Noninferior Frontier and Exponential Generating Method," Journal of Optimization Theory and Applications, vol. 99, pp. 183–199, 1998.
  2. 2.0 2.1 2.2 S. Boyd, S. J. Kim, and L. Vandenberghe et al., "A tutorial on geometric programming," Optimization and Engineering, vol. 8, article 67, 2007.
  3. I. E. Grossmann, "Review of Nonlinear Mixed-Integer and Disjunctive Programming Techniques," Optimization and Engineering, vol. 3, pp. 227–252, 2002.
  4. T. F. Edgar, D. M. Himmelblau, and L. S. Lasdon, Optimization of Chemical Processes, McGraw-Hill, 2001.
  5. M. Chiang, "Geometric Programming for Communication Systems," 2005.
  6. K. J. Björk and T. Westerlund, "Global optimization of heat exchanger network synthesis problems with and without the isothermal mixing assumption," Computers & Chemical Engineering, vol. 26, issue 11, pp. 1581-1593, 2002.
  7. http://home.eng.iastate.edu/~cnchu/pubs/j08.pdf
  8. https://link.springer.com/content/pdf/10.1007/BF02591746.pdf