McCormick envelopes: Difference between revisions

From Cornell University Computational Optimization Open Textbook - Optimization Wiki
Jump to navigation Jump to search
mNo edit summary
No edit summary
 
(95 intermediate revisions by 2 users not shown)
Line 1: Line 1:
'''Introduction'''
Author: Susan Urban (SYSEN 5800 Fall 2021)


Optimization of a non-convex function f(x) is challenging since it may have multiple locally optimal points and it can take a significant amount of time or effort to determine if the problem has no solution or if the solution is global. Gradient based solvers are unable to certify optimality1. Different techniques are used to address this challenge depending on the characteristics of the problem. One technique used is convex envelopes2:
== Introduction ==
The McCormick Envelope, originally developed by Dr. Garth McCormick, is a type of convex relaxation used for the optimization of bilinear <math>(e.g., x*y, e^xy + y, sin(x+y) - x^2)
</math> non-linear programming (NLP) problems<sup>1</sup>. Optimization of these non-convex functions f(x) is challenging since they may have multiple locally optimal solutions or no solution. It can take a significant amount of time, computing resources, and effort to determine if the solution is global or the problem has no feasible solution<sup>2</sup>. Different techniques are used to address this challenge depending on the characteristics of the problem.
== Theory, Methodology and Algorithmic Discussions ==
[[File:Image of updated estimators and envelopes.png|thumb|Figure 1: Relationships between the given function, concave over-estimators, convex under-estimators, a concave envelope, and a convex envelope. ]]
As McCormick noted, McCormick Envelopes are based on the key assumption that convex and concave envelopes can be constructed for the given function<sup>1</sup>. The concave envelope, and respectively the convex envelope, is the concave over-estimator and convex under-estimator for the given function providing the tightest fit to the given function<sup>1</sup>.  The envelope surrounds the given function, like an envelope encloses a letter, and limits the feasible solution space the most in comparison to all other concave over-estimators and convex under-estimators. Multiple concave over-estimators and multiple convex under-estimators may exist but there is only one concave envelope and one convex envelope for a given function and domain.  Figure 1 depicts the relationship between the given function f(x), multiple concave over-estimators, multiple convex under-estimators, a convex envelope, and a concave envelope. 


Given a non-convex function f(x), g(x) is a convex envelope of f(x) for X <math>\in</math> S if:
Each bilinear term is replaced with a new variable and four sets of constraints are added<sup>1</sup>. The non-linear programming is converted to a relaxed convex linear programming (LP) which can be more easily solved.


·      g(x) is convex under-estimator of f(x)
The LP solution gives a lower (L) bound and any feasible solution to the problem gives an upper (U) bound.


·      g(x)>=h(x) for all convex under-estimators h(x)
As noted by Scott et al, McCormick envelopes are effective since they are recursive, can be applied to a variety of applications, and are typically stronger than those resulting from convexification or linearization procedures<sup>3</sup>.


'''McCormick Envelopes'''
The following is a derivation of the McCormick Envelopes for a given bilinear function<sup>1</sup>:


In particular, for bilinear (e.g., x*y, x<sup>2</sup>) Non-Linear Programming (NLP) problems3, the McCormick Envelope is a type of convex relaxation used for optimization.  
<math>Let \ w = xy</math>


In case of an NLP, an LP relaxation is derived by replacing each bilinear term with a new variable and adding four sets of constraints. In the case of an MINLP, an MILP relaxation is derived. This strategy is known as McCormick relaxation.
<math>x^{L}\leq x\leq x^{U}</math>


The LP solution gives a lower bound and any feasible solution gives an upper bound.
<math>y^{L}\leq y\leq y^{U}</math>


As noted by Scott et al4, McCormick envelopes are attractive due to their recursive nature of their definition, which affords wide applicability and easy implementation computationally.  Furthermore, these relaxations are typically stronger than those resulting from convexification or linearization procedures.  
where <math>x^{L}, x^{U}, y^{L}, y^{U} </math>are   upper  and   lower  bound  values  for  <math>x</math> and <math>y</math>, respectively.


'''Derivation of McCormick Envelopes'''
<math>a=\left (x-x^{L} \right )</math>


<math>b=\left (y-y^{L} \right )</math>


a and b are both positive resulting in a positive product


'''Example: Convex Relaxation'''
<math>a * b\geq 0</math>


Good bounds are essential to focusing the feasible solution which may be obtained either by inspection or solving the optimization problem to minimize (maximize) x subject to the same constraints as the original problem.
<math>a * b=\left ( x-x^{L} \right )\left ( y-y^{L} \right )=xy-x^{L}y-xy^{L}+x^{L}y^{L}\geq 0 </math>


substituting w=xy and reformulating the inequality produces


<math>w\geq x^{L}y+xy^{L}-x^{L}y^{L}</math>


As noted by Hazaji, state-of-the-art global optimization solvers implement bound contraction techniques in order to improve this bounding procedure. Once bound propagation is completed,
Following the same sequence of steps, the remaining inequalities are produced:


domain partitioning becomes necessary. Spatial branch and bound schemes [6, 7] are among the most effective partitioning methods in global optimization.
<math>a=\left ( x^{U}-x \right )</math>


By splitting the domain of a given variable, the solver is able to divide the original domain into two smaller regions, further tightening the convex relaxations of each partition.
<math>b = \left ( y^{U} -y\right )</math>


'''Example: Numerical'''
<math>w\geq x^{U}y+xy^{U}-x^{U}y^{U}</math>


'''Application'''


'''Conclusion'''
<math>a=\left ( x^{U}-x\right )</math>


'''References'''
<math>b=\left ( y-y^{L} \right )</math>


# Castro, Pedro. "A Tighter Piecewise McCormick     Relaxation for Bilinear Problems." (n.d.): n. pag. 3 June 2014. Web.     6 June 2015. <<nowiki>http://minlp.cheme.cmu.edu/2014/papers/castro.pdf</nowiki>
<math>w\leq x^{U}y+xy^{L}-x^{U}y^{L}</math>
# You, F (2021). Notes for a lecture on Mixed Integer    Non-Linear Programming (MINLP). Archives for SYSEN 5800 Computational    Optimization (2021FA), Cornell University, Ithaca, NY.
 
# Dombrowski, J. (2015, June 7). Northwestern University    Open Text Book on Process Optimization, McCormick Envelopes Retrieved from    <nowiki>https://optimization.mccormick.northwestern.edu/index.php/McCormick_envelopes</nowiki>
 
#  Scott, J. K. Stuber, M. D. & Barton, P. I. (2011). Generalized McCormick Relaxations. Journal of Global Optimization, Vol. 51, Issue 4, 569-606 doi: 10.1007/s10898-011-9664-7 
<math>a=\left ( x-x^{L} \right )</math>
# Hijazi, H., Perspective Envelopes for Bilinear Functions, unpublished manuscript, The Australian National University, Canberra, Australia 4841.pdf
 
# Androulakis, I., Maranas, C., Floudas, C.: alphabb: A global optimization method for general constrained nonconvex problems. Journal of Global Optimization 7(4), 337{363 (1995)
<math>b=\left ( y^{U}-y \right )</math>
# Smith, E., Pantelides, C.: A symbolic reformulation/spatial branch-and-bound algorithm for the global optimisation of nonconvex fMINLPsg. Computers & Chemical Engineering 23(4), 457 { 478 (1999)  <br />
 
<math>w\leq xy^{U}+x^{L}y-x^{L}y^{U}</math>
 
The under-estimators of the function are represented by:
 
<math>w\geq x^{L}y+xy^{L}-x^{L}y^{L}</math>
 
<math>w\geq x^{U}y+xy^{U}-x^{U}y^{U}</math>
 
The over-estimators of the function are represented by:
 
<math>w\leq x^{U}y+xy^{L}-x^{U}y^{L}</math>
 
<math>w\leq xy^{U}+x^{L}y-x^{L}y^{U}</math>
 
The following shows the relaxation of a non-convex problem using McCormick Envelopes:
 
The original non-convex problem:
 
<math>\min Z= \textstyle \sum_{i=1} \sum_{j=1}  c_{i,j}x_i x_j +g_0(x)
</math>
 
<math>s.t. \sum_{i=1}\sum_{j=1} c_{i,j}^l x_i x_j + g_l (x) \leq 0,\forall l \in L
 
</math>
 
<math>x^ L \leq x \leq x^U
 
</math>
 
Replacing <math>u_{i,j} = x_i x_j</math>, we obtain a relaxed, convex problem using McCormick Envelopes:
 
<math>\min Z= \textstyle \sum_{i=1} \sum_{j=1}  c_{i,j} u_{i,j} +g_0(x)</math>
 
<math>s.t. \sum_{i=1}\sum_{j=1} c_{i,j}^l u_{i,j}+ g_l (x) \leq 0,\forall l \in L</math>
 
<math>u_{i,j} \geq x_i^L x_j + x_i x_j^L - {x_i}^L {x_j}^L, \forall  i,j \
 
 
 
</math>
 
<math>u_{i,j} \geq x_i^U x_j + x_i x_j^U - {x_i}^U {x_j}^U, \forall i,j \
 
 
</math>
 
<math>u_{i,j} \leq x_i^L x_j + x_i x_j^U - {x_i}^L {x_j}^U, \forall i,j \
 
</math>
 
<math>u_{i,j} \leq x_i^U x_j + x_i x_j^L - {x_i}^U {x_j}^L, \forall i,j \
 
</math>
 
<math>x^L \leq x \leq x^U,  \ \ u^L \leq u \leq u^U</math>
 
Similar steps are taken to derive the McCormick Envelopes for given functions with different formats e.g., e<sup>xy</sup>.
 
Good lower and upper bounds focus and minimize the feasible solution space; they reduce the number of iterations to find the optimal solution.
 
'''Piecewise McCormick Relaxation'''
 
As discussed by Hazaji, global optimization solvers focus initially on optimizing the lower and upper bounds, and when necessary, focus on domain partitioning<sup>4</sup>. By dividing the domain of a given variable into partitions or smaller regions, the solver is able to tailor and further tighten the convex relaxations of each partition. This strategy is known as the Piecewise McCormick relaxation<sup>5</sup>
 
== Example: Numerical ==
For the given problem,
 
<math>min \  Z = xy + 6x + y</math>
 
<math>s.t. \ xy \leq 18</math>
 
<math>0 \leq x \leq 10
</math>
 
<math>0 \leq y \leq 2</math>
 
 
 
<math>Let\  w = xy
</math>
 
<math>x^L = 0 
 
 
 
 
 
 
 
</math>
 
<math>x^U = 10</math>
 
<math>y^L = 0
 
 
 
</math>
 
<math>y^U = 2
 
 
</math>
 
Substituting these values into the original problem and the McCormick Envelopes, the problem is reformulated:
 
<math>min \ Z = w + 6x + y
</math>
 
<math>s.t. \ w \leq 18</math>
 
<math>w\geq x^{L}y+xy^{L}-x^{L}y^{L}</math>
 
<math>w \geq 0*y + x * 0 - 0*0
 
 
 
 
</math>
 
<math>w \geq 0
 
</math>
 
<math>w\geq x^{U}y+xy^{U}-x^{U}y^{U}</math>
 
<math>w \geq 10*y + x*2 - 10*2
 
 
 
 
</math>
 
<math>w \geq 10y + 2x - 20
 
 
 
 
</math>
 
<math>w\leq x^{U}y+xy^{L}-x^{U}y^{L}</math>
 
<math>w \leq 10*y + x*0 - 10*0
 
 
 
 
</math>
 
<math>w \leq 10y
 
 
 
 
</math>
 
<math>w\leq xy^{U}+x^{L}y-x^{L}y^{U}</math>
 
<math>w \leq x*2 + 0*y -0*2
 
 
 
</math>
 
<math>w \leq 2x
 
 
</math>
 
Using GAMS to solve the reformulated problem, the solution is z= -76.2, x=10, y=1.8.
 
== Applications ==
Bilinear functions occur in numerous engineering and natural science applications where McCormick Envelopes can be utilized.
 
For example, an application involves the modeling of dynamic biological processes based on experimental data<sup>2</sup>. The challenge for this model is establishing model parameter values that align or calibrate the model response with observed data. These model parameter values can be estimated using optimization techniques, including McCormick Envelopes, to minimize the difference between observed and simulated data. 
 
Another application involves the optimization of financial objectives and risk management for energy conversion for a large scale operation such as a college campus or a municipality<sup>6</sup>. This operation involves flexibility of energy sources and several facets of uncertainty involved with pricing. This model uses McCormick Envelopes to optimize the bilinear nonconvex components. 
 
A third application involves the optimized efficiency and reliability of the operation of the electricity grid. Optimal power flow (OPF) and unit commitment (UC) are two optimization problems that support the electricity market operations<sup>7</sup>. McCormick Envelopes are used to strengthen the second-order cone (SOC) relaxation of the alternate current optimal power flow (ACOPF)<sup>8</sup>.
== Conclusion ==
Non-convex NLPs are challenging to solve and may require a significant amount of time, computing resources, and effort to determine if the solution is global or the problem has no feasible solution.  McCormick Envelopes provide a relaxation technique for bilinear non-convex nonlinear programming problems.  McCormick Envelopes provide a straightforward technique of replacing each bilinear term with a new variable and adding four constraints. Due to the recursive nature of this technique, it may be applied to a wide variety of engineering and scientific applications involving bilinear terms. 
 
== References ==
# G. P. McCormick, "Computability of Global Solutions To Factorable Nonconvex Solutions: Part I: Convex Underestimating Problems," ''Mathematical Programming,'' vol. 10, pp. 147-175, 1976.
# A. Miro, C. Pozo, G. Guillen-Gosalbez, J. Egea and L. Jimenez, "Deterministic global optimization algorithm based on outer approximation for the parameter estimation of nonlinear dynamic biological systems," ''BMC Bioinformatics,'' vol. 13, 2012.
# J. K. Scott, M. D. Stuber and P. I. Barton, "Generalized McCormick Relaxations," ''Journal of Global Optimization,'' vol. 51, pp. 569-606, 2011.
# H. Hijazi, "[https://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=&ved=2ahUKEwjSr7eu8K70AhVYmHIEHUJzCVsQFnoECAIQAQ&url=http%3A%2F%2Fwww.optimization-online.org%2FDB_FILE%2F2015%2F03%2F4841.pdf&usg=AOvVaw1xf9B1f-EPw0mG1LOqRfm9 Perspective Envelopes for Bilinear Functions"]
# P. Castro, "Tightening piecewise McCormick relaxations for bilinear problems," ''Computers & Chemical Engineering'', vol. 72, pp. 300-311, 2015.
# J. Kantor and P. Mousaw, "A class of bilinear models for the optimization of energy conversion networks," ''Chemical Engineering Science,'' vol. 67'','' pp. 131-138, 2012.
# Jeremy Lin; Fernando H. Magnago, "Optimal Power Flow," in Electricity Markets: Theories and Applications , IEEE, 2017, pp.147-171, doi: 10.1002/9781119179382.ch6.
#B. Michael, A. Castillo, J. Watson and C. Laird, "Strengthened SOCP Relaxations for ACOPF with McCormick Envelopes and Bounds Tightening," ''Computer Aided Chemical Engineering,'' vol. 44'','' pp. 1555-1560, 2018.
#

Latest revision as of 07:21, 12 December 2021

Author: Susan Urban (SYSEN 5800 Fall 2021)

Introduction

The McCormick Envelope, originally developed by Dr. Garth McCormick, is a type of convex relaxation used for the optimization of bilinear non-linear programming (NLP) problems1. Optimization of these non-convex functions f(x) is challenging since they may have multiple locally optimal solutions or no solution. It can take a significant amount of time, computing resources, and effort to determine if the solution is global or the problem has no feasible solution2. Different techniques are used to address this challenge depending on the characteristics of the problem.

Theory, Methodology and Algorithmic Discussions

Figure 1: Relationships between the given function, concave over-estimators, convex under-estimators, a concave envelope, and a convex envelope.

As McCormick noted, McCormick Envelopes are based on the key assumption that convex and concave envelopes can be constructed for the given function1. The concave envelope, and respectively the convex envelope, is the concave over-estimator and convex under-estimator for the given function providing the tightest fit to the given function1. The envelope surrounds the given function, like an envelope encloses a letter, and limits the feasible solution space the most in comparison to all other concave over-estimators and convex under-estimators. Multiple concave over-estimators and multiple convex under-estimators may exist but there is only one concave envelope and one convex envelope for a given function and domain. Figure 1 depicts the relationship between the given function f(x), multiple concave over-estimators, multiple convex under-estimators, a convex envelope, and a concave envelope.

Each bilinear term is replaced with a new variable and four sets of constraints are added1. The non-linear programming is converted to a relaxed convex linear programming (LP) which can be more easily solved.

The LP solution gives a lower (L) bound and any feasible solution to the problem gives an upper (U) bound.

As noted by Scott et al, McCormick envelopes are effective since they are recursive, can be applied to a variety of applications, and are typically stronger than those resulting from convexification or linearization procedures3.

The following is a derivation of the McCormick Envelopes for a given bilinear function1:

where are   upper  and   lower  bound  values  for   and , respectively.

a and b are both positive resulting in a positive product

substituting w=xy and reformulating the inequality produces

Following the same sequence of steps, the remaining inequalities are produced:



The under-estimators of the function are represented by:

The over-estimators of the function are represented by:

The following shows the relaxation of a non-convex problem using McCormick Envelopes:

The original non-convex problem:

Replacing , we obtain a relaxed, convex problem using McCormick Envelopes:

Similar steps are taken to derive the McCormick Envelopes for given functions with different formats e.g., exy.

Good lower and upper bounds focus and minimize the feasible solution space; they reduce the number of iterations to find the optimal solution.

Piecewise McCormick Relaxation

As discussed by Hazaji, global optimization solvers focus initially on optimizing the lower and upper bounds, and when necessary, focus on domain partitioning4. By dividing the domain of a given variable into partitions or smaller regions, the solver is able to tailor and further tighten the convex relaxations of each partition. This strategy is known as the Piecewise McCormick relaxation5.

Example: Numerical

For the given problem,


Substituting these values into the original problem and the McCormick Envelopes, the problem is reformulated:

Using GAMS to solve the reformulated problem, the solution is z= -76.2, x=10, y=1.8.

Applications

Bilinear functions occur in numerous engineering and natural science applications where McCormick Envelopes can be utilized.

For example, an application involves the modeling of dynamic biological processes based on experimental data2. The challenge for this model is establishing model parameter values that align or calibrate the model response with observed data. These model parameter values can be estimated using optimization techniques, including McCormick Envelopes, to minimize the difference between observed and simulated data.

Another application involves the optimization of financial objectives and risk management for energy conversion for a large scale operation such as a college campus or a municipality6. This operation involves flexibility of energy sources and several facets of uncertainty involved with pricing. This model uses McCormick Envelopes to optimize the bilinear nonconvex components.

A third application involves the optimized efficiency and reliability of the operation of the electricity grid. Optimal power flow (OPF) and unit commitment (UC) are two optimization problems that support the electricity market operations7. McCormick Envelopes are used to strengthen the second-order cone (SOC) relaxation of the alternate current optimal power flow (ACOPF)8.

Conclusion

Non-convex NLPs are challenging to solve and may require a significant amount of time, computing resources, and effort to determine if the solution is global or the problem has no feasible solution. McCormick Envelopes provide a relaxation technique for bilinear non-convex nonlinear programming problems. McCormick Envelopes provide a straightforward technique of replacing each bilinear term with a new variable and adding four constraints. Due to the recursive nature of this technique, it may be applied to a wide variety of engineering and scientific applications involving bilinear terms.

References

  1. G. P. McCormick, "Computability of Global Solutions To Factorable Nonconvex Solutions: Part I: Convex Underestimating Problems," Mathematical Programming, vol. 10, pp. 147-175, 1976.
  2. A. Miro, C. Pozo, G. Guillen-Gosalbez, J. Egea and L. Jimenez, "Deterministic global optimization algorithm based on outer approximation for the parameter estimation of nonlinear dynamic biological systems," BMC Bioinformatics, vol. 13, 2012.
  3. J. K. Scott, M. D. Stuber and P. I. Barton, "Generalized McCormick Relaxations," Journal of Global Optimization, vol. 51, pp. 569-606, 2011.
  4. H. Hijazi, "Perspective Envelopes for Bilinear Functions"
  5. P. Castro, "Tightening piecewise McCormick relaxations for bilinear problems," Computers & Chemical Engineering, vol. 72, pp. 300-311, 2015.
  6. J. Kantor and P. Mousaw, "A class of bilinear models for the optimization of energy conversion networks," Chemical Engineering Science, vol. 67, pp. 131-138, 2012.
  7. Jeremy Lin; Fernando H. Magnago, "Optimal Power Flow," in Electricity Markets: Theories and Applications , IEEE, 2017, pp.147-171, doi: 10.1002/9781119179382.ch6.
  8. B. Michael, A. Castillo, J. Watson and C. Laird, "Strengthened SOCP Relaxations for ACOPF with McCormick Envelopes and Bounds Tightening," Computer Aided Chemical Engineering, vol. 44, pp. 1555-1560, 2018.