Revision as of 13:38, 10 December 2024

Authors: Kayleigh Calder (kjc263), Colton Jacobucci (cdj59), Carolyn Johnson (cj456), Caleb McKinney (cdm235), Olivia Thomas (oat9) (ChemE 6800 Fall 2024)
Stewards: Nathan Preuss, Wei-Han Chen, Tianqi Xiao, Guoqing Hu

Introduction

A quadratic program is an optimization problem that comprises a quadratic objective function bound to linear constraints.¹ Quadratic Programming (QP) is a common type of non-linear programming (NLP) used to optimize such problems.

One of the earliest known theories for QP was documented in 1943 by Columbia University’s H.B. Mann^2,3, but many are given credit for their early contributions to the field such as E. W. Barankin and R. Dorfman in their naval research throughout the 1950s⁴ and Princeton University’s Wolfe and Frank for their research in 1956.⁵ The field has since made other prominent strides, such as when Harry Markowitz famously received the Nobel Prize in Economics in 1990 for his application of QP in optimizing his portfolio’s financial risk and reward.⁶

QP is essential to the field of optimization for multiple reasons. Firstly, quadratic problems can often be applied to real world applications due to the quadratic nature of variance, the sum of squares deviation used to represent uncertainty.⁶ QP can also be applied to a wide variety of real-world problems such as scheduling, planning, flow computations, engineering modeling, design and control, game theory, and economics.⁷ Secondly, QP is commonly used as a steppingstone for more general optimization problems such as sequential quadratic programming and augmented Lagrangian methods.¹

Algorithm Discussion

General Problem Fomulation

Quadratic programming problems are typically formatted as minimization problems, and the general mathematical formulation is:

minimize $q(x)={\frac {1}{2}}x^{T}Qx+c^{T}x$

subject to:

$Ax\geq b$

$x\geq 0$

Where:

$x\in R^{n}$ is the decision variable vector.
$Q\in R^{n\times n}$ is a symmetric, positive semi-definite $n\times n$ matrix representing the quadratic coefficients.
$A\in R^{m\times n}$ is the inequality constraint matrix.
$b\in R^{m}$ is an $m$ dimensional vector representing a constraint boundary.
$c\in R^{n}$ is a linear coefficient vector.

Dual Formulation

The general mathematical formulation for the QP dual is:

maximize $q(x,y)=b^{T}y-{\frac {1}{2}}x^{T}Qx$

subject to:

$A^{T}y-Qx+s=c$

$y,s\geq 0$

Where:

$y\in R^{m}$ is an m-dimensional vector dual variable.
$x\in R^{n}$ is the prime decision variable.
$s\in R^{n}$ is the slack variable.
$Q\in R^{n\times n}$ is a symmetric, positive semi-definite $n\times n$ matrix representing the quadratic coefficients.
$A\in R^{m\times n}$ is the inequality constraint matrix.
$b\in R^{m}$ is an $m$ dimensional vector representing a constraint boundary.
$c\in R^{n}$ is a linear coefficient for the dual target.

The general conditions for using QP for an optimization problem begin with having a quadratic objective function accompanied by linear constraints. For convex problems, Q defined in the equation above must be positive semi-definite; if not, there may be multiple local solutions meeting minimization criteria and deemed non-convex. As later sections in this paper will discuss, problem dimensions may vary in size, but it is not an issue as certain quadratic algorithms are tailored to meet computational demand. Finally, the feasibility region must be non-empty- otherwise there will be no solution.

Active Set Methods

The active set algorithm is a method used to solve quadratic programming problems by iteratively identifying and working with the set of constraints that are most likely to influence the solution, called the active set. More specifically, the algorithm maintains a subset of constraints that are treated as equalities at each step, solving a simplified quadratic programming problem for this subset. Constraints can be added or removed from the active set as the solution progresses until optimality conditions are satisfied.

When to Use

Active set methods are best suited for most linear programming problems, particularly those with manageable dimensions, as they exploit the problem's structure and update estimates of active constraints iteratively. However, for problems where nonlinearity or degeneracy complicates the constraint structure, active set methods, as a broader class, are useful since they generalize the simplex approach to handle quadratic or nonlinear constraints. In cases with large-scale problems or poor simplex performance due to its exponential worst-case complexity, alternative methods like interior-point techniques may be more appropriate.

Implementation Steps

Start with an Initial Solution and Active Set
- Begin with a feasible point that satisfies all constraints.
- Identify which constraints are active (equality constraints or inequality constraints that are tight).
Iterative Process:
- Solve a Reduced Problem: Fix the active constraints as equalities and solve the resulting smaller QP problem.
- Check Optimality: Verify if the current solution satisfies the Karush-Kuhn-Tucker conditions.
- Update the Active Set:
  - Add violated constraints to the active set if they are not satisfied.
  - Remove constraints from the active set if they are no longer binding.
Repeat Until Convergence:
- Iterate through the process until the optimal solution is found, ensuring all constraints are satisfied.

Pseudocode

This is the algorithm for Active-Set Method for Convex QP where:

$x\in R^{n}$ is the decision variable vector.
$G\in R^{n\times n}$ is a symmetric, positive semi-definite $n\times n$ matrix representing the quadratic coefficients.
$c\in R^{n}$ is a linear coefficient vector.
$W$ represents the active constraints.

Compute a feasible starting point $x_{0}$ ;

Set $W_{0}$ to be a subset of the active constraints at $x_{0}$ ;

for $k=0,1,2...$

Solve for $p_{k}={\frac {1}{2}}x_{k}^{T}Gx_{k}+c^{T}x_{k}$

if $p_{k}=0$

Compute Lagrange multipliers ${\hat {\lambda }}_{i}$ that satisfy $\Sigma _{i\epsilon {\hat {W}}}a_{i}{\hat {\lambda }}_{i}=g=G{\hat {x}}+c,$

with ${\hat {W}}=W_{k}$

if ${\hat {\lambda }}_{i}\geq 0$ for all $i\in W_{k}\cap \jmath$

stop with solution $x^{*}=x_{k};$

else

$j\longleftarrow$ arg $min_{i\in W_{k}\cap \jmath }{\hat {\lambda }}_{j};$

$x_{k+1}\longleftarrow x_{k};W_{k+1}\longleftarrow W_{k}\ \{j\};$

Else $(p_{k}\neq 0)$

Compute $a_{k}=min(1,min({\frac {b_{i}-a_{i}^{T}x_{k}}{a_{i}^{T}p_{k}}}))$

$x_{k+1}\longleftarrow x_{k}+a_{k}p_{k};$

if there are blocking constraints:

obtain $W_{k+1}$ by adding one of the blocking constraints to $W_{k};$

else

$W_{k+1}\longleftarrow W_{k};$

end (for)

Interior Point Methods

Interior-point methods, developed in the 1980s, are a class of algorithms for solving large-scale linear programs using concepts from nonlinear programming. Unlike the active set methods, which navigate along the boundaries of the feasible region by testing vertices, interior-point methods approach the solution from within or near the interior, avoiding boundary constraints. These methods were inspired by the simplex method's poor worst-case complexity and the desire for algorithms with stronger theoretical guarantees, such as the ellipsoid method and Karmarkar's algorithm.