Line search methods: Difference between revisions

VisualWikitext

Revision as of 11:21, 28 November 2021

Authors: Lihe Cao, Zhengyi Sui, Jiaqi Zhang, Yuqing Yan, and Yuhui Gu (6800 Fall 2021).

Introduction

Generic Line Search Method

Basic Algorithm

Search Direction for Line Search

Step Length

Convergence

Exact Search

Steepest Descent Method

Given the intuition that the negative gradient $-\nabla f_{k}$ can be an effective search direction, steepest descent follows the idea and establishes a systematic method for minimizing the objective function. Setting $-\nabla f_{k}$ as the direction, steepest descent computes the step-length $\alpha ^{k}$ by minimizing a single-variable objective function. More specifically, the steps of Steepest Descent Method are as follows.

PSEUDOCODE HERE

One advantage of the steepest descent method is that it has a nice convergence theory. For a steepest descent method, it converges to a local minimal from any starting point.

Theorem: global convergence of steepest descent^[1] Let the gradient of $f\in C^{1}$ be uniformly Lipschitz continuous on $R^{n}$ . Then, for the iterates with steepest-descent search directions, one of the following situations occurs:

$\nabla f(x_{k})=0$ for some finite $k$
$\lim _{k\to \infty }f(x_{k})=-\infty$
$\lim _{k\to \infty }\nabla f(x_{k})=0$

Steepest descent method is a special case of gradient descent in that the step-length is rigorously defined. Generalization can be made regarding the choice of $\alpha$ .

Inexact Search

Backtracking

Zoutendijk’s Theorem

Numeric Example

Reference

↑ Dr Raphael Hauser, Oxford University Computing Laboratory, Line Search Methods for Unconstrained Optimization [1]

[1] Dr Raphael Hauser, Oxford University Computing Laboratory, Line Search Methods for Unconstrained Optimization [1]

[1]

@@ Line 22: / Line 22: @@
 One advantage of the steepest descent method is that it has a nice convergence theory. For a steepest descent method, it converges to a local minimal from any starting point.
-'''Theorem: global convergence of steepest descent'''<ref>[https://people.maths.ox.ac.uk/hauser/hauser_lecture2.pdf]</ref>
+'''Theorem: global convergence of steepest descent'''<ref>Dr Raphael Hauser, Oxford University Computing Laboratory, Line Search Methods for Unconstrained Optimization [https://people.maths.ox.ac.uk/hauser/hauser_lecture2.pdf]</ref>
 Let the gradient of <math>f \in C^1</math> be uniformly Lipschitz continuous on <math>R^n</math>. Then, for the iterates with steepest-descent search directions, one of the following situations occurs:
 * <math>\nabla f(x_k) = 0</math> for some finite <math>k</math>