Nesterov's gradient acceleration

From Calculus
Revision as of 03:48, 9 September 2014 by Vipul (talk | contribs) (Created page with "==Definition== '''Nesterov's gradient acceleration''' refers to a general approach that can be used to modify a gradient descent-type method to improve its initial conver...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Definition

Nesterov's gradient acceleration refers to a general approach that can be used to modify a gradient descent-type method to improve its initial convergence.

The two-step iteration description

In this description, there are two intertwined sequences of iterates that constitute our guesses:

Explicitly, the sequences are intertwined as follows:

We use parenthesized superscripts to denote the iteration stage, because the subscripts are reserved for use by the coordinates.

For the initialization, we set:

The two-step iteration is as follows:

Note that the sign on the right side is not a typographical error: is a non-convex combination of and . The value is a member of a predetermined sequence of real numbers in the interval that is independent of the specific problem. A typical expression is (so ).