Nesterov's gradient acceleration
Definition
Nesterov's gradient acceleration refers to a general approach that can be used to modify a gradient descent-type method to improve its initial convergence.
The two-step iteration description
In this description, there are two intertwined sequences of iterates that constitute our guesses:
Explicitly, the sequences are intertwined as follows:
We use parenthesized superscripts to denote the iteration stage, because the subscripts are reserved for use by the coordinates.
For the initialization, we set:
The two-step iteration is as follows:
Note that the sign on the right side is not a typographical error: is a non-convex combination of and . The value is a member of a predetermined sequence of real numbers in the interval that is independent of the specific problem. A typical expression is (so ).