Definition
Nesterov's gradient acceleration refers to a general approach that can be used to modify a gradient descent-type method to improve its initial convergence.
The two-step iteration description
In this description, there are two intertwined sequences of iterates that constitute our guesses:
Explicitly, the sequences are intertwined as follows:
We use parenthesized superscripts to denote the iteration stage, because the subscripts are reserved for use by the coordinates.
For the initialization, we set:
The two-step iteration is as follows:


Note that the sign on the right side is not a typographical error:
is a non-convex combination of
and
. The value
is a member of a predetermined sequence of real numbers in the interval
that is independent of the specific problem. A typical expression is
(so
).