L1-regularized quadratic function of multiple variables: Difference between revisions

From Calculus
Line 43: Line 43:


where <math>\overline{\operatorname{sgn}}</math> is the [[signum vector function]].
where <math>\overline{\operatorname{sgn}}</math> is the [[signum vector function]].
===Hessian matrix===
The Hessian matrix of the function, defined wherever all the coordinates are nonzero, is the matrix <math>2A</math>.

Revision as of 19:28, 11 May 2014

Definition

A -regularized quadratic function of the variables is a function of the form (satisfying the positive definiteness condition below):

In vector form, if we denote by the column vector with coordinates , then we can write the function as:

where is the matrix with entries and is the column vector with entries .

Note that the matrix is non-unique: if then we could replace by . Therefore, we could choose to replace by the matrix . We will thus assume that is a symmetric matrix.

We impose the further restriction that the matrix be a symmetric positive definite matrix.

Key data

Item Value
default domain the whole of

Differentiation

Partial derivatives and gradient vector

The partial derivative with respect to the variable , and therefore also the coordinate of the gradient vector (if it exists), is given as follows when :

By the symmetry assumption, this becomes:

The partial derivative is undefined when .

The gradient vector exists if and only if all the coordinates are nonzero.

In vector notation, the gradient vector is as follows for all with all coordinates nonzero:

where is the signum vector function.

Hessian matrix

The Hessian matrix of the function, defined wherever all the coordinates are nonzero, is the matrix .