L1-regularized quadratic function of multiple variables: Difference between revisions

From Calculus
Line 33: Line 33:


<math>\frac{\partial f}{\partial x_i} = \left(\sum_{j=1}^n 2a_{ij}x_j\right) + b_i + \lambda \operatorname{sgn}(x_i)</math>
<math>\frac{\partial f}{\partial x_i} = \left(\sum_{j=1}^n 2a_{ij}x_j\right) + b_i + \lambda \operatorname{sgn}(x_i)</math>
The partial derivative is undefined when <math>x_i = 0</math>.
The partial derivative is undefined when <math>x_i = 0</math>.



Revision as of 19:26, 11 May 2014

Definition

A -regularized quadratic function of the variables is a function of the form (satisfying the positive definiteness condition below):

In vector form, if we denote by the column vector with coordinates , then we can write the function as:

where is the matrix with entries and is the column vector with entries .

Note that the matrix is non-unique: if then we could replace by . Therefore, we could choose to replace by the matrix . We will thus assume that is a symmetric matrix.

We impose the further restriction that the matrix be a symmetric positive definite matrix.

Key data

Item Value
default domain the whole of

Differentiation

Partial derivatives and gradient vector

The partial derivative with respect to the variable , and therefore also the coordinate of the gradient vector (if it exists), is given as follows when :

By the symmetry assumption, this becomes:

The partial derivative is undefined when .

The gradient vector exists if and only if all the coordinates are nonzero.

In vector notation, the gradient vector is as follows for all with all coordinates nonzero:

where is the signum vector function.