L1-regularized quadratic function of multiple variables: Difference between revisions

Revision as of 19:22, 11 May 2014

Definition

A $L^{1}$ -regularized quadratic function of the variables $x_{1}, x_{2}, \dots, x_{n}$ is a function of the form:

$f (x_{1}, x_{2}, \dots, x_{n}) : = (\sum_{i = 1}^{n} \sum_{j = 1}^{n} a_{i j} x_{i} x_{j}) + (\sum_{i = 1}^{n} b_{i} x_{i}) + λ \sum_{i = 1}^{n} | x_{i} | + c$

In vector form, if we denote by $\vec{x}$ the column vector with coordinates $x_{1}, x_{2}, \dots, x_{n}$ , then we can write the function as:

${\vec{x}}^{T} A \vec{x} + {\vec{b}}^{T} \vec{x} + λ | \vec{x} |_{1} + c$

where $A$ is the $n \times n$ matrix with entries $a_{i j}$ and $\vec{b}$ is the column vector with entries $b_{i}$ .

Key data

Item	Value
default domain	the whole of $R^{n}$

Differentiation

Partial derivatives and gradient vector

The partial derivative with respect to the variable $x_{i}$ , and therefore also the $i^{t h}$ coordinate of the gradient vector (if it exists), is given as follows when $x_{i} \neq 0$ :

$\frac{\partial f}{\partial x_{i}} = (\sum_{j = 1}^{n} (a_{i j} + a_{j i}) x_{j}) + b_{i} + λ s g n (x_{i})$

The partial derivative is undefined when $x_{i} = 0$ .

The gradient vector exists if and only if all the coordinates are nonzero.

In vector notation, the gradient vector is as follows for all $\vec{x}$ with all coordinates nonzero:

$\nabla f (\vec{x}) = A \vec{x} + \vec{b} + λ \overset{g}{s} (\vec{x})$

where $\overset{g}{s}$ is the signum vector function.

@@ Line 25: / Line 25: @@
 The partial derivative with respect to the variable <math>x_i</math>, and therefore also the <math>i^{th}</math> coordinate of the [[gradient vector]] (if it exists), is given as follows when <math>x_i \ne 0</math>:
-<math>\frac{\partial f}{\partial x_i} = \left(\sum_{j=1}^n a_{ij}x_j\right) + b_i + \lambda \operatorname{sgn}(x_i)</math>
+<math>\frac{\partial f}{\partial x_i} = \left(\sum_{j=1}^n (a_{ij} + a_{ji})x_j\right) + b_i + \lambda \operatorname{sgn}(x_i)</math>
 The partial derivative is undefined when <math>x_i = 0</math>.