Hessian matrix

From Calculus
Revision as of 23:34, 23 April 2012 by Vipul (talk | contribs) (Created page with "==Definition at a point== ===For a function of two variables at a point=== Suppose <math>f</math> is a real-valued function of two variables <math>x,y</math> and <math>(x_0,...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Definition at a point

For a function of two variables at a point

Suppose f is a real-valued function of two variables x,y and (x0,y0) is a point in the domain of f. Suppose all the four second-order partial derivatives exist at (x0,y0), i.e., the two pure second-order partials fxx(x0,y0),fyy(x0,y0) exist, and so do the two second-order mixed partial derivatives fxy(x0,y0 and fyx(x0,y0). Then, the Hessian matrix of f at (x0,y0), denoted H(f)(x0,y0), is a 2×2 matrix of real numbers defined as follows:

(fxx(x0,y0)fxy(x0,y0)fyx(x0,y0)fyy(x0,y0))

For a function of multiple variables at a point

Suppose f is a real-valued function of multiple variables (x1,x2,,xn). Suppose (a1,a2,,an) is a point in the domain of f. In other words, a1,a2,,an are real numbers and the point has coordinates x1=a1,x2=a2,,xn=an. Suppose, further, that all the second-order partials (pure and mixed) of f with respect to these variables exist at the point (a1,a2,,an). Then, the Hessian matrix of f at (a1,a2,,an), denoted H(f)(a1,a2,,an), is a n×n matrix of real numbers defined as follows:

The (ij)th entry (i.e., the entry in the ith row and jth column) is fxixj(a1,a2,,an). This is the same as 2fxjxif(a1,a2,,an). Note that in the two notations, the order in which we write the partials differs because the convention differs (left-to-right versus right-to-left).

The matrix looks like this:

Failed to parse (unknown function "\begin{pmatrix}"): {\displaystyle \begin{pmatrix} f_{x_1x_1}(a_1,a_2,\dots,a_n) & f_{x_1x_2}(a_1,a_2,\dots,a_n) & \dots & f_{x_2x_n}(a_1,a_2,\dots,a_n)\\ f_{x_2x_1}(a_1,a_2,\dots,a_n) & f_{x_2x_2}(a_1,a_2,\dots,a_n) & \dots & f_{x_2x_n}(a_1,a_2,\dots,a_n)\\ \dot & \dot & \dot & \dot\\ \dot & \dot & \dot & \dot\\ \dot & \dot & \dot & \dot\\ f_{x_nx_1}(a_1,a_2,\dots,a_n) & f_{x_nx_2}(a_1,a_2,\dots,a_n) & \dots & f_{x_nx_n}(a_1,a_2,\dots,a_n)\\\end{pmatrix}}

Definition as a function

For a function of two variables

Suppose f is a real-valued function of two variables x,y. The Hessian matrix of f, denoted H(f), is a 2×2 matrix-valued function that sends each point to the Hessian matrix at that point, if that matrix is defined. It is defined as:

(x0,y0)H(f)(x0,y0)=(fxx(x0,y0)fxy(x0,y0)fyx(x0,y0)fyy(x0,y0))

In the point-free notation, we can write this as:

H(f)=(fxxfxyfyxfyy)

Under continuity assumptions

If we assume that all the second-order partials of f are continuous functions everywhere, then the following happens:

Note that the second conclusion actually only requires the existence of the gradient vector, hence it holds even if the second-order partials are not continuous.