Jacobian matrix: Difference between revisions

From Calculus
No edit summary
No edit summary
Line 36: Line 36:
===Definition in terms of gradient vectors as row vectors===
===Definition in terms of gradient vectors as row vectors===


Suppose <math>f</matH> is a vector-valued function with <math>n</math>-dimensional inputs and <math>m</math>-dimensional outputs. Explicitly, suppose <math>f</math> is a function with inputs <math>x_1,x_2,\dots,x_n</math> and outputs <math>f_1(x_1,x_2,\dots,x_n), f_2(x_1,x_2,\dots,x_n),\dots,f_m(x_1,x_2,\dots,x_n)</math>. Then, the '''Jacobian matrix''' of <math>f</math>is a <math>m \times n</math> matrix of ''functions'' whose <math>i^{th}</math> row is given by the [[gradient vector]] of <matH>f_i</math>.  
Suppose <math>f</matH> is a vector-valued function with <math>n</math>-dimensional inputs and <math>m</math>-dimensional outputs. Explicitly, suppose <math>f</math> is a function with inputs <math>x_1,x_2,\dots,x_n</math> and outputs <math>f_1(x_1,x_2,\dots,x_n), f_2(x_1,x_2,\dots,x_n),\dots,f_m(x_1,x_2,\dots,x_n)</math>. Then, the '''Jacobian matrix''' of <math>f</math>is a <math>m \times n</math> matrix of ''functions'' whose <math>i^{th}</math> row is given by the [[gradient vector]] of <matH>f_i</math>. Explicitly, it looks like this:
 
<math>\begin{pmatrix} \nabla(f_1) \\ \nabla(f_2)\\ \cdot \\ \cdot \\ \cdot \\ \nabla(f_m) \\\end{pmatrix}</math>
 


Note that the domain of this function is the set of points at which all the <math>f_i</math>s individually are differentiable.
Note that the domain of this function is the set of points at which all the <math>f_i</math>s individually are differentiable.
Line 46: Line 49:
<math>\frac{\partial f_i}{\partial x_j}(x_1,x_2,\dots,x_n)</math>
<math>\frac{\partial f_i}{\partial x_j}(x_1,x_2,\dots,x_n)</math>


wherever all the <math>f_i</math>s individually are differentiable in the sense of the gradient vectors existing. If the gradient vectors do not exist but the partial derivatives do, a matrix can still be constructed using this recipe but it may not satisfy the nice behavior that the Jacobian matrix does.
wherever all the <math>f_i</math>s individually are differentiable in the sense of the gradient vectors existing. Here's how the matrix looks:
 
<math>\begin{pmatrix} \frac{\partial f_1}{\partial x_1} & \frac{\partial f_1}{\partial x_2}& \dots & \frac{\partial f_1}{\partial x_n}\\
\frac{\partial f_2}{\partial x_1} & \frac{\partial f_2}{\partial x_2} & \dots & \frac{\partial f_2}{\partial x_n}\\
\cdot & \cdot & \cdot & \cdot \\
\frac{\partial f_m}{\partial x_1} & \frac{\partial f_m}{\partial x_2} & \dots & \frac{\partial f_m}{\partial x_n}\\\end{pmatrix}</math>
 
If the gradient vectors do not exist but the partial derivatives do, a matrix can still be constructed using this recipe but it may not satisfy the nice behavior that the Jacobian matrix does.


==Particular cases==
==Particular cases==

Revision as of 15:49, 12 May 2012

This article describes an analogue for functions of multiple variables of the following term/fact/notion for functions of one variable: derivative

Importance

The Jacobian matrix is the appropriate notion of derivative for a function that has multiple inputs (or equivalently, vector-valued inputs) and multiple outputs (or equivalently, vector-valued outputs).

Definition at a point

Direct epsilon-delta definition

Definition at a point in terms of gradient vectors as row vectors

Suppose f is a vector-valued function with n-dimensional inputs and m-dimensional outputs. Explicitly, suppose f is a function with inputs x1,x2,,xn and outputs f1(x1,x2,,xn),f2(x1,x2,,xn),,fm(x1,x2,,xn). Suppose (a1,a2,,an) is a point in the domain of f such that fi is differentiable at (a1,a2,,an) for i{1,2,,m}. Then, the Jacobian matrix of f at (a1,a2,,an) is a m×n matrix of numbers whose ith row is given by the gradient vector of fi at (a1,a2,,an).

Explicitly, in terms of rows, it looks like:

((f1)(a1,a2,,an)(f2)(a1,a2,,an)(fm)(a1,a2,,an))

Definition at a point in terms of partial derivatives

Suppose f is a vector-valued function with n-dimensional inputs and m-dimensional outputs. Explicitly, suppose f is a function with inputs x1,x2,,xn and outputs f1(x1,x2,,xn),f2(x1,x2,,xn),,fm(x1,x2,,xn). Suppose (a1,a2,,an) is a point in the domain of f such that fi is differentiable at (a1,a2,,an) for i{1,2,,m}. Then, the Jacobian matrix of f at (a1,a2,,an) is a m×n matrix of numbers whose (ij)th entry is given by:

fixj(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)

Here's how the matrix looks:

((x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an)(x1,x2,,xn)|(x1,x2,,xn)=(a1,a2,,an))

Note that for this definition to be correct, it is still necessary that the gradient vectors exist. If the gradient vectors do not exist but the partial derivatives do, a matrix can still be constructed using this recipe but it may not satisfy the nice behavior that the Jacobian matrix does.

Definition as a function

Definition in terms of gradient vectors as row vectors

Suppose f is a vector-valued function with n-dimensional inputs and m-dimensional outputs. Explicitly, suppose f is a function with inputs x1,x2,,xn and outputs f1(x1,x2,,xn),f2(x1,x2,,xn),,fm(x1,x2,,xn). Then, the Jacobian matrix of fis a m×n matrix of functions whose ith row is given by the gradient vector of fi. Explicitly, it looks like this:

((f1)(f2)(fm))


Note that the domain of this function is the set of points at which all the fis individually are differentiable.

Definition in terms of partial derivatives

Suppose f is a vector-valued function with n-dimensional inputs and m-dimensional outputs. Explicitly, suppose f is a function with inputs x1,x2,,xn and outputs f1(x1,x2,,xn),f2(x1,x2,,xn),,fm(x1,x2,,xn). Then, the Jacobian matrix of f is a m×n matrix of functions whose (ij)th entry is given by:

fixj(x1,x2,,xn)

wherever all the fis individually are differentiable in the sense of the gradient vectors existing. Here's how the matrix looks:

()

If the gradient vectors do not exist but the partial derivatives do, a matrix can still be constructed using this recipe but it may not satisfy the nice behavior that the Jacobian matrix does.

Particular cases

Case What happens in that case?
m=n=1 f is a real-valued function of one variable. The Jacobian matrix is a 1×1 matrix whose entry is the ordinary derivative.
n=1, m>1 f is a vector-valued function of one variable. We can think of it as a parametric curve in Rm. The Jacobian matrix is a m×1 matrix which, read as a column vector, is the parametric derivative of the vector-valued function.
m=1, n>1 f is a real-valued function of multiple variables. The Jacobian matrix is a 1×n matrix which, read as a row vector, is the gradient vector function.
f is a linear or affine map. The Jacobian matrix is the same as the matrix describing f (or, if f is affine, the matrix describing the linear part of f).
m=n, and we are identifying the spaces of inputs and outputs of f. The Jacobian matrix can then be thought of as a linear self-map from the n-dimensional space to itself. In this context, we can consider the Jacobian determinant.