Summary table of multivariable derivatives

From Calculus
Revision as of 03:42, 3 November 2018 by IssaRice (talk | contribs) (moving from https://machinelearning.subwiki.org/wiki/Summary_table_of_multivariable_derivatives)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

This page is a summary table of multivariable derivatives.

  • TODO maybe good to have separate rows for evaluated and pre-evaluated versions, for things that are functions/can be applied

Single-variable real function

For comparison and completeness, we give a summary table of the single-variable derivative. Let f:RR be a single-variable real function.

Term Notation Type Definition Notes
Derivative of f f or dfdx RR f(x)=limh0f(x+h)f(x)h
Derivative of f at x0R f(x0) or dfdx(x0) or ddxf(x)|x=x0 R f'(x0)=limh0f(x0+h)f(x0)h=limxx0f(x)f(x0)xx0 In the most general multivariable case, f(x0) will become a linear transformation, so analogously we may wish to talk about the single-variable f(x0) as the function f(x0):RR defined by f(x0)(x)=f(x0)x, where on the left side "f(x0)" is a function and on the right side "f(x0)" is a number. If "f(x0)" is a function, we can evaluate it at 1 to recover the number: f(x0)(1). This is pretty confusing, and in practice everyone thinks of "f(x0)" in the single-variable case as a number, making the notation divergent; see Notational confusion of multivariable derivatives § The derivative as a linear transformation in the several variable case and a number in the single-variable case for more information.

Real-valued function of Rn

Let f:RnR be a real-valued function of Rn.

Term Notation Type Definition Notes
Partial derivative of f with respect to its jth variable jf or xjf or fxj or fxj or fj RnR jf(x)=limt0f(x+tej)f(x)t Here ej=(0,,1,,0) is the jth vector of the standard basis, i.e. the vector with all zeroes except a one in the jth spot. Therefore x+tej can also be written (x1,,xj+t,,xn) when broken down into components.
Gradient f RnRn f(x)=(1f(x),,nf(x))
Gradient at x0Rn f(x0) Rn or M1,n(R) (1f(x0),,nf(x0)) or the vector c such that limxx0|f(x)f(x0)c(xx0)||xx0|=0
Directional derivative in the direction of v Dvf or vf RnR Dvf(x)=limt0f(x+tv)f(x)t When v=ej, this reduces to the jth partial derivative.

I think in this case, since f(x0)(v) coincides with f(x0)v, people don't usually define the derivative separately. For example, Folland in Advanced Calculus defines differentiability but not the derivative! He just says that the vector that makes a function differentiable is the gradient.

TODO: answer questions like "Is the gradient the derivative?"

Vector-valued function of R

Let f:RRm be a vector-valued function of R. A parametric curve (or parametrized curve) is an example of this. Since the function is vector-valued, some authors use a boldface letter like f.

Term Notation Type Definition Notes
Velocity vector at t v(t) or Df(t) RRm (f1(t),,fn(t))

Note the absence for partial/directional derivatives. There is only one variable with respect to which we can differentiate, so there is no direction to choose from.

Vector-valued function of Rn

Let f:RnRm be a vector-valued function of Rn. Since the function is vector-valued, some authors use a boldface letter like f.

Term Notation Type Definition Notes
Partial derivative with respect to the jth variable jf or xjf or fxj or fxj or fj RnRm jf(x)=limt0f(x+tej)f(x)t
Directional derivative in the direction of v Dvf or vf RnRm Dvf(x)=limt0f(x+tv)f(x)t
Total or Fréchet derivative (sometimes just called the derivative) at point x0Rn f(x0) or (Df)x0 or dx0f RnRm The linear transformation L such that limxx0|f(x)f(x0)L(xx0)||xx0|=0 The derivative at a given point is a linear transformation. One might wonder then what the derivative (without giving a point) is, i.e. what meaning to assign to "f" as we can in the single-variable case. Its type would have to be RnRnRm or more specifically RnL(Rn,Rm). Also the notation f(x0) is slightly confusing: if the total derivative is a function, what happens if n=m=1? We see that f(x0):RR, so the single-variable derivative isn't actually a number! To get the actual slope of the tangent line, we must evaluate the function at 1: f(x0)(1)R. Some authors avoid this by using different notation in the general multivariable case. Others accept this type error and ignore it.
Derivative matrix, differential matrix, Jacobian matrix at point x0Rn Df(x0) or M(f(x0)) Mm,n(R) (1f1(x0)nf1(x0)1fn(x0)nfn(x0)) Since the total derivative is a linear transformation, and since linear transformations from Rn to Rm have a one-to-one correspondence with real-valued m by n matrices, the behavior of the total derivative can be summarized in a matrix; that summary is the derivative matrix. Some authors say that the total derivative is the matrix. TODO: talk about gradient vectors as rows.

Note the absence of the gradient in the above table. The generalization of the gradient to the RnRm case is the derivative matrix.

See also

References

  • Tao, Terence. Analysis II. 2nd ed. Hindustan Book Agency. 2009.
  • Folland, Gerald B. Advanced Calculus. Pearson. 2002.
  • Pugh, Charles Chapman. Real Mathematical Analysis. Springer. 2010.

External links