Notational confusion of multivariable derivatives: Difference between revisions
No edit summary |
No edit summary |
||
Line 5: | Line 5: | ||
* The ambiguity of expressions like <math>\nabla f(Ax)</math> | * The ambiguity of expressions like <math>\nabla f(Ax)</math> | ||
* dual basis stuff -- see Tao's explanation of this in p. 225 of [https://terrytao.files.wordpress.com/2011/06/blog-book.pdf] | * dual basis stuff -- see Tao's explanation of this in p. 225 of [https://terrytao.files.wordpress.com/2011/06/blog-book.pdf] | ||
Working off the example from Tao above, let <math>f(x,y) = (x^2,y^2)</math>. What does <math>\frac{d}{dx} f(x,x)</math> mean? Here are three possibilities: | |||
# It's <math>f'</math> (i.e., the total derivative of f) evaluated at the point (x,x). | |||
# It's <math>\frac{df}{dx}</math> (i.e., the total derivative of f with respect to x, in which we treat the variable y as a function of x, and use the chain rule <math>\frac{df}{dx} = \frac{\partial f}{\partial x} + \frac{\partial f}{\partial y} \frac{dy}{dx}</math> to compute) evaluated at the point (x,x). | |||
# We first compute f(x,x) to get an expression involving only x, which implicitly defines a function <math>\mathbf R \to \mathbf R^2</math>. We now differentiate this function. | |||
==The derivative as a linear transformation in the several variable case and a number in the single-variable case== | ==The derivative as a linear transformation in the several variable case and a number in the single-variable case== |
Revision as of 08:55, 30 May 2020
I think there's several different confusions that arise from multivariable derivative notation:
- The thing where can mean two different things on LHS and RHS when is used as both an initial and intermediate variable. (See Folland for details.)
- The thing where if then feels like it might be even though it's actually . (Example from Tao.) See also [1]
- The ambiguity of expressions like
- dual basis stuff -- see Tao's explanation of this in p. 225 of [2]
Working off the example from Tao above, let . What does mean? Here are three possibilities:
- It's (i.e., the total derivative of f) evaluated at the point (x,x).
- It's (i.e., the total derivative of f with respect to x, in which we treat the variable y as a function of x, and use the chain rule to compute) evaluated at the point (x,x).
- We first compute f(x,x) to get an expression involving only x, which implicitly defines a function . We now differentiate this function.
The derivative as a linear transformation in the several variable case and a number in the single-variable case
- The thing where the total derivative for "should" be a function but people treat it as a number. Refer to "Appendix A: Perorations of Dieudonne" (p. 337) in Pugh's Real Mathematical Analysis.
Total derivative versus derivative matrix
Technically the total derivative at a point is a linear transformation, whereas the derivative matrix is a matrix so an array of numbers arranged in a certain order. However, there is a one-to-one correspondence between linear transformations and by matrices, so many books call the total derivative a matrix or equate the two.
A similar confusion exists in the teaching of linear algebra, where sometimes only matrices are mentioned.