Directional derivative

From Calculus
Revision as of 04:27, 27 January 2013 by Vipul (talk | contribs) (→‎Graphical interpretation)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

This article describes an analogue for functions of multiple variables of the following term/fact/notion for functions of one variable: derivative

Definition at a point

Generic definition

Suppose f is a function of many variables. Consider the domain of f as a subset of Euclidean space. Fix a direction in this space and a point in the domain. Then, the directional derivative at the point in the direction is the derivative of the function with respect to movement of the point along that direction, at the specific point.

For a function of two variables

Suppose f is a function of two variables x,y. Suppose u,v is a unit vector (i.e., we have u2+v2=1). Suppose (x0,y0) is a point in the domain of f We define the directional derivative of f at (x0,y0) in the direction of u,v as follows.

Item Value
Notation Du,v(f)(x0,y0) or u,v(f)(x0,y0)
Definition as a limit limh0f(x0+uh,y0+vh)f(x0,y0)h
Definition as an ordinary derivative ddh[f(x0+uh,y0+vh)]|h=0
{{#widget:YouTube|id=FhrHn_mA-mc}}

For a function of multiple variables

Suppose f is a function of variables x1,x2,,xn. Suppose u1,u2,,un is a unit vector (i.e., we have u12+u22++un2=1). Suppose (a1,a2,,an) is a point in the domain of f. The directional derivative of f at (a1,a2,,an) in the direction of u1,u2,,un is defined as follows.

Item Value
Notation Du1,u2,,un(f)(a1,a2,,an) or u1,u2,,un(f)(a1,a2,,an)
Definition as a limit limh0f(a1+u1h,a2+u2h,,an+unh)f(a1,a2,,an)h
Definition as an ordinary derivative ddh[f(a1+u1h,a2+u2h,,an+unh)]|h=0

For a function of multiple variables in vector notation

Suppose f is a function of a vector variable x¯=x1,x2,,xn. Suppose u¯ is a unit vector and a¯ is a point in the domain of f. The directional derivative of f at a¯ in the direction of u¯ is denoted and defined as below.

Item Value
Notation Du¯(f)(a¯) or u¯(f)(a¯)
Definition as a limit limh0f(a¯+hu¯)f(a¯)h
Definition as an ordinary derivative ddh[f(a¯+hu¯)]|h=0
{{#widget:YouTube|id=3Xu9660Occs}}

Definition as a function

Generic definition

Suppose f is a function of many variables. Consider the domain of f as a subset of Euclidean space. Fix a direction in this space. Then, the directional derivative in the direction is the function sending a point in the domain of f to the derivative of the function with respect to movement of the point along that direction.

For a function of two variables

Suppose f is a function of two variables x,y, with domain a subset of R2. Suppose u,v is a unit vector (i.e., we have u2+v2=1). Then, the directional derivative in the direction of u,v is a function with domain a subset of the domain of f, defined as the function that sends any point in the domain of f to the directional derivative of f in the direction of u,v at the point.

Item Value
Notation Du,v(f)(x,y) or u,v(f)(x,y)
Definition as a limit limh0f(x+uh,y+vh)f(x,y)h
Definition as a partial derivative h[f(x+uh,y+vh)]|h=0. Note that we need to use a partial derivative because x,y are now variable as we are not doing this at a single point.
{{#widget:YouTube|id=EIot8Fdi0EE}}

For a function of multiple variables

Suppose f is a function of variables x1,x2,,xn. Suppose u1,u2,,un is a unit vector (i.e., we have u12+u22++un2=1). We define and denote the directional derivative as below.

Item Value
Notation Du1,u2,,un(f)(x1,x2,,xn) or u1,u2,,un(f)(x1,x2,,xn)
Definition as a limit limh0f(x1+u1h,x2+u2h,,xn+unh)f(x1,x2,,xn)h
Definition as an ordinary derivative h[f(x1+u1h,x2+u2h,,xn+unh)]|h=0. Note that we need to use a partial derivative because x1,x2,,xn are now variable as we are not doing this at a single point.

For a function of multiple variables in vector notation

Suppose f is a function of a vector variable x¯=x1,x2,,xn. Suppose u¯ is a unit vector. We define and denote the directional derivative of f in the direction of u below.

Item Value
Notation Du¯(f)(x¯) or u¯(f)(x¯)
Definition as a limit limh0f(x¯+hu¯)f(x¯)h
Definition as an ordinary derivative h[f(x¯+hu¯)]|h=0. Note that we need to take the partial derivative instead of the ordinary derivative because the coordinates of x¯ themselves are not fixed, as we are doing this at a generic rather than a fixed point.

Graphical interpretation

For a function of two variables

For further information, refer: graph of a function of two variables

The directional derivative Du,vf(x0,y0) in the direction of a unit vector u,v at a point (x0,y0) can be determined as follows: first, intersect the graph of the function with the plane v(xx0)=u(yy0). This plane is perpendicular to the xy-plane and its intersection with the xy-plane is the line through (x0,y0) in the direction of the unit vector u,v.

This intersection can be thought of as the graph of a function of one variable, where the point (x0,y0,0) is treated as the origin, the direction (u,v,0) is the independent variable axis, and the z-axis direction is the dependent variable axis. Now, the directional derivative is the slope of this graph for dependent variable value of 0.

{{#widget:YouTube|id=sICGShQ7CHE}}{{#widget:YouTube|id=hX5KhF_Xog0}}

Relation with gradient vector

Version type Statement
at a point, in vector notation (multiple variables) Suppose f is a function of a vector variable x¯=x1,x2,,xn. Suppose u¯ is a unit vector and a¯ is a point in the domain of f. Suppose that the gradient vector of f at a¯ exists. We denote this gradient vector by f(a¯). Then, we have the following relationship:
Du¯(f)(a¯)=u¯(f(a¯))
The right side here is the dot product of vectors.
generic point, in vector notation (multiple variables) Suppose f is a function of a vector variable x¯=x1,x2,,xn. Suppose u¯ is a unit vector. We then have:
Du¯(f)(x¯)=u¯(f(x¯))
The right side here is a dot product of vectors. The equality holds whenever the right side makes sense.
generic point, point-free notation (multiple variables) Suppose f is a function of a vector variable x¯=x1,x2,,xn. Suppose u¯ is a unit vector. We then have:
Du¯(f)=u¯(f)
The right side here is a dot product of vector-valued functions (the constant function u¯ and the gradient vector of f). The equality holds whenever the right side makes sense.
{{#widget:YouTube|id=TAeK0MVuzJU}}
{{#widget:YouTube|id=T8WGcg_T-DM}}

Relation with partial derivatives

Partial derivatives as directional derivatives

Version type Statement
Generic For a function f of many variables x1,x2,,xn, the partial derivative fxi(x1,x2,,xn) can be thought of as the directional derivative in the direction of the unit vector u1,u2,,un where ui=1 and all other coordinates are zero.

Directional derivative in terms of partial derivatives

If the gradient vector at a point exists, then it is a vector whose coordinates are the corresponding partial derivatives of the function. Thus, conditional to the existence of the gradient vector, we have that:

Du1,u2,,un(f)(x1,x2,,xn)=i=1nuifxi(x1,x2,,xn)=u1fx1(x1,x2,,xn)+u2fx2(x1,x2,,xn)++unfxn(x1,x2,,xn)

{{#widget:YouTube|id=84hww4kkoq8}}

In particular, for a function f(x,y) of two variables and a unit vector u,v, we have the following, assuming that the gradient vector exists (i.e., that the function is differentiable):

Du,v(f)(x,y)=ufx(x,y)+vfy(x,y)