# Chain rule for differentiation

View other differentiation rules

## Statement for two functions

The chain rule is stated in many versions:

Version type Statement
specific point, named functions Suppose $f$ and $g$ are functions such that $g$ is differentiable at a point $x = x_0$, and $f$ is differentiable at $g(x_0)$. Then the composite $f \circ g$ is differentiable at $x_0$, and we have:
$\! \frac{d}{dx}[f(g(x))]|_{x = x_0} = f'(g(x_0))g'(x_0)$
generic point, named functions, point notation Suppose $f$ and $g$ are functions of one variable. Then, we have
$\! \frac{d}{dx}[f(g(x))] = f'(g(x))g'(x)$ wherever the right side expression makes sense.
generic point, named functions, point-free notation Suppose $f$ and $g$ are functions of one variable. Then,
$\! (f \circ g)' = (f' \circ g) \cdot g'$ where the right side expression makes sense, where $\cdot$ denotes the pointwise product of functions.
pure Leibniz notation Suppose $u = g(x)$ is a function of $x$ and $v = f(u)$ is a function of $u$. Then,
$\frac{dv}{dx} = \frac{dv}{du}\frac{du}{dx}$
MORE ON THE WAY THIS DEFINITION OR FACT IS PRESENTED: We first present the version that deals with a specific point (typically with a $\{ \}_0$ subscript) in the domain of the relevant functions, and then discuss the version that deals with a point that is free to move in the domain, by dropping the subscript. Why do we do this?
The purpose of the specific point version is to emphasize that the point is fixed for the duration of the definition, i.e., it does not move around while we are defining the construct or applying the fact. However, the definition or fact applies not just for a single point but for all points satisfying certain criteria, and thus we can get further interesting perspectives on it by varying the point we are considering. This is the purpose of the second, generic point version.

### One-sided version

A one-sided version of sorts holds, but we need to be careful, since we want the direction of differentiability of $f$ to be the same as the direction of approach of $g(x)$ to $g(x_0)$. The following are true:

• We require $g$, the function composed first, to be left differentiable at $x_0$ and require $f$ to be differentiable at $g(x_0)$. Then, the left hand derivative of $f \circ g$ at $x_0$ is $f'(g(x_0))$ times the left hand derivative of $g$ at $x_0$.
• We require $g$, the function composed first, to be right differentiable at $x_0$ and require $f$ to be differentiable at $g(x_0)$. Then, the right hand derivative of $f \circ g$ at $x_0$ is $f'(g(x_0))$ times the right hand derivative of $g$ at $x_0$.
• We require $g$ to be left differentiable at $x_0$, $f$ to be left differentiable at $g(x_0)$, and we require $g$ to be an increasing function. Then, $f \circ g$ is left differentiable at $x_0$ and the left hand derivative is the left hand derivative of $f$ at $g(x_0)$ times the left hand derivative of $g$ at $x_0$.
• We require $g$ to be right differentiable at $x_0$, $f$ to be right differentiable at $g(x_0)$, and we require $g$ to be an increasing function. Then, $f \circ g$ is right differentiable at $x_0$ and the right hand derivative is the right hand derivative of $f$ at $g(x_0)$ times the left hand derivative of $g$ at $x_0$.
• We require $g$ to be left differentiable at $x_0$, $f$ to be right differentiable at $g(x_0)$, and we require $g$ to be a decreasing function. Then, $f \circ g$ is left differentiable at $x_0$ and the left hand derivative is the right hand derivative of $f$ at $g(x_0)$ times the left hand derivative of $g$ at $x_0$.
• We require $g$ to be right differentiable at $x_0$, $f$ to be left differentiable at $g(x_0)$, and we require $g$ to be a decreasing function. Then, $f \circ g$ is right differentiable at $x_0$ and the right hand derivative is the left hand derivative of $f$ at $g(x_0)$ times the left hand derivative of $g$ at $x_0$.

## Statement for multiple functions

Suppose $f_1,f_2,\dots,f_n$ are functions. Then, the following is true wherever the right side makes sense:

$(f_1 \circ f_2 \circ f_3 \dots \circ f_n)' = (f_1' \circ f_2 \circ \dots \circ f_n) \cdot (f_2' \circ \dots \circ f_n) \cdot \dots \cdot (f_{n-1}' \circ f_n) \cdot f_n'$

For instance, in the case $n = 3$, we get:

$(f_1 \circ f_2 \circ f_3)' = (f_1' \circ f_2 \circ f_3) \cdot (f_2' \circ f_3) \cdot f_3'$

In point notation, this is:

$\! \frac{d}{dx}[f_1(f_2(f_3(x)))] = f_1'(f_2(f_3(x))f_2'(f_3(x))f_3'(x)$

## Significance

### Qualitative and existential significance

Each of the versions has its own qualitative significance:

Version type Significance
specific point, named functions This tells us that if $g$ is differentiable at a point $x_0$ and $f$ is differentiable at $g(x_0)$, then $f \circ g$ is differentiable at $x_0$.
generic point, named functions, point notation If $g$ is a differentiable function and $f$ is a differentiable function on the intersection of its domain with the range of $g$, then $f \circ g$ is a differentiable function.
generic point, named functions, point-free notation We can deduce properties of $(f \circ g)'$ based on properties of $f',g',f,g$. In particular, if $f'$ and $g'$ are both continuous functions, so is $(f \circ g)'$. Another way of putting this is that if $f$ and $g$ are both continuously differentiable functions, so is $f \circ g$.

### Computational feasibility significance

Each of the versions has its own computational feasibility significance:

Version type Significance
specific point, named functions If we know the values (in the sense of numerical values) $g'(x_0)$ and $f'(g(x_0))$, we can use these to compute $(f \circ g)'(x_0)$.
generic point, named functions This tells us that knowledge of the general expressions for the derivatives of $f$ and $g$ (along with expressions for the functions themselves) allows us to compute the general expression for the derivative of $f \circ g$.

### Computational results significance

Shorthand Significance
significance of derivative being zero If $g'(x_0) = 0$, and $f$ is differentiable at $g(x_0)$, then $(f \circ g)'(x_0) = 0$. Note that the conclusion need not follow if $f$ is not differentiable at $g(x_0)$.
Also, if $f'(g(x_0)) = 0$ and $g$ is differentiable at $x_0$, then $(f \circ g)'(x_0) = 0$.
significance of sign of derivative The product of the signs of $f'(g(x_0))$ and $g'(x_0)$ gives the sign of $(f \circ g)'(x_0)$. In particular, if both have the same sign, then $(f \circ g)'$ is positive. If both have opposite signs, then $(f \circ g)'$ is negative. This is related to the idea that a composite of increasing functions is increasing, and similar ideas.
significance of uniform bounds on derivatives If $f'$ and $g'$ are uniformly bounded, then so is $(f \circ g)'$, with a possible uniform bound being the product of the uniform bounds for $f'$ and $g'$.