Proof of chain rule for differentiation

From Calculus

This article describes a proof of the chain rule for differentiation.

Specific point, named functions version

Suppose f and g are functions such that g is differentiable at a point x=x0, and f is differentiable at g(x0). Then the composite fg is differentiable at x0, and we have:
ddx[f(g(x))]|x=x0=f(g(x0))g(x0)

Pure Leibniz notation version

Suppose u=g(x) is a function of x and v=f(u) is a function of u. Then,
dvdx=dvdududx

Proof

Intuitive proof using the pure Leibniz notation version

The following intuitive proof is not rigorous, but captures the underlying idea:

  • Start with the expression dvdududx.
  • Cancel the du between the denominator and the numerator.
  • We are left with dvdx.

First attempt at formalizing the intuition

This again is not a complete proof, but it gets closer:

Note the following product relation between difference quotients when measured between any pair of distinct points x=x1 and x=x2:

ΔvΔx=ΔvΔuΔuΔx

In the functional notation, this would read as saying the obvious thing that:

f(g(x2))f(g(x1))x2x1=f(g(x2))f(g(x1))g(x2)g(x1)g(x2)g(x1)x2x1

Now, set x1=x0 and let x2=x be some number close enough to x0. We get:

f(g(x))f(g(x0))xx0=f(g(x))f(g(x0))g(x)g(x0)g(x)g(x0)xx0

Now, take the limit on both sides as xx0, and we get:

limxx0f(g(x))f(g(x0))xx0=limxx0f(g(x))f(g(x0))g(x)g(x0)limxx0g(x)g(x0)xx0

Since g is differentiable at x0, g is continuous at x0 as well (using the fact that differentiable implies continuous) and thus the first limit on the right side can be taken as:

limxx0f(g(x))f(g(x0))xx0=limg(x)g(x0)f(g(x))f(g(x0))g(x)g(x0)limxx0g(x)g(x0)xx0

The three limits are now definitionally derivatives, so we get:

(fg)(x0)=f(g(x0))g(x0)

In the Δ-notation, we are simply taking the appropriate limits on

ΔvΔx=DeltavΔuDeltauΔx

and getting that:

dvdx=dvdududx

Fixing the bug in the proof

The proof above is correct in essentials, but has one bug -- namely, the issue that Δu=g(x)g(x0) may be equal to zero for xx0, making the difference quotient Δv/Δu undefined at these points. This is not an issue if this happens only at finitely many points, because we can take the limit close enough. It does become an issue, however, if g(x)=g(x0) at points x arbitrarily close to x0.

The solution to those problem is to replace the expression f(g(x))f(g(x0))g(x)g(x0) by the expression:

{f'(g(x0)),g(x)=g(x0),g(x)g(x0)

In other words, we fill in the removable discontinuity before plugging it into the product expression. If we use this expression instead, the proof becomes rigorous.

Here is the full rigorous proof. Fill this in later