Practical:Chain rule for differentiation: Difference between revisions
| Line 77: | Line 77: | ||
| * If the functions being composed are fairly easy to differentiate mentally, use the ''shortest inline procedure'' -- this is fast and reliable. | * If the functions being composed are fairly easy to differentiate mentally, use the ''shortest inline procedure'' -- this is fast and reliable. | ||
| * If the functions being composed are somewhat more difficult to differentiate, then choose between the other two more explicit procedures, based on whether you are more comfortable with writing large inline expressions or with doing separate work on the side. | * If the functions being composed are somewhat more difficult to differentiate, then choose between the other two more explicit procedures, based on whether you are more comfortable with writing large inline expressions or with doing separate work on the side. | ||
| ==Error types== | |||
| ===Incorrect formula=== | |||
| A common mistake in differentiating a composite of functions is the use of an incorrect formula, such as <math>(f \circ g)'(x) = f'(g'(x))</math> or <math>(f \circ g)'(x) = f'(x)g'(x)</math>. See [[Chain rule for differentiation#Why more naive chain rules don't_make_sense]] for more background on why these formulas are incorrect. | |||
| ===Writing only one piece of the chain rule=== | |||
| This is an error of the ''incomplete task'' form and is harder to avoid. What happens here is that you forget to write one of the two pieces being multiplied, so perhaps you end up doing: | |||
| <math>\frac{d}{dx}[f(x)g(x)] = f'(g(x)) \qquad \mbox{WRONG! Forgot the second factor}</math> | |||
| <math>\frac{d}{dx}[f(x)g(x)] = g'(x) \qquad \mbox{WRONG! Forgot the first factor}</math> | |||
| '''Why this error occurs''': Usually, this error is common if you are trying to use the shortest inline procedure, i.e., differentiating the functions and applying the chain rule simultaneously, ''and'' one of the functions being differentiated is rather tricky to differentiate, requiring a product rule or chain rule for differentiation in and of itself. | |||
| '''How to avoid this error''':  | |||
| * When the functions being differentiated are tricky to differentiate, use either the fully explicit procedure or the inline procedure with Leibniz notation. ''Do not'' try to simultaneously differentiate the pieces and use the chain rule. | |||
| * After finishing a chain rule problem, ask the following sanity check question: ''did I get a product of two distinct terms as originally anticipated?'' | |||
Revision as of 06:05, 5 December 2023
This article considers practical aspects of the chain rule for differentiation: how is this rule used in actual computations?
ORIGINAL FULL PAGE: Chain rule for differentiation
STUDY THE TOPIC AT MULTIPLE LEVELS:
ALSO CHECK OUT: Practical tips on the topic |Quiz (multiple choice questions to test your understanding) |Page with videos on the topic, both embedded and linked to
Statement to remember
The statement of the chain rule for differentiation that we will be using is:
where and .
NOTE: As a matter of convention, and to reduce confusion, we use a different variable ( in this case) for the generic input to compared to the variable ( in this case) that we use for the generic input to .
Procedure to apply the chain rule for differentiation
The chain rule for differentiation is useful as a technique for differentiating functions that are expressed in the form of composites of simpler functions.
Most explicit procedure
The explicit procedure is outlined below:
- Identify the two functions whose composite is the given function. In other words, explicitly decompose the function as a composite of two functions. We will here call the functions and , though you may choose to give them different names.
- Calculate the derivatives of and separately, on the side.
- Plug into the chain rule formula the expressions for the functions and their derivatives.
- Simplify the expression thus obtained (this is optional in general, though it may be required in some contexts).
For instance, consider the problem:
Differentiate the function
The procedure is as follows:
- Identify the two functions: The two functions are and (note: per the note included with the formulation of the chain rule, we use different variable names for the generic variable for the two functions, to reduce confusion regarding which one to apply on what).
- Calculate the derivatives: and .
- Plug into the chain rule formula: We get .
- Simplify the expression thus obtained: There isn't really anything to simplify, but we can rearrange the terms to the more conventional order where the algebraic part is before the trigonometric part, obtaining the final answer .
More inline procedure using Leibniz notation
Although the explicit procedure above is fairly clear, Step (2) of the procedure can be a waste of time in the sense of having to do the derivative calculations separately. If you are more experienced with doing differentiation quickly, you can combine Steps (2) and (3) by calculating the derivatives while plugging into the formula, rather than doing the calculations separately prior to plugging into the formula. Further, we do not need to explicitly name the functions if we use the Leibniz notation to compute the derivatives inline.
The shorter procedure is outlined below:
- Identify the two functions being composed (but you don't have to give them names).
- Plug into the formula for the chain rule, using the Leibniz notation for derivatives that have not yet been computed.
- Compute derivatives and simplify
For instance, consider the problem:
Differentiate the function
- Identify the two functions being composed: The functions are (the outer/later function) and (the inner/earlier function)
- Plug into the formula for the chain rule: We get (here basically , though we don't have to say this explicitly)
- Compute derivatives and simplify: We get
Shortest inline procedure
If you are really experienced with doing derivatives in your head, you can shorten the procedure even further by combining Steps (2) and (3) in the previous procedure. The procedure has two steps:
- Identify the two functions being composed (but you don't have to give them names).
- Use the formula for the chain rule, computing the derivatives of the functions while plugging them into the formula
For instance, consider the problem:
Differentiate the function
- Identity the two functions: The functions are (the outer/later function) and (the inner/earlier function)
- Use the formula for the chain rule, computing the derivatives of the functions while plugging them into the formula: We get
Choosing between procedures
The procedures are not fundamentally different, but they differ in the degree of explicitness of the steps. Generally speaking, the following are recommended:
- If the functions being composed are fairly easy to differentiate mentally, use the shortest inline procedure -- this is fast and reliable.
- If the functions being composed are somewhat more difficult to differentiate, then choose between the other two more explicit procedures, based on whether you are more comfortable with writing large inline expressions or with doing separate work on the side.
Error types
Incorrect formula
A common mistake in differentiating a composite of functions is the use of an incorrect formula, such as or . See Chain rule for differentiation#Why more naive chain rules don't_make_sense for more background on why these formulas are incorrect.
Writing only one piece of the chain rule
This is an error of the incomplete task form and is harder to avoid. What happens here is that you forget to write one of the two pieces being multiplied, so perhaps you end up doing:
Why this error occurs: Usually, this error is common if you are trying to use the shortest inline procedure, i.e., differentiating the functions and applying the chain rule simultaneously, and one of the functions being differentiated is rather tricky to differentiate, requiring a product rule or chain rule for differentiation in and of itself.
How to avoid this error:
- When the functions being differentiated are tricky to differentiate, use either the fully explicit procedure or the inline procedure with Leibniz notation. Do not try to simultaneously differentiate the pieces and use the chain rule.
- After finishing a chain rule problem, ask the following sanity check question: did I get a product of two distinct terms as originally anticipated?