Skip to main content
Logo image

APEX Calculus for PROTEUS

Section 1.2 The Chain Rule

We have covered almost all of the derivative rules that deal with combinations of two (or more) functions. The operations of addition, subtraction, multiplication (including by a constant) and division led to the sum, difference, constant multiple, power, product, and quotient rules. To complete the list of differentiation rules, we look at the last way two (or more) functions can be combined: the process of composition (i.e. one function “inside” another).
Figure 1.2.1. Video introduction to Section 1.2
One example of a composition of functions is \(f(x) = \cos(x^2)\text{.}\) We currently do not know how to compute this derivative. If forced to guess, one might guess \(\fp(x) = -\sin(2x)\text{,}\) where we recognize \(-\sin(x)\) as the derivative of \(\cos(x)\) and \(2x\) as the derivative of \(x^2\text{.}\) However, this is not the case; \(\fp(x)\neq -\sin(2x)\text{.}\) One way to see this is to examine the graph of \(y=\cos\mathopen{}\left(x^2\right)\mathclose{}\) in Figure 1.2.2 and its tangent line at \(x=\pi/2\text{.}\) Clearly the slope of the tangent line there is nonzero, but \(-2\sin(2\cdot\pi/2)=0\text{.}\) So it can’t be correct to say that \(y'=-\sin(2x)\text{.}\)
A cosine wave with increasing frequency with a negatively sloped tangent line.
A cosine wave with increasing frequency. The distance between the peaks decreases as \(x\) increases. There is a point drawn on the curve at \(x= \frac{\pi}{2}\text{,}\) roughly at \(y=-0.8\text{.}\) A tangent line to the point is drawn, sloping downward.
Figure 1.2.2. A graph of \(y=\cos(x^2)\) and a tangent line at \(\pi/2\)
In Example 1.2.12 we’ll see the correct way to compute the derivative of \(\sin\mathopen{}\left(x^2\right)\mathclose{}\text{,}\) which employs the new rule this section introduces, the Chain Rule.
Before we define this new rule, recall the notation for composition of functions. We write \((f \circ g)(x)\) or \(f(g(x))\text{,}\) read as “\(f\) of \(g\) of \(x\text{,}\)” to denote composing \(f\) with \(g\text{.}\) In shorthand, we simply write \(f \circ g\) or \(f(g)\) and read it as “\(f\) of \(g\text{.}\)” Before giving the corresponding differentiation rule, we note that the rule extends to multiple compositions like \(f(g(h(x)))\) or \(f(g(h(j(x))))\text{,}\) etc.
To motivate the rule, let’s look at three derivatives we can already compute.

Example 1.2.3. Exploring similar derivatives.

Find the derivatives of \(F_1(x) = (1-x)^2\text{,}\) \(F_2(x) = (1-x)^3\text{,}\) and \(F_3(x) = (1-x)^4\text{.}\) (We’ll see later why we are using subscripts for different functions and an uppercase \(F\text{.}\))
Solution.
In order to use the rules we already have, we must first expand each function as
\begin{align*} F_1(x)\amp = 1 - 2x + x^2\\ F_2(x)\amp = 1 - 3x + 3x^2 - x^3\\ F_3(x)\amp = 1 - 4x + 6x^2 - 4x^3 + x^4 \end{align*}
It is not hard to see that:
\begin{align*} F_1'(x)\amp = -2 + 2x\\ F_2'(x)\amp = -3 + 6x - 3x^2\\ F_3'(x)\amp = -4 + 12x - 12x^2 + 4x^3 \end{align*}
An interesting fact is that these can be rewritten as:
\begin{align*} F_1'(x)\amp = -2(1-x)\\ F_2'(x)\amp = -3(1-x)^2\\ F_3'(x)\amp = -4(1-x)^3 \end{align*}
A pattern might jump out at you; note how the we end up multiplying by the old power and the new power is reduced by 1. We also always multiply by \((-1)\text{.}\)
Recognize that each of these functions is a composition, letting \(g(x) = 1-x\text{:}\)
\begin{align*} F_1(x)\amp = f_1(g(x)),\amp\amp \text{ where } f_1(x) = x^2,\\ F_2(x)\amp = f_2(g(x)),\amp\amp \text{ where } f_2(x) = x^3,\\ F_3(x)\amp = f_3(g(x)),\amp\amp \text{ where } f_3(x) = x^4\text{.} \end{align*}
We’ll come back to this example after giving the formal statements of the Chain Rule; for now, we are just illustrating a pattern.
Figure 1.2.5. Video presentation of Theorem 1.2.4
Here is the Chain Rule in words:
The derivative of the outside function, evaluated at the inside function, multiplied by the derivative of the inside function.

Exercise 1.2.6.

For the function \(s(x) = 3(7x^2+11x)^3\text{,}\) find functions \(f(x)\) and \(g(x)\) such that \(f(x)\) is the outside function and \(g(x)\) is the inside function; that is, such that \(s(x)=f(g(x))\text{.}\) Then find \(\fp(x)\) and \(g'(x)\text{,}\) and finally, \(s'(x)\) using the chain rule.

Exercise 1.2.7.

Imagine you are tracking the temperature inside a greenhouse between 7 am and 3 pm. The temperature inside the greenhouse, \(T\text{,}\) depends on the temperature outside, \(S\text{,}\) which itself depends on the time of day, \(t\text{.}\) The figure below illustrates this relationship with the outside temperature as a common axis.
Graphs of the two parts of a composite function, arranged in three dimensions with the intermediate function as a common axis.
Two graphs are arranged in three dimensions to show the interdependence of the parts of a composite function. The graphs are aligned so that the graph of the outside temperature, \(S\text{,}\) lies on the “floor” or a room, while the graph of the inside temperature \(T\text{,}\) lies on one wall of the room.
The bottom of the graph of the outside temperature \(S(t)\) is labeled with times from 7 am until 3 pm. There is a grid line for each hour in this range. The left side of the outside temperature graph is labeled with temperaturs in degrees Fahrenheit, ranging from \(64^\circ\) F to \(76^\circ\) F, with a grid line for each degree. The graph itself is in the shape of a parabola opening downward, and appears to pass near to the points \((9 am,74)\) and \((1 pm, 70)\text{.}\)
The temperature axis which was the left side of the outside temperature graph forms the bottom of the inside temperature graph. This second graph is the graph of \(T(S)\text{,}\) which shows the inside temperature as a function of the outside temperature. Markings for the inside temperature appear on the left side of the second graph, and range from \(88^\circ\) F to \(100^\circ\) F, with grid markings for each degree. The graph rises from bottom-left to upper-right, and has a slight downward curve. It appears to pass near the points \((70,96)\) and \((71,97)\text{.}\)

(a)

Use image above to approximate (within \(1^\circ\) F) the following values:
  1. The temperature outside at 9 am.
  2. The temperature inside the greenhouse when the temperature outside is \(71^\circ\) F.
  3. The temperature inside the greenhouse at 1 pm.

(b)

Explain why the temperature inside the greenhouse is also a function of time.

(c)

Notice that the graph of \(T(S)\) shows a strictly increasing trend. This suggests that, throughout the day, the greenhouse temperature never decreases, even though the outside temperature does. Explain what is incorrect about this statement in the context of the chain rule.

Exercise 1.2.8.

You are moving straight upward in a hot air balloon at a rate of 2 mph. Let \(A\) be your distance from the ground. The air temperature, \(H\text{,}\) is changing as a function of altitude, and decreasing at a rate of \(16^\circ\) F per mile.

(a)

Explain in your own words, how the air temperature you feel changes with time, and find that rate.
To help understand the Chain Rule, we return to Example 1.2.3.

Example 1.2.9. Using the Chain Rule.

Use the Chain Rule to find the derivatives of the functions \(F_1(x)\text{,}\) \(F_2(x)\text{,}\) and \(F_3(x)\text{,}\) as given in Example 1.2.3.
Solution.
Example 1.2.3 ended with the recognition that each of the given functions was actually a composition of functions. To avoid confusion, we ignore most of the subscripts here.
\(F_1(x) = (1-x)^2\)
We found that
\begin{equation*} y=(1-x)^2 = f(g(x))\text{,} \end{equation*}
where \(f(x) = x^2\) and \(g(x) = 1-x\text{.}\) To find \(y'\text{,}\) we apply the The Chain Rule. We need to note that \(\fp(x)=2x\) and \(g'(x)=-1\text{.}\)
Part of the The Chain Rule uses \(\fp(g(x))\text{.}\) This means substitute \(g(x)\) for \(x\) in the equation for \(\fp(x)\text{.}\) That is, \(\fp(x) = 2(1-x)\text{.}\) Finishing out the The Chain Rule we have
\begin{align*} y' \amp = \fp(g(x))\cdot g'(x)\\ \amp = 2(1-x)\cdot (-1)\\ \amp = -2(1-x)\\ \amp = 2x-2\text{.} \end{align*}
\(F_2(x) = (1-x)^3\)
Let \(y = (1-x)^3 = f(g(x))\text{,}\) where \(f(x) = x^3\) and \(g(x) = (1-x)\text{.}\) We have \(\fp(x) = 3x^2\text{,}\) so \(\fp(g(x)) = 3(1-x)^2\text{.}\) The The Chain Rule then states
\begin{align*} y' \amp = \fp(g(x))\cdot g'(x)\\ \amp = 3(1-x)^2\cdot (-1)\\ \amp = -3(1-x)^2\text{.} \end{align*}
\(F_3(x) = (1-x)^4\)
Finally, when \(y = (1-x)^4\text{,}\) we have \(f(x)= x^4\) and \(g(x) = (1-x)\text{.}\) Thus \(\fp(x) = 4x^3\) and \(\fp(g(x)) = 4(1-x)^3\text{.}\) Thus
\begin{align*} y' \amp = \fp(g(x))\cdot g'(x)\\ \amp = 4(1-x)^3\cdot (-1)\\ \amp = -4(1-x)^3\text{.} \end{align*}
Example 1.2.9 demonstrated a particular pattern: when \(f(x)=x^n\text{,}\) then \(y' =n\cdot (g(x))^{n-1}\cdot g'(x)\text{.}\) This is called the Generalized Power Rule.
This allows us to quickly find the derivative of functions like \(y = (3x^2-5x+7+\sin(x) )^{20}\text{.}\) While it may look intimidating, the Generalized Power Rule states that
\begin{equation*} y' = 20(3x^2-5x+7+\sin(x) )^{19}\cdot (6x-5+\cos(x) )\text{.} \end{equation*}
Treat the derivative-taking process step-by-step. In the example just given, first multiply by \(20\text{,}\) then rewrite the inside of the parentheses, raising it all to the \(19\)th power. Then think about the derivative of the expression inside the parentheses, and multiply by that.
We now consider more examples that employ the The Chain Rule.

Example 1.2.11. Using the Chain Rule.

Find the derivatives of the following functions:
  1. \(y = \sin(2x)\text{.}\)
  2. \(y= \ln(4x^3-2x^2)\text{.}\)
  3. \(y = e^{-x^2}\text{.}\)
Solution 1.
  1. Consider \(y = \sin(2x)\text{.}\) Recognize that this is a composition of functions, where \(f(x) = \sin(x)\) and \(g(x) = 2x\text{.}\) Thus
    \begin{align*} y' \amp = \fp(g(x))\cdot g'(x)\\ \amp = \cos(2x)\cdot \lzoo{x}{2x}\\ \amp = \cos(2x)\cdot 2\\ \amp = 2\cos(2x)\text{.} \end{align*}
  2. Recognize that \(y = \ln\mathopen{}\left(4x^3-2x^2\right)\mathclose{}\) is the composition of \(f(x) = \ln(x)\) and \(g(x) = 4x^3-2x^2\text{.}\) Also, recall that
    \begin{equation*} \lzoo{x}{\ln(x)} = \frac{1}{x}\text{.} \end{equation*}
    This leads us to:
    \begin{align*} y' \amp = \frac{1}{4x^3-2x^2} \cdot \lzoo{x}{4x^3-2x^2}\\ \amp = \frac{1}{4x^3-2x^2} \cdot \left(12x^2-4x\right)\\ \amp = \frac{12x^2-4x}{4x^3-2x^2}\\ \amp = \frac{4x(3x-1)}{2x(2x^2-x)}\\ \amp = \frac{2(3x-1)}{2x^2-x}\text{.} \end{align*}
    Note that \(\ln\mathopen{}\left(4x^3-2x^2\right)\mathclose{}=\ln\mathopen{}\left(4x^2(x-1/2)\right)\mathclose{}\) was only defined for \(x \gt 1/2\text{,}\) so the result of \(y'=\frac{2(3x-1)}{2x^2-x}\) is only valid for \(x \gt 1/2\) as well.
  3. Recognize that \(y = e^{-x^2}\) is the composition of \(f(x) = e^x\) and \(g(x) = -x^2\text{.}\) Remembering that \(\fp(x) = e^x\text{,}\) we have
    \begin{align*} y' \amp = e^{-x^2}\cdot \lzoo{x}{-x^2}\\ \amp = e^{-x^2}\cdot (-2x)\\ \amp = -2xe^{-x^2}\text{.} \end{align*}
Solution 2. Video solution

Example 1.2.12. Using the Chain Rule to find a tangent line.

Let \(f(x) = \cos(x^2)\text{.}\) Find the equation of the line tangent to the graph of \(f\) at \(x=1\text{.}\)
Solution.
The tangent line goes through the point \((1,f(1)) \approx (1,0.54)\) with slope \(\fp(1)\text{.}\) To find \(\fp\text{,}\) we need the The Chain Rule.
\(\fp(x) = -\sin(x^2) \cdot(2x) = -2x\sin(x^2)\text{.}\) Evaluated at \(x=1\text{,}\) we have \(\fp(1) = -2\sin(1) \approx -1.68\text{.}\) Thus the equation of the tangent line is approximated by
\begin{equation*} y \approx -1.68(x-1)+0.54\text{.} \end{equation*}
The tangent line is sketched along with \(f\) in Figure 1.2.13.
A wave with increasing frequency with a negatively sloped tangent line at x=1.
A cosine wave with increasing frequency as the graph moves further from the \(y\)-axis. To the left, there is one peak and valley before the graph moves to the \(y\)-axis, where the graph becomes gradually horizontal. To the right, the graph has another valley and peak, symmetrical to the left side. At the point \(x=1\text{,}\) there is a tangent line drawn. This line has a clear negative slope.
Figure 1.2.13. \(f(x) = \cos(x^2)\) sketched along with its tangent line at \(x=1\)
The The Chain Rule is used often in taking derivatives. Because of this, one can become familiar with the basic process and learn patterns that facilitate finding derivatives quickly. For instance,
\begin{equation*} \lzoo{x}{\ln(\text{anything})} = \frac{1}{\text{anything}}\cdot\lzoo{x}{\text{anything}} = \frac{\lzoo{x}{\text{anything}}}{\text{anything}}\text{.} \end{equation*}
A concrete example of this is
\begin{equation*} \lzoo{x}{\ln(3x^{15}-\cos(x) +e^x)} = \frac{45x^{14}+\sin(x) +e^x}{3x^{15}-\cos(x) +e^x}\text{.} \end{equation*}
While the derivative may look intimidating at first, look for the pattern. The denominator is the same as what was inside the natural log function; the numerator is simply its derivative.
This pattern recognition process can be applied to lots of functions. In general, instead of writing “anything”, we use \(u\) as a generic function of \(x\text{.}\) We then say
\begin{equation*} \lzoo{x}{\ln(u)} = \frac{u'}{u}\text{.} \end{equation*}
The following is a short list of how the The Chain Rule can be quickly applied to familiar functions.
  1. \(\lzoo{x}{u^n} = n\cdot u^{n-1}\cdot u'\text{.}\)
  2. \(\lzoo{x}{e^u} = e^u \cdot u'\text{.}\)
  3. \(\lzoo{x}{\sin(u)} = \cos(u) \cdot u'\text{.}\)
  4. \(\lzoo{x}{\cos(u)} = -\sin(u)\cdot u'\text{.}\)
  5. \(\lzoo{x}{\tan(u)} = \sec^2(u) \cdot u'\text{.}\)
Of course, the The Chain Rule can be applied in conjunction with any of the other rules we have already learned. We practice this next.

Example 1.2.14. Using the Product, Quotient and Chain Rules.

Find the derivatives of the following functions.
  1. \(f(x) = x^5 \sin(2x^3)\text{.}\)
  2. \(f(x) = \dfrac{5x^3}{e^{-x^2}}\text{.}\)
Solution 1.
  1. We must use the product rule and The Chain Rule. Do not think that you must be able to “see” the whole answer immediately; rather, just proceed step-by-step.
    \begin{align*} \fp(x) \amp = x^5 \cdot \lzoo{x}{\sin\mathopen{}\left(2x^3\right)\mathclose{}} + \sin\mathopen{}\left(2x^3\right)\mathclose{}\cdot \lzoo{x}{x^5}\\ \amp = x^5\left(\cos\mathopen{}\left(2x^3\right)\mathclose{}\cdot \lzoo{x}{2x^3} \right) + 5x^4\left(\sin\mathopen{}\left(2x^3\right)\mathclose{} \right)\\ \amp = x^5\big(6x^2\cos\left(2x^3\right) \big) + 5x^4\big(\sin\left(2x^3\right) \big)\\ \amp = 6x^7\cos\mathopen{}\left(2x^3\right)\mathclose{}+5x^4\sin\mathopen{}\left(2x^3\right)\mathclose{}\text{.} \end{align*}
  2. We must employ the quotient rule along with the The Chain Rule. Again, proceed step-by-step.
    \begin{align*} \fp(x) \amp= \frac{e^{-x^2}\cdot \lzoo{x}{5x^3} - 5x^3\cdot \lzoo{x}{e^{-x^2}}}{\left(e^{-x^2}\right)^2}\\ \amp= \frac{e^{-x^2}\cdot 15x^2 - 5x^3\cdot e^{-x^2}\cdot \lzoo{x}{-x^2}}{\left(e^{-x^2}\right)^2}\\ \amp= \frac{e^{-x^2}\left(15x^2\right) - 5x^3\left((-2x)e^{-x^2}\right)}{\left(e^{-x^2}\right)^2}\\ \amp =\frac{e^{-x^2}\left(10x^4+15x^2\right)}{e^{-2x^2}}\\ \amp = e^{x^2}\left(10x^4+15x^2\right)\text{.} \end{align*}
Solution 2. Video solution
We can also employ the The Chain Rule itself several times, as shown in the next example.

Example 1.2.15. Using the Chain Rule multiple times.

Find the derivative of \(y = \tan^5(6x^3-7x)\text{.}\)
Solution 1.
Recognize that we have the \(g(x)=\tan\mathopen{}\left(6x^3-7x\right)\mathclose{}\) function “inside” the \(f(x)=x^5\) function; that is, we have \(y = \left(\tan\mathopen{}\left(6x^3-7x\right)\mathclose{}\right)^5\text{.}\) We begin using the Generalized Power Rule; in this first step, we do not fully compute the derivative. Rather, we are approaching this step-by-step.
\begin{equation*} y' = 5\left(\tan\mathopen{}\left(6x^3-7x\right)\mathclose{}\right)^4\cdot g'(x)\text{.} \end{equation*}
We now find \(g'(x)\text{.}\) We again need the The Chain Rule;
\begin{align*} g'(x) \amp = \sec^2\mathopen{}\left(6x^3-7x\right)\mathclose{}\cdot \lzoo{x}{6x^3-7x}.\\ \amp = \sec^2\mathopen{}\left(6x^3-7x\right)\mathclose{}\cdot\left(18x^2-7\right)\text{.} \end{align*}
Combine this with what we found above to give
\begin{align*} y' \amp = 5\left(\tan\mathopen{}\left(6x^3-7x\right)\mathclose{}\right)^4\cdot\sec^2\mathopen{}\left(6x^3-7x\right)\mathclose{}\cdot\left(18x^2-7\right)\\ \amp = \left(90x^2-35\right)\sec^2\mathopen{}\left(6x^3-7x\right)\mathclose{}\tan^4\mathopen{}\left(6x^3-7x\right)\mathclose{}\text{.} \end{align*}
This function is frankly a ridiculous function, possessing no real practical value. It is very difficult to graph, as the tangent function has many vertical asymptotes and \(6x^3-7x\) grows so very fast. The important thing to learn from this is that the derivative can be found. In fact, it is not “hard”; one can take several simple steps and should be careful to keep track of how to apply each of these steps.
Solution 2. Video solution
It is a traditional mathematical exercise to find the derivatives of arbitrarily complicated functions just to demonstrate that it can be done. Just break everything down into smaller pieces.

Example 1.2.16. Using the Product, Quotient and Chain Rules.

Find the derivative of \(f(x) = \frac{x\cos(x^{-2})-\sin^2(e^{4x})}{\ln(x^2+5x^4)}\text{.}\)
Solution.
This function likely has no practical use outside of demonstrating derivative skills. The answer is given below without simplification.
\begin{align*} \amp\fp(x)\\ \amp=\Bigg(\ln\mathopen{}\left(x^2+5x^4\right)\mathclose{}\cdot\bigg[\Big(x\cdot\left(-\sin\mathopen{}\left(x^{-2}\right)\mathclose{}\right)\cdot\left(-2x^{-3}\right)\\ \amp\qquad{}+1\cdot \cos\mathopen{}\left(x^{-2}\right)\Big)-2\sin\mathopen{}\left(e^{4x}\right)\mathclose{}\cdot\cos\mathopen{}\left(e^{4x}\right)\mathclose{}\cdot\left(4e^{4x}\right)\bigg]\\ \amp\qquad\qquad\qquad{}-\left(x\cos\mathopen{}\left(x^{-2}\right)\mathclose{}-\sin^2\mathopen{}\left(e^{4x}\right)\mathclose{}\right)\cdot\frac{2x+20x^3}{x^2+5x^4}\Bigg)\\ \amp\phantom{{}={}}\left/\left(\ln\mathopen{}\left(x^2+5x^4\right)\mathclose{}\right)^2\right.\text{.} \end{align*}
The The Chain Rule also has theoretic value. That is, it can be used to find the derivatives of functions that we have not yet learned as we do in the following example.

Example 1.2.17. The Chain Rule and exponential functions.

Use the Chain Rule to find the derivative of \(y= 2^x\text{.}\)
Solution.
We only know how to find the derivative of one exponential function, \(y = e^x\text{.}\) We can accomplish our goal by rewriting \(2\) in terms of \(e\text{.}\) Recalling that \(e^x\) and \(\ln(x)\) are inverse functions, we can write \(2=e^{\ln 2}\) and so
\begin{equation*} y=2^x = \left(e^{\ln 2}\right)^x = e^{x(\ln(2))}\text{,} \end{equation*}
using the “power to a power” property of exponents.
The function is now the composition \(y=f(g(x))\text{,}\) with \(f(x) = e^x\) and \(g(x) = x(\ln(2))\text{.}\) Since \(\fp(x) = e^x\) and \(g'(x) = \ln(2)\text{,}\) the The Chain Rule gives
\begin{equation*} y' = e^{x (\ln(2))} \cdot \ln 2\text{.} \end{equation*}
Recall that the \(e^{x(\ln(2))}\) term on the right hand side is just \(2^x\text{,}\) our original function. Thus, the derivative contains the original function itself. We have
\begin{equation*} y' = y \cdot \ln(2) = 2^x\cdot \ln(2)\text{.} \end{equation*}
We can extend this process to use any base \(a\text{,}\) where \(a \gt 0\) and \(a\neq 1\text{.}\) All we need to do is replace each “2” in our work with “\(a\text{.}\)” The Chain Rule, coupled with the derivative rule of \(e^x\text{,}\) allows us to find the derivatives of all exponential functions.
The comment at the end of previous example is important and is restated formally as a theorem.
Figure 1.2.19. Derivatives of exponential and general power functions

Alternate Chain Rule Notation.

It is instructive to understand what the The Chain Rule “looks like” using “\(\lz{y}{x}\)” notation instead of \(y'\) notation. Suppose that \(y=f(u)\) is a function of \(u\text{,}\) where \(u=g(x)\) is a function of \(x\text{,}\) as stated in Theorem 1.2.4. Then, through the composition \(f \circ g\text{,}\) we can think of \(y\) as a function of \(x\text{,}\) as \(y=f(g(x))\text{.}\) Thus the derivative of \(y\) with respect to \(x\) makes sense; we can talk about \(\lz{y}{x}\text{.}\) This leads to an interesting progression of notation:
\begin{align*} y' \amp = \fp(g(x))\cdot g'(x)\\ \lz{y}{x} \amp = y'(u) \cdot u'(x)\amp\amp \text{ since } y=f(u) \text{ and }u=g(x)\\ \lz{y}{x} \amp = \lz{y}{u} \cdot \lz{u}{x}\amp\amp \text{(using “fractional notation” for the derivative)} \end{align*}
Here the “fractional” aspect of the derivative notation stands out. On the right hand side, it seems as though the “\(du\)” terms cancel out, leaving
\begin{equation*} \frac{dy}{dx} = \frac{dy}{dx}\text{.} \end{equation*}
It is important to realize that we are not canceling these terms; the derivative notation of \(\lz{y}{u}\) is one symbol. It is equally important to realize that this notation was chosen precisely because of this behavior. It makes applying the The Chain Rule easy with multiple variables. For instance,
\begin{equation*} \lz{y}{t} = \lz{y}{\bigcirc} \cdot \lz{\bigcirc}{\triangle} \cdot \lz{\triangle}{t}\text{.} \end{equation*}
where \(\bigcirc\) and \(\triangle\) are any variables you’d like to use.
One of the most common ways of “visualizing” the The Chain Rule is to consider a set of gears, as shown in Figure 1.2.20. The gears have \(36\text{,}\) \(18\text{,}\) and \(6\) teeth, respectively. That means for every revolution of the \(x\) gear, the \(u\) gear revolves twice. That is, the rate at which the \(u\) gear makes a revolution is twice as fast as the rate at which the \(x\) gear makes a revolution.
Using the terminology of calculus, the rate of \(u\)-change, with respect to \(x\text{,}\) is \(\lz{u}{x} = 2\text{.}\)
Likewise, every revolution of \(u\) causes \(3\) revolutions of \(y\text{:}\) \(\lz{y}{u} = 3\text{.}\) How does \(y\) change with respect to \(x\text{?}\) For each revolution of \(x\text{,}\) \(y\) revolves \(6\) times; that is,
\begin{equation*} \frac{dy}{dx} = \frac{dy}{du}\cdot \frac{du}{dx} = 2\cdot 3 = 6\text{.} \end{equation*}
We can then extend the The Chain Rule with more variables by adding more gears to the picture.
3 gears of various sizes demonstrating the chain rule.
Three gears, connected in the order \(x,u,y\text{.}\) \(x\) is the largest gear, having 36 teeth. It is rotating counter-clockwise. \(u\) is connected to \(x\text{,}\) and it has 18 teeth. To the left of the connection is \(\frac{du}{dx} = 2\text{.}\) \(y\) is connected to \(u\text{,}\) and it has 6 teeth. Below the connection is \(\frac{dy}{du}=3\text{.}\) To the right of the gears is the expression \(\frac{dy}{dx} = 6\text{.}\)
Figure 1.2.20. A series of gears to demonstrate the Chain Rule. Note how \(\lz{y}{x} = \lz{y}{u}\cdot\lz{u}{x}\)
It is difficult to overstate the importance of the The Chain Rule. So often the functions that we deal with are compositions of two or more functions, requiring us to use this rule to compute derivatives. It is also often used in real life when actual functions are unknown. Through measurement, we can calculate (or, approximate) \(\lz{y}{u}\) and \(\lz{u}{x}\text{.}\) With our knowledge of the The Chain Rule, we can find \(\lz{y}{x}\text{.}\)
You have attempted of activities on this page.