Integration and Mean Value Theorems

The problem posed in the basic study of integration is to find for the graph of a particular function f(x), the area under the curve of f between two points. As seen here:

The process is to take a set of intervals (x_{i} - x_{i-1}) = \Delta x along a segment of the x-axis, and treat each such interval as the base of a rectangle, with the value for f(x_{i}) as the height of our rectangle, then summing each rectangle allows us to calculate the approximate area under the curve. An increase in exactitude can be achieved as we reduce the base of each rectangle so that “intuitively” no rectangle exceeds or “dips” under the height of the curve .

As can be seen here the rectangles can extend above and below the x-axis, but the result is the same the smaller we set the base of our rectangles, the more accurately we approximate the space between the axis and the curve. So at first glance this technique seems merely about calculating an area A. Formally we denote this as follows:

Let a, b be two points on the x-axis and f, then we divide the segment as follows:

a = x_{0} < x_{1} < x_{2} < x_{3} .... x_{n-1} < x_{n} = b

and we set x_{i} - x_{i-1} = \Delta x as before. Now to calculate the area of those rectangles we need to define the summation of the products:

A = \sum\limits_{i}^{n} f(x_{i}) \Delta x

But this will be inexact unless we shrink the base of these rectangles i.e. \Delta x as described above. To achieve this last specification we simply take the limit of A as \Delta x approaches zero. This process is called integration. Formally:

\int\limits_{a}^{b}f(x_{i}) \Delta x = \lim_{\Delta \to 0} \sum\limits_{i}^{n} f(x_{i}) \Delta x

More specifically this is a definite integral since we have specified the bound a, b but we need not. If instead we treat the bounds as variable then we still have a function known as the indefinite integral.

The Summation/Difference Property:

We want to show that:

\int\limits_{a}^{b} [f(x) + g(x)] \Delta x = \int\limits_{a}^{b} f(x) \Delta x + \int\limits_{a}^{b} g(x) \Delta x

 but this is almost self evident as we can see:

\lim_{\Delta \to 0} [ \sum\limits_{i}^{n} f(x_{i}) \Delta x] + \lim_{\Delta \to 0} [ \sum\limits_{i}^{n} g(x_{i}) \Delta x ]

is by the distributive properties of limits equivalent to:

\lim_{\Delta \to 0} [ \sum\limits_{i}^{n} f(x_{i}) \Delta x] + [ \sum\limits_{i}^{n} g(x_{i}) \Delta x]

but by distribution and rearranging we get:

\lim_{\Delta \to 0} \sum\limits_{i}^{n} [f(x_{i}) + g(x_{i})] \Delta x

which is exactly:

\int\limits_{a}^{b} [f(x) + g(x)] \Delta x

as desired. An exactly analogous proof applies to show that \int distributes over the difference of two functions.

The Mean Value Theorem(s):

First note that for any continuous function there is a maximum and minimum point on the interval i.e. there exists a point at the lowest or highest depth of the curve. Now suppose f is defined in the closed interval [a. b] and is differentiable in the open interval (a, b), we will show that if f(a) = f(b) = k, then there is a point c \in (a, b) such that f'(c) = 0. This result is known as Rolle’s Theorem.

 

There are three cases (i) where f(x) = k is defined as a constant function with value k and the limit of a constant function collapses to 0 at all points, so this would prove the theorem.

If on the other hand (i) m< k or (ii) k > M then there is slightly more work to be done. There are two sub-cases but both are analogous so we prove (ii): Suppose k > M, then pick a point c such that f(c) = M. This satisfies the hypothesis so it remains to show that f'(c) = 0. We do this by reductio, but first we prove a small lemma.

Lemma: If f'(p) is positive (negative) then f is increasing(decreasing) at a neighborhood of p.

Assume positive (the negative case is analogous), then we know by work here that the the limit of the quotient minus the derivative is equivalent to 0, so rewriting in terms of \epsilon, \delta can take any \epsilon > 0 and see that:

-\epsilon < | \dfrac{f(p+ \Delta x) - f(p)}{\Delta x} - f'(p) | < \epsilon

When |\Delta x| < \delta. Let f'(p) = \epsilon, then add f'(p) to all sides to get:

0 < \dfrac{f(p + \Delta x) - f(p)}{\Delta x} < 2f'(p)

 which ensures that the difference quotient is positive, and both numerator and denominator must have the same sign. It falls out that: f(p - \Delta x) < f(p) < f(p + \Delta x). Hence f is an increasing function in the neighborhood around p. This completes the proof.

Rolle’s Theorem

To complete Rolle’s theorem we need only consider the case that f'(c) \neq 0, either f'(c) is positive or negative, and then by the above lemma, f is either decreasing or increasing in the continuous p-neighborhood. In either case M would not be the maximum or extreme point on the interval. This is a contradiction. Hence f'(c) = 0 as desired. An analogous strategy works for the case (i). Geometrically this theorem states that the slope of the curve vanishes at some point in the interval between f(a) and f(b), which means the slope becomes briefly horizontal as can be seen in the picture above.

The Mean Value Theorem (Differentiation)

Let f be continuous in the closed interval [a, b ] and differentiable, with the derivative f' in the open interval (a, b). Then there is a point c in (a, b) such that:

f'(c) = \dfrac{f(b) - f(a)}{b -a } = f(b) - f(a) = f'(c)(b - a)

This can be most intuitively thought of as saying that if f tracks movement of an object across a plane between two points a and b, then there is a point on it’s trajectory of the function’s equivalent to the average rate of ascent. If you travel between A and B 100 miles apart and arrive in an hour, then at some point on your journey you were traveling at one hundred miles an hour. Here the claim is that the instantaneous rate of change at point c is the same as the average rate of change between the two points a and b. Geometrically it means that the slope of the curve at some point must match the slope of the secant line.

The Proof: 

Define a function g(x) = f(x) - kx, where k is a constant so that g(a) = g(b). Hence,  g(a) = f(a) - ka = f(b) - kb = g(b). We can solve for k. Take

f(a) - ka = f(b) - kb

adding kb to each side we get:

f(a) - ka + kb = f(b)

subtracting -f(a) becomes:

-ka + kb = f(b) - f(a)

from which it follows:

k(-a + b) = f(b) - f(a)

which is equivalent to:

k(b-a) = f(b) - f(a)

then dividing by (b - a) and canceling we get:

k = \dfrac{f(b) - f(a)}{b - a}

ensuring that k is equivalent to the slope of the secant line. We must show that there is a the tangent line on the curve of f equivalent to k, if we are to prove the mean value theorem. That is we must find a point c \in (a b) such that f'(c) = k

But then note that since g satisfies the hypothesis of Rolle’s theorem, so we have a point g'(c) = 0 Differentiating we see that

g'(c) = [f'(c) - (k c')] = 0

but the derivative c is 1 so this collapses into

g'(c) = [f'(c) - k(1)] = 0

from which we can infer that

f'(c) = k = \dfrac{f(b) -f(a)}{b -a}

so we have shown that an appropriate point c \in (a, b) exists as desired.

The Mean Value Theorem: Integration

We want to prove that if f is continuous in the closed interval [a, b], then there is a point c \in [a, b] such that

\int_{a}^{b} f(x) \Delta x = f(c)(b - a)

We begin the proof by letting m and M be defined as the minimum and maximum points on the curve as in the last proof. We are looking for the average value of our integrand function defined on the the interval [a, b ]. First let:

\tau = \sum_{i = 1}^{n} f(x_{i}) \Delta x

then since

m \Delta x \leq f(x_{i}) \Delta x \leq M \Delta x

 it follows:

\sum\limits_{i}^{n} m \Delta x \leq \tau \leq \sum\limits_{i}^{n} M \Delta x

which by definition and distribution is just:

m(b -a) \leq \tau \leq M(b - a)

then taking the limit as \Delta x approaches zero adds nothing, so:

m(b - a) \leq \int_{a}^{b} f(x) \Delta x \leq M(b-a)

then dividing by (b-a) we get:

m \leq \dfrac{\int_{a}^{b} f(x) \Delta x}{b-a} \leq M

which is equivalent to:

m \leq \dfrac{1}{b-a} \int_{a}^{b} f(x) \Delta x \leq M

Let’s call this point f(c) \in [m, M]. Then by the intermediate value theorem (proof easy and omitted), there is a point c \in [a, b]. Geometrically the mean value theorem for integrals can be understood as stating that for any area under the curve of the line there is a rectangle defined at the average height f(c) with respect to [a, b] such that the area of it is equivalent to the area under the total curve defined by f. In a picture:

The area in excess of the height f(c) is “compensated” for by inclusion of the novel sections defined by our rectangle. To see this precisely: note that [ \dfrac{1}{b-c} \int_{a}^{b} f(c) \Delta c]\dfrac{b-c}{1} cancels to give us \int_{a}^{b} f(c) \Delta c as desired. This concludes the section; in the next post we will prove the first fundamental theorem of calculus.

One thought on “Integration and Mean Value Theorems

  1. Pingback: Some Facts about Famous Functions | Aspiring PI

Leave a comment