בחינת הבית של הקורס מצורפת. אינכם חייבים לפתור את כל השאלות בכדי להצליח בבחינה, אך נסו לפתור שאלות רבות ככל האפשר (גם רעיונות שלא הצלחתם לעבד לכדי הוכחה מדוייקת מומלץ לרשום). שאלות לגבי הבחינה אפשר לשאול כאן או במייל שלי או של סרגיי. בהצלחה לכולם!
Monday, February 8, 2010
Lecture 13 (Feb 2, 2010)
Integral: antiderivation and area
If is a function, can we find another differential function
such that the derivative of
is
? If yes, then how many such functions exist? If we have a complete record of the car speedometer, can we trace the route? compute the end point of the travel?
“Uniqueness” of solution is easy: if there are two solutions such that
, then the derivative of their difference
is identically zero. By the finite difference lemma, for any point
there exists an intermediate point
such that
, thus
is a constant.
Clearly, if is a solution, then
, is also a solution. In particular, we can choose this constant so that
takes any specified value.
Example. If , then
, as one can check by the direct derivation.
Example. If can be found in the right hand side of any table of derivatives (eventually, with a constant coefficient), then
can be found in the same table. E.g., if
, then
. Denoting
and dividing both sides by
, we conclude that if
, then
. The case of
occurs on another line in the table: if
, then
.
What to do if cannot be so easily found? A good idea is to try approximating
by some simple functions and see what happens.
Example. Let and assume that the function
takes the same constant value
on the (semi)interval
. Then one can easily check that the following function,
is satisfying the following conditions:
- continuous on the entire segment
,
- differentiable everywhere except the entire points
,
- the derivative of
coincides with
,
- the graph of
is a broken line (קו שבור).
Instead of the cumbersome (מורכות) formula (1), one can use the equivalent verbal description:
area under the graph of
between the vertical segments
and
Indeed, if is small enough, then both
and
belong to the same interval
and hence
, thus the derivative
.
Clearly, an arbitrary interval can be subdivided into
equal subintervals: the formula (1) will be changed, yet its meaning (2) would obviously remain the same.
This observation suggests that the formula (2) gives the answer also in the general case, provided that the expression “the area under the graph” makes sense. Clearly, this is the case when is piecewise constant (“the step function”, פונקציית מדרגה, as above) and even when
is piecewise linear (קו שבור). Note that we allow only for step functions having only finitely many intervals of continuity, to avoid summing infinitely many areas of rectangles!
To define the area in the general case, we appeal to the following intuitively clear observation: if are two nested subsets of the plane, then the areas of the sets, if it is well defined, should satisfy the inequality
. For polygons this is an elementary theorem.
Let be a bounded function on a finite interval. For any step function
which is everywhere less or equal to
,
, the area
under the graph of
is called a lower sum for
on
. In a similar way, an upper sum for
is the area under the graph of any step function
such that
on
. Clearly, for any two such functions
, the inequality
holds, thus any lower sum is less or equal to any upper sum.
Definition. A function is called integrable on the segment
, if there exists a single number
which separates the lower and the upper sums, so that
for any pair of lower/upper sums
. This number is called the integral (האינטגרל המסוים) of the function
on the interval
and denoted by
.♦
An equivalent definition of integrability requires that for any positive there exist an upper and a lower sum which differ by no more than
:
Exercise. Prove that the two definitions are indeed equivalent.
Exercise. Prove that the function , is integrable (in the above sense) on any interval and its integral is equal to the (geometrically calculated) area of the corresponding trapeze (טרפז).
Proposition. Any monotone (bounded) function is integrable.
Proof. Consider the partition of into
equal parts by the points
and denote
the values of
at these points. Then the two step functions,
and
squeeze
between them. The lower sum is
and
. Their difference is less or equal to
and it becomes less than any positive number
when
is large enough.
Theorem. Any continuous function (on the closed interval) is integrable.
Proof. The idea is to construct two step functions with the common set of jump points such that on each interval
these functions squeeze
between themselves and differ by less than a given $latex\varepsilon>0$. The points
should be close enough to each other so that the continuity of
implies the required proximity of the corresponding values.
More precisely, for any point there exists a small interval latex
containing
such that on this interval the function is squeezed between
and
. The union of all these small intervals covers
which is compact. Hence there exists a finite covering of
by these intervals. The endpoints of these intervals subdivide
into finitely many intervals
in such a way that on each interval the function
is squeezed between two constants
such that
. The corresponding upper and lower sum differ by no more than
. This difference can be as small as required if
is chosen small enough. ♦
Clearly, continuity is sufficient but not necessary assumption for integrability: the step functions are by definition integrable, though they are discontinuous. The accurate description of all integrable functions goes beyond the scope of these notes, yet there are certainly many non-integrable functions.
Example. The Dirichlet function is non-integrable on . Indeed, any upper sum must be at least 1, and any lower sum at most 0, hence the gap between these values cannot be bridged (complete all details of this proof!).
Problems.
- Prove that any function on
which differs from identical zero at only finitely many points, is integrable and its integral is zero, no matter what are the values at these points.
- Prove that a function that differs from identical zero at countably many points
, is integrable and the integral is zero if
. Can one drop the limit assumption?
- Prove that the Riemann function equal to
at the rational points
(assuming
mutually prime) and zero at irrational points, is integrable on
and its integral is zero.
Problem. Is the function integrable on
? (Do not try to compute the integral :-))

Sin(1/x) for x near the origin
The Newton-Leibniz formula
Quite obviously, if is integrable on
, then it is also integrable on any sub-interval
,
. The formula (2) relating the area (definite integral) with the antiderivative
of an integrable function
will take the form
This allows to express the area under the graph of a function in terms of its antiderivative (if the latter exists and is known).
However, integrability is a weaker condition than existence of the antiderivative: if is a step function with the partition points
, then the corresponding area function
is continuous but differentiable only everywhere except these points.
Theorem. If is continuous at a point
, then the area function
is differentiable at
and
. ♦
Change of the independent variable in the integrals
If is a function defined on the interval
, and
is a new variable which is obtained from
by a monotone differentiable transformation,
, then this transformation maps bijectively (1-1-way) the interval
into the interval
. The function $f(x)$ becomes after such change a new function
of the new variable:
if and only if
, i.e., the two functions take equal values at the two points “connected” by the transformation
.
The “formal” relationship between these functions is easier written “in the opposite direction”, expressing via
:
The graphs of functions of and
are obtained from each other by a “non-uniform stretch along the horizontal axis” which keeps the vertical direction. In particular, any step function with the partition points
will be transformed into the step function with the partition points
,
, with the same values. Moreover, if
is squeezed between two step functions,
, then its transform is squeezed between the transforms
of these functions.
Thus it is sufficient to study how the change of variables affects areas under step functions, which are equal to finite sums and their transforms
. The heights
are unchanged, and the widths
by the finite difference lemma are equal to the initial widths multiplied by the derivative
computed at some intermediate points
. The result is as if instead of the step function
we would integrate the function
. Passing to limit, we conclude that
Of course, this is equivalent to the chain rule of differentiation for the primitive functions: if and
, then
and
.
The formula for change of variables of integrals can be easily memorized using the existing notation: in the formula one has to transform not just the integrand
by substituting
, but also the differential
should be transformed using the formula
.
Monday, February 1, 2010
Lecture 12 (Jan 26, 2010)
Higher derivatives
In this lecture we return back to functions of one variable, defined on an open or closed interval on the real axis.
Definition. A function is called twice (two times) differentiable, if it is differentiable at all points of the segment
, and the derivative
considered as a scalar (numeric) function on this segment, is also differentiable.
Iterating this construction, we say that a function is times differentiable, if it is differentiable and its derivative is
times differentiable. This is an inductive definition.
Variations. Sometimes it is convenient to say that a function is 0 times differentiable, if it is continuous on the segment. If this agreement is used as the base of induction, then this would define a slightly more restricted classes of times differentiable functions, usually denoted by
.
Exercise.
- Give non-polynomial example of a function which is infinitely many times differentiable.
- Give example of a function that has exactly 7 derivatives, but not 8, on the segment
.
As was already established, existence of the first derivative allows to construct the linear approximation for a given function. If this approximation is not degenerate (i.e., the derivative is non-vanishing), it allows to study the extremal properties of the function, in particular,
- Guarantee a certain type of the extremum at the endpoints
of the segment;
- Guarantee the absence of extremum at the interior points of the interval
.
It turns out that higher order derivatives allow to construct approximation of functions by polynomials of higher order, and this approximation sometimes guarantees presence/type or absence of extremum. In cases it does not, one needs more of the same.
Theorem. If a function is
times differentiable on a segment
containing the origin
, then there exists a unique polynomial
of degree
which gives an approximation of
at the origin with the accuracy
,
The coefficients of this polynomial are proportional to the respective higher derivatives
at the origin,
.
Remark. There is nothing mysterious in the formulas: if is a polynomial itself, then the higher order derivatives can be easily computed for each monomial separately:
.
This proves the formulas for the coefficients via the derivatives of the polynomial.
For the proof we need the following lemma, which is very much like the intermediate value theorem.
Lemma (on finite differences). For a function differentiable on the interval the normalized finite difference
coincides with the derivative
at some intermediate point
.
Proof of the Lemma. Consider the auxiliary function . Then
and the same finite difference for this function is equal to zero. Since the function takes equal values at the endpoints, either its maximum or its minimum is achieved at some point
inside the interval
. By the Fermat rule,
. By construction,
.
Proof of the Theorem. The formulas for the coefficients imply that the difference is a function which is
times differentiable and all its derivatives vanish at the origin. We have to prove that
.
For this is obvious: by definition of differentiability, for function with zero derivative we have
. Reasoning by induction, consider the function
and its derivative
. By the inductive assumption,
for an arbitrary
on a sufficiently small interval.
By the Lemma, for some
. Using the inequality
, we conclude that
. Since
can be arbitrary small, we conclude that the limit
is smaller than any number, i.e., must be zero.
Definition. The polynomial above is called the Taylor polynomial of order
for the function
at the origin
. One can easily modify this to become the definition if the Taylor polynomial at an arbitrary point
.
Application to the investigation of functions
Any times differentiable function can be written (near the origin, but this is not a restriction!) as its Taylor polynomial of degree
plus an error term which is fast decreasing (faster than the highest degree term of the polynomial). Hence this term cannot affect the extremal properties of the polynomial. In particular, if
and the Taylor polynomial has a minimum (maximum) at
, so does the function itself.For quadratic polynomials the horns of the parabola go up (resp., down) if its principal coefficient is positive (resp., negative). This immediately proves the following result.
Second-order sufficient condition.If is a twice differentiable function and
is a critical point,
, then the following holds:
- If
, then
is a local minimum,
- If
, then
is a local maximum,
- If
, everything depends on the Taylor polynomial of degree 3.
Problem. Find conditions for a degree 3 polynomial to have a local maximum/minimum on (a) interval
, (b) semi-interval
. Formulate the third order necessary/sufficient conditions for extremum in the interior point (resp., left/right endpoint) of the domain of a non-polynomial function
.
Lecture 11 (Jan 19, 2010)
Functions of two variables and their derivation. Application to the study of planar curves.
The idea of linear approximation (and differentiability) can be easily adopted for functions of more than one variable. Apart from the usual (scalar) functions we will consider applications
(parametrized curves) and functions
of two variables.
A curve can be considered in kinematic (mechanical) terms as the description of a travel in the plane with coordinates
, parametrized by the time
. The notions of limit, continuity, differentiability are defined as the corresponding properties of the coordinate functions
and
.
Examples.
- The graph of any continuous (differentiable) function
of one variable is a continuous (resp., differentiable) curve
.
- The circle
is the image of the curve
. Note that this image is “covered” infinitely many times: preimage of any point on the circle is an entire arithmetical progression with the difference
.
- The cusp (חוֹד)
is the image of the differentiable curve
. Yet the cusp itself is not “smooth-looking”:
The “linear approximation” at a point takes the unit “positive” vector to
, attached to the point
, to the vector
, tangent to the plane at the point
, with the coordinates
. The vector
is called the velocity vector, or the tangent vector to the curve.
When a differentiable parametrized curve is smooth? and what is smoothness?
Definition. A planar curve is smooth at a point , if inside some sufficiently small square
this curve can be represented as the graph of a function
or
differentiable at the point
(resp.,
).
Proposition. A differentiable (parametrized) curve with nonzero velocity vector is smooth at the corresponding point.
Proof. Assume that ; then the function
has nonzero derivative and hence is locally invertible, so that
can be expressed as a differentiable function of
,
. Then the curve is the graph of the function
.
Problem. Give an example of a differentiable (parametrized) curve with zero velocity at some point, which is nonetheless smooth.
Differentiability of functions of two variables
While the definitions of the limit and continuity for a function of two variables are rather straightforward (open intervals in the one-variable definition should be replaced by open squares
with the rest staying the same), the notion of differentiability cannot so easily be reduced to one-dimensional case.
We want the function of two variables be approximable by a linear map near the point
. This means that there exist two constants
(coordinates of the approximating map) such that the difference is fast going to zero,
Denote by (resp.,
the linear functions which take the value
(resp.,
) on the vector with the coordinates
. Then the linear map
, approximating the function
, can be written as
. To compute the coefficients
, consider the functions
and
of one argument each. Then
,
are partial derivatives of
with respect to the variables
at the point
.
Definition. For a function of two variables the partial derivative with respect to the variable
at a point
is the limit (if it exists)
Example. Consider the function . Its differential is equal to
and at a point
on the unit circle it vanishes on the vector with coordinates
tangent to the level curve
(circle) passing through the point
.
Functions of two variables are nicely represented by their level curves (קווי גובה). Typically these look like smooth curves, but eventually one can observe “singularities”.
Examples. Draw the level curves of the functions and
. In the second case you may help yourself by changing variables from
to
as follows,
and look at the picture below,
Definition. A point inside the domain of a differentiable function
is called critical, if the linear approximation (the differential) is identically zero, i.e., if both partial derivatives
are zeros. Otherwise the point is called regular.
If we replace a nonlinear function by its affine approximation
,
at a regular point, then the level curves of
form a family of parallel lines. It turns out that for regular points these lines give a linear approximation for the (nonlinear) level curves of the initial nonlinear function. In other words, the regularity condition (a form of nondegeneracy) is the condition which guarantees that the linear approximation works nicely also when studying the level curves.
Theorem. If a point is regular for a differentiable function
, then there exists a small square around
such that the piece of the level curve of
passing through this point is a smooth curve tangent to the level line of the affine approximation
.

Zero level curve of a nonlinear function and the zero level line of its affine approximation
This claim is often called the Implicit Function Theorem (משפט הפונקציות הסתומות), however, the traditional formulation of the Implicit Function Theorem looks quite different!
Theorem. If at a point the partial derivative
of a differential function
is nonzero, then the equation
,
, can be locally resolved with respect to
: there exists a differentiable function
defined on a sufficiently small interval
, such that
and
. The derivative
of this function is given by the ratio,
Exercise. Prove that the two formulations of the Implicit Function Theorem are in fact equivalent.