From: kovarik@mcmail.cis.McMaster.CA (Zdislav V. Kovarik)
Subject: Re: ordinary and partial diff eq. and the meaning of dt.
Date: 5 Aug 1999 04:06:53 -0400
Newsgroups: sci.math

In article <37A76BCA.13A80BBE@i.am>, wis  <wis.poutou@i.am> wrote:
>First, excuse my poor english and my poor ascii art.
>Then remove .poutou in my email to reply.
>
 This may be of interest to others, so no e-mail.

[...]

>What is dy/dt exactly, for me it is JUST a notation for
>      y(t+h)-y(t)
> lim  ----------.
> h->0     h
>
>   1. so why can we multiply both part by dt ?
>   2. why can we put both integral sign ?

The definition of the derivative of a real-valued function of a single
real variable is correct (an assumption about t being an interior point
of the domain of f needs to be included).

Bad news: As stated, the definition is useless or exceedingly clumsy to
extend to many variables. Why?

      You can't divide by vectors in any way useful for calculus.

So, a reform was needed; done by Frechet or someone before him, but the
news did not reach the chambers of most educators, or most textbook
authors.

 What should the derivative be, according to Frechet?

A linear transformation (call it L for the time being and remember that it
depends on x) acting on the increment h in the "short Taylor's expansion
with remainder"

  f(x+h) = f(x) + L(h) + R(x;h)

where R(x;h)/norm(h)  goes to 0 as norm(h) goes to 0. (Norm: usually the
Euclidean length.)

Example:  let  f(x) = 1/x  where x is not 0. Then, subtracting one term at
a time and simplifying, we get (check it out!)

  1/(x+h) = 1/x - h/x^2 + h^2/(x^2 * (x+h))

Here L(h) is defined by  L(h) = h/x^2, and it is linear in h indeed, while
the term  h^2/(x^2 * (x+h))  can be declared the remainder term (divide by
abs(h), and it will still go to 0 as h goes to 0).

Who has difficulties with general x, might start by replace x with 2 and
write:

  1/(2+h) = 1/2 - h/4 + h^2/(4*(4+h))

so the linear transformation is L(h) = (-1/4) * h.

*** Here is the point that clouds the understanding of the derivative in
*** one variable:
*** A linear transformation in one variable is simply multiplication by a
*** constant. This constant is traditionally (and simplistically) 
*** presented as the derivative. Instead of the concept of a
*** proportionality relation, one is shortchanged by giving just that
*** constant of proportionality.

And back to dx and dy:  In the context of the relation   y = f(x) ,

one writes the "derivative relation" using the linear transformation L, as

   dy = L(dx) = f'(x) * fx

A fine point of domain of function: The variable h in Taylor's expansion
is often restricted, so that (x+h) is still in the domain of f.

No such restriction is needed for the variable dx: it can run through all
real numbers.

The punchline: If a (user-supplied) value of dx is not 0 then we can
divide by dx, and reconcile the L-notation with the prime notation:

                   dy / dx = f'(x)

keeping in mind that a specific x is considered, but otherwise with no
further fuss. Division as we know it.

Let's go for 2 variables, and let f(x, y) = x * y^2 .

Then the polynomial routine gives (the increment of the vector (x,y) is
called by me (h,k):

 f(x+h, y+k) = f(x,y)

             + y^2 * h + 2*x*y * k

             + h*k*(2*y + k)

Immediately, the second line expression

             + y^2 * h + 2*x*y * k

is a linear transformation of a vector (h,k) into a number, and the third
line term can be shown to go to 0 even after being divided by

 norm((h,k)) = sqrt(h^2 + k^2) .

So, L defined by  

  L(h,k) = (y^2) * h + (2*x*y) * k

is the derivative.

(As a piece de resistance, I like to show the students how concise the
derivative of the matrix inversion function (matrix-valued function of a
matrix variable!) looks when you present it as a linear transformation: 

 Let Inv be defined as Inv(X) = X^(-1) for all invertible matrices. Then

  d Inv(X) (H) = - Inv(X) * H * Inv(X)

and writing it down in terms of cofactors is pure torture.)

With the extension as before, replace h by dx and k by dy, and from the
relation

                    z = f(x,y)

we obtain

             dz = L(dx, dy) = (y^2) * dx + (2*x*y) * dy

One should recognize the coefficients of the linear transformation as the
"partial derivatives", and prove familiar theorems about them.

So, in case of differentiable f,

df(x,y) = 

(partial d)f/(partial d)x * dx + (partial d)f/(partial d)y * dy

The dx and dy are, respectively, the increments of x and y, extended for
the purposes of linear transformation to all pairs of real numbers.

> for me
> /
> | f dt
> /is only a NOTATION for the primitive or the RIEMANN integral if
>borned, the dt has no sense.

 [ 3D example skipped because 2 variables already illustrate the point ]

The integral question is a case of fusing two concepts into one (Riemann's
integral and "Newton's integral") because they seem to lead to the same
formula manipulation to obtain results:

On a straight line segment, the function of two variables x and dx,
linear in dx, defined by

             f(x) * dx

is the same as what is called in differential geometry a "differential
form". It is this differential form that is being integrated by "boundary
evaluation of the primitive form": finding F such that (the notation is
deliberately redundant) 

       F'(x) * dx = f(x) * dx

and then int[a to b] f(x) * dx = F(b) - F(a)

Observe: No mention of Riemann approach, and the problem of the existence
of F is postponed. We can still prove and use the sum formula,
substitution rule, and integration by parts, before we learn about
Riemann.

It is a fundamental achievement of early Calculus developers, showing that
this (Newton's, if you need a name) integral of f(x)*dx comes out the same
as Riemann's integral of f (and the question of existence is resolved).

Remark: To emphasize the distinction, some textbooks write (replace int by
that tall S and edit graphically)

 int[a to b] f , or even int_J f  where  J = [a, b]

for Riemann's integral, to indicate by the absence of dx (or d(whatever))
that Riemann's or related ideas are followed here. 

(In advanced integration theory, dx is short for d(mu)_1 where (mu)_1 is
one-dimensional Lebesgue measure (restricted if required). Tradition,
rather than notational consistency, prevailed.)

Now the manipulations which treat dy/dx as a true quotient become
legitimate (as long as dx is not 0).

Hope some of it helped,
ZVK(Slavek).