Higher-Order Differentials

Introduction

Higher-order differentials are necessary skills for calculus. When it comes to the multivariate higher-order differentials, people will often feel confused about the definition and the mathematical expressions.

In this blog post, I would like to discuss and derive the univariate and multivariate higher-order differentials.

Prerequisites

Derivative Definition

The derivative of a function y=f(x) measures the rate of change of y with respect to x. The derivative of the function y=f(x) at the point x is defined as the limit of the ratio ΔyΔx as Δx0.

y=f(x)=dydx=df(x)dx=limΔx0ΔyΔx=limΔx0f(x+Δx)f(x)Δx

Multi-Index Notation

There are some notations in order to express the multivariate Taylor theorem conveniently.

Suppose αNn, α={α1,α2,,αn}, and xRn, x={x1,x2,,xn}, we have the following notations.

|α|=α1+α2++αn

α!=α1!α2!αn!

(nα)=n!α!=n!α1!α2!αn!

xα=x1α1x2α2xnαn

Given a constant natural number k, how many different α are there such that |α|=k?

Let’s look at an example. Assume n=3 and k=2, the α such that |α|=k are

α=(2,0,0)α=(0,2,0)α=(0,0,2)α=(1,1,0)α=(1,0,1)α=(0,1,1)

Therefore, we have 6 different α for n=3 and k=2.

The formula for the general case is actually not hard to derive. This combination problem is actually equivalent to the combination problem that we have k+n identical balls, and n different boxes, how many unique ways to put these balls into the boxes such that each box has at least one ball. To solve it, we would need to put all the k+n balls as a sequence, and insert n1 barriers between the balls, such that each two adjacent balls could have at most one barrier. The number of combinations for this problem is, obviously, (k+n1n1).

Let’s verify if the formula we derived is valid for the example we have seen above.

(k+n1n1)=(2+3131)=(42)=6

Univariate Higher-Order Differentials

First-Order Differential

Suppose a univariate function f:RR in an interval I=(a,b), y=f(x), the first-order differential of the function at the point xI is defined as

dy=df(x)=f(x)dx

Note that the definition of the first-order differential is consistent with the definition of the derivative.

There are many useful properties of the first-order differential, which could be proved by the definition of derivatives mentioned in the prerequisite section. The properties that will be used for deriving the higher-order differentials are the linearity rule and the product rule.

d(af(x)+bg(x))=adf(x)+bg(x)=af(x)dx+bg(x)dx

d(f(x)g(x))=g(x)df(x)+f(x)dg(x)=g(x)f(x)dx+f(x)g(x)dx

We will skip the proofs for these properties since the proofs are somewhat trivial.

Higher-Order Differential

Because x is an independent variable, dx is treated as a constant, and only dependent variable has the first-order differential, so we have the second-order differential of y, d2y, as follows

d2y=d(dy)=d(f(x)dx)=d(f(x))dx=(f(x)dx)dx=f(x)(dx)2=f(x)dx2

Note that (dx)2 is commonly denoted as dx2.

d2y=f(x)dx2

In general, the n-th order differential dny is

dny=fn(x)dxn

This also leads to the n-order derivative for univariate function.

fn(x)=dnydxn

Sometimes, it is written as

fn(x)=dnf(x)dxn

Higher-Order Differential of a Composite Function

Suppose y=f(x) and x is also dependent on an auxillary or latent variable u, i.e., x=g(u),

y=f(x)=f(g(u))

By applying the first-order differential properties, we have

d2y=d(dy)=d(f(x)dx)

Note that now x is not an independent variable and dx is no longer a constant, so we have

d2y=d(f(x)dx)=(d(f(x))dx+f(x)d(dx))=f(x)(dx)2+f(x)d2x=f(x)dx2+f(x)d2x

Using the same approach, computing d3y, d4y, , dny, , becomes relatively simpler. For example,

d3y=d(d2y)=d(f(x)dx2+f(x)d2x)=d(f(x)dx2)+d(f(x)d2x)=d(f(x))dx2+f(x)d(dx2)+d(f(x))d2x+f(x)d(d2x)=f(x)dxdx2+f(x)d((dx)(dx))+f(x)dxd2x+f(x)d3x=f(x)dx3+f(x)(d(dx)dx+dxd(dx))+f(x)dxd2x+f(x)d3x=f(x)dx3+f(x)(d2xdx+dxd2x)+f(x)dxd2x+f(x)d3x=f(x)dx3+3f(x)dxd2x+f(x)d3x

Multivariate Higher-Order Differentials

First-Order Differential

Suppose a univariate function f:RnR in a region, y=f(x)=f(x1,x2,,xn), the first-order partial differential of the function at the point x={x1,x2,,xn} with respect to the variable xi is defined as

yxidxi

The first-order total differential of the function evaluated at x is defined as

dy=yx1dx1+yx2dx2++yxndxn=i=1nyxidxi=i=1nf(x)xidxi

Higher-Order Differential

Similar to the higher-order differential for univariate functions, we could derive the higher-order differential for multivariate functions based on definitions.

The second-order total differential of the function evaluated at x is

d2y=d(dy)=d(i=1nyxidxi)=i=1nd(yxidxi)=i=1nd(yxi)dxi=i=1n(j=1n(yxi)xjdxj)dxi=i=1n(j=1n2yxixjdxj)dxi=i=1n(j=1n2yxixjdxidxj)=|α|=2(|α|α)(|α|yx1α1x2α2xnαndx1α1dx2α2dxnαn)=|α|=2(|α|α)(|α|f(x)x1α1x2α2xnαndx1α1dx2α2dxnαn)

where αNn and (|α|α) is the multinomial coefficient represented using multi-index notation.

In general, the k-th order total differential of the function evaluated at x is

dky=dkf(x)=|α|=k(kα)(|α|f(x)x1α1x2α2xnαndx1α1dx2α2dxnαn)

If we denote

dx=dx1,dx2,,dxn

and the k-th order partial derivative

Dαf=|α|fx1α1x2α2xnαn

We could simplify the k-th order total differential of the function evaluated at x,

dky=dkf(x)=|α|=k(|α|α)Dαf(x)(dx)α

As has been discussed in the prerequisite section, the number of terms in the summation is (k+n1n1).

Higher-Order Differential of a Composite Function

The higher-order differential for multivariate functions that consists of composite function(s) are much more complicated and should be discussed case by case. We will just skip the discussion about it here.

References

Author

Lei Mao

Posted on

06-22-2021

Updated on

06-22-2021

Licensed under


Comments