Total derivative (Fréchet derivative in ℝ^k)

The linear map Df(a) giving the best first-order approximation f(a+h)=f(a)+Df(a)h+o(‖h‖).
Total derivative (Fréchet derivative in ℝ^k)

Let URkU\subseteq\mathbb{R}^k be and let f:URmf:U\to\mathbb{R}^m. The function ff is (Fréchet) differentiable at aUa\in U if there exists a A:RkRmA:\mathbb{R}^k\to\mathbb{R}^m such that

limh0f(a+h)f(a)Ah2h2=0. \lim_{h\to 0}\frac{\|f(a+h)-f(a)-A h\|_2}{\|h\|_2}=0.

The map AA is uniquely determined when it exists and is called the (total) derivative of ff at aa, denoted Df(a)Df(a).

This definition captures the best linear approximation of ff near aa. In coordinates, Df(a)Df(a) is represented by the Jf(a)J_f(a) when ff has continuous .

Examples:

  • If f(x)=Ax+bf(x)=Ax+b is affine (with AA an m×km\times k matrix), then Df(a)=ADf(a)=A for all aa.
  • If f:R2Rf:\mathbb{R}^2\to\mathbb{R}, f(x,y)=x2+y2f(x,y)=x^2+y^2, then Df(a)Df(a) is the linear map h2a,hh\mapsto 2\langle a,h\rangle (equivalently, gradient dot hh).
  • Existence of all partial derivatives at aa does not necessarily imply existence of Df(a)Df(a) (Fréchet differentiability).