Second derivative tests

Using second derivatives (or the Hessian) to classify critical points as local minima or maxima
Second derivative tests

One-variable second derivative test

Let f:IRf:I\to\mathbb{R} be twice and let aIa\in I^\circ satisfy f(a)=0f'(a)=0.

Proposition (one variable):

  • If f(a)>0f''(a)>0, then aa is a strict of ff.
  • If f(a)<0f''(a)<0, then aa is a strict local maximum of ff.
  • If f(a)=0f''(a)=0, no conclusion follows in general.

Multivariable Hessian test

Let URnU\subseteq\mathbb{R}^n be and let f:URf:U\to\mathbb{R} be . Let aUa\in U be a , i.e. HAHAHUGOSHORTCODE779s5HBHB=0 =0, and let H=HAHAHUGOSHORTCODE779s6HBHBH= be the Hessian matrix at aa.

Proposition (multivariable):

  • If HH is positive definite (i.e., vTHv>0v^{\mathsf T}Hv>0 for all v0v\neq 0), then aa is a strict local minimum.
  • If HH is negative definite (i.e., vTHv<0v^{\mathsf T}Hv<0 for all v0v\neq 0), then aa is a strict local maximum.
  • If HH is indefinite (i.e., takes both positive and negative values on nonzero vv), then aa is a saddle point (neither local min nor local max).
  • If HH is only semidefinite, the test is inconclusive.

These tests are consequences of : near a critical point, the quadratic term 12hTHh\frac12 h^{\mathsf T}Hh controls the leading behavior of f(a+h)f(a)f(a+h)-f(a).

Proof sketch: One-variable: Taylor’s theorem gives f(a+h)=f(a)+12f(ξ)h2f(a+h)=f(a)+\tfrac12 f''(\xi)h^2 for some ξ\xi between aa and a+ha+h; the sign of f(a)f''(a) controls the sign for hh small. Multivariable: Taylor’s theorem yields f(a+h)=f(a)+12hTHh+o(h2). f(a+h)=f(a)+\frac12 h^{\mathsf T}Hh+o(\|h\|^2). If HH is positive definite, then hTHhch2h^{\mathsf T}Hh\ge c\|h\|^2 for some c>0c>0, which dominates the o(h2)o(\|h\|^2) remainder for small hh, giving f(a+h)>f(a)f(a+h)>f(a) for h0h\neq 0 small.