Gradient Descent
The basic idea of gradient descent is to follow the gradient “downhill” until it’s zero (local minimum)
Higher-order Descent
General idea: apply a coordinate transformation so that the local energy landscape looks more like a “round bowl” so gradient now points directly toward nearby minimizer.
Most basic higher-order descent strategy is the Newton’s method