Optimization for Machine Learning with NumPy and SciPy/

...

Newton’s Method

Learn how to use the second-order information, such as Hessian, for gradient descent.

We'll cover the following...

The second-order optimization algorithms
Implementation of Newton’s method

The second-order optimization algorithms

Newton’s methods are a class of optimization algorithms that leverage second-order information, such as Hessians, to achieve faster and more efficient convergence. In contrast, the gradient descent algorithms, like the Nesterov momentum, depend solely on the first-order gradient information.

The idea of Newton’s method is to utilize the curvature information present in Hessians to get a more accurate approximation of the function near the optimum.

Recall the two-degree Taylor series expansion of our objective $f(x)$ around a point $x_t$ (at the time $t$ ) as follows:

Introduction to Optimization

Vector Calculus

Convex Optimization

Gradient Descent for Non-Convex Optimization

Use Particle Swarm Optimizer to Optimize a Non-convex Function

Constrained Optimization

Miscellaneous Methods

Course Conclusion

Test Your Concepts of Optimization

Training Support Vector Machines (SVMs)

Newton’s Method

The second-order optimization algorithms