Coordinate Descent

Learn about the coordinate descent algorithm and how it minimizes the objective along one coordinate direction at a time.

The coordinate descent algorithm

Consider a multivariate function f(x)=f(x1,x2,...,xn)f(x) = f(x_1, x_2, ..., x_n) that we want to optimize. Using the gradient descent algorithm, the updates will happen in all directions at once.

Coordinate descent is a variation of gradient descent that tries to find the minimum of a function by minimizing it (i.e., performing gradient descent) along one coordinate direction at a time. Starting from an initial point x0x^0, coordinate descent defines xtx^t at the time t>0t>0 by solving nn single variable optimization problems one by one as follows:

Get hands-on with 1400+ tech skills courses.