The figure shows gradient descent on a convex cost function….
The figure shows gradient descent on a convex cost function. The parameter θ\thetaθ starts near the bottom of the curve, but the updates repeatedly jump from one side of the valley to the other, moving far away from the minimum. What is the best interpretation of this behavior?
Read Details