In linear regression, we need to find the parameters that minimize the sum of squared residuals.
When we reach the minimum point, the slope =0, then we can have the equation and then solve the required parameter.
Then why do we need to use the gradient descent which needs a lot of steps to find the solution and is not the exact minimum point.
Thank you!
When we reach the minimum point, the slope =0, then we can have the equation and then solve the required parameter.
Then why do we need to use the gradient descent which needs a lot of steps to find the solution and is not the exact minimum point.
Thank you!