Suppose that you are training a network with parameters [4.5…
Suppose that you are training a network with parameters [4.5, 2.5, 1.2, 0.6] , a learning rate of 0.3, and a gradient of [-1, 9, 2, 5] . After one update step of gradient descent, what would your network’s parameters be equal to?
Read Details