An ML engineer has trained a neural network by using stochastic gradient descent (SGD). The neural network performs poorly on the test set. The values for training loss and validation loss remain high and show an oscillating pattern. The values decrease for a few epochs and then increase for a few epochs before repeating the same cycle.
What should the ML engineer do to improve the training process?
ninomfr64
1 month, 3 weeks agogulf1324
1 month, 4 weeks agofeelgoodfactor
2 months, 2 weeks agomotk123
2 months, 3 weeks agoGiorgioGss
3 months ago