Advice/Tricks and Issues to Train a Neural Network

Adaptive Learning Rate

The idea is to start with a higher learning rate and decrease it as time progresses.

$\Delta w_i^t = -\eta \frac{\partial E^t}{\partial w_i} + \alpha \Delta w_i^{t-1}$

(t is time). $\alpha$ can be default or computed.

Too much training could cause overfitting. Stop training when the validation error starts to increase.

Last updated 5 years ago

Was this helpful?