Mean Normalization
In addition to scaling the features, some may also consider Mean Normalization.
In this, we replace by so as to make the features have approximately 0 mean.
(Note: This is not applied for which has a fixed value 1).
In general, we can use the following formula to scale the features using mean normalization:
where is the feature, is its mean and is its range (i.e. max-min).
If this leads to being in the range [-0.5, 0.5] approximately, the gradient descent will work quickly.
Last updated
Was this helpful?