[Repost Q] Should I scale values before using them to train autoencoder?

ShadowAether · edit-2 2 years ago

[Repost Q] Should I scale values before using them to train autoencoder?

ShadowAether · 2 years ago

Original answer:

Scaling the dataset before passing it to the autoencoder is usually how I do it, you don’t need to rescale after if you are only using the encoder portion (for example for dimensionality reduction). If you don’t do it linearly (aka (x-min(x))/(max(x)-min(x) ) and use exp or log to do it then be mindful that it would likely have an impact with respect to loss/optimization behaviour.

Make sure to take the max and min values from the training data then apply it to the testing (in the case of values out of bounds, set them to the boundary value but this shouldn’t have a big impact if your training dataset is large enough with enough variance).