What is regularization and how do the different techniques compare?

Question

Accepted Answer

What is regularization and how do the different techniques compare? When would you pick one over another? Think about: what regularization is fundamentally doing to the loss landscape. Why L1 produces sparse weights and L2 doesn't. What dropout is doing at training vs inference. Where batch norm fits into this picture. Regularization is any technique that reduces a model's tendency to overfit by constraining its capacity or adding noise during training. The goal is to improve generalization — pe