What is train-serve skew and how do you prevent it?

Question

Accepted Answer

You train a model daily on logged features. At serving time the same features are computed in real-time. What can go wrong, and how do you catch it before it hurts users? Think about: all the ways a feature computed in a batch pipeline at training time can differ from the same feature computed in a real-time serving system. Time windows that don't align. Events that arrive late. Code that's duplicated and diverges. What monitoring would tell you something is wrong before users notice. **What tra