How would you deploy a new model version safely?

Question

Accepted Answer

Walk me through how you'd deploy a new model version to production safely. What strategies exist and how do you choose between them? Think about: the risk of rolling out a model that performs worse. How you'd limit blast radius. The difference between rollout strategies. What "safe" means for an ML deployment vs. a code deployment. Model deployment is riskier than code deployment because you can't fully verify correctness offline — a model that looks great on your eval set can still fail in ways