Walk me through classification evaluation metrics — precision, recall, F1, AUC

Question

Accepted Answer

Walk me through precision, recall, F1, AUC-ROC, and AUC-PR. When would you use each, and what does each actually measure? Think about: what the denominator is in precision vs recall — what each cares about missing. What the ROC curve is actually plotting and why a random classifier is a diagonal line. Why AUC-ROC can be misleadingly high for imbalanced datasets. What it means for a model to be "calibrated." **Confusion matrix foundation** - **Precision** = TP / (TP + FP): of all predicted positi