When is a large foundation model unjustified? Simple models vs LLMs.

Question

Accepted Answer

When is reaching for a large foundation model the wrong choice? Walk me through how you'd decide between a logistic regression, a gradient boosted tree, a fine-tuned BERT, and GPT-4 for a new ML task. Think about: what the cost of inference is at 10M requests/day for each model class. What interpretability requirements exist in regulated industries. What data volume is needed to fine-tune well vs prompt well. What latency SLA your product has. What "evaluation debt" means when you can't reliably