GBDT vs neural networks for ranking — when do you pick each?

Question

Accepted Answer

You have a ranking task with 500 tabular features and 200M training examples. When would you pick GBDT over a deep neural network, and what signals would make you switch? Think about: what GBDTs are actually good at (heterogeneous tabular features, feature interactions, missing values) and where they struggle (high-cardinality categoricals that need embeddings, raw text/images, cross features with 100M+ ID features). What matters for 200M examples specifically — training speed, memory, serving l