How do decision trees split? Walk me through Gini impurity vs information gain.

Question

Accepted Answer

How does a decision tree decide where to split? What's the difference between Gini impurity and information gain, and which should you use? Think about: what "impurity" means for a leaf node — what it means for a node to be "pure" vs "impure." Why you want a split that reduces impurity. What entropy measures vs Gini. Whether the difference between the two actually matters in practice. What controls overfitting in a decision tree (not regularization, but tree-specific constraints). **The splittin