Walk me through backpropagation

Question

Accepted Answer

Walk me through backpropagation. How does a neural network actually learn from a mistake? Think about: what the forward pass computes and what information it leaves behind. How the chain rule connects the output error back to each weight. What "gradient" actually means for a weight deep in the network. Backpropagation is just the chain rule of calculus applied systematically to a computational graph. The goal is to compute `∂L/∂w` for every weight w in the network, where L is the loss. With thos