Implement k-nearest neighbors with efficient distance computation

Question

Accepted Answer

Implement k-nearest neighbors classification from scratch in NumPy. Focus on making the distance computation vectorized and efficient — no for loops over data points. Then discuss: what's the computational bottleneck and how would you address it at scale? Think about: how to compute pairwise Euclidean distances without loops. The trick with expanding (a-b)² = a² - 2ab + b². How you'd handle the memory problem when the training set is large. What data structures accelerate nearest-neighbor search