What are embeddings and how are they learned?

Question

Accepted Answer

What are embeddings and how are they learned? Why do similar things end up close together in embedding space? Think about: what a one-hot vector can't capture. What the training objective of word2vec is. Why proximity in the learned space corresponds to semantic similarity. Where embeddings appear beyond words. An embedding is a dense, low-dimensional vector representation of a discrete object — a word, a user, a product, a movie. Instead of representing "cat" as a one-hot vector of size 100,000