When do you use a cross-encoder reranker instead of a bi-encoder?

Question

Accepted Answer

Compare bi-encoders and cross-encoders for search or recommendation. Why not use the more accurate cross-encoder everywhere? Think about: independent embeddings, query-item interactions, precomputation, latency, candidate generation, and multi-stage ranking. **Bi-encoder** A bi-encoder encodes query and item independently: Document embeddings can be precomputed and stored in an ANN index. This makes bi-encoders suitable for retrieval over millions or billions of items. The limitation: query and