What is position bias in ranking and how do you correct for it?

Question

Accepted Answer

Your training data comes from user clicks on ranked results. What's wrong with training directly on this data, and how do you correct for position bias? Think about: how a user's probability of clicking differs between position 1 and position 10 for the same item. What the model learns when it treats all clicks as equal signals of relevance. What it means for the ranker to be self-reinforcing. What information you'd need to disentangle "clicked because relevant" from "clicked because shown first