Towards Disentangling Relevance and Bias in Unbiased Learning to Rank

Yunan Zhang

Le Yan

Zhen Qin

Honglei Zhuang

Jiaming Shen

Xuanhui Wang

Mike Bendersky

Marc Najork

29TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) (2023)

Download Google Scholar

Abstract

Unbiased learning to rank (ULTR) studies the problem of mitigating various biases from implicit user feedback data such as clicks, and has been receiving considerable attention recently. A popular ULTR approach for real-world applications uses a two-tower architecture, where click modeling is factorized into a relevance tower with regular input features, and a bias tower with bias-relevant inputs such as the position of a document. A successful factorization will allow the relevance tower to be exempt from biases. In this work, we identify a critical issue that existing ULTR methods ignored - the bias tower can be confounded with the relevance tower via the underlying true relevance. In particular, the positions were determined by the logging policy, i.e., the previous production model, which would possess relevance information. We give both theoretical analysis and empirical results to show the negative effects on relevance tower due to such a correlation. We then propose two methods to mitigate the negative confounding effects by better disentangling relevance and bias. Offline empirical results on both controlled public datasets and a large-scale industry dataset show the effectiveness of the proposed approaches. We conduct a live experiment on a popular web store for four weeks, and find a significant improvement in user clicks over the baseline, which ignores the negative confounding effect.

Research Areas

Information Retrieval and the Web

Defining the technology of today and tomorrow.

Philosophy

People

Foundational ML & Algorithms

Computing Systems & Quantum AI

Science, AI & Society

Projects

Publications

Resources

Shaping the future, together.

Student programs

Faculty programs

Conferences & events

Towards Disentangling Relevance and Bias in Unbiased Learning to Rank

Abstract

Research Areas

Learn more about how we conduct our research