针对协变量转移的高效Bradley-Terry模型推断方法
Efficient Inference for Covariate-adjusted Bradley-Terry Model with Covariate Shift
摘要 Abstract
我们提出了一种通用框架,用于在成对比较中对玩家整体实力进行统计推断,允许协变量分布存在潜在变化。这些协变量捕获可能影响每个玩家获胜概率的重要上下文信息。我们通过目标分布的Kullback-Leibler投影来衡量玩家在目标分布下的整体实力,投影到一类调整协变量的Bradley-Terry模型类中。因此,我们的估计量在不施加严格模型假设的情况下仍然定义明确。我们开发了半参数有效的估计量及其对应的推断程序,允许灵活估计非关键函数。当条件Bradley-Terry假设成立时,我们还提出了不需要观察所有成对比较的额外估计量。我们在模拟研究中展示了所提出方法的表现,并将其应用于评估大型语言模型在现实应用中与人类偏好的一致性。
We propose a general framework for statistical inference on the overall strengths of players in pairwise comparisons, allowing for potential shifts in the covariate distribution. These covariates capture important contextual information that may impact the winning probability of each player. We measure the overall strengths of players under a target distribution through its Kullback-Leibler projection onto a class of covariate-adjusted Bradley-Terry model. Consequently, our estimands remain well-defined without requiring stringent model assumptions. We develop semiparametric efficient estimators and corresponding inferential procedures that allow for flexible estimation of the nuisance functions. When the conditional Bradley-Terry assumption holds, we propose additional estimators that do not require observing all pairwise comparisons. We demonstrate the performance of our proposed method in simulation studies and apply it to assess the alignment of large language models with human preferences in real-world applications.