摘要 Abstract
实践中模型常常出现误设,因此模型批评成为贝叶斯分析的关键部分。不仅需要判断模型是否错误,还需要明确哪些方面存在错误,并以计算方便且统计严谨的方式进行检测。本文提出了一种基于如下事实的新模型批评方法:如果参数从先验分布中抽取,数据集按照假设的似然函数生成,则后验样本将遵循先验分布。因此,可以通过检验后验样本是否可能由先验分布生成来检测假设的似然函数或先验分布的偏差。在此基础上,我们建议将似然函数和先验分布的所有随机元素重新参数化为独立的均匀随机变量(u值)。这使得可以聚合数据点和参数的任意子集的u值,利用经典的依赖性或非均匀性假设检验方法来测试模型偏差。我们通过多个示例实证展示了这种均匀参数化检验(UPCs)方法在模型批评中的有效性,并发展了相关的理论结果。
Models are often misspecified in practice, making model criticism a key part of Bayesian analysis. It is important to detect not only when a model is wrong, but which aspects are wrong, and to do so in a computationally convenient and statistically rigorous way. We introduce a novel method for model criticism based on the fact that if the parameters are drawn from the prior, and the dataset is generated according to the assumed likelihood, then a sample from the posterior will be distributed according to the prior. Thus, departures from the assumed likelihood or prior can be detected by testing whether a posterior sample could plausibly have been generated by the prior. Building upon this idea, we propose to reparametrize all random elements of the likelihood and prior in terms of independent uniform random variables, or u-values. This makes it possible to aggregate across arbitrary subsets of the u-values for data points and parameters to test for model departures using classical hypothesis tests for dependence or non-uniformity. We demonstrate empirically how this method of uniform parametrization checks (UPCs) facilitates model criticism in several examples, and we develop supporting theoretical results.