Hilbert空间上线性Gaussian反问题的最优低秩近似,第二部分:后验均值近似
Optimal low-rank approximations for linear Gaussian inverse problems on Hilbert spaces, Part II: posterior mean approximation
摘要 Abstract
在这项工作中,我们构建了线性Gaussian反问题中高斯后验分布的最优低秩近似。参数空间为可能具有无限维的可分Hilbert空间,数据空间假设为有限维。我们考虑了后验分布的各种近似族。首先,我们考虑近似后验,其中均值在数据的结构保持或忽略的低秩变换类中变化,而后验协方差保持固定。我们给出了这些近似后验与精确后验同时对所有可能的数据实现都等价的必要且充分条件。对于这些近似,我们使用Kullback-Leibler散度、Rényi散度($\alpha \in (0,1)$)以及Amari $\alpha$-散度和Hellinger距离来衡量平均意义上的近似误差,并对数据分布进行平均。在这些损失函数下,我们找到了最优的近似并提出了其唯一性的等价条件,扩展了Spantini等人在有限维情况下(SIAM J. Sci. Comput. 2015)的工作。然后,我们通过后验协方差也变化到第一部分中考虑的低秩更新形式,研究了均值和协方差的同时近似。对于反向Kullback-Leibler散度,我们证明了分别优化的均值和协方差的近似可以组合成均值和协方差的联合最优近似。此外,我们将均值为最优结构忽略形式的联合近似解释为参数空间中的一个最优投影器。
In this work, we construct optimal low-rank approximations for the Gaussian posterior distribution in linear Gaussian inverse problems. The parameter space is a separable Hilbert space of possibly infinite dimension, and the data space is assumed to be finite-dimensional. We consider various types of approximation families for the posterior. We first consider approximate posteriors in which the means vary among a class of either structure-preserving or structure-ignoring low-rank transformations of the data, and in which the posterior covariance is kept fixed. We give necessary and sufficient conditions for these approximating posteriors to be equivalent to the exact posterior, for all possible realisations of the data simultaneously. For such approximations, we measure approximation error with the Kullback-Leibler, R\'enyi and Amari $\alpha$-divergences for $\alpha\in(0,1)$, and with the Hellinger distance, all averaged over the data distribution. With these losses, we find the optimal approximations and formulate an equivalent condition for their uniqueness, extending the work in finite dimensions of Spantini et al. (SIAM J. Sci. Comput. 2015). We then consider joint approximation of the mean and covariance, by also varying the posterior covariance over the low-rank updates considered in Part I of this work. For the reverse Kullback-Leibler divergence, we show that the separate optimal approximations of the mean and of the covariance can be combined to yield an optimal joint approximation of the mean and covariance. In addition, we interpret the joint approximation with the optimal structure-ignoring approximate mean in terms of an optimal projector in parameter space.