基于贝叶斯的协变量相关图学习与双组群尖峰-滑块先验
Bayesian Covariate-Dependent Graph Learning with a Dual Group Spike-and-Slab Prior
摘要 Abstract
协变量相关的图学习在分析异质数据的图形建模文献中引起了越来越多的兴趣。然而,这项任务在建模、计算效率和可解释性方面都带来了挑战。感兴趣参数可以自然地表示为一个三维数组,其元素可以根据两个方向进行分组,分别对应节点级别和协变量级别。本文提出了一种新的双组群尖峰-滑块先验,能够在协变量级别、节点级别以及个体(局部)级别实现稀疏选择。我们引入了一种嵌套策略并作出具体选择,以应对由不同分组方向带来的各种挑战。对于后验推断,我们开发了一种无需调节参数的Gibbs采样器,这减轻了高维图形模型中常见的参数调节困难,并促进了常规实现。通过模拟研究,我们证明所提出的模型在图恢复的准确性上优于现有方法。我们通过应用到微生物组数据展示了模型的实际效用,在此过程中,我们寻求更好地理解微生物之间的相互作用以及这些相互作用如何受到相关协变量的影响。
Covariate-dependent graph learning has gained increasing interest in the graphical modeling literature for the analysis of heterogeneous data. This task, however, poses challenges to modeling, computational efficiency, and interpretability. The parameter of interest can be naturally represented as a three-dimensional array with elements that can be grouped according to two directions, corresponding to node level and covariate level, respectively. In this article, we propose a novel dual group spike-and-slab prior that enables multi-level selection at covariate-level and node-level, as well as individual (local) level sparsity. We introduce a nested strategy with specific choices to address distinct challenges posed by the various grouping directions. For posterior inference, we develop a tuning-free Gibbs sampler for all parameters, which mitigates the difficulties of parameter tuning often encountered in high-dimensional graphical models and facilitates routine implementation. Through simulation studies, we demonstrate that the proposed model outperforms existing methods in its accuracy of graph recovery. We show the practical utility of our model via an application to microbiome data where we seek to better understand the interactions among microbes as well as how these are affected by relevant covariates.