GraphGrad：高效估计广义状态空间模型稀疏多项式表示的方法

Research

arXiv

GraphGrad: Efficient Estimation of Sparse Polynomial Representations for General State-Space Models

摘要 Abstract

状态空间模型（SSMs）是一种强大的统计工具，通过潜在状态对时变系统进行建模。在这些模型中，潜在状态无法直接观测到，而是可以通过与状态相关的观测序列获得。状态空间模型由状态动力学和观测模型定义，两者均由参数化分布描述。对这些分布参数的估计是一项极具挑战性但必不可少的任务，用于执行推理和预测。此外，通常并非系统的所有状态都相互作用，因此可以通过图来编码状态之间的交互关系，这种图通常不是完全连接的。然而，大多数参数估计方法并未利用这一特性。本文提出GraphGrad，这是一种全自动的方法，通过多项式近似获得非线性状态空间模型中状态交互的稀疏估计。该新颖方法揭示了数据生成过程的潜在结构，允许我们推断出一般状态空间模型的结构及其丰富且高效的参数化值。我们的方法利用可微粒子滤波器优化蒙特卡洛似然估计器，并通过适当的邻近更新促进系统估计中的稀疏性，这种方法比次梯度方法更高效且稳定。如论文所示，许多已知的动力学系统可以通过我们的方法准确表示和恢复，为实际应用场景提供了基础。

State-space models (SSMs) are a powerful statistical tool for modelling time-varying systems via a latent state. In these models, the latent state is never directly observed. Instead, a sequence of observations related to the state is available. The state-space model is defined by the state dynamics and the observation model, both of which are described by parametric distributions. Estimation of parameters of these distributions is a very challenging, but essential, task for performing inference and prediction. Furthermore, it is typical that not all states of the system interact. We can therefore encode the interaction of the states via a graph, usually not fully connected. However, most parameter estimation methods do not take advantage of this feature. In this work, we propose GraphGrad, a fully automatic approach for obtaining sparse estimates of the state interactions of a non-linear state-space model via a polynomial approximation. This novel methodology unveils the latent structure of the data-generating process, allowing us to infer both the structure and value of a rich and efficient parameterisation of a general state-space model. Our method utilises a differentiable particle filter to optimise a Monte Carlo likelihood estimator. It also promotes sparsity in the estimated system through the use of suitable proximity updates, known to be more efficient and stable than subgradient methods. As shown in our paper, a number of well-known dynamical systems can be accurately represented and recovered by our method, providing basis for application to real-world scenarios.