耦合语义-几何模型与语义安全意识的在线混合信念POMDP

Research

arXiv

Online Hybrid-Belief POMDP with Coupled Semantic-Geometric Models and Semantic Safety Awareness

摘要 Abstract

在复杂且未知环境中工作的机器人通常需要对环境进行几何-语义表征，以便安全地完成任务。在推断环境的同时，它们必须考虑多种可能的情景以规划未来的动作。由于物体类别为离散变量，而机器人的自身姿态及物体的姿态为连续变量，因此可以通过混合离散-连续信念来表征环境，并根据模型和输入数据对其进行更新。环境的先验概率和观测模型可以利用深度学习算法从数据中学习得到。这些模型通常耦合了环境的语义和几何属性，导致语义变量相互关联，使得语义状态空间维度呈指数级增长。本文研究了在部分可观察马尔可夫决策过程（POMDP）下基于混合语义-几何信念的不确定性规划问题。所考虑的模型和先验考虑了语义变量与几何变量之间的耦合关系。在POMDP框架内，我们引入了语义感知的安全性概念。获取用于估计值函数的理论混合信念代表性样本是非常具有挑战性的。作为关键贡献，我们提出了一种新的混合信念形式并利用其抽取代表性样本。我们证明，在某些条件下，可以通过对所有可能的语义映射进行显式期望，高效计算值函数和安全性概率。我们的仿真结果表明，与在完整语义状态空间上对理论混合信念采样后进行穷尽估计的评估器相比，我们所提出的客观函数和安全性概率的估计达到了相似的准确性水平。然而，我们的估计器的复杂度是多项式而非指数级。

Robots operating in complex and unknown environments frequently require geometric-semantic representations of the environment to safely perform their tasks. While inferring the environment, they must account for many possible scenarios when planning future actions. Since objects' class types are discrete and the robot's self-pose and the objects' poses are continuous, the environment can be represented by a hybrid discrete-continuous belief which is updated according to models and incoming data. Prior probabilities and observation models representing the environment can be learned from data using deep learning algorithms. Such models often couple environmental semantic and geometric properties. As a result, semantic variables are interconnected, causing semantic state space dimensionality to increase exponentially. In this paper, we consider planning under uncertainty using partially observable Markov decision processes (POMDPs) with hybrid semantic-geometric beliefs. The models and priors consider the coupling between semantic and geometric variables. Within POMDP, we introduce the concept of semantically aware safety. Obtaining representative samples of the theoretical hybrid belief, required for estimating the value function, is very challenging. As a key contribution, we develop a novel form of the hybrid belief and leverage it to sample representative samples. We show that under certain conditions, the value function and probability of safety can be calculated efficiently with an explicit expectation over all possible semantic mappings. Our simulations show that our estimates of the objective function and probability of safety achieve similar levels of accuracy compared to estimators that run exhaustively on the entire semantic state-space using samples from the theoretical hybrid belief. Nevertheless, the complexity of our estimators is polynomial rather than exponential.