基于无数据动作屏蔽的物理感知组合装配序列规划

Physics-Aware Combinatorial Assembly Sequence Planning using Data-free Action Masking

摘要 Abstract

组合装配利用标准化的单元基元构建满足用户规格的对象。本文研究了物理组合装配的装配序列规划(ASP)。给定目标对象的形状,目标是找到放置单元基元的一系列操作以构建目标对象。特别地,我们旨在确保计划的装配序列在物理上可执行。然而,由于组合装配的组合性质,ASP 尤其具有挑战性。为了解决这一挑战,我们采用深度强化学习来学习一种构造策略,用于按顺序放置单元基元以构建所需对象。具体而言,我们设计了一种在线的物理感知动作屏蔽,以过滤掉无效操作,这有效地指导了策略学习并确保了无违规部署。最后,我们将所提出的方法应用于超过250个3D结构的乐高积木装配任务。实验结果表明,所提出的方法能够规划出物理有效的装配序列以构建所有结构,成功率达到100%,而最佳的可比基线方法有超过40个结构失败。我们的实现代码可在 \url{https://github.com/intelligent-control-lab/PhysicsAwareCombinatorialASP} 获取。

Combinatorial assembly uses standardized unit primitives to build objects that satisfy user specifications. This paper studies assembly sequence planning (ASP) for physical combinatorial assembly. Given the shape of the desired object, the goal is to find a sequence of actions for placing unit primitives to build the target object. In particular, we aim to ensure the planned assembly sequence is physically executable. However, ASP for combinatorial assembly is particularly challenging due to its combinatorial nature. To address the challenge, we employ deep reinforcement learning to learn a construction policy for placing unit primitives sequentially to build the desired object. Specifically, we design an online physics-aware action mask that filters out invalid actions, which effectively guides policy learning and ensures violation-free deployment. In the end, we apply the proposed method to Lego assembly with more than 250 3D structures. The experiment results demonstrate that the proposed method plans physically valid assembly sequences to build all structures, achieving a $100\%$ success rate, whereas the best comparable baseline fails more than $40$ structures. Our implementation is available at \url{https://github.com/intelligent-control-lab/PhysicsAwareCombinatorialASP}.