DexForce:从带有触觉演示中提取力引导动作用于灵巧操作

DexForce: Extracting Force-informed Actions from Kinesthetic Demonstrations for Dexterous Manipulation

摘要 Abstract

模仿学习需要由状态-动作对序列组成的高质量演示。对于需要灵巧性的接触丰富的灵巧操作任务,这些状态-动作对中的动作必须产生正确的力。当前广泛使用的收集灵巧操作演示的方法由于人类到机器人的运动重定向不直观且缺乏直接的触觉反馈,难以用于展示接触丰富的任务。受这些关注点的启发,我们提出了DexForce。DexForce利用在运动学习过程中测量的接触力来计算带有触觉信息的动作。我们为六个任务收集了演示,并表明基于我们带有力引导的动作训练的策略在所有任务上的平均成功率达到了76%。相比之下,直接基于不考虑接触力的动作训练的策略的成功率接近于零。我们还进行了一项研究,消融了在策略观察中包含力数据的影响。我们发现,虽然使用力数据从未损害策略性能,但它在需要高级别精度和协调的任务中帮助最大,例如打开AirPods盒子和拧开螺母。

Imitation learning requires high-quality demonstrations consisting of sequences of state-action pairs. For contact-rich dexterous manipulation tasks that require dexterity, the actions in these state-action pairs must produce the right forces. Current widely-used methods for collecting dexterous manipulation demonstrations are difficult to use for demonstrating contact-rich tasks due to unintuitive human-to-robot motion retargeting and the lack of direct haptic feedback. Motivated by these concerns, we propose DexForce. DexForce leverages contact forces, measured during kinesthetic demonstrations, to compute force-informed actions for policy learning. We collect demonstrations for six tasks and show that policies trained on our force-informed actions achieve an average success rate of 76% across all tasks. In contrast, policies trained directly on actions that do not account for contact forces have near-zero success rates. We also conduct a study ablating the inclusion of force data in policy observations. We find that while using force data never hurts policy performance, it helps most for tasks that require advanced levels of precision and coordination, like opening an AirPods case and unscrewing a nut.

DexForce:从带有触觉演示中提取力引导动作用于灵巧操作 - arXiv