基于可见性图的模型预测控制在人形机器人路径规划与对抗对手追踪中的应用

Research

arXiv

Model Predictive Control with Visibility Graphs for Humanoid Path Planning and Tracking Against Adversarial Opponents

Ruochen Hou ,

Gabriel I. Fernandez ,

Mingzhang Zhu ,

Dennis W. Hong

论文信息在线阅读PDF

摘要 Abstract

本文详细介绍了我们在2024年RoboCup成人组自主人形足球联赛中获胜所采用的避障、路径规划和轨迹跟踪方法。我们的团队在所有坐席比赛中保持不败，并在6场比赛中打入45球，决赛以6比1获胜。在竞赛期间，双足运动带来的测量噪声以及有限视野（FOV）是碰撞规避的主要挑战。此外，障碍物会间歇性地出现在我们计划的轨迹中。有时，我们的估计器会将机器人置于硬约束内。因此，任何参赛者都需要具备足够的计算效率，以便实时重新规划并作出反应。这促使我们开发了轨迹生成和跟踪的方法。在许多情况下，需要长期和短期规划。为了高效找到一条避开所有障碍物的总体路径，我们开发了DAVG（动态增强可见性图）。DAVG通过基于障碍物和期望目标姿态激活某些区域来进行重点路径规划。通过在图中增加状态，考虑转弯角度，这对大型足球机器人至关重要，因为转弯可能代价更高。轨迹由DAVG生成的离散点线性插值得到。然后使用修改后的模型预测控制（MPC），称为cf-MPC（无碰撞MPC），来跟踪该轨迹，以确保短期规划。cf-MPC无需切换公式即可同时考虑机器人动力学和无碰撞约束。在噪声导致机器人处于约束边界内时，由于没有硬切换，控制输入可以平滑过渡。非线性公式大约以120 Hz运行，而二次版本则达到约400 Hz。

In this paper we detail the methods used for obstacle avoidance, path planning, and trajectory tracking that helped us win the adult-sized, autonomous humanoid soccer league in RoboCup 2024. Our team was undefeated for all seated matches and scored 45 goals over 6 games, winning the championship game 6 to 1. During the competition, a major challenge for collision avoidance was the measurement noise coming from bipedal locomotion and a limited field of view (FOV). Furthermore, obstacles would sporadically jump in and out of our planned trajectory. At times our estimator would place our robot inside a hard constraint. Any planner in this competition must also be be computationally efficient enough to re-plan and react in real time. This motivated our approach to trajectory generation and tracking. In many scenarios long-term and short-term planning is needed. To efficiently find a long-term general path that avoids all obstacles we developed DAVG (Dynamic Augmented Visibility Graphs). DAVG focuses on essential path planning by setting certain regions to be active based on obstacles and the desired goal pose. By augmenting the states in the graph, turning angles are considered, which is crucial for a large soccer playing robot as turning may be more costly. A trajectory is formed by linearly interpolating between discrete points generated by DAVG. A modified version of model predictive control (MPC) is used to then track this trajectory called cf-MPC (Collision-Free MPC). This ensures short-term planning. Without having to switch formulations cf-MPC takes into account the robot dynamics and collision free constraints. Without a hard switch the control input can smoothly transition in cases where the noise places our robot inside a constraint boundary. The nonlinear formulation runs at approximately 120 Hz, while the quadratic version achieves around 400 Hz.