解锁HyDRa:混合融合、深度一致性与雷达的统一3D感知

Unleashing HyDRa: Hybrid Fusion, Depth Consistency and Radar for Unified 3D Perception

摘要 Abstract

近年来,以低成本视觉为中心的自动驾驶3D感知系统取得了显著进展,缩小了与昂贵的基于LiDAR方法之间的差距。然而,成为完全可靠的替代方案的主要挑战在于具备鲁棒的深度预测能力,因为基于摄像头的系统在长检测范围以及不良光照和天气条件下表现不佳。在这项工作中,我们引入了HyDRa,这是一种用于多样化3D感知任务的新颖摄像头-雷达融合架构。HyDRa基于密集BEV(鸟瞰图)架构的原则,提出了一种混合融合方法,结合互补摄像头和雷达特征在两种不同表征空间中的优势。我们的Height Association Transformer模块利用透视视图中的雷达特征,生成更鲁棒和准确的深度预测。在BEV中,通过Radar加权的深度一致性对初始稀疏表示进行优化。HyDRa在公开的nuScenes数据集上实现了新的最先进性能,其64.2 NDS(+1.8)和58.4 AMOTA(+1.5)的融合结果达到了前所未有的高度。此外,我们的新语义丰富且空间精确的BEV特征可以直接转换为强大的占用表示,在Occ3D基准测试中超越所有先前基于摄像头的方法,提高了3.7 mIoU。代码和模型可在https://github.com/phi-wol/hydra获取。

Low-cost, vision-centric 3D perception systems for autonomous driving have made significant progress in recent years, narrowing the gap to expensive LiDAR-based methods. The primary challenge in becoming a fully reliable alternative lies in robust depth prediction capabilities, as camera-based systems struggle with long detection ranges and adverse lighting and weather conditions. In this work, we introduce HyDRa, a novel camera-radar fusion architecture for diverse 3D perception tasks. Building upon the principles of dense BEV (Bird's Eye View)-based architectures, HyDRa introduces a hybrid fusion approach to combine the strengths of complementary camera and radar features in two distinct representation spaces. Our Height Association Transformer module leverages radar features already in the perspective view to produce more robust and accurate depth predictions. In the BEV, we refine the initial sparse representation by a Radar-weighted Depth Consistency. HyDRa achieves a new state-of-the-art for camera-radar fusion of 64.2 NDS (+1.8) and 58.4 AMOTA (+1.5) on the public nuScenes dataset. Moreover, our new semantically rich and spatially accurate BEV features can be directly converted into a powerful occupancy representation, beating all previous camera-based methods on the Occ3D benchmark by an impressive 3.7 mIoU. Code and models are available at https://github.com/phi-wol/hydra.