单目实时光度SLAM与高效三维高斯点云融合:MGSO

MGSO: Monocular Real-time Photometric SLAM with Efficient 3D Gaussian Splatting

摘要 Abstract

密集三维映射下的实时SLAM计算挑战巨大,尤其是在资源受限的设备上。近期三维高斯点云(3DGS)的发展为实时密集三维重建提供了有前景的方法。然而,现有的基于3DGS的SLAM系统在平衡硬件简洁性、速度和地图质量方面存在困难。大多数系统在上述一个或两个方面表现优异,但很少能够全面实现。关键问题在于初始化三维高斯分布的同时进行SLAM的难度。为了解决这些挑战,我们提出了单目GSO(MGSO),这是一种新颖的实时SLAM系统,将光度SLAM与3DGS相结合。光度SLAM为3DGS提供密集结构化点云以加速优化,从而生成更高效且包含较少高斯分布的地图。实验表明,我们的系统在质量、内存效率和速度之间实现了平衡,优于现有技术。此外,我们的系统仅使用RGB输入即可完成所有结果。我们在Replica、TUM-RGBD和EuRoC数据集上评估了当前实时密集重建系统。不仅超越了现代系统,实验还显示我们的性能在笔记本硬件上得以保持,使其成为机器人、增强现实(A/R)和其他实时应用的实际解决方案。

Real-time SLAM with dense 3D mapping is computationally challenging, especially on resource-limited devices. The recent development of 3D Gaussian Splatting (3DGS) offers a promising approach for real-time dense 3D reconstruction. However, existing 3DGS-based SLAM systems struggle to balance hardware simplicity, speed, and map quality. Most systems excel in one or two of the aforementioned aspects but rarely achieve all. A key issue is the difficulty of initializing 3D Gaussians while concurrently conducting SLAM. To address these challenges, we present Monocular GSO (MGSO), a novel real-time SLAM system that integrates photometric SLAM with 3DGS. Photometric SLAM provides dense structured point clouds for 3DGS initialization, accelerating optimization and producing more efficient maps with fewer Gaussians. As a result, experiments show that our system generates reconstructions with a balance of quality, memory efficiency, and speed that outperforms the state-of-the-art. Furthermore, our system achieves all results using RGB inputs. We evaluate the Replica, TUM-RGBD, and EuRoC datasets against current live dense reconstruction systems. Not only do we surpass contemporary systems, but experiments also show that we maintain our performance on laptop hardware, making it a practical solution for robotics, A/R, and other real-time applications.