地球视觉统一Copernicus基础模型的研究

Research

arXiv

Towards a Unified Copernicus Foundation Model for Earth Vision

Yi Wang ,

Nikolaos Ioannis Bountos ,

摘要 Abstract

地球观测（EO）基础模型的进步解锁了大规模卫星数据从太空学习通用表征的潜力，从而造福于众多对地球至关重要的下游应用。然而，大多数现有工作仍局限于固定光谱传感器，仅关注地球表面，并忽略了图像之外的有价值元数据。在本文中，我们迈向下一代EO基础模型，提出了三个关键组成部分：1）Copernicus-Pretrain，一个大规模预训练数据集，整合了来自所有主要Copernicus哨兵任务的1870万对齐图像，覆盖了从地球表面到大气层的范围；2）Copernicus-FM，一种能够处理任何光谱或非光谱传感器模态的统一基础模型，采用扩展动态超网络和灵活的元数据编码方式；3）Copernicus-Bench，一个包含15个分层下游任务的系统评估基准，这些任务涵盖了从每颗哨兵任务的预处理到专门应用的各个阶段。我们的数据集、模型和基准极大地提升了EO基础模型的可扩展性、多功能性和多模态适应能力，同时为连接EO、天气和气候研究创造了新的机遇。代码、数据集和模型可在https://github.com/zhu-xlab/Copernicus-FM获取。

Advances in Earth observation (EO) foundation models have unlocked the potential of big satellite data to learn generic representations from space, benefiting a wide range of downstream applications crucial to our planet. However, most existing efforts remain limited to fixed spectral sensors, focus solely on the Earth's surface, and overlook valuable metadata beyond imagery. In this work, we take a step towards next-generation EO foundation models with three key components: 1) Copernicus-Pretrain, a massive-scale pretraining dataset that integrates 18.7M aligned images from all major Copernicus Sentinel missions, spanning from the Earth's surface to its atmosphere; 2) Copernicus-FM, a unified foundation model capable of processing any spectral or non-spectral sensor modality using extended dynamic hypernetworks and flexible metadata encoding; and 3) Copernicus-Bench, a systematic evaluation benchmark with 15 hierarchical downstream tasks ranging from preprocessing to specialized applications for each Sentinel mission. Our dataset, model, and benchmark greatly improve the scalability, versatility, and multimodal adaptability of EO foundation models, while also creating new opportunities to connect EO, weather, and climate research. Codes, datasets and models are available at https://github.com/zhu-xlab/Copernicus-FM.