针对极端热浪预测中准确度与可解释性权衡的机器学习模型层级研究

Research

arXiv

Tackling the Accuracy-Interpretability Trade-off in a Hierarchy of Machine Learning Models for the Prediction of Extreme Heatwaves

摘要 Abstract

在利用机器学习（ML）进行预测时，我们主要关注性能和可解释性，这自然形成了一个权衡，即复杂模型通常具有更高的技能，但更难解释和信任。在气候科学界，可解释性尤为重要，因为我们旨在深入了解潜在现象。当预测涉及对社会影响巨大的极端天气事件时，这一点显得尤为重要。本文通过法国地区极端热浪的概率预测，使用了越来越复杂的机器学习模型层级，以找到准确度与可解释性之间的最佳平衡点。具体而言，我们使用的模型从全局高斯近似（GA）到深度卷积神经网络（CNN），中间还包括简单的内在可解释神经网络（IINN）以及使用散射变换（ScatNet）的模型。我们的研究表明，CNN虽然提供了更高的准确性，但其黑箱特性严重限制了可解释性，即使使用最先进的可解释人工智能（XAI）工具亦是如此。相比之下，ScatNet在实现与CNN相似的性能的同时，提供了更大的透明度，能够识别驱动预测的关键尺度和数据模式。这项研究强调了可解释性在气候科学中的潜力，表明较简单的模型可以媲美更复杂的模型性能，同时更容易理解。这种获得的可解释性对于建立对模型预测的信任并揭示新的科学见解至关重要，最终有助于深化我们对极端天气事件的理解和管理。

When performing predictions that use Machine Learning (ML), we are mainly interested in performance and interpretability. This generates a natural trade-off, where complex models generally have higher skills but are harder to explain and thus trust. Interpretability is particularly important in the climate community, where we aim at gaining a physical understanding of the underlying phenomena. Even more so when the prediction concerns extreme weather events with high impact on society. In this paper, we perform probabilistic forecasts of extreme heatwaves over France, using a hierarchy of increasingly complex ML models, which allows us to find the best compromise between accuracy and interpretability. More precisely, we use models that range from a global Gaussian Approximation (GA) to deep Convolutional Neural Networks (CNNs), with the intermediate steps of a simple Intrinsically Interpretable Neural Network (IINN) and a model using the Scattering Transform (ScatNet). Our findings reveal that CNNs provide higher accuracy, but their black-box nature severely limits interpretability, even when using state-of-the-art Explainable Artificial Intelligence (XAI) tools. In contrast, ScatNet achieves similar performance to CNNs while providing greater transparency, identifying key scales and patterns in the data that drive predictions. This study underscores the potential of interpretability in ML models for climate science, demonstrating that simpler models can rival the performance of their more complex counterparts, all the while being much easier to understand. This gained interpretability is crucial for building trust in model predictions and uncovering new scientific insights, ultimately advancing our understanding and management of extreme weather events.