可学习的Cut流方法

Research

arXiv

可学习的Cut流方法

Learnable cut flow

摘要 Abstract

神经网络已成为高能物理任务中的强大工具，然而其不透明的训练过程使其成为“黑箱”。相比之下，传统的Cut流方法简单且具有可解释性，但需要人工努力来确定最优边界。为融合两种方法的优势，我们提出了可学习的Cut流（Learnable Cut Flow, LCF），这是一种神经网络，将传统的Cut选择转化为完全可微、数据驱动的过程。LCF实现了两种Cut策略——并行策略，其中可观测量分布被独立处理；顺序策略，其中先前的Cut影响后续的Cut——以灵活地确定最优边界。在此基础上，我们引入了可学习的重要性，一种量化特征重要性的度量，并相应调整其对损失的贡献，提供了不同于随意指标的模型驱动洞察。为了确保可微性，修改后的损失函数用掩码操作替代硬Cut，从而在整个训练过程中保持数据形状。LCF在六个不同的模拟数据集和一个真实的双玻色子与QCD数据集上进行了测试。结果表明，LCF（1）在并行和顺序策略下准确地学习了典型特征分布的Cut边界，（2）为区分性强且重叠少的特征赋予更高的重要性，（3）稳健地处理冗余或相关的特征，（4）在真实场景中表现有效。在双玻色子数据集中，当使用所有可观测量时，LCF初始性能低于Boosted Decision Trees和Multi-Layer Perceptron。然而，通过根据学习到的重要性修剪不太重要的特征，其性能提升至与这些基准相当甚至超越。LCF弥合了传统Cut流方法与现代黑盒神经网络之间的差距，为训练过程和特征重要性提供了可操作的洞见。

Neural networks have emerged as a powerful paradigm for tasks in high energy physics, yet their opaque training process renders them as a black box. In contrast, the traditional cut flow method offers simplicity and interpretability but demands human effort to identify optimal boundaries. To merge the strengths of both approaches, we propose the Learnable Cut Flow (LCF), a neural network that transforms the traditional cut selection into a fully differentiable, data-driven process. LCF implements two cut strategies-parallel, where observable distributions are treated independently, and sequential, where prior cuts shape subsequent ones-to flexibly determine optimal boundaries. Building on this, we introduce the Learnable Importance, a metric that quantifies feature importance and adjusts their contributions to the loss accordingly, offering model-driven insights unlike ad-hoc metrics. To ensure differentiability, a modified loss function replaces hard cuts with mask operations, preserving data shape throughout the training process. LCF is tested on six varied mock datasets and a realistic diboson vs. QCD dataset. Results demonstrate that LCF (1) accurately learns cut boundaries across typical feature distributions in both parallel and sequential strategies, (2) assigns higher importance to discriminative features with minimal overlap, (3) handles redundant or correlated features robustly, and (4) performs effectively in real-world scenarios. In diboson dataset, LCF initially underperforms boosted decision trees and multiplayer perceptrons when using all observables. However, pruning less critical features-guided by learned importance-boosts its performance to match or exceed these baselines. LCF bridges the gap between traditional cut flow method and modern black-box neural networks, delivering actionable insights into the training process and feature importance.