原子机器学习的最佳不变基

Research

arXiv

原子机器学习的最佳不变基

Optimal Invariant Bases for Atomistic Machine Learning

摘要 Abstract

在机器学习模型中表征原子构型已经产生了许多描述符，通常用于描述原子的局域环境。然而，许多这些描述符是不完整的和/或函数相关的。不完整的描述符集无法表示原子环境中所有有意义的变化。另一方面，完整的原子环境描述符构造往往具有高度的功能相关性，其中一些描述符可以写成其他描述符的函数。这些冗余描述符不能提供额外的能力来区分不同的原子环境，并增加了计算负担。通过利用模式识别文献中的技术对现有的原子表示进行处理，我们去除了那些是其他描述符函数的描述符，从而得到满足完整性的最小可能集合。我们将这种方法应用于两种方式：首先，我们改进了一个现有的描述符，即原子簇展开。我们证明这产生了一组更高效的描述符。其次，我们增强了一个基于标量神经网络的不完整构造，通过利用一组最优的笛卡尔张量不变量，得到了一种新的消息传递网络架构，每个神经元能够识别高达五体模式。该架构在最先进的基准测试中表现出色，同时保持了较低的计算成本。我们的结果不仅提高了模型性能，还为众多应用提供了最小化成本并最大化表达能力的不变基类别的方向。

The representation of atomic configurations for machine learning models has led to the development of numerous descriptors, often to describe the local environment of atoms. However, many of these representations are incomplete and/or functionally dependent. Incomplete descriptor sets are unable to represent all meaningful changes in the atomic environment. Complete constructions of atomic environment descriptors, on the other hand, often suffer from a high degree of functional dependence, where some descriptors can be written as functions of the others. These redundant descriptors do not provide additional power to discriminate between different atomic environments and increase the computational burden. By employing techniques from the pattern recognition literature to existing atomistic representations, we remove descriptors that are functions of other descriptors to produce the smallest possible set that satisfies completeness. We apply this in two ways: first we refine an existing description, the Atomistic Cluster Expansion. We show that this yields a more efficient subset of descriptors. Second, we augment an incomplete construction based on a scalar neural network, yielding a new message-passing network architecture that can recognize up to 5-body patterns in each neuron by taking advantage of an optimal set of Cartesian tensor invariants. This architecture shows strong accuracy on state-of-the-art benchmarks while retaining low computational cost. Our results not only yield improved models, but point the way to classes of invariant bases that minimize cost while maximizing expressivity for a host of applications.