基于离散小波变换的时间-频率分析在顺序推荐系统中的过滤应用:一种自适应且轻量化的模型
Filtering with Time-frequency Analysis: An Adaptive and Lightweight Model for Sequential Recommender Systems Based on Discrete Wavelet Transform
摘要 Abstract
顺序推荐系统(SRS)旨在对用户的序列行为进行建模,以捕捉其兴趣,而这些兴趣通常会随时间演变。基于Transformer的SRS近期取得了显著的成功。然而,研究表明,基于Transformer的模型中的自注意力机制本质上是一种低通滤波器,忽略了可能包含有意义用户兴趣模式的高频信息。这促使我们寻求SRS更好的过滤技术,并最终发现来自数字信号处理领域的著名时间-频率分析技术——离散小波变换(DWT)能够有效处理低频和高频信息。我们设计了一种基于DWT技术的自适应时间-频率滤波器,该滤波器将用户兴趣分解为多个具有不同频率和时间的信号,并能自动学习这些信号的权重。此外,我们开发了DWTRec,这是一种完全基于自适应时间-频率滤波器的顺序推荐模型。由于快速的DWT技术,DWTRec在理论上具有更低的时间复杂度和空间复杂度,并且擅长对长序列进行建模。实验表明,我们的模型在不同领域、稀疏程度和平均序列长度的数据集上优于最先进的基准模型。特别是,当序列变长时,我们的模型相较于先前的模型表现出显著的性能提升,这进一步证明了我们模型的另一优势。
Sequential Recommender Systems (SRS) aim to model sequential behaviors of users to capture their interests which usually evolve over time. Transformer-based SRS have achieved distinguished successes recently. However, studies reveal self-attention mechanism in Transformer-based models is essentially a low-pass filter and ignores high frequency information potentially including meaningful user interest patterns. This motivates us to seek better filtering technologies for SRS, and finally we find Discrete Wavelet Transform (DWT), a famous time-frequency analysis technique from digital signal processing field, can effectively process both low-frequency and high-frequency information. We design an adaptive time-frequency filter with DWT technique, which decomposes user interests into multiple signals with different frequency and time, and can automatically learn weights of these signals. Furthermore, we develop DWTRec, a model for sequential recommendation all based on the adaptive time-frequency filter. Thanks to fast DWT technique, DWTRec has a lower time complexity and space complexity theoretically, and is Proficient in modeling long sequences. Experiments show that our model outperforms state-of-the-art baseline models in datasets with different domains, sparsity levels and average sequence lengths. Especially, our model shows great performance increase in contrast with previous models when the sequence grows longer, which demonstrates another advantage of our model.