HybEA: 实体对齐的混合模型

Research

arXiv

HybEA: 实体对齐的混合模型

HybEA: Hybrid Models for Entity Alignment

Nikolaos Fanourakis ,

Fatia Lekbour ,

Guillaume Renton ,

Vasilis Efthymiou ,

Vassilis Christophides

论文信息在线阅读PDF

摘要 Abstract

实体对齐（EA）旨在检测不同知识图谱（KG）中描述相同现实世界实体的内容。一些嵌入方法被提出，用于根据两个KG在嵌入空间中的相似性来排名潜在匹配的实体。然而，现有EA嵌入方法面临来自真实世界KG的多种结构异构性（即邻域实体）和语义异构性（如实体名称和字面属性值）的挑战，尤其是在跨越多个领域（如DBpedia、Wikidata）时。现有方法要么根据上下文专注于两种异构性之一（单语言与多语言）。为了解决这一局限性，我们提出了一个灵活的框架HybEA，它结合了两种模型：一种新的基于注意力的事实模型，与最先进的结构模型共同训练。我们的实验结果表明，HybEA在Hits@1指标上比最先进的EA系统平均提高了16%，在5个单语言数据集上的提升范围从3.6%到40%，其中一些数据集现在可以被视为已解决。我们还表明，HybEA在3个多语言数据集以及2个放弃不现实但广泛采用的一对一假设的数据集上优于最先进的方法。总体而言，HybEA在所有（10个）评估的数据集和所有（3个）指标上都显著优于所有（11个）基线方法，且差异具有统计学意义。

Entity Alignment (EA) aims to detect descriptions of the same real-world entities among different Knowledge Graphs (KG). Several embedding methods have been proposed to rank potentially matching entities of two KGs according to their similarity in the embedding space. However, existing EA embedding methods are challenged by the diverse levels of structural (i.e., neighborhood entities) and semantic (e.g., entity names and literal property values) heterogeneity exhibited by real-world KGs, especially when they are spanning several domains (DBpedia, Wikidata). Existing methods either focus on one of the two heterogeneity kinds depending on the context (mono- vs multi-lingual). To address this limitation, we propose a flexible framework called HybEA, that is a hybrid of two models, a novel attention-based factual model, co-trained with a state-of-the-art structural model. Our experimental results demonstrate that HybEA outperforms the state-of-the-art EA systems, achieving a 16% average relative improvement of Hits@1, ranging from 3.6% up to 40% in 5 monolingual datasets, with some datasets that can now be considered as solved. We also show that HybEA outperforms state-of-the-art methods in 3 multi-lingual datasets, as well as on 2 datasets that drop the unrealistic, yet widely adopted, one-to-one assumption. Overall, HybEA outperforms all (11) baseline methods in all (3) measures and in all (10) datasets evaluated, with a statistically significant difference.