Subasa——针对低资源环境下的僧伽罗语冒犯性语言检测适配语言模型
Subasa -- Adapting Language Models for Low-resourced Offensive Language Detection in Sinhala
摘要 Abstract
准确检测冒犯性语言对于社交媒体安全相关的多种应用至关重要。在这一任务中,低资源语言与高资源语言之间存在显著的性能差异。本文探索了之前未曾在僧伽罗语中用于冒犯性语言检测下游任务的微调策略,并由此引入了四种模型:"Subasa-XLM-R",它通过采用掩码释义预测的中间预微调步骤来增强性能;"Subasa-Llama" 和 "Subasa-Mistral" 的两个变体分别是基于 Llama(3.2 版)和 Mistral(v0.3 版)的特定任务策略微调版本。我们使用 SOLD 数据集对这些模型进行了评估,结果显示所有模型均优于现有基线模型。在零样本设置下,Subasa-XLM-R 在相同的 SOLD 数据集上的 Macro F1 得分达到 0.84,超过了包括 GPT-4o 在内的最先进大型语言模型。相关模型和代码已公开发布。
Accurate detection of offensive language is essential for a number of applications related to social media safety. There is a sharp contrast in performance in this task between low and high-resource languages. In this paper, we adapt fine-tuning strategies that have not been previously explored for Sinhala in the downstream task of offensive language detection. Using this approach, we introduce four models: "Subasa-XLM-R", which incorporates an intermediate Pre-Finetuning step using Masked Rationale Prediction. Two variants of "Subasa-Llama" and "Subasa-Mistral", are fine-tuned versions of Llama (3.2) and Mistral (v0.3), respectively, with a task-specific strategy. We evaluate our models on the SOLD benchmark dataset for Sinhala offensive language detection. All our models outperform existing baselines. Subasa-XLM-R achieves the highest Macro F1 score (0.84) surpassing state-of-the-art large language models like GPT-4o when evaluated on the same SOLD benchmark dataset under zero-shot settings. The models and code are publicly available.