基于内容定义分块的文件备份服务分块攻击

Research

arXiv

基于内容定义分块的文件备份服务分块攻击

Chunking Attacks on File Backup Services using Content-Defined Chunking

摘要 Abstract

文件备份服务等系统常使用基于内容定义分块（CDC）算法，特别是滚动哈希技术，将文件分割为可实现数据去重的块。这些分块算法通常依赖于每个用户的参数，试图避免泄露存储数据的信息。我们提出了一种提取这些分块参数的攻击方法，并讨论了在参数被攻破时协议无关的安全损失（包括当这些参数未设置时的情况，这通常是可选的）。我们的参数提取攻击本身是协议特定的，但其思想可以推广到许多潜在的CDC方案。

Systems such as file backup services often use content-defined chunking (CDC) algorithms, especially those based on rolling hash techniques, to split files into chunks in a way that allows for data deduplication. These chunking algorithms often depend on per-user parameters in an attempt to avoid leaking information about the data being stored. We present attacks to extract these chunking parameters and discuss protocol-agnostic attacks and loss of security once the parameters are breached (including when these parameters are not setup at all, which is often available as an option). Our parameter-extraction attacks themselves are protocol-specific but their ideas are generalizable to many potential CDC schemes.