概率模型个性化联邦学习：基于PAC-Bayes的方法

Research

arXiv

Personalized Federated Learning of Probabilistic Models: A PAC-Bayesian Approach

Mahrokh Ghoddousi Boroujeni ,

Andreas Krause ,

Giancarlo Ferrari Trecate

论文信息在线阅读PDF

摘要 Abstract

联邦学习（FL）旨在从多个客户端存储的私有且分散的数据中推断共享模型。个性化联邦学习（PFL）通过调整全局模型以适应每个客户端，从而提升模型对各客户端的适配性。对于高度异构的客户端，需要较高的个性化水平，但在数据量较小的情况下实现这一点可能具有挑战性。为了解决这一问题，我们引入了PAC-PFL框架用于概率模型的个性化联邦学习。PAC-PFL推断一个共享的超后验，并将每个客户端的后验推理视为个性化步骤。与之前的PFL算法不同，PAC-PFL不会将所有个性化模型正则化为单一共享模型，从而极大地增强了其个性化灵活性。通过建立并最小化客户端平均真实损失的PAC-Bayes泛化界，PAC-PFL即使在数据匮乏的情境下也能有效缓解过拟合。此外，PAC-PFL还为后期加入的新客户端提供了泛化界。实验结果表明，PAC-PFL能够实现准确且校准良好的预测。

Federated Learning (FL) aims to infer a shared model from private and decentralized data stored by multiple clients. Personalized FL (PFL) enhances the model's fit for each client by adapting the global model to the clients. A significant level of personalization is required for highly heterogeneous clients but can be challenging to achieve, especially when clients' datasets are small. To address this issue, we introduce the PAC-PFL framework for PFL of probabilistic models. PAC-PFL infers a shared hyper-posterior and treats each client's posterior inference as the personalization step. Unlike previous PFL algorithms, PAC-PFL does not regularize all personalized models towards a single shared model, thereby greatly enhancing its personalization flexibility. By establishing and minimizing a PAC-Bayesian generalization bound on the average true loss of clients, PAC-PFL effectively mitigates overfitting even in data-poor scenarios. Additionally, PAC-PFL provides generalization bounds for new clients joining later. PAC-PFL achieves accurate and well-calibrated predictions, as supported by our experiments.