FEDDE: Federated Data Deduplication

Jinming Hu, Armaan Nanji, Zixiu Meng, Hao Wang, Jingxian Wang, Wentao Wu, Qizhen Zhang

February 2026

摘要

This paper introduces FEDDE, a general and efficient framework that addresses data redundancy across clients to facilitate effective federated learning (FL). At its core, FEDDE adopts a hierarchical deduplication architecture where clients first perform local, centralized deduplication and then send minimal records that are only meaningful for redundancy detection to the server for global deduplication. To enable flexible trade-offs between FL training efficiency and the accuracy of the training outcomes, FEDDE proposes two-round approximate deduplication protocols. A set of system optimizations is further applied to reduce deduplication overhead.

类型

预印本

FEDDE: Federated Data Deduplication

摘要

Jinming Hu

创始人兼首席科学家

相关