FEDDE: Federated Data Deduplication

摘要

This paper introduces FEDDE, a general and efficient framework that addresses data redundancy across clients to facilitate effective federated learning (FL). At its core, FEDDE adopts a hierarchical deduplication architecture where clients first perform local, centralized deduplication and then send minimal records that are only meaningful for redundancy detection to the server for global deduplication. To enable flexible trade-offs between FL training efficiency and the accuracy of the training outcomes, FEDDE proposes two-round approximate deduplication protocols. A set of system optimizations is further applied to reduce deduplication overhead.

类型
Avatar
Jinming Hu
创始人兼首席科学家

我的研究兴趣包括机器学习、数据挖掘、深度学习、计算机视觉、操作系统和数据库。

相关