论文标题
Privaterec:差异化私人培训和为联邦新闻推荐服务
PrivateRec: Differentially Private Training and Serving for Federated News Recommendation
论文作者
论文摘要
对敏感个人数据的收集和培训在个性化建议系统中引起了严重的隐私问题,并且联合学习可以通过培训模型而不是分散的用户数据来减轻问题。但是,在联合建议的培训和服务阶段中,一种理论上是私人解决方案,在联合建议的培训阶段和服务阶段都是必不可少的。 between privacy and utility due to the high-dimensional characteristics of model gradients and hidden representations.In this work, we propose a federated news recommendation method for achieving a better utility in model training and online serving under a DP guarantee.We first clarify the DP definition over behavior data for each round in the life-circle of federated recommendation systems.Next, we propose a privacy-preserving online serving mechanism under this definition based on the idea of decomposing用户嵌入具有公共基本向量的用户嵌入,并扰动较低维的组合系数。我们采用随机行为填充机制来降低所需的噪声强度以提高效用。此外,我们设计了一种联合推荐模型培训方法,该方法可以在为培训参与者提供DP时生成有效的公共基本向量。我们避免通过标签置换和差异私人注意模块的大型模型依赖尺寸依赖性噪声。现实世界新闻推荐数据集的实验验证了我们的方法在培训和服务联盟新闻建议的培训和服务中都可以在DP保证下实现卓越的实用性。
Collecting and training over sensitive personal data raise severe privacy concerns in personalized recommendation systems, and federated learning can potentially alleviate the problem by training models over decentralized user data.However, a theoretically private solution in both the training and serving stages of federated recommendation is essential but still lacking.Furthermore, naively applying differential privacy (DP) to the two stages in federated recommendation would fail to achieve a satisfactory trade-off between privacy and utility due to the high-dimensional characteristics of model gradients and hidden representations.In this work, we propose a federated news recommendation method for achieving a better utility in model training and online serving under a DP guarantee.We first clarify the DP definition over behavior data for each round in the life-circle of federated recommendation systems.Next, we propose a privacy-preserving online serving mechanism under this definition based on the idea of decomposing user embeddings with public basic vectors and perturbing the lower-dimensional combination coefficients. We apply a random behavior padding mechanism to reduce the required noise intensity for better utility. Besides, we design a federated recommendation model training method, which can generate effective and public basic vectors for serving while providing DP for training participants. We avoid the dimension-dependent noise for large models via label permutation and differentially private attention modules. Experiments on real-world news recommendation datasets validate that our method achieves superior utility under a DP guarantee in both training and serving of federated news recommendations.