论文标题

基于深厚的增强学习的互动搜索

Interactive Search Based on Deep Reinforcement Learning

论文作者

Yu, Yang, Gu, Zhenhao, Tao, Rong, Ge, Jingtian, Chang, Kenglun

论文摘要

随着机器学习技术的持续开发,主要的电子商务平台已经启动了基于IT的推荐系统,以更有效地为有不同需求的大量客户服务。与传统的监督学习相比,强化学习可以更好地捕获用户在决策过程中的状态过渡,并考虑一系列用户操作,而不仅仅是某个时刻用户的静态特征。从理论上讲,它将具有长期的观点,从而产生更有效的建议。加强学习的特殊要求使其需要依靠离线虚拟系统进行培训。我们的项目主要建立一个虚拟用户环境,用于离线培训。同时,我们试图改善基于双聚类的增强学习算法,以扩大建议代理的动作空间和建议的路径空间。

With the continuous development of machine learning technology, major e-commerce platforms have launched recommendation systems based on it to serve a large number of customers with different needs more efficiently. Compared with traditional supervised learning, reinforcement learning can better capture the user's state transition in the decision-making process, and consider a series of user actions, not just the static characteristics of the user at a certain moment. In theory, it will have a long-term perspective, producing a more effective recommendation. The special requirements of reinforcement learning for data make it need to rely on an offline virtual system for training. Our project mainly establishes a virtual user environment for offline training. At the same time, we tried to improve a reinforcement learning algorithm based on bi-clustering to expand the action space and recommended path space of the recommendation agent.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源