基于深厚的增强学习的互动搜索

论文标题

基于深厚的增强学习的互动搜索

Interactive Search Based on Deep Reinforcement Learning

论文作者

Yu, Yang, Gu, Zhenhao, Tao, Rong, Ge, Jingtian, Chang, Kenglun

论文摘要

随着机器学习技术的持续开发，主要的电子商务平台已经启动了基于IT的推荐系统，以更有效地为有不同需求的大量客户服务。与传统的监督学习相比，强化学习可以更好地捕获用户在决策过程中的状态过渡，并考虑一系列用户操作，而不仅仅是某个时刻用户的静态特征。从理论上讲，它将具有长期的观点，从而产生更有效的建议。加强学习的特殊要求使其需要依靠离线虚拟系统进行培训。我们的项目主要建立一个虚拟用户环境，用于离线培训。同时，我们试图改善基于双聚类的增强学习算法，以扩大建议代理的动作空间和建议的路径空间。

With the continuous development of machine learning technology, major e-commerce platforms have launched recommendation systems based on it to serve a large number of customers with different needs more efficiently. Compared with traditional supervised learning, reinforcement learning can better capture the user's state transition in the decision-making process, and consider a series of user actions, not just the static characteristics of the user at a certain moment. In theory, it will have a long-term perspective, producing a more effective recommendation. The special requirements of reinforcement learning for data make it need to rely on an offline virtual system for training. Our project mainly establishes a virtual user environment for offline training. At the same time, we tried to improve a reinforcement learning algorithm based on bi-clustering to expand the action space and recommended path space of the recommendation agent.

下载PDF全文

下载文献需遵守相关版权规定

论文标题