论文标题
基于深厚的增强学习的互动搜索
Interactive Search Based on Deep Reinforcement Learning
论文作者
论文摘要
随着机器学习技术的持续开发,主要的电子商务平台已经启动了基于IT的推荐系统,以更有效地为有不同需求的大量客户服务。与传统的监督学习相比,强化学习可以更好地捕获用户在决策过程中的状态过渡,并考虑一系列用户操作,而不仅仅是某个时刻用户的静态特征。从理论上讲,它将具有长期的观点,从而产生更有效的建议。加强学习的特殊要求使其需要依靠离线虚拟系统进行培训。我们的项目主要建立一个虚拟用户环境,用于离线培训。同时,我们试图改善基于双聚类的增强学习算法,以扩大建议代理的动作空间和建议的路径空间。
With the continuous development of machine learning technology, major e-commerce platforms have launched recommendation systems based on it to serve a large number of customers with different needs more efficiently. Compared with traditional supervised learning, reinforcement learning can better capture the user's state transition in the decision-making process, and consider a series of user actions, not just the static characteristics of the user at a certain moment. In theory, it will have a long-term perspective, producing a more effective recommendation. The special requirements of reinforcement learning for data make it need to rely on an offline virtual system for training. Our project mainly establishes a virtual user environment for offline training. At the same time, we tried to improve a reinforcement learning algorithm based on bi-clustering to expand the action space and recommended path space of the recommendation agent.