论文标题
AutoFas:预先级别系统的自动功能和体系结构选择
AutoFAS: Automatic Feature and Architecture Selection for Pre-Ranking System
论文作者
论文摘要
工业搜索和推荐系统主要遵循经典的多阶段信息检索范式:匹配,预先排名,排名和重新排列阶段。为了说明系统效率,基于简单的基于矢量产物的模型通常在前级阶段部署。最近的作品考虑将大型排名模型的高知识提炼成小小的预级模型,以提高有效性。但是,在前级系统中仍存在两个主要挑战:(i)在没有明确建模性能增益与计算成本的情况下,预定级阶段的预定延迟约束不可避免地会导致次优的解决方案; (ii)将排名教师的知识转移到具有预定的手工体系结构的预先排名的学生中,仍然遭受模型性能的丧失。在这项工作中,提出了一种新型的框架自动汇款,该框架共同优化了预级模型的效率和有效性:(i)首次使用神经体系结构搜索(NAS)技术同时选择最有价值的功能和网络架构; (ii)在NAS过程中配备了排名模型指导奖励,AutoFas可以为给定排名教师选择最佳的预级架构,而无需任何计算开销。现实世界搜索系统中的实验结果表明,AutoFas始终以较低的计算成本优于先前的最新方法(SOTA)方法。值得注意的是,我们的模型已在Meituan搜索系统的前级模块中采用,从而带来了重大改进。
Industrial search and recommendation systems mostly follow the classic multi-stage information retrieval paradigm: matching, pre-ranking, ranking, and re-ranking stages. To account for system efficiency, simple vector-product based models are commonly deployed in the pre-ranking stage. Recent works consider distilling the high knowledge of large ranking models to small pre-ranking models for better effectiveness. However, two major challenges in pre-ranking system still exist: (i) without explicitly modeling the performance gain versus computation cost, the predefined latency constraint in the pre-ranking stage inevitably leads to suboptimal solutions; (ii) transferring the ranking teacher's knowledge to a pre-ranking student with a predetermined handcrafted architecture still suffers from the loss of model performance. In this work, a novel framework AutoFAS is proposed which jointly optimizes the efficiency and effectiveness of the pre-ranking model: (i) AutoFAS for the first time simultaneously selects the most valuable features and network architectures using Neural Architecture Search (NAS) technique; (ii) equipped with ranking model guided reward during NAS procedure, AutoFAS can select the best pre-ranking architecture for a given ranking teacher without any computation overhead. Experimental results in our real world search system show AutoFAS consistently outperforms the previous state-of-the-art (SOTA) approaches at a lower computing cost. Notably, our model has been adopted in the pre-ranking module in the search system of Meituan, bringing significant improvements.