论文标题
Minerl Diamond 2021竞赛:概述,结果和经验教训
MineRL Diamond 2021 Competition: Overview, Results, and Lessons Learned
论文作者
论文摘要
强化学习竞赛通过提供适当的范围和支持以开发针对特定问题的解决方案来推动该领域。为了促进更广泛适用的方法的开发,组织者需要强制使用通用技术,使用样品有效方法以及结果的可重复性。虽然对研究界有益,但这些限制却以增加难度为代价。如果进入的障碍太高,那么许多潜在的参与者会士气低落。考虑到这一点,我们主持了第三版的Minerl Getaiamond比赛Minerl Diamond 2021,并带有单独的曲目,其中我们允许任何解决方案来促进新移民的参与。有了这一曲目以及更广泛的教程和支持,我们看到了更多的提交。这条更轻松的曲目的参与者能够获得钻石,而艰难轨道的参与者则在同一任务中进步了可推广的解决方案。
Reinforcement learning competitions advance the field by providing appropriate scope and support to develop solutions toward a specific problem. To promote the development of more broadly applicable methods, organizers need to enforce the use of general techniques, the use of sample-efficient methods, and the reproducibility of the results. While beneficial for the research community, these restrictions come at a cost -- increased difficulty. If the barrier for entry is too high, many potential participants are demoralized. With this in mind, we hosted the third edition of the MineRL ObtainDiamond competition, MineRL Diamond 2021, with a separate track in which we permitted any solution to promote the participation of newcomers. With this track and more extensive tutorials and support, we saw an increased number of submissions. The participants of this easier track were able to obtain a diamond, and the participants of the harder track progressed the generalizable solutions in the same task.