论文标题
无监督的机器学习中的下一件大事:婴儿学习的五个教训
The Next Big Thing(s) in Unsupervised Machine Learning: Five Lessons from Infant Learning
论文作者
论文摘要
在受到监督深度学习的普及程度激增之后,渴望减少对精选,标记的数据集的依赖,并利用最近引发的大量未标记数据引发了对无监督学习算法的重新兴趣。尽管由于方法的识别,对比度的学习和聚类优化,但由于方法的识别而显着提高了性能,但无监督的机器学习的性能仍然没有其假设的潜力。机器学习以前已经从神经科学和认知科学中汲取了灵感,并取得了巨大的成功。但是,这主要是基于具有访问标签和大量先验知识的成年学习者。为了推动无监督的机器学习前进,我们认为婴儿认知的发展科学可能是解锁下一代无监督学习方法的关键。从概念上讲,人类婴儿的学习是最接近人造无监督学习的生物学,因为婴儿也必须从未标记的数据中学习有用的表示。与机器学习相反,这些新表示是从相对较少的示例中迅速学习的。此外,婴儿学习强大的表示,可以在许多不同的任务和上下文中灵活有效地使用。我们确定了五个至关重要的因素,可以使婴儿的学习质量和学习速度评估在机器学习中已经利用这些因素的程度,并提出这些因素如何进一步采用这些因素如何导致以前看不见的绩效水平。
After a surge in popularity of supervised Deep Learning, the desire to reduce the dependence on curated, labelled data sets and to leverage the vast quantities of unlabelled data available recently triggered renewed interest in unsupervised learning algorithms. Despite a significantly improved performance due to approaches such as the identification of disentangled latent representations, contrastive learning, and clustering optimisations, the performance of unsupervised machine learning still falls short of its hypothesised potential. Machine learning has previously taken inspiration from neuroscience and cognitive science with great success. However, this has mostly been based on adult learners with access to labels and a vast amount of prior knowledge. In order to push unsupervised machine learning forward, we argue that developmental science of infant cognition might hold the key to unlocking the next generation of unsupervised learning approaches. Conceptually, human infant learning is the closest biological parallel to artificial unsupervised learning, as infants too must learn useful representations from unlabelled data. In contrast to machine learning, these new representations are learned rapidly and from relatively few examples. Moreover, infants learn robust representations that can be used flexibly and efficiently in a number of different tasks and contexts. We identify five crucial factors enabling infants' quality and speed of learning, assess the extent to which these have already been exploited in machine learning, and propose how further adoption of these factors can give rise to previously unseen performance levels in unsupervised learning.