论文标题

脱机手写Amharic角色识别使用很少的学习

Offline Handwritten Amharic Character Recognition Using Few-shot Learning

论文作者

Samuel, Mesay, Schmidt-Thieme, Lars, Sharma, DP, Sinamo, Abiot, Bruck, Abey

论文摘要

很少有射击学习是一个重要但具有挑战性的机器学习问题,旨在仅从更少的标记培训示例中学习。由于深度学习需要大量的标记数据集,因此它已成为一个积极的研究领域,在现实世界中这是不可行的。从一些例子中学习也是像人类一样学习的重要尝试。在机器学习应用的不同领域,尤其是在图像分类中,很少有学习的学习已证明是一个很好的希望。由于它是最近的技术,因此大多数研究人员只专注于Mini-ImageNet和Omniglot,专注于理解和解决与其概念相关的问题。很少有学习的学习也为解决Amharic等低资源语言提供了机会。在这项研究中,解决了几乎没有射击学习的离线手写Amharic角色识别。特别是,原型网络是流行的,更简单的少量学习,以基准实现。提出了提出一种增强训练情节的新型方法,利用具有行列和列的相似性的Amharic Alphabet性质所探索的机会。实验结果表明,该方法的表现优于基线方法。这项研究首次对阿姆哈拉语角色实施了很少的学习。更重要的是,该研究的发现开辟了新的方法来研究训练情节在几次学习中的影响,这是需要探索的重要问题之一。用于本研究的数据集使用本研究的一部分开发的Android应用程序从本地的Amharic语言作者那里收集。

Few-shot learning is an important, but challenging problem of machine learning aimed at learning from only fewer labeled training examples. It has become an active area of research due to deep learning requiring huge amounts of labeled dataset, which is not feasible in the real world. Learning from a few examples is also an important attempt towards learning like humans. Few-shot learning has proven a very good promise in different areas of machine learning applications, particularly in image classification. As it is a recent technique, most researchers focus on understanding and solving the issues related to its concept by focusing only on common image datasets like Mini-ImageNet and Omniglot. Few-shot learning also opens an opportunity to address low resource languages like Amharic. In this study, offline handwritten Amharic character recognition using few-shot learning is addressed. Particularly, prototypical networks, the popular and simpler type of few-shot learning, is implemented as a baseline. Using the opportunities explored in the nature of Amharic alphabet having row-wise and column-wise similarities, a novel way of augmenting the training episodes is proposed. The experimental results show that the proposed method outperformed the baseline method. This study has implemented few-shot learning for Amharic characters for the first time. More importantly, the findings of the study open new ways of examining the influence of training episodes in few-shot learning, which is one of the important issues that needs exploration. The datasets used for this study are collected from native Amharic language writers using an Android App developed as a part of this study.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源