论文标题
Mangoleafbd:一个综合图像数据集,用于对患病和健康的芒果叶分类
MangoLeafBD: A Comprehensive Image Dataset to Classify Diseased and Healthy Mango Leaves
论文作者
论文摘要
农业是剩下的少数几个尚未得到机器学习社区的关注的部门之一。数据集在机器学习学科中的重要性不能过分强调。缺乏与农业有关的标准和公开可用的数据集,阻碍了该学科的从业者,以利用这些强大的计算预测工具和技术的全部好处。为了改善这种情况,我们据我们所知,是有史以来的第一个标准,现成的,即芒果叶子的公开数据集。这些图像是从孟加拉国的四个芒果果园中收集的,孟加拉国是世界上顶级芒果发展的国家之一。该数据集包含4000张大约1800个不同叶子的图像,涵盖了7种疾病。尽管数据集仅使用孟加拉国的芒果叶子开发,但由于我们处理许多国家常见的疾病,因此该数据集也可能适用于其他国家 /地区的芒果疾病,从而提高芒果产量。预计该数据集将引起自动化农业领域的机器学习研究人员和从业人员的广泛关注。
Agriculture is of one of the few remaining sectors that is yet to receive proper attention from the machine learning community. The importance of datasets in the machine learning discipline cannot be overemphasized. The lack of standard and publicly available datasets related to agriculture impedes practitioners of this discipline to harness the full benefit of these powerful computational predictive tools and techniques. To improve this scenario, we develop, to the best of our knowledge, the first-ever standard, ready-to-use, and publicly available dataset of mango leaves. The images are collected from four mango orchards of Bangladesh, one of the top mango-growing countries of the world. The dataset contains 4000 images of about 1800 distinct leaves covering seven diseases. Although the dataset is developed using mango leaves of Bangladesh only, since we deal with diseases that are common across many countries, this dataset is likely to be applicable to identify mango diseases in other countries as well, thereby boosting mango yield. This dataset is expected to draw wide attention from machine learning researchers and practitioners in the field of automated agriculture.