PP-OR：实用的超轻量OCR系统

论文标题

PP-OR：实用的超轻量OCR系统

PP-OCR: A Practical Ultra Lightweight OCR System

论文作者

Du, Yuning, Li, Chenxia, Guo, Ruoyu, Yin, Xiaoting, Liu, Weiwei, Zhou, Jun, Bai, Yifan, Yu, Zilin, Yang, Yehua, Dang, Qingqing, Wang, Haoshuang

论文摘要

光学特征识别（OCR）系统已在各种应用程序场景中广泛使用，例如Office Automation（OA）系统，工厂自动化，在线教育，地图制作等。但是，由于文本外观和计算效率的需求，OCR仍然是一项艰巨的任务。在本文中，我们提出了一个实用的超轻型OCR系统，即PP-OCR。识别6622个汉字和280万次识别63个字母数字符号的总体模型大小仅为350万。我们引入了一袋策略，以提高模型能力或降低模型大小。还提供了具有实际数据的相应消融实验。同时，释放了几种用于中文和英语识别的预训练模型，包括文本检测器（使用了97K图像），方向分类器（使用600K图像）以及文本识别器（使用了1790万张图像）。此外，在其他几项语言识别任务中还验证了拟议的PP-OR，包括法语，韩语，日语和德语。上述所有上述模型都是开源的，并且代码可在GitHub存储库中可用，即https：//github.com/paddlepaddle/paddleocr。

The Optical Character Recognition (OCR) systems have been widely used in various of application scenarios, such as office automation (OA) systems, factory automations, online educations, map productions etc. However, OCR is still a challenging task due to the various of text appearances and the demand of computational efficiency. In this paper, we propose a practical ultra lightweight OCR system, i.e., PP-OCR. The overall model size of the PP-OCR is only 3.5M for recognizing 6622 Chinese characters and 2.8M for recognizing 63 alphanumeric symbols, respectively. We introduce a bag of strategies to either enhance the model ability or reduce the model size. The corresponding ablation experiments with the real data are also provided. Meanwhile, several pre-trained models for the Chinese and English recognition are released, including a text detector (97K images are used), a direction classifier (600K images are used) as well as a text recognizer (17.9M images are used). Besides, the proposed PP-OCR are also verified in several other language recognition tasks, including French, Korean, Japanese and German. All of the above mentioned models are open-sourced and the codes are available in the GitHub repository, i.e., https://github.com/PaddlePaddle/PaddleOCR.

下载PDF全文

下载文献需遵守相关版权规定

论文标题