当前位置: X-MOL 学术 › Journal of Computational Methods in Sciences and Engineering › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
A fast restoring method for arbitrarily warped images of Chinese document
Journal of Computational Methods in Sciences and Engineering Pub Date : 2015-06-03 , DOI: 10.3233/jcm-150529
Fanfeng Zeng , Zhengdong Guo , Jingzhong Wang , Lei He

With the rapid development of information technology, converting paper books into electronic documents will be more widely used. The document images of thick book captured by imaging device have a certain degree of distortion, which has some damage on OCR Recognition effect. To solve this problem, this paper proposes a fast restoring method for Chinese document images with arbitrarily warping. Firstly, the Chinese characters are extracted step by step using the Characters and Text lines Locate Alternately (CTLA), and the text lines are identified based on the nearest aggregation method. Then, the vertical positions of the extracted characters are corrected according to every text line, and the reconstructed texts are saved in a new image. The experiment of nearly 200 document images shows the average recognition rate can be significantly improved to 95% with rapid fast speed.

中文翻译:

中文文档任意变形图像的快速恢复方法

随着信息技术的飞速发展,将纸质书转换为电子文件将得到更广泛的应用。成像设备拍摄的厚书文件图像有一定程度的畸变,对OCR识别效果有一定的损害。为了解决这个问题,本文提出了一种任意变形的中文文档图像快速恢复方法。首先,使用“字符和文本行交替定位”(CTLA)逐步提取汉字,并根据最近的聚合方法识别文本行。然后,根据每个文本行校正提取的字符的垂直位置,并将重构的文本保存在新图像中。
更新日期:2015-06-03
down
wechat
bug