Effective Binarization of Document Images with Uneven Shading

  • Xiaohua Zhang Hiroshima Institute of Technology
  • Ning Xie Tongji University
  • Heming Huang Qinghai Normal University
  • Yuelan Xin Qinghai Normal University

Abstract

With rapid popularization of mobile camera, capturing a document and storing it become easy. However, when the document is illuminated under poor conditions, the document image may appear uneven shading. In that case, it is difficult to restore texts for character recognition and document analysis etc. In this paper, we propose an effective and simple approach to remove uneven shading from a document image. First, a local uneven shading is estimated using a constant time weighted median filter. Then, reflectance is computed by removing the uneven shading based on retinex theory, followed by a bandpass thresholding to partially binarize the clean document image. Finally, the unbinarized pixels are classified into text and background by using contrast map weighted graphcut approach. A host of experimental results demonstrate that the proposed method runs effectively and has pleasurable performance.

References

N. Otsu, “A Thresholding Selection Method from Gray-level Histograms”, IEEE Trans. Syst. Man Cybern., Vol.9, No.1, pp.62-66, 1979.

W. Niblack, “An Introduction to Digital Image Processing”, Englewood Cliffs, NJ: Prentice-Hall, 1986.

J. Sauvola and M. Pietikainen, “Adaptive document image Binarization”, Pattern Recognition, Vol.33, No.2, pp.225-236, 2000.

G. Bradley and G. Roth, “Adaptive Thresholding Using Integral Image”, Journal of Graphics Tools, Vol.12, No.2 pp.13-21, 2007.

F. Shafait, D. Keysers, and T. M. Breuel, “Efficient Implementation of Local Adaptive Thresholding Techniques Using Integral Images”, SPIE Document Recognition and Retrieval XV, DRR’08, 2008.

B. Gatos, I. Pratikakis, and S. Perantonis: “Adaptive Degraded Document Image Binarization”, Pattern Recognition, 39, 3, pp. 317-327, 2006.

J. Wen, S. Li, and J. Sun: “A New Binarization Method for Non-uniform Illuminated Document Images”, Pattern Recognition, 46, 6, pp. 1670-1690, 2013.

S. Lu, and C. T. Tan: “Binarization of Badly Illuminated Document Images through Shading Estimation and Compensation”, International Conference on Document Analysis and Recognition, pp. 23-26, 2007.

X. Zhang, Y. Xin, N. Xie, W. Jiang, and Y. Zhao, “Shading Surface Estimation Using Piecewise Polynomials for Binarizing Unevenly Illuminated Document Images”, International Journal of Intelligent Engineering & Systems, 7, 2, pp. 1-10, 2014.

X. Zhang, Y. Xin, H. Huang, and N. Xie, “Using Gaussian Kernels to Remove Uneven Shading from a Document Image”, ITE Transactions on Media Technology and Applications, Vol.3, No.3, pp. 194-205, 2015.

B. Su, S. Lu, and C. L. Tan, “Combination of Document Image Binarization Techniques”, 11th International Conference on Document Analysis and Recognition, ICDAR 2011.

A. B. Petro, C. Sbert ,and J. M. Morel, “Multiscale Retinex”, Image Processing On Line, pp. 71-88, 2014.

S. Perreault, and P. Hebert, “Median Filtering in Constant Time”, IEEE Trans. on Image Processing, Vol. 16, No. 9, 2007.

R. F. Moghaddam, and M. Cheriet: “A Multi-scale framework for Adaptive Binarization of Degraded Document Images”, Pattern Recognition, Vol.43, No.6, pp.2186-2198, 2010.

K. He, J. Sun, and X. Tang, “Guided Image Filtering”, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol.35, No.6, pp.1397-1409, 2013.

K. He, and J. Sun: “Fast Guided Filter”, Technical Report, arXiv:1505.00996v1, 2015.

Y. Boykov, and G. Funka-lea, “Graph Cuts and Efficient N-D Image Segmentation”, International Journal of Computer Vision, Vol. 70, No. 2, pp.109-131, 2006.

Y. Boykov, and V. Kolmogorov, “An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimiza- tion in Vision”, IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 26, no.9, pp. 1124-1137, 2004.

ABBYY: http://finereader.add-soft.jp/

Published
2016-01-25
How to Cite
Zhang, X., Xie, N., Huang, H., & Xin, Y. (2016). Effective Binarization of Document Images with Uneven Shading. Journal of the Institute of Industrial Applications Engineers, 4(1), 1. https://doi.org/10.12792/jiiae.4.1
Section
Articles