Palm Leaf Manu Script Document Enhancement by Combined Binarization and Normalization Method

DOI : 10.17577/IJERTV2IS1329

Download Full-Text PDF Cite this Publication

Text Only Version

Palm Leaf Manu Script Document Enhancement by Combined Binarization and Normalization Method

Rajeev Medithi#1

N.V.G.Prasad*2

N.Venkata Rao#3

M.Tech Student

Associate Professor

Professor

#Dept of Electronics and Communication Engineering,

Sasi Institute Of Technology & Engineering, affiliated to JNTU Kakinada, India.

Abstract

In this paper we propose a set of transform based methods (combined normalization and binarization methods) for enhancing digital images of palm leaf manuscripts. The methods first approximate the background of a gray scale image using one of two models piece-wise linear or non linear models. The background approximations are designed to overcome unevenness of document background. Then the background normalization algorithms are applied to the component channel images of a color palm leaf image. We also propose two local adaptive normalization algorithms for extracting enhanced gray scale images from color palm leaf images. Finally binarization is performed by using contrast adaptive binarization method inorder to extract useful text information from low quality document images. The performance of the algorithm is demonstrated on by palm leaf manuscripts/color documents distorted with show through effects, uneven background color and localized spot and preliminary results show significant improvement in readability. The techniques can also be used to enhance images of ancient, historical, degraded papyrus and paper documents.

Keywords – Contrast Adaptive Binarization, Document Pre-Processing, Document Recognition, Image Enhancement, Image Processing.

  1. Introduction

    Palm leaves have been a popular writing medium for over two thousand years in south and south-east Asia. Use of palm leaves for recording literary and scientific texts have been reported from about the fifth centuary B.C.,with the oldest existing documents dating from about the second century

    A.D. palm leaf manuscripts are produced from two main types of palms: palmyra and talipot.

    The manuscripts are typically created by using a metallic stylus to etch letters into the dried leaf and enhancing the contrast and legibility of the script by applying lampblack or turmeric mixed with aromatic oils chosen for their insect repellent qualities. A survey by the institute of Asian studies, Chennai, India indicates that there are still about a hundred thousand palm leaf manuscripts surviving in south India repositories alone with many more scattered across India, Nepal, Myanmar, laos Thailand, Cambodia and other south east Asian countries. These manuscripts contain religious texts and treatises on a host of subjects such as astronomy, astrology, architecture, law, medicine and music.

    In the past, Indian kings, temple authorities, and other concerned individuals ensured that the oldest manuscripts were ritually disposed only after they had been copied onto new palm- leaves. When this age old cycle was broken in the 19th century, the remaining corpus of palm-leaf manuscripts and the knowledge contained in them

    began a long slide into obscurity and destruction. Most of these palm leaves are nearing the end of their natural lifetime or are facing destruction from elements such as dampness, fungus, ants and cockroaches. This has spurred many new preservation projects to protect these valuable historical documents. Efforts, funded by many foundations, universities and other institutions, are now underway for recovering and preserving these valuable palm leaves. Besides the many programs for preserving the manuscripts in their physical form, scanning and digital photograph imaging have been used to preserve their content and current appearance for future studies.

    Despite the availability of advanced photography and scanning equipment, natural aging and deterioration have rendered many palm leaf images unreadable. The original leaves are aged, leading to deterioration of the writing media, with seepage of ink and smearing along cracks, damage to the leaf due to the holes used for binding the manuscripts leaves and dirt and other discoloration. The process of capturing a digital image of the leaves also presents some difficulties. In order to best preserve the fragile originals, the digital images are sometimes captured by using digital cameras instead of platen scanners. Leaf manuscripts cannot be forced flat and the light source for digital cameras is usually uneven. These factors lead to a very poor contrast between the foreground text and the background.

    Digital image processing techniques are necessary to improve the legibility of the manuscripts. Previous image enhancement algorithms for historical documents have been designed primarily for segmentation of the textual content from the background of the images.

    An overview of the traditional thresholding algorithms for text segmentation are given in [Leedham et al., 2002] which compares three popular methods, namely Otsus thresholding technique[N.Otsu, 1979], entropy techniques proposed by kapur et al [J.N.Kapur and A.K.C.Wong, 1985] and the minimal error technique by Kittler and Illingworth[J.Kittler and J.Illingworth, 1986].another entropy-based method specifically designed for historical document segmentation [C.A.B.Mello and R.D.Lins, 2000] deals with the noise inherent in the paper especially in documents written on both sides. Tan et al. presented methods to separate text from

    background noise and bleed through text (from the backside of the paper) using direct image matching [Wang and Tan, 2001] and directional wavelets [Wang et al., 2003]. These techniques are designed mainly as preparation stages for subsequent OCR processing. Other methods for historical image enhancement are driven by the goal of improving human readability while maintaining the original look and feel of the documents [C.A.B.Mello and R.D.Lins, 2002]. These methods do not produce satisfactory results in processing these palm leaf manuscripts since the contrast between the foreground and background is typically low and the color intensity of the background varies throughout the image.

    In this paper we propose a transform based method (combined normalization and binarization method) for enhancing digital photograph images of palm leaf manuscripts. The method uses a dynamically selected pivoting background color in a linear transform to enhance the legibility of the foreground text. Then a combination of two other image processing techniques for histogram normalization and background normalization are applied to the transformed image. The algorithms can be mathematically combined into one or two transformations for computational efficiency. The method was tested using a set of palm leaf manuscript images from publicly available sources and the preliminary result shows significant improvement in readability.

    Figure 1: Sample Palm Leaf Manuscripts.

    In section 2 we present related work, in section 3 our enhancement algorithms (proposed method).we

    present experimental results in section 4 and conclusion in section 5.

  2. Related Work

    Binarization, which scans gray-scale text images into two levels, is usually the first stage in document image understanding systems. This is because the use of bi-level information greatly reduces the computational load and the analysis algorithm complexity. Moreover, binarization is also the most critical stage, since any error in this phase will pass down to the following ones. One of the poplar binarization approaches is gray value thresholding. And the corresponding techniques can be further classified into global and local thresholding.

    1. Global Thresholding

      Global thresholding uses only one threshold value, which is estimated based on statistics or heuristics on global image attributes, to classify image pixels into foreground or background. The major drawback of global thresholding techniques is that it cannot differentiate those pixels which share the same gray level but do not belong to the same group. Otsus method [3] is one of the best global thresholding methods. It works well with dearly scanned images, but it performs unsatisfactorily for those poor quality images that have low contrast and non-uniform illumination.

    2. Local Thresholding

      In local thresholding, the threshold values are spatially varied and determined based on the local contents of the & get image. In comparison with global techniques, local thresholding techniques have better performance against noise and error, especially when dealing with information near text and objects. According to Triers survey [4]. Yan,in.it7-llrucksteini method [5] and Niblacks method [6] are two of the best performing local thresholding methods. Yanuwilz-Brucksteini method is extraordinary complicated and thus enquires very large computational power. This makes it infeasible and too expensive for real time system applications.the concept of Niblacks

      algorithm is to build a threshold surface, based on the local mean, m and local standard deviation computed in a small neighbourhood of each pixel in the form of T=m+k.s.

  3. Proposed Techniques For Image Enhancement

    The color of the treated palm leaves is light brown when the leaves are ready for writing. Damage to and deterioration of palm leaves are usually the result of staining, mechanical damage, splitting and cleavage, and insect and rodent activity. Palm leaf is susceptible to desiccation, losing its flexibility and becoming brittle. In many cases this dryness is treated by reapplying oil, which has a darkening effect if done too often. The lignified cells are particularly susceptible to degradation and discoloration.

    These processes often result in uneven background coloration across the image and the darkening of the background which reduces the contrast between the foreground text and the background color of the palm leaf. We first target the problem of low contrast between the foreground and the background. We have designed a transform to bleach out the background colors of the leaf image to the extent possible. At this stage, the text color level is still close to the bleached background. To further enhance the contrast, we apply the histogram normalization algorithm on the transformed image to elevate the text color away from the background.

    Finally, to solve the uneven background problem, we apply a background normalization algorithm which smoothes out the background. The background normalization enhances the image, making it more legible to the eye as well as facilitating segmentation of the text from the non- text background.

    1. Pivoting Color Bleaching Transform

      The first step in the proposed process is to identify a base color for the background inorder to wash out the background colors.since the back ground colors on different leaf manuscripts are different due to difference in age or material, the background color has to be dynamically determined for each individual leaf.the simplest way of determining the background color is by calculating a

      color histogram. Our assumption is that the most dominant colors from a leaf are from the background. We first locate a range for the most frequently occurring colors on a leaf, and then take the mean of the colors in the range as our background base color.

      After calculating the base background color as (r0; g0; b0), we design a linear model in terms of the following transform:

      L = R/r0 + G/g0+ B/b0 (1)

      This transforms the background colors to a range around number 3. Re-scaling by a factor of 88, the background colors are transposed to a range near 255. This moves all the other darker colors to lower levels and colors lighter than the background are transformed to have values greater than 255.

      A transformed grey-scale image is constructed by truncating the above values aat corresponding pixel positions to the normal range of 0 to 255.the transform(1) is constructed by putting the contrast contributionsfrom all the R, G and B color component channels together. In each channel, the component color of the background mapped to 1 and other darker colors will have mapped values smaller than 1.

      The above scaling and truncating operations are equivalent to a washing-out process for removing the predominant background color and lighter colors

      i.e. colors which we would want mapped to white in abinarization process. Although the transform works very well in the case of the palm leaf manuscript image, care should be taken in implementingthe transformfor any other application. In the extreme case when one of the RGB components of a background base color ha s a value 0, the transform will have undesirable effects on the image. However, in this extreme case, the component channel which creates the problem doesnot have any meaningful contribution to the contrast (background is so dark, that there would be no text visible in the foreground).

      In the extreme case, the corresponding term in (1) for that component channel can simply be omitted.

      Figure 2: Result Of The Color-Bleaching Transform Applied To The Palm Leaf Image

      In Figure 1.

    2. Histogram Normalization

      A grey-scale bleached or brightened image is created from the original color image using the bleached described in 2.1. the original background colors are mapped into a color range very close to white and all other colors darker than the estimated original background colors map to darker gery levels. Since the original background color in many of the aged leaf-manuscripts are so dark that the foreground text colors are very close to the background, the transformed grey-scale images while brighter than the original dark image need further enhancement to increase the contrast between the text and the background. A histogram normalization algorithm is applied to the transformed grey scale images to effect this contrast enhancement.

      The algorithm applied is as follows.the distribution of the grey levels is computed and a small percent of valus at both ends of the grey spectrum (black and white) are cut off. The cut off pixel values are folded to the nearest cut-off levels. The pixel values are rescaled to stretch the grey levels to the range 0 to 255. The resultant image shows appreciable contrast enhancement (see figure 3).

      Figure 3: Histogram Normalization Further Enhances The Contrast.

    3. Background Normalization

      The problem of uneven background color intensity across the image is often seen in historical document images. We propose a new background normalization method building on our earlier work [Shi and Govindaraju, 2004] using a nonlinear approximation. We approximate the uneven background by a non linear curve that best fits the background color values. For efficiency, we compute a non-linear approximation for the image background color along each scan line and intensity along a selected sccanline and intensity along a selected scanline is shown above the grey-scale document. The horizontal line is the calculated average level. The curve is the approximation of the background.

      Consider the histogram of foreground pixel color intensity. The histogram exhibits taller peaks with higher variations at text locations. The non-text locations in the histogram, on the other hand, appear as a lower and less variant distribution. Another fact to be noted is that the number of background pixels in the document image is significantly larger than the number of freground pixels for text. Based on the above observations, we first compute the mean or average level of the histogram. Then we use the mean level as reference guideline to set a background level at each pixel position along the scanline. We scan the scanline from left to right. If the pixel level at the current position is less than the mean, then we take the value of the level for the next computation of our approximation and update a variable previous low with the value of the current

      level. If the current level is higher than the mean, we retain the value in previous low as the background level at the current location for the following computation of our approximation.

      Thus far, we have a set of approximate background level for each pixel position on the scan line. This rough background is not very accurate for two reasons. First, at the foreground pixel location, the foreground level is set using a previously remembered background level which may be used multiple times for a consecutive run of foreground pixels. Second, due to the low image quality even the real background pixels may be locally very distant from the desired globally dominant document background level. We therefore propose to use this roughly selected and estimated background (SEB) to obtain a bettr approximation of the normalized background level. Using the selected and estimated background(SEB) pixel levels on the scanline, the approximation of the normalized document background level can be achievd in two ways. One approach is to use a moving window paradigm. At each pixel position, the approximated background level is computed from an average of the SEB values in the local neighbourhood of the pixel position.

      A better approximation is computed using a best fitting straight line in each of the above neighbourhood. At each position, we use all the SEB values in its neighbourhoods to find a best fitting line using least squares. The approximation value for the pixel position is calculated from the straight line corresponding to the position. If the final approximation of the background is a curve, the line segments going through each point on the curve at the corresponding pixel position form an envelope of the approximation curve. Continuing the processing of the palm leaf manuscript image after the histogram normalization, the gray scale image can be further normalized using the non linear approximation described above. For any pixel at location (x; y) with pixel value zorig, the normalized pixel value is then computed as

      znew = zorig ¡ z + c (2)

      where z is the corresponding pixel value on the approximated background; c is a constant fixed at some number close to the white color value 255. The resultant normalized image is shown in Figure 4.

      Figure 4: Background Normalization Evens Out The Media Background For

      Easier Binarization.

    4. Adaptive Binarization Method

      Contrast adaptive thresholding method [1] is used to perform binarization based on image local gray value contrast in order to extract useful text information from low quality document images. The local threshold value is computed based on local mean, local standard deviation and minimum gray value in aprimary local window and standard deviation in a larger local window ( secondary local window).

      The procedure for calculating threshold value is given below.

      • Compute the mean value at each pixel from an average of the value in the local neighbourhood (e.g. 7×7 neighbourhood) of the pixel position in a primary local window.

  4. Experimental Results

    Several historical palm leaf manuscript images were downloaded from many online repositories.

    A large number of the images have obvious uneven background problems and low contrast. Visual inspection of the enhanced images produced by the proposed techniques show a marked improvement in image quality of human reading.

    The techniques described in this paper were also used to process images of other historical documents such as papyrus manuscripts and aged, stained or otherwise discolored paper documents and were found to generate binarized images of very high quality with very little text degradation. One of the goals of automatic document segmentation is to facilitate document OCR, and we propose to test the performance of intensity house OCR systems on papyrus and aged paper documents containing Roman character text processed using the segmentation techniques proposed in this work. A longer term objective is to also process palm leaf manuscripts in Indic scripts using Indic OCR systems currently under development.

    The figure 6 and 7 are the images obtained by using Nilblacks and Otsu methods and the figure

    8 shows the image that is obtained by using the combined normalization and binarization method and it is better when compared with the previous algorithms.

    Where N is the total number of pixels and Ii is the pixel value at position of i.

    • Compute standard deviaton (say s) in aprimary local window by using the equation

      ( ) ( )

    • Calculate the minimum grey level value M in a primary local window.

    • Calculate the dynamic range of gray value standard deviation Rs in a larger local window (secondary local window).

    Figure 5: Original Image

    Figure 6: Binarized Image Using Ni Blacks Algorithm

    Figure 7: Binarized Image Using Otsu Method.

    Figure 8: Binarization Using Combined Normalization And

    Binarization Method (Proposed Method)

  5. Conclusion

    In this paper we present image enhancement techniques for historical palm leaf manuscript document images. The algorithm first converts the color image into grey-scale image. Binary image is obtained using a simple global thresholding of the image. A linear transform to brighten the text foreground by removing most of the background colors for better contrast. Then a non linear model to approximate the flatness of the background is applied after histogram

    normalization. The transformed leaf manuscript image is normalized by adjusting the pixel values relative to the background approximation.

    From our experiments and visual evaluation, the algorithm has been found to work successfully in improving readability of document images and produce high quality binarized images and produce suitable for OCR, on not only palm leaf manuscripts but also on other aged and degraded documents such as papyrus and historical paper documents.

  6. Acknowledgements

The author would like to thank N.V.G.Prasad (Head of the ECE Department), N.Venkata Rao (Dean of ECE) of Sasi Institute Of Technology And Engineering and V.Venkateswara Rao of Vasavi Engineering College And Reviewers.

References

[1]. Sridhar cherala ,priti rege, palm leaf manuscript/color document image enhancement by Using improved adaptive binarization method,

Sixth Indian Conference on Computer Vision, Graphics & Image Processing,2008, IEEE.

  1. Meng-Ling Feng and YapPeng Tan, Adaptive Binarization Method for Document Image Analysis, School of Electrical and Electronic Engineering Nanyang Technological University, Singapore, 2004.

  2. Sauvola,J., Pietikainen, M, Adaptive Document Image Binarization, Pattern Recognition, pp. 225-236, 33, 2000.

  3. Sititsak. Rodtook, Yuttapong Rangsanseri, Adaptive Thresholding of Document images based on Laplacian sign,

    Information Technology: Coding and Computing, Proceedings, International Conference, pages 501 505, 2001.

  4. A.B.Mello and R.D.Lins, 2000] C.A.B.Mello and R.D.Lins. Image segmentation of historical documents. In Visual2000, Mexico City, Mexico, September 2000.

  5. C.A.B.Mello and R.D.Lins, 2002] C.A.B.Mello and R.D.Lins. Generation of images of historical documents by composition. In ACM Symposium on Document Engineering, McLean, VA, USA, 2002.

  6. .Kittler and J.Illingworth, 1986] .Kittler and J.Illingworth. Minimum error thresholding. Pattern Recognition, 19(1):4147, 1986.

  7. N.Kapur and A.K.C.Wong, 1985] P.K.Sahoo J.N.Kapur and A.K.C.Wong. A new method for gray-level picture thresholding using the entropy of the histogram. Computer Vision, Graphics,and Image Processing, 29:273285, 1985.

  8. Leedham et al., 2002] G. Leedham, S. Varma, A. Patankar, and V. Govindaraju. Separating text and background in degraded document images – a comparison of global thresholding techniques for multi-stage thresholding. In Proceedings Eighth International Workshop on Frontiers of Handwriting Recognition, September 2002.

[10]. Otsu, 1979] N.Otsu. A threshold selection method from gray level histogram. IEEE Transactions in Systems, Man, and Cybernetics, 9:6266, 1979.

  1. Shi and Govindaraju, 2004] Z. Shi and V. Govindaraju. Historical document image enhancement using background light intensity normalization. 17th International Conference on Pattern Recognition, Cambridge, UnitedKingdon, 23-26 August 2004.

  2. Wang and Tan, 2001] Q. Wang and C.L. Tan. Matching of double-sided document images to remove interference. In IEEE Conference on Computer Vision and Pattern Recognition, Hawaii, USA, 2001.

[13]. Wang et al., 2003] W. Wang, T. Xia, L. Li, and C.L. Tan. Document image enhancement using directional wavelet. In Proceedings of the 2003 IEEE Conference on Computer Vision and Pattern Recognition, Madison, Wisconsin, USA, June 2003.

.

Leave a Reply