Handwritten Character Recognition -A Review

DOI : 10.17577/IJERTV1IS8038

Download Full-Text PDF Cite this Publication

Text Only Version

Handwritten Character Recognition -A Review

1MR.NARENDRASING.B RAJPUT, 2PROF.S.M RAJPUT, 3PROF.S.M.BADAVE

1ME(COMMUNICATION) , MIT, Aurangabad

2Associate Professor, SSVPS.B.S.D.C.O.E.Dhule

3Associate Professor, MIT, Aurangabad,

ABSTRACT – Character Recognition (CR) has been studied in details in the last few decades and approached to a level, of producing technology driven applications. The fast growing computational power equips the implementation of the current CR methodologies and also creates an increasing demand on numerous emerging application domains, which require advanced methodologies. This material helps as a guide lining document and upgrade for the readers, working in the Character Recognition area. An overview of CR systems and their evolution over time is presented followed by the available CR techniques with their superiorities and limitations are reviewed. Finally, the current status of CR is discussed focusing on the off-line handwriting recognition.

Index Terms – Character Recognition, Off-line Handwriting Recognition, Segmentation, Feature Extraction, Training and Recognition.

1. INTRODUCTION

Machine simulation of human functions has been a very challenging research field since the advent of digital computers. In some areas, which require certain amount of intelligence, such as number crunching or chess playing tremendous improvements are achieved. On the other hand, humans still outperform even the most powerful computers in the relatively routine functions such as vision. Machine simulation of human reading is one of these areas, which has been the subject of intensive research for the last three decades, yet it is still far from the final frontier. In this overview, Character Recognition (CR) is used as an umbrella term, which covers all types of machine recognition of characters in various application domains. The overview serves as an update for the state of the art in the CR field, emphasizing the methodologies required for the increasing needs in newly emerging areas, such as development of electronic libraries, multimedia databases and systems which require handwriting

data entry. The study investigates the direction of the CR research, analyzing the limitations of methodologies for the systems, which can be classified based upon two major criteria: the data acquisition process (on-line or off-line) and the text type (machine-printed or hand-written). No matter which class the problem belongs, in general there are five major stages in the CR problem:

  1. Pre-processing,

  2. Segmentation,

  3. Representation,

  4. Training and recognition,

  5. Post processing.

The paper is arranged to review the CR methodologies with respect to the stages of the CR systems, rather than surveying the complete solutions. Although the off-line and on-line character recognition techniques have different approaches, they share a lot of common problems and solutions. Since it is relatively more complex and requires more research compared to on-line and machine-printed recognition, off-line handwritten character

recognition is selected as a focus of attention in this article. However, the article also reviews some of the methodologies for on-line character recognition, as it intersects with the off- line case. The comprehensive survey on off-line and on-line handwriting recognition in [1], the survey in [2], dedicated to off-line cursive script recognition and the book in [3] which covers the Optical Character Recognition methodologies, can be taken as good starting points to reach the recent studies in various types and applications of the CR problem.

  1. PREVIOUS WORK IN CR SYSTEM

    Writing, which has been the most natural mode of collecting, storing and transmitting the information through the centuries, now serves not only for the communication among humans, but also, serves for the communication of humans and machines. The intensive research effort on the field of CR was not only because of its challenge on simulation of human reading, but also, because it provides efficient applications such as the automatic processing of bulk amount of papers, transferring data into machines and web interface to paper documents. Historically, CR systems have evolved in three ages:

    1900-1980 Early ages– The history of character recognition can be traced as early as 1900, when the Russian Scientist Trying attempted to develop an aid for visually handicapped. The first character recognizers appeared in the middle of the 1940s with the development of the digital computers. The early work on the automatic recognition of characters has been concentrated either upon machine printed text or upon small set of well- distinguished handwritten text or symbols. Machine-printed CR systems in this period generally used template matching in which an image is compared to a library of images. For handwritten text, low level image processing techniques have been used on the binary image to extract feature

    vectors, which are then fed to statistical classifiers. Successful, but constrained algorithms have been implemented mostly for Latin characters and numerals. However, some studies on Japanese, Chinese, Hebrew, Indian, Cyrillic, Greek and Arabic characters and numerals in both machine-printed and handwritten cases were also initiated [4].

    The commercial character recognizers were available in 1950s, when electronic tablets capturing the x-y coordinate data of pen-tip movement was first introduced. This innovation enabled the researchers to work on the on-line handwriting recognition problem. A good source of references for on-line recognition until 1980 can be found in [5].

    1980-1990 Developments– The studies until 1980 suffered from the lack of powerful computer hardware and data acquisition devices. With the explosion on the information technology, the previously developed methodologies found a very fertile environment for rapid growth in many application areas, as well as CR system development [6]. Structural approaches were initiated in many systems in addition to the statistical methods. These systems broke the character image into a set of pattern primitives such as lines and curves. The rules were then determined which character most likely matched the extracted primitives. However, the CR research was focused on basically the shape recognition techniques without using any semantic information. This led to an upper limit in the recognition rate, which was not sufficient in many practical applications. Historical review of CR research and development during this period can be found in [7] and

    [8] for off-line and on-line case, respectively.

    After 1990 Advancements– The real progress on CR systems is achieved during this period, using the new development tools and methodologies, which are empowered by the continuously growing information technologies. In the early

    nineties, Image Processing and Pattern Recognition techniques are efficiently combined with the Artificial Intelligence methodologies. Researchers developed complex CR algorithms, which receive high-resolution input data and require extensive number crunching in the implementation phase. Nowadays, in addition to the more powerful computers and more accurate electronic equipments such as scanners, cameras and electronic tablets, we have efficient, modern use of methodologies such as Neural Networks, Hidden Markov Models, Fuzzy Set Reasoning and Natural Language Processing. The recent systems for the machine-printed off-line [9] and limited vocabulary, user dependent on-line handwritten characters [10] are quite satisfactory for restricted applications. However, there is still a long way to go in order to reach the ultimate goal of machine simulation of fluent human reading, especially for unconstrained on-line and off-line handwriting.

  2. CHARACTER RECOGNITION (CR) SYSTEM

    In this section, we classify the available CR systems according to the data acquisition techniques and the text type as follows:

    1. Systems Classified According to the Data Acquisition Techniques

      The progress in CR methodologies evolved in two categories according to the mode of data acquisition, as online and off-line character recognition systems. The problem of recognizing handwriting, recorded with a digitizer, as a time sequence of pen coordinates is known as on-line character recognition. The on-line handwriting recognition problem has a number of distinguishing features like

      1. It is a real time process.

      2. It is adaptive in real time.

      3. It captures the temporal and dynamic information of

        the pen trajectory.

      4. Very little pre-processing is required.

      5. Segmentation is easy.

      On the other hand, the disadvantages of the on-line character recognition are as follows:

      1. The writer requires special equipment, which is not as comfortable as pen and paper.

      2. It cannot be applied to documents printed or written on papers.

      3. Punching is much faster and easier than handwriting for small size alphabet such as English.

      4. The available systems are slow and recognition rates are low for handwriting that is not neat.

      Applications of on-line character recognition systems include small hand- held devices like Pen based computers, educational software for teaching handwriting and signature verifiers are the examples of popular tools utilizing the on- line character recognition techniques. Off- line character recognition is known as Optical Character Recognition (OCR), because the image of writing is converted into bit pattern by an optically digitizing device such as optical scanner or camera. The recognition is done on this bit pattern data for machine-printed or hand-written text. The research and development is well progressed for the recognition of the machine-printed documents. In recent years, the focus of attention is shifted towards the recognition of hand-written script. The major advantage of the off-line recognizers is to allow the previously written and printed texts to be processed and recognized. Some applications of the off-line recognition are large-scale data processing such as postal address reading , check sorting , office automation for text entry automatic inspection and identification [11]. Off-line character recognition is a very important tool for creation of the electronic libraries.

    2. Systems Classified According to the Text Type

      Considering the text type, hand-written and machine-printed character recognition systems are two main areas of interest in

      the CR field. Machine-printed text includes the materials such as books, newspapers, magazines, documents and various writing units in the video or still image. The problems for fixed-font, multifont and Omni-font character recognition is relatively well understood and solved with little constraint [12]. When the documents are generated on a high quality paper with modern printing technologies, the available systems yield as good as 99% recognition accuracy. However, the recognition rates of the commercially available products are very much dependent on the age of the documents, quality of paper and ink, which may result in significant data acquisition noise.

  3. METHODOLOGIES OF CR SYSTEM

    In this section, we focus on the methodologies of CR systems, emphasizing the off-line handwriting recognition problem. A hierarchical approach for most of the systems would be from pixel to text, as follows:

    Pixels Features Characters Sub- words Words Meaningful Text. This bottom up approach varies a great deal, depending upon the type of the CR system and the methodology used. The literature review in the field of CR indicates that the above hierarchical tasks are grouped in the stages of the CR for pre-rocessing,segmentation,representation, training and recognition, post processing.

    1. Pre-processing

      The raw data, depending on the data acquisition type, is subjected to a number of preliminary processing steps to make it usable in the descriptive stages of character analysis. Preprocessing aims to produce data that are easy for the CR systems to operate accurately. The main objectives of preprocessing are:

      1. Noise reduction,

      2. Normalization of the data,

    2. Segmentation

      Segmentation is an important stage, because the extent one can reach in

      separation of words, lines or characters directly affects the recognition rate of the script. The two types of segmentation are

      -External Segmentation

      -Internal Segmentation

      -Explicit Segmentation

      -Implicit Segmentation

    3. Representation

Image representation plays one of the most important roles in a recognition system. In the simplest case, gray-level or binary images are fed to a recognizer. However, in most of the recognition systems, in order to avoid extra complexity and to increase the accuracy of the algorithms, a more compact and characteristic representation is required. A good survey on feature extraction methods for character recognition can be found in [13]. Hundreds of document image representation methods are categorized in three major groups as:

-Global Transformation and Series Expansion

– Statistical Representation

-Geometrical and Topological Representation

    1. Training and Recognition Techniques

      CR systems extensively use the methodologies of pattern recognition, which assigns an unknown sample into a predefined class. Numerous techniques for CR can be investigated in four general approaches of Pattern Recognition, as suggested as

      -Template Matching,

      -Statistical Techniques,

      -Structural Techniques,

      -Neural Networks.

      1. Template Matching

        The simplest way of character recognition is based on matching the stored prototypes against the character or word to be recognized (group of pixels, shapes, curvature etc.). Matching techniques can be studied in three classes:

        -Direct Matching

        -Deformable Templates and Elastic Matching

        -Relaxation Matching

      2. Statistical Techniques

        The major statistical approaches, applied in the CR field are the followings:

        -Non-parametric Recognition

        -Parametric Recognition

        -Clustering Analysis

        -Hidden Markov Modeling (HMM) Hidden Markov Models are the most widely and successfully used technique for handwritten character recognition problem. There are two basic approaches to CR systems using HMM:

        -Model Discriminant HMM

        -Path Discriminant HMM

      3. Structural Techniques

        These patterns are used to describe and classify the characters in the CR systems. The characters are represented as the union of the structural primitives. The following structural methods are applied to the CR problems:

        -Grammatical Methods

        -Graphical Methods

      4. Neural Networks (NN)

        A neural network is defined as a computing architecture that consists of massively parallel interconnection of adaptive 'neural' processors. Because of its parallel nature, it can perform computations at a higher rate compared to the classical techniques. Because of its adaptive nature, it can adapt to changes in the data and learn the characteristics of input signal. A neural network contains many nodes. Neural network architectures can be classified into two major groups, namely, feed-forward and feedback (recurrent) networks. The most common neural networks used in the CR systems are the multilayer perception of the feed forward networks. Some of the connections are variable and can be modified by learning.

    2. Post Processing:

      It is well known that humans read by context up to 60% for careless handwriting. While preprocessing tries to clean the document in a certain sense, it may remove important information, since

      the context information is not available at this stage. The review of the recent CR research indicates minor improvements, when only shape recognition of the character is considered. Therefore, the incorporation of context and shape information in all the stages of CR systems is necessary for meaningful improvements in recognition rates. This is done in the post processing stage with a feedback to the early stages of CR. The simplest way of incorporating the context information is the utilization of a dictionary for correcting the minor mistakes of the CR systems.

  1. DISCUSSIONS

In this study, we have overviewed the basic approaches in the CR domain attempting to bring out the present status of CR research. Although each of the methods mentioned have their own advantages and limitations, the success level is achieved by each method. However, taking in to account various databases, constraints and sample spaces it is difficult to comment about the success of recognition methods, especially in terms of recognition rates. An improved approach in almost all stages of CR research is needed in case of texts which are handwritten under poor conditions or for free style handwriting. Number digit or limited vocabulary form (bank checks, envelopes and forms designed for specific applications) recognition is a popular application area.

References-

1] Nafiz Arica, Fatos T.Yarman Vural, Overview of Character Recognition Focused on offline handwriting,IEEE Transactions on systems, man, and Cybernetics part c: applications and Reviews, vol. 31, no. 2, may 2001

2] R. Plamondon, Special Issue on Hand-Written Processing and Recognition, Pattern Recognition, vol. 26, no.3, 1993.

3] T. Steinherz, E. Rivlin, N. Intrator, Off-line Cursive script Word

Recognition – A survey, Int. Journal Document Analysis and Recognition, vol.2, no.2, pp.90- 110, 1999.

4] S. Mori, H. Nishida, H. Yamada, Optical Character Recognition, Wiley, 1999.

5] S. Mori, K. Yamamoto, M. Yasuda, Research on Machine Recognition Of Hand printed Characters, IEEE Trans. Pattern Analysis, Machine Intelligence, vol.6, no.4, pp.386-404, 1984.

6] C. Y. Suen, C. C. Tappert, T. Wakahara, The State of the Art in on-Line Handwriting Recognition, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, no. 8, Pp.787-808, 1990.

7] V. K. Govindan, A. P. Shivaprasad, Character ecognition- A review, Pattern Recognition vol.23, no.7, Pp.671- 683, 1990.

8] T. Steinherz, E. Rivlin, N. Intrator, Off-line Cursive script Word Recognition – A survey, Int. Journal Document Analysis and Recognition, vol.2, no.2, pp.90- 110, 1999

9] C. Y. Suen, C. C. Tappert, T. Wakahara, The State of the Art in On-Line Handwriting Recognition, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 12, no. 8, pp.787-

808, 1990.

10] I. Bazzi, R. Schwartz, J. Makhoul, An Omnifont Open Vocabulary OCR System for English and Arabic, IEEE Trans. Pattern Analysis and Machine Intelligence, vol.21, no.6, pp.495-504, 1999.

11] J. Hu, S. G. Lim, M.K. Brown,

Writer Independent On-line

Handwriting Recognition Using an HMM Approach, Pattern Recognition, vol.33, no.1, pp.133- 147, 2000.

12] H. E. S. Said, T. N. Tan, K. D.

Baker, Personal Identification Based on Handwriting, Pattern Recognition, vol.33, no.1, pp.149- 160, 2000.

13] Q. Tian, P. Zhang, T. Alexander,

Y. Kim. Survey: Omni font Printed Character Recognition, Visual Communication and Image Processing 91: Image Processing, pp. 260 – 268, 1991.

.

Leave a Reply