Sadri, Javad (2007) Automatic segmentation and recognition of unconstrained handwritten numeral strings. PhD thesis, Concordia University.
Preview |
Text (application/pdf)
7MBNR31139.pdf - Accepted Version |
Abstract
Segmentation and recognition of handwritten numeral strings is a very interesting and challenging problem in pattern recognition. It also has a lot of important applications such as: postal code recognition, bank check processing; tax form reading, etc. In this thesis, a new system for the segmentation and recognition of unconstrained handwritten numeral strings is proposed. The system uses a combination of foreground and background features for the segmentation of touching numerals in strings. The method introduces new algorithms for the traversal of top and bottom foreground and background skeletons, and top and bottom contours of numerals. Then; it tries to locate all feature points on these skeletons and contours and alternatively match feature points from top to bottom (or bottom to top) of the images to build all possible candidate segmentation paths (so-called segmentation hypotheses). A novel genetic representation scheme is utilized in order to represent the space of all possible segmentation hypotheses. In order to improve searching and evolution of segmentation hypotheses and facilitate finding the ones with the highest confidence values of segmentation and recognition, this genetic framework utilizes contextual knowledge extracted from string images. A novel evaluation scheme based on segmentation and recognition scores is introduced in order to improve the evaluation of segmentation hypotheses and to enhance the outlier resistance of the system. In order to improve stability and plasticity of our system in the learning and recognition of numerals, a new algorithm for clustering of handwritten digits based on their shapes is proposed. Also, in order to improve the searching power of our system and its convergence, a new evolutionary algorithm based on genetic particle swarm optimization (GBPSO) is proposed. Numerous experiments using images from well known databases of handwritten numeral strings such as CENPARMI, NIST NSTRING SD19, and our newly created databases of Farsi/Arabic numerals have been conducted in order to evaluate the performance of the proposed method. Experiments have shown that proper use of contextual knowledge in segmentation; evaluation and search greatly improves the overall performance of the system. This system shows superior results compared with those reported in the literature.
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering |
---|---|
Item Type: | Thesis (PhD) |
Authors: | Sadri, Javad |
Pagination: | xxviii, 192 leaves : ill. ; 29 cm. |
Institution: | Concordia University |
Degree Name: | Ph. D. |
Program: | Computer Science and Software Engineering |
Date: | 2007 |
Thesis Supervisor(s): | Suen, Ching Y and Bui, Tien D |
Identification Number: | LE 3 C66C67P 2007 S23 |
ID Code: | 975345 |
Deposited By: | Concordia University Library |
Deposited On: | 22 Jan 2013 16:06 |
Last Modified: | 13 Jul 2020 20:07 |
Related URLs: |
Repository Staff Only: item control page