Fu, Nenghong (2004) A hybrid 2-D HMM and MLP OCR system for processing multi-font and low-quality English documents. Masters thesis, Concordia University.
Preview |
Text (application/pdf)
3MBMQ91032.pdf - Accepted Version |
Abstract
This thesis presents a Hybrid 2- Direction (D) Hidden Markov Model (2-D HMM) and Multi-Layer Perceptron (MLP) OCR system for the recognition of Multi-font printed documents of varying qualities. It emphasizes on new methods proposed. First, a statistical analysis of the frequency of touching characters has been conducted, and some statistics of touching characters have been generated from real documents. Based on these statistical results which could be the first formal statistics on touching characters, a new classifier has been designed to recognize some frequent touching characters without segmentation. Second, a new hierarchical character classifier is presented to enhance character recognition accuracy. We group all characters into several categories according to character layout contextual information (Ascender, Descender and Center). Consequently we implement several independent classifiers to recognize the characters in each group. In addition, a 2-D HMM is included in the hierarchical classifier to improve the character recognition rate, and an automatic builder of special touching character HMM is also described in this thesis. (Abstract shortened by UMI.)
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering |
---|---|
Item Type: | Thesis (Masters) |
Authors: | Fu, Nenghong |
Pagination: | xi, 85 leaves : ill. ; 29 cm. |
Institution: | Concordia University |
Degree Name: | M. Comp. Sc. |
Program: | Computer Science |
Date: | 2004 |
Thesis Supervisor(s): | Suen, C. Y |
Identification Number: | TK 7895 O6F8 2004 |
ID Code: | 7922 |
Deposited By: | Concordia University Library |
Deposited On: | 18 Aug 2011 18:10 |
Last Modified: | 13 Jul 2020 20:02 |
Related URLs: |
Repository Staff Only: item control page