Login | Register

A hybrid 2-D HMM and MLP OCR system for processing multi-font and low-quality English documents

Title:

A hybrid 2-D HMM and MLP OCR system for processing multi-font and low-quality English documents

Fu, Nenghong (2004) A hybrid 2-D HMM and MLP OCR system for processing multi-font and low-quality English documents. Masters thesis, Concordia University.

[thumbnail of MQ91032.pdf]
Preview
Text (application/pdf)
MQ91032.pdf - Accepted Version
3MB

Abstract

This thesis presents a Hybrid 2- Direction (D) Hidden Markov Model (2-D HMM) and Multi-Layer Perceptron (MLP) OCR system for the recognition of Multi-font printed documents of varying qualities. It emphasizes on new methods proposed. First, a statistical analysis of the frequency of touching characters has been conducted, and some statistics of touching characters have been generated from real documents. Based on these statistical results which could be the first formal statistics on touching characters, a new classifier has been designed to recognize some frequent touching characters without segmentation. Second, a new hierarchical character classifier is presented to enhance character recognition accuracy. We group all characters into several categories according to character layout contextual information (Ascender, Descender and Center). Consequently we implement several independent classifiers to recognize the characters in each group. In addition, a 2-D HMM is included in the hierarchical classifier to improve the character recognition rate, and an automatic builder of special touching character HMM is also described in this thesis. (Abstract shortened by UMI.)

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Fu, Nenghong
Pagination:xi, 85 leaves : ill. ; 29 cm.
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science
Date:2004
Thesis Supervisor(s):Suen, C. Y
Identification Number:TK 7895 O6F8 2004
ID Code:7922
Deposited By: Concordia University Library
Deposited On:18 Aug 2011 18:10
Last Modified:13 Jul 2020 20:02
Related URLs:
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top