Jifroodian-Haghighi, Puntis (2010) A discrete hidden Markov model for the recognition of handwritten Farsi words. Masters thesis, Concordia University.
Preview |
Text (application/pdf)
4MBMR67130.pdf - Accepted Version |
Abstract
Handwriting recognition systems (HRS) have been researched for more than 50 years. Designing a system to recognize specific words in a handwritten clean document is still a difficult task and the challenge is to achieve a high recognition rate. Previously, most of the research in the handwriting recognition domain was conducted on Chinese and Latin languages, while recently more people have shown an interest in the Indo-Iranian script recognition systems. In this thesis, we present an automatic handwriting recognition system for Farsi words. The system was trained, validated and tested on the CENPARMI Farsi Dataset, which was gathered during this research. CENPARMI's Farsi Dataset is unique in terms of its huge number of images (432,357 combined grayscale and binary), inclusion of all possible handwriting types (Dates, Words, Isolated Characters, Isolated Digits, Numeral Strings, Special Symbols, Documents), the variety of cursive styles, the number of writers (400) and the exclusive participation of Native Farsi speakers in the gathering of data. The words were first preprocessed. Concavity and Distribution features were extracted and the codebook was calculated by the vector quantization method. A Discrete Hidden Markov Model was chosen as the classifier because of the cursive nature of the Farsi script. Finally, encouraging recognition rates of98.76% and 96.02% have been obtained for the Training and Testing sets, respectively
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering |
---|---|
Item Type: | Thesis (Masters) |
Authors: | Jifroodian-Haghighi, Puntis |
Pagination: | xiv, 131 leaves : ill. ; 29 cm. |
Institution: | Concordia University |
Degree Name: | M. Comp. Sc. |
Program: | Computer Science and Software Engineering |
Date: | 2010 |
Thesis Supervisor(s): | Suen, C |
Identification Number: | LE 3 C66C67M 2010 J54 |
ID Code: | 979323 |
Deposited By: | Concordia University Library |
Deposited On: | 09 Dec 2014 17:57 |
Last Modified: | 13 Jul 2020 20:12 |
Related URLs: |
Repository Staff Only: item control page