Login | Register

License Plate Detection and Character Recognition using Deep Learning and Font Evaluation

Title:

License Plate Detection and Character Recognition using Deep Learning and Font Evaluation

Ebrahimi Vargoorani, Zahra (2024) License Plate Detection and Character Recognition using Deep Learning and Font Evaluation. Masters thesis, Concordia University.

[thumbnail of Ebrahimi Vargoorani_MSc_S2024.pdf]
Preview
Text (application/pdf)
Ebrahimi Vargoorani_MSc_S2024.pdf - Accepted Version
Available under License Spectrum Terms of Access.
8MB

Abstract

License plate detection and character recognition pose challenges due to environmental
sensitivity, such as lighting, dust, and the impact of the chosen font type on recognition
tasks. Automatic License Plate Detection and Recognition (ALPR) are crucial
in practical applications such as traffic control and parking, vehicle tracking, toll
collection, and law enforcement. While much research has been done using image
processing and machine learning algorithms, deep learning methods need further
exploration due to their recent advances in reliable performance in various scenarios.
Moreover, current proposals are limited to specific regions and dataset applicability.
This study has a dual focus: firstly, we suggest utilizing a Deep Learning technique,
specifically using Faster R-CNN for the license plate detection task and a CNN-RNN
model with CTC loss, and a MobileNet V3 backbone for recognition task. We also
utilized You Only Look Once (YOLO) for license plate detection and recognition tasks.
Secondly, we aim to assess font features within the LP context. This work uses Brazilian
dataset and datasets from two different provinces in Canada and two different states in
the United States of America, including Ontario, Quebec, California, and New York
State. We suggest employing an adaptive algorithm based on Faster R-CNN and CTC
network along with YOLO, fine-tuned with optimized parameters to improve its
effectiveness using two different approaches, including domain generalization.
Alongside presenting the recall ratio findings, this study will perform a thorough error
analysis to gain insights into the nature of false positives. The proposed model
demonstrated a commendable recall ratio of 94% using a single YOLO network.
Specific fonts pose readability challenges for humans, while others present difficulties
for computer systems regarding recognition. In this study, we provide five sets of
outcomes for font assessment: results about font anatomy and those related to the
recognition of commercial products. The font anatomy analysis focuses on five specific
fonts: Driver Gothic, Dreadnought, California Clarendon, Zurich Extra Condensed, and
Mandatory. Additionally, we assess the impact of these fonts in the context of a dataset
made of five different license plates using a commercial product, OpenALPR. The font
anatomy findings unveil significant confusion cases and quality features associated with
chosen fonts.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Ebrahimi Vargoorani, Zahra
Institution:Concordia University
Degree Name:M. Sc.
Program:Computer Science
Date:1 August 2024
Thesis Supervisor(s):Suen, Ching Yee
ID Code:994677
Deposited By: Zahra Ebrahimi Vargoorani
Deposited On:24 Oct 2024 16:17
Last Modified:24 Oct 2024 16:17
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top