Login | Register

Building Reliable Frameworks for 3D Object Classification Based on Bayesian and Deep Learning Approaches

Title:

Building Reliable Frameworks for 3D Object Classification Based on Bayesian and Deep Learning Approaches

Eita, Ahmed Yasser (2023) Building Reliable Frameworks for 3D Object Classification Based on Bayesian and Deep Learning Approaches. Masters thesis, Concordia University.

[thumbnail of Eita_MASc_S2024.pdf]
Preview
Text (application/pdf)
Eita_MASc_S2024.pdf - Accepted Version
Available under License Spectrum Terms of Access.
1MB

Abstract

In the past decade, 3D objects have gained remarkable importance in everyday applications, and the ability to recognize them has therefore became a vital task in numerous fields. Ever since the emergence of 3D object recognition, there have been certain drawbacks that each newly invented model is striving to overcome. Among those shortcomings are; the ability to capture all critical features of the object, lack of spatial attributes consideration, insufficient visual relationships between semantic features, the necessity for expensive resources, and slow manipulation consequently. Computer Vision researchers have accomplished an excellent performance with multiple models, however, there is still an area for improvement. In this thesis, we are proposing two different novel 3D multi-view object classification methodologies inspired by Natural Language Processing (NLP) well-known approaches. The reason for this motivation is due to the NLP models’ impressive capability in capturing the underlying characteristics in texts and the semantic feature relationships from sequential data types. The first model is a statistical approach, named F-GDA, which deploys Generalized Dirichlet (GD) distribution in all its priors to compose a fully flexible framework and the later one, named VAeViT, incorporates the reputed deep learning architectures; Variational Autoencoder (VAE) and Vision Transformer (ViT) to form a comprehensive structure. Each model has been innovatively invented to resolve some major limitations confronted by the model’s methodology. Both models were evaluated on benchmark datasets and have proven reliably effective in classifying 3D multi-view objects and outperformed the state-of-the-art methodologies in the field.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Concordia Institute for Information Systems Engineering
Item Type:Thesis (Masters)
Authors:Eita, Ahmed Yasser
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Quality Systems Engineering
Date:22 September 2023
Thesis Supervisor(s):Bouguila, Nizar
Keywords:3D object classification, Bayesian approach, Deep Learning, Generalized Dirichlet, Variational Autoencoder, Vision Transformer
ID Code:993058
Deposited By: Ahmed Yasser Mohamed Sabri Eita
Deposited On:05 Jun 2024 16:51
Last Modified:05 Jun 2024 16:51
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top