Login | Register

An Efficient Neural Network Architecture and Training Protocol for 3D Point Cloud Classification

Title:

An Efficient Neural Network Architecture and Training Protocol for 3D Point Cloud Classification

Paul, Sneha ORCID: https://orcid.org/0000-0001-7731-4196 (2023) An Efficient Neural Network Architecture and Training Protocol for 3D Point Cloud Classification. Masters thesis, Concordia University.

[thumbnail of Paul_MASc_S2023.pdf]
Preview
Text (application/pdf)
Paul_MASc_S2023.pdf - Accepted Version
Available under License Spectrum Terms of Access.
5MB

Abstract

The point cloud is a set of data points in a 3D coordinate system with an irregular data format. As a result, they are needed to be transformed into a collection of images before being fed into models. This unnecessarily increases the volume of the data and increases complexities. The existing literature on point cloud uses a fixed number of points sampled from the whole point cloud as the input. However, with large point cloud data, it is important to consider more points as input to have a better understanding of the scene. The computational expense increases if the input number of points increases for existing networks. Our research contributes to the existing point cloud classification literature in two directions. First, we develop a training protocol for improved point cloud training accuracy on top of the existing PointNet \cite{qi2017pointnet} architecture over the ModelNet10 dataset. A few variations of encoder models have been proposed in this regard. Also, an extensive hyperparameter study and ablation study are done. These experiments achieve a 6.10\% improvement over the baseline model. After that, we propose DualNet, a novel 3D point cloud network that resolves the trade-off between the number of input points and the computational expense of 3D data. The DualNet consists of two branches: DensetNet and SparseNet. The SparseNet is a comparatively large network in terms of number of parameters, that samples a small number of points from the whole point cloud. Whereas the DenseNet is a lightweight network that takes a large number of points as input. SparseNet is composed of more number of channels than DenseNet making it more computationally expensive than DenseNet. While the accuracy of the model shows good improvement when the number of points increases, the overall computational cost of DenseNet does not increase much in such settings. DualNet shows 0.81\% and 0.45\% increase in the SOTA results on ModelNet40 and ScanObjectNN respectively. In respect of computational complexity, our model takes about 40\% less time compared to SOTA.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Concordia Institute for Information Systems Engineering
Item Type:Thesis (Masters)
Authors:Paul, Sneha
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Quality Systems Engineering
Date:6 January 2023
Thesis Supervisor(s):Patterson, Zachary and Bouguila, Nizar
ID Code:991522
Deposited By: Sneha Paul
Deposited On:21 Jun 2023 14:36
Last Modified:26 Jan 2024 01:00
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top