Login | Register

Distributional Feature Mapping in Data Classification


Distributional Feature Mapping in Data Classification

Rahman, Md. Hafizr ORCID: https://orcid.org/0000-0001-5154-0769 (2020) Distributional Feature Mapping in Data Classification. Masters thesis, Concordia University.

[thumbnail of Rahman_MASc_S2020.pdf]
Text (application/pdf)
Rahman_MASc_S2020.pdf - Accepted Version
Available under License Spectrum Terms of Access.


Performance of a machine learning algorithm depends on the representation of the input data. In computer vision problems, histogram based feature representation has significantly improved the classification tasks. L1 normalized histograms can be modelled by Dirichlet and related distributions to transform input space to feature space. We propose a mapping technique that contains prior knowledge about the distribution of the data and increases the discriminative power of the classifiers in supervised learning such as Support Vector Machine (SVM). The mapping technique for proportional data which is based on Dirichlet, Generalized Dirichlet, Beta Liouville, scaled Dirichlet and shifted scaled Dirichlet distributions can be incorporated with traditional kernels to improve the base kernels accuracy. Experimental results show that the proposed technique for proportional data increases accuracy for machine vision tasks such as natural scene recognition, satellite image classification, gender classification, facial expression recognition and human action recognition in videos. In addition, in object tracking, learning parametric features of the target object using Dirichlet and related distributions may help to capture representations invariant to noise. This further motivated our study of such distributions in object tracking. We propose a framework for feature representation on probability simplex for proportional data utilizing the histogram representation of the target object at initial frame. A set of parameter vectors determine the appearance features of the target object in the subsequent frames.

Motivated by the success of distribution based feature mapping for proportional data, we extend this technique for semi-bounded data utilizing inverted Dirichlet, generalized inverted Dirichlet and inverted Beta Liouville distributions. Similar approach is taken into account for count data where Dirichlet multinomial and generalized Dirichlet multinomial distributions are used to map density features with input features.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Concordia Institute for Information Systems Engineering
Item Type:Thesis (Masters)
Authors:Rahman, Md. Hafizr
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Quality Systems Engineering
Date:28 February 2020
Thesis Supervisor(s):Bouguila, Nizar
Keywords:Feature Mapping, SVM, Proportional Data, Semi-Bounded Data, Count Data
ID Code:986531
Deposited By: Md Hafizur Rahman
Deposited On:26 Jun 2020 13:34
Last Modified:26 Jun 2020 13:34
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top