Brnawy, Rahmah (2015) An Efficient Technique for Clustering Data with Mixed Attribute Types. Masters thesis, Concordia University.
Preview |
Text (application/pdf)
2MBAn_Efficient_Technique_for_Clustering_Data_with_Mixed_Attribute_Types-Rahmah.Brnawy.pdf - Accepted Version |
Abstract
ABSTRACT
An Efficient Technique for Clustering Data with Mixed Attribute Types
Rahmah Brnawy
Clustering is a technique used to extract useful information and discover patterns from data. Existing clustering techniques have often focused on datasets with attributes that are either numeric or categorical but not both. The problem of clustering mixed numeric and categorical datasets has received increased attention more recently and a number of solutions have been proposed. In this research, we study these solutions and propose two clustering algorithms. The first algorithm that we present is called Cluc+, which extends and improves Cluc, an existing algorithm proposed for clustering pure categorical data. Using Cluc+, we then develop a new algorithm, called k-mixed for clustering data with mixed numeric and categorical attribute types. We conduct numerous experiments to evaluate the performance of our proposed algorithms using real-life benchmark datasets. Our results indicate increased efficiency and accuracy of the proposed solution techniques.
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering |
---|---|
Item Type: | Thesis (Masters) |
Authors: | Brnawy, Rahmah |
Institution: | Concordia University |
Degree Name: | M. Comp. Sc. |
Program: | Computer Science |
Date: | 17 June 2015 |
ID Code: | 980119 |
Deposited By: | RAHMAH BRNAWY |
Deposited On: | 03 Nov 2015 15:54 |
Last Modified: | 18 Jul 2019 15:22 |
Repository Staff Only: item control page