Breadcrumb

 
 

A cohesion-based clustering technique for categorical data

Title:

A cohesion-based clustering technique for categorical data

Nemalhabib, Aida (2006) A cohesion-based clustering technique for categorical data. Masters thesis, Concordia University.

[img]
Preview
PDF - Accepted Version
3270Kb

Abstract

Clustering is a technique which aims to partition a given dataset of objects into groups of similar objects. In this work, we consider categorical data, which are unordered unlike numerical data. This makes clustering such data a more challenging task. We propose a clustering technique for categorical data, which uses a novel similarity function, called cohesion , to measure the degree to which objects "stick" to clusters. We have implemented this technique, to which we refer as CLUC ( CLU stering with C ohesion). To evaluate CLUC, we compared its results with those produced by well-known clustering algorithms. The results of our extensive experiments on real and synthetic datasets show that CLUC generates high quality clusters which conform better to clusterings by human experts. For some well-known real datasets, CLUC even discovers clusterings identical to those provided by experts. Our results also indicate that CLUC is order insensitive in general and is scalable when the dataset grows in size (the number of objects) and/or dimensions (attributes)

Divisions:Concordia University > Faculty of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Nemalhabib, Aida
Pagination:x, 87 leaves : ill. ; 29 cm.
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science and Software Engineering
Date:2006
Thesis Supervisor(s):Shiri, Nematollaah
ID Code:8846
Deposited By:Concordia University Libraries
Deposited On:18 Aug 2011 14:37
Last Modified:18 Aug 2011 15:07
Related URLs:
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Document Downloads

More statistics for this item...

Concordia University - Footer