Kim, Kevin (2019) Empirical Evaluation Of Parallelizing Correlation Algorithms For Sequential Telecommunication Devices Data. Masters thesis, Concordia University.
Preview |
Text (application/pdf)
3MBkim_MSC_S2020.pdf - Accepted Version Available under License Spectrum Terms of Access. |
Abstract
Context: Connected devices within IoT is a source of generating big data. The data measured from devices consists of large number of features from hundreds to thousands. Analyzing these features is both data and computing intensive. Distributed and parallel processing frameworks such as Apache Spark provide in-memory processing technologies to design feature analytic workflows. However, algorithms for discovering data patterns and trends over time series are not necessarily ready to cooperate issues such as data partition, data shuffling that rise from distribution and parallelism. Aim: This thesis aims to explore the relation between algorithm characteristics and parallelisms as well as the effects on clustering results and the system performance. Method: System level techniques were developed to address particularly the data partition, load-balancing and data shuffling issues. Furthermore, these techniques are applied to adopt clustering algorithms on distributed parallel computing frameworks. In the evaluation, two workflows were built in which each consists of a clustering algorithm and its corresponding metrics for measuring distances of any two time series data. Result: These system level techniques improve the overall performance and execution of the workflows. Conclusion: The distribution and parallel workflows address both algorithmic factors and parallelism factors to improve accuracy and performance of processing big time series data of connected devices.
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Electrical and Computer Engineering |
---|---|
Item Type: | Thesis (Masters) |
Authors: | Kim, Kevin |
Institution: | Concordia University |
Degree Name: | M.A. Sc. |
Program: | Electrical and Computer Engineering |
Date: | December 2019 |
Thesis Supervisor(s): | Liu, Yan |
ID Code: | 986621 |
Deposited By: | KEVIN KIM |
Deposited On: | 30 Jun 2021 15:02 |
Last Modified: | 01 Jun 2022 00:00 |
Repository Staff Only: item control page