Login | Register

Optimization of Random Forest Based Methods Applying the Genetic Algorithms

Title:

Optimization of Random Forest Based Methods Applying the Genetic Algorithms

Aback, Zahra (2018) Optimization of Random Forest Based Methods Applying the Genetic Algorithms. Masters thesis, Concordia University.

[img]
Preview
Text (application/pdf)
Aback_MSc_F2018.pdf - Accepted Version
629kB

Abstract

In this century access to large and complex datasets is much easier. These datasets are large in dimension and volume, and researchers are interested in methods that are able to handle this type of data and at the same time produce accurate results. Machine learning methods are particularly efficient for this type of data, where the emphasis is on data analysis, and not on fitting a statistical model. A very popular method from this group is Random Forests which have been applied in different areas of study on two types of problems: classification and regression. The former is more popular, while the latter can be applied for data analysis. Moreover, many efficient techniques for missing value imputation were added to Random Forest over time. One of these methods which can handle all types of variables is MissForest. There are several studies that applied different approaches to improve the performance of classification type of Random Forests, but there are not many studies available for regression type. In the present study, it is evaluated if the performance of regression type of Random Forests and MissForests could be improved by applying Genetic Algorithms as an optimization method. The experiments were conducted on five datasets to minimize the mean square error (MSE) of the Random Forest and imputation errors of the MissForest. The results showed the superiority of the proposed method in comparison to the classical Random Forest methods.

Divisions:Concordia University > Faculty of Arts and Science > Mathematics and Statistics
Item Type:Thesis (Masters)
Authors:Aback, Zahra
Institution:Concordia University
Degree Name:M. Sc.
Program:Mathematics
Date:5 August 2018
Thesis Supervisor(s):Sen, Arusharka
ID Code:984115
Deposited By: ZAHRA ABACK
Deposited On:12 Nov 2018 17:56
Last Modified:12 Nov 2018 17:56
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Back to top Back to top