Login | Register

Log-Based Anomaly Detection: Comparative Study of Real-World System Logs using Machine Learning And Deep Learning Approaches

Title:

Log-Based Anomaly Detection: Comparative Study of Real-World System Logs using Machine Learning And Deep Learning Approaches

Nipa, Nadira Anjum (2025) Log-Based Anomaly Detection: Comparative Study of Real-World System Logs using Machine Learning And Deep Learning Approaches. Masters thesis, Concordia University.

[thumbnail of Nipa_MASc_F2025.pdf]
Preview
Text (application/pdf)
Nipa_MASc_F2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
1MB

Abstract

The reliability and security of today’s smart and autonomous systems increasingly rely on effective anomaly detection capabilities. Logs generated by intelligent devices during runtime offer valuable insights for system monitoring and troubleshooting. Nonetheless, the enormous quantity and complexity of these logs render manual anomaly inspection impractical and error-prone. To address this, various automated log-based anomaly detection methods have been developed. However, many of these approaches are evaluated in controlled environments with publicly available datasets, which differ significantly from the noisy, unstructured, and unlabeled logs encountered in industrial settings. This thesis explores and adapts existing machine learning and deep learning techniques for anomaly detection in real-world system logs produced by an intelligent autonomous display device. Initially, we conduct a comparative analysis of machine learning and deep learning methods using a small manually labeled dataset to evaluate the detection accuracy and computational efficiency. Our results highlight the most suitable approaches for enabling proactive maintenance and enhancing system reliability. Expanding on this, we evaluate advanced deep learning methods across weakly supervised, semi-supervised, and unsupervised learning paradigms, using heuristically labeled logs and benchmark them against fully supervised baselines to examine the trade-offs between label dependency, detection performance, and industrial applicability. Finally, we propose a systematic approach for managing unlabeled and noisy log data, providing practical guidelines for selecting suitable learning strategies based on label availability, data quality, and real-world constraints. The findings of this work provide valuable insights for the implementation of scalable, accurate, and robust log-based anomaly detection in industrial IoT environments.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Concordia Institute for Information Systems Engineering
Item Type:Thesis (Masters)
Authors:Nipa, Nadira Anjum
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Quality Systems Engineering
Date:10 July 2025
Thesis Supervisor(s):Bouguila, Dr. Nizar and Patterson, Dr. Zachary
ID Code:995927
Deposited By: Nadira Anjum Nipa
Deposited On:04 Nov 2025 17:40
Last Modified:04 Nov 2025 17:40
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top