Login | Register

Techniques to Improve the Parsing of Unstructured Logs for AIOps

Title:

Techniques to Improve the Parsing of Unstructured Logs for AIOps

Sedki, Issam (2025) Techniques to Improve the Parsing of Unstructured Logs for AIOps. PhD thesis, Concordia University.

[thumbnail of Sedki_PhD_S2025.pdf]
Preview
Text (application/pdf)
Sedki_PhD_S2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
4MB

Abstract

Artificial Intelligence for IT Operations (AIOps) is revolutionizing IT management by incorporating AI, machine learning, and big data analytics to automate and enhance system operations. Logs are the backbone of AIOps—they provide the fundamental data on system events, user activities, and performance metrics that AIOps needs for proactive monitoring, predictive analytics, and maintaining system health. Log management, especially log parsing, is therefore critical for identifying anomalies, diagnosing failures, and ensuring overall operational efficiency. However, challenges such as diverse log formats, insufficient logging guidelines, the sheer volume of logs, and the need for real-time insights significantly limit the precision, scalability, and effectiveness of AIOps.
This thesis develops novel methods for universal log parsing, introduces accurate evaluation metrics, proposes a comprehensive taxonomy of log characteristics, and addresses privacy compliance, collectively advancing the efficacy, scalability, and trustworthiness of AIOps.
Inaccurate log parsing can lead to inaccurate insights—misleading or incorrect conclusions drawn from analysis. Such errors may cause AIOps to misclassify events, overlook crucial anomalies, or generate noise that obscures genuine issues. To address this, the first major contribution of this thesis is the development of ULP (Universal Log Parser), which leverages a frequent token counting method to identify recurring patterns and extract log templates efficiently. By reducing computational complexity, ULP enables faster, more accurate log parsing, making it highly effective for large-scale IT environments—a key capability for the automation and responsiveness required in AIOps.
The second contribution is AML (Accuracy Metric for Log Parsing), a structured framework for evaluating the accuracy of log parsing. Traditional metrics are insufficient for heterogeneous log formats, leading to errors and subsequent misguided AIOps decisions. AML offers nuanced metrics that account for both omission and commission errors, enabling detailed and reliable comparisons across different log parsers.
The third contribution is a taxonomy of log characteristics, categorizing logs based on their structural and contextual properties. This taxonomy not only guides parser design by clarifying how logs differ across applications but also informs logging practices, helping practitioners tailor log writing strategies for better analytics.
The fourth contribution focuses on enhancing log privacy compliance, a crucial aspect in AIOps —especially as automated processes handle large volumes of sensitive log data. The thesis provides guidelines for evaluating and managing privacy risks associated with log data, ensuring that the automation capabilities of AIOps remain compliant with stringent privacy regulations and best practices.
Together, these contributions form a framework for advancing log parsing within AIOps. It is a holistic approach—encompassing efficient universal parsing, robust accuracy evaluation, a guiding taxonomy, and built-in privacy compliance—that addresses the entire life cycle of log-based analytics. This framework ultimately strengthens the capabilities of IT operations to be more proactive, responsive, and compliant, paving the way for the next generation of AIOps-driven IT management.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Electrical and Computer Engineering
Item Type:Thesis (PhD)
Authors:Sedki, Issam
Institution:Concordia University
Degree Name:Ph. D.
Program:Electrical and Computer Engineering
Date:27 March 2025
Thesis Supervisor(s):Hamou-Lhadj, Abdelwahab and Ait Mohamed, Otmane
ID Code:995238
Deposited By: Issam Sedki
Deposited On:17 Jun 2025 14:53
Last Modified:17 Jun 2025 14:53
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top