Login | Register

Network Traffic Classification Under Label Scarcity and Privacy Constraints Using Federated Self-Supervised and Confident Learning

Title:

Network Traffic Classification Under Label Scarcity and Privacy Constraints Using Federated Self-Supervised and Confident Learning

Eslami, Ehsan ORCID: https://orcid.org/0009-0003-2189-2650 (2025) Network Traffic Classification Under Label Scarcity and Privacy Constraints Using Federated Self-Supervised and Confident Learning. Masters thesis, Concordia University.

[thumbnail of Eslami_MA_S2026.pdf]
Preview
Text (application/pdf)
Eslami_MA_S2026.pdf - Accepted Version
Available under License Spectrum Terms of Access.
9MB

Abstract

Network Traffic Classification (NTC) is essential for network management and security; however, it faces challenges such as label scarcity, pseudo-label noise, privacy concerns, non-independent and non-identically distributed (non-IID) data, and class imbalance in distributed environments. This thesis proposes two interconnected frameworks to address these issues. The centralized approach integrates traffic-adapted self-supervised learning (SSL) with confident learning (CL), employing a constraint-consistent autoencoder (AE) and enhanced tabular contrastive learning (TabCL) for robust pseudo-label generation from unlabeled flows, followed by CL-based denoising using per-class quantile thresholds, calibration-aware weighting, and balanced retention. This centralized SSL+CL pipeline is proven to be highly effective, establishing a strong performance benchmark.

The federated extension, FedSSL-NTC, distributes this process across clients, incorporating FedProx for non-IID stability, class-weighted losses for imbalance, sample-size-weighted (SSW) Federated Averaging (FedAvg) for fair aggregation, and a tailored secure aggregation (SecAgg) protocol for privacy preservation, ensuring no raw data sharing. Evaluations on a self-generated, unlabeled dataset combined with ISCX VPN-nonVPN and the UCDavis–QUIC benchmark demonstrate high effectiveness. Experimental results demonstrate that FedSSL-NTC achieves near-centralized accuracy. These contributions advance label-efficient, noise-robust, and privacy-aware NTC, bridging gaps in centralized and distributed paradigms for real-world applications in IoT, 5G/6G, and encrypted traffic scenarios.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Electrical and Computer Engineering
Item Type:Thesis (Masters)
Authors:Eslami, Ehsan
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Electrical and Computer Engineering
Date:12 December 2025
Thesis Supervisor(s):Hamouda, Walaa
ID Code:996722
Deposited By: Ehsan Eslami
Deposited On:29 Jun 2026 14:40
Last Modified:29 Jun 2026 14:40
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top