Login | Register

Next-Generation Data Centers: Experimental Analysis of Topologies and Algorithms for Next-Generation Data Centers

Title:

Next-Generation Data Centers: Experimental Analysis of Topologies and Algorithms for Next-Generation Data Centers

Azzizi, Abdeltif (2025) Next-Generation Data Centers: Experimental Analysis of Topologies and Algorithms for Next-Generation Data Centers. Masters thesis, Concordia University.

[thumbnail of Azzizi_MCompSC_F2025.pdf]
Preview
Text (application/pdf)
Azzizi_MCompSC_F2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
1MB

Abstract

In recent years, data center networks (DCNs) have faced growing pressure from AI and ML workloads with intensive communication patterns and stringent latency requirements. Traditional hierarchical architectures like Clos (Fat-Tree) increasingly struggle with scalability bottlenecks, operational complexity, and congestion under bursty traffic. To address these challenges, this work explores the Structured Re-Arranged Topology (STRAT), which combines expander-graph-inspired path diversity with deterministic structure to enable efficient, scalable, and fault-tolerant designs. Unlike rigid designs, STRAT supports incremental growth, reduced cabling complexity, and better load distribution. This thesis evaluates STRAT not only in simulation but also on real programmable-switch hardware, demonstrating its practical viability. A key contribution is DEALER, a congestion-aware, data-plane-friendly forwarding algorithm leveraging programmable switches. DEALER uses a distributed distance-vector protocol and local queue occupancy to balance load among equal-cost and slightly longer paths, achieving significant improvements over ECMP in high-load scenarios while running at line rate on commercial ASICs. To further enhance STRAT, this work integrates a hybrid electrical-optical fabric with Optical Circuit Switching (OCS) links, guided by proactive, ML-based flow classification. An XGBoost model predicts elephant flows early, enabling their diversion to pre-configured optical paths with minimal control overhead. Simulations show reductions in tail latency and improved throughput. Together, these contributions offer a practical, holistic redesign for DCNs, uniting scalable graph-based topologies, programmable forwarding logic, and ML-guided optical hybridization to meet the performance, efficiency, and scalability demands of modern AI and cloud workloads.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Azzizi, Abdeltif
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science
Date:June 2025
Thesis Supervisor(s):Assi, Chadi
ID Code:995716
Deposited By: Abdeltif Azzizi
Deposited On:04 Nov 2025 15:34
Last Modified:04 Nov 2025 15:34
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top