Login | Register

AI-Aware and QoE-Driven Adaptive Transport for Real-Time Applications

Title:

AI-Aware and QoE-Driven Adaptive Transport for Real-Time Applications

Sahu, Aman ORCID: https://orcid.org/0009-0008-4773-7233 (2025) AI-Aware and QoE-Driven Adaptive Transport for Real-Time Applications. Masters thesis, Concordia University.

[thumbnail of Sahu_MA_S2026.pdf]
Text (application/pdf)
Sahu_MA_S2026.pdf - Accepted Version
Restricted to Repository staff only until 31 July 2026.
Available under License Spectrum Terms of Access.
945kB

Abstract

Real-time applications—including interactive video streaming, cloud gaming, and multimodal
large language model (MLLM) assistants—require low-latency, reliable delivery over highly variable
wireless networks. Yet current transport and application mechanisms are often optimized in
isolation: HTTP-based streaming introduces segment-level delays, generic WebRTC and QUIC
configurations lack application awareness, and MLLM pipelines frequently wait for full video upload
before inference begins. This thesis examines whether adaptive, application-informed transport
can reduce end-to-end latency and improve reliability across these workloads.
The first part of the thesis presents RabbitLLM, an end-to-end system for real-time MLLM
inference. RabbitLLM combines WebRTC-based frame-wise video streaming with chunked KVcache
prefilling and progressive decoding, enabling LLM processing to begin before full video
transmission completes. Compared to conventional TCP pipelines, it achieves lower transmission
delay, reduced end-to-end latency, and more efficient bandwidth usage while maintaining semantic
output quality.
The second part introduces TAROT, an optimization-driven adaptive forward error correction
(FEC) framework for QUIC-based video streaming. TAROT selects redundancy parameters dynamically
based on network conditions and timing constraints, yielding higher perceptual video
quality and more stable playback than static RaptorQ or Reed–Solomon strategies across diverse
cellular traces.
Together, these results show that cross-layer adaptive transport—integrating WebRTC, QUIC,
and application-level execution dynamics can significantly improve latency, efficiency, and perceptual
quality for next-generation real-time multimodal and video streaming systems.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Sahu, Aman
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science
Date:30 November 2025
Thesis Supervisor(s):Bentaleb, Abdelhak
ID Code:996591
Deposited By: Aman Sahu
Deposited On:29 Jun 2026 14:59
Last Modified:29 Jun 2026 14:59
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top