Login | Register

Transformer-Based Models for Identifying Customer Needs in User-Generated Content: Performance Gaps, Unintended Bias, and Broader Implications

Title:

Transformer-Based Models for Identifying Customer Needs in User-Generated Content: Performance Gaps, Unintended Bias, and Broader Implications

Kashi, Mehrshad ORCID: https://orcid.org/0009-0008-8899-7621 (2025) Transformer-Based Models for Identifying Customer Needs in User-Generated Content: Performance Gaps, Unintended Bias, and Broader Implications. Masters thesis, Concordia University.

[thumbnail of Kashi_MA_S2025.pdf]
Preview
Text (application/pdf)
Kashi_MA_S2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
3MB

Abstract

This thesis reviews and evaluates intelligent methods for identifying customer needs in user-generated content (UGC). It first surveys prior work and shows that many studies share generic goals yet overlook the complexity and taxonomy of needs in their evaluation setups. To clarify scope, the thesis distinguishes between using Machine Learning (ML) as a tool to support marketing workflows and treating customer-needs identification itself as an Natural Language Processing (NLP) task with clear definitions and constructs. Building on this perspective, a large experimental study assesses Transformer-based models for generalizability, robustness, fairness, and sample efficiency across varied settings. Results indicate competitive accuracy, with gains in F1 up to 18% over baselines, but also consistent limitations: shared error patterns, difficulty with rare or unseen needs, reliance on lexical cues that weakens cross-domain performance, and no guaranteed gains in sample efficiency from larger models. Cross-domain results benefit most from richer, diverse domain training, while adding more in-domain data does not improve transfer. Beyond technical metrics, the thesis highlights adoption barriers, costs, data constraints, task complexity, and ethical considerations and argues for evaluation frameworks that reflect taxonomy, transparency, and fairness. It concludes with practical guidance that bridges marketing theory and NLP practice to support responsible, reproducible deployment.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Electrical and Computer Engineering
Item Type:Thesis (Masters)
Authors:Kashi, Mehrshad
Institution:Concordia University
Degree Name:M.A. Sc.
Program:Electrical and Computer Engineering
Date:26 August 2025
Thesis Supervisor(s):Ait Mohamed, Otmane and Lahmiri, Salim
ID Code:996055
Deposited By: Mehrshad Kashi
Deposited On:04 Nov 2025 16:08
Last Modified:04 Nov 2025 16:08
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top