Disentangling the AI Black-Box Model: From Direct to Indirect Influence

Title:

Disentangling the AI Black-Box Model: From Direct to Indirect Influence

Ishkhanian, Serly (2025) Disentangling the AI Black-Box Model: From Direct to Indirect Influence. Masters thesis, Concordia University.

Preview

Text (application/pdf)
Ishkhanian_MA_S2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.

4MB

Abstract

Due to the widespread use of artificial intelligence and machine learning models across numerous domains that directly impact individuals’ lives, it is essential to ensure that these models are making their decisions in a non-discriminatory manner. That is, biases encoded in the underlying training data are not reflected in the output of the model. This is, however, often hard to control since the predictive accuracy of these models come at the cost of interpretability.
The purpose of this thesis is to investigate and apply methods for interpreting black-box machine learning models to bring more transparency into their decision-making process by analyzing both the direct and indirect influence of input features on model predictions. We begin by studying the theoretical background of the SHAP (SHapley Additive exPlanations) framework, Shapley values, and their implementation in measuring direct influence. We explore the SHAP Python package for both local and global explanations and visualization of direct feature influence on both synthetic and real-world datasets. We then transition to studying indirect influence using the Disentangled Influence Audit procedure, which uses adversarial training to learn disentangled latent representations for auditing hidden dependencies. After presenting all the background information, we implement this procedure and evaluate these methods through numerical experiments on synthetic functions and real-world datasets including the Adult Income and the Montréal Housing datasets. In addition to these experiments, we also examine how “data-hungry” these auditing methods are by examining their convergence as a function of the number of samples required. Our results demonstrate the importance of auditing both direct and indirect pathways of influence to promote interpretability and fairness in complex models.

Divisions:	Concordia University > Faculty of Arts and Science > Mathematics and Statistics
Item Type:	Thesis (Masters)
Authors:	Ishkhanian, Serly
Institution:	Concordia University
Degree Name:	M.A.
Program:	Mathematics
Date:	30 June 2025
Thesis Supervisor(s):	Brugiapaglia, Simone and Wang, Weiqi
ID Code:	996123
Deposited By:	Serly Ishkhanian
Deposited On:	04 Nov 2025 17:05
Last Modified:	04 Nov 2025 17:05

Repository Staff Only: item control page

Download Statistics

Downloads per month over past year

Research related to the current document (at the CORE website)

Spectrum Research Repository

Disentangling the AI Black-Box Model: From Direct to Indirect Influence

Disentangling the AI Black-Box Model: From Direct to Indirect Influence

Abstract