Login | Register

Three Essays on Stochastic Processes, Reinforcement Learning, and Financial Applications

Title:

Three Essays on Stochastic Processes, Reinforcement Learning, and Financial Applications

Ahmadi, Zaniar ORCID: https://orcid.org/0000-0003-0145-1252 (2025) Three Essays on Stochastic Processes, Reinforcement Learning, and Financial Applications. PhD thesis, Concordia University.

[thumbnail of Ahmadi_PhD_F2025.pdf]
Preview
Text (application/pdf)
Ahmadi_PhD_F2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
2MB

Abstract

In this dissertation, we undertake three research projects.
In the first study, by adopting a perturbation approach, we find expressions of potential densities for refracted skew Brownian motion (skew Brownian motion with two-valued drift). As applications, we recover its transition density and study its long-time asymptotic behaviors. In addition, we also compare with previous results on transition densities for skew Brownian motions. We propose two approaches for generating quasi-random samples by approximating the cumulative distribution function and discuss their risk measurement application.

In the second study, we introduce a dynamic portfolio optimization model for an investor that is averse to both financial risk and the greenhouse gas (GHG) emissions footprint from the portfolio. The dynamic asset allocation is obtained by solving for the optimal mean-variance-emission trade-off with reinforcement learning (RL).
As an application of the model, we consider the case of an investor seeking the optimal allocation across the eleven sectors of the S&P 500. We conduct an analysis for multiple investors, each with varying levels of risk and GHG aversion. Optimal RL portfolios are shown to substantially outperform equal-weigthed, optimal static and myopic dynamic benchmarks by (i) properly identifying sectors offering favorable trade-off between financial and environmental performance that are in-line with the investor's preferences, (ii) adjusting the allocation dynamically to reflect changes in market and emission conditions, and (iii) considering the interaction between the various rebalancing decisions over time. Moreover, the RL optimization model demonstrates consistent performance throughout the entire investment period, not just at the final time horizon, making it a robust strategy.

Finally, in the last study, we investigate reinforcement learning (RL) as a framework for the dynamic hedging of swaptions, contrasting its performance with traditional sensitivity-based rho-hedging. We design agents under three distinct objective functions: mean squared error, downside risk, and Conditional Value-at-Risk to capture alternative risk preferences and evaluate how these objectives shape hedging styles.
Relying on a three-factor arbitrage-free dynamic Nelson-Siegel model for our simulation experiments, our findings show that most hedging effectiveness is achieved with two swaps as hedging instruments. The underlying swap consistently anchors the hedge by replicating exposure to the level factor, while a second swap complements it by mitigating residual slope and curvature risks, or by serving as a targeted tail-risk hedge under CVaR objectives. The different objectives induce distinct behaviors: the MSE agent relies on level exposures, the downside risk agent balances gain protection with selective risk-taking, and the CVaR agent prioritizes tail protection while opportunistically harvesting premia. Compared with rho-hedging, RL strategies avoid over-hedging, adapt dynamically across payoff states, reduce turnover when transaction costs are present, and remain robust under model misspecification. These results highlight RL’s potential to deliver more efficient and resilient swaption hedging strategies.

Divisions:Concordia University > Faculty of Arts and Science > Mathematics and Statistics
Item Type:Thesis (PhD)
Authors:Ahmadi, Zaniar
Institution:Concordia University
Degree Name:Ph. D.
Program:Mathematics
Date:11 November 2025
Thesis Supervisor(s):Godin, Frederic and Zhou, Xiaowen
Keywords:Reinforcement Learning, Portffolio Optimization, Swaption Hedging, Refracted Skew Brownian Motion
ID Code:996451
Deposited By: Zaniar Ahmadi
Deposited On:29 Jun 2026 17:55
Last Modified:29 Jun 2026 17:55
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top