Multimodal Learning-Based Frameworks for Sensor-Free Force-Aware Perception and Autonomous Navigation in Cardiac Catheterization

Title:

Multimodal Learning-Based Frameworks for Sensor-Free Force-Aware Perception and Autonomous Navigation in Cardiac Catheterization

Fekri, Pedram ORCID: https://orcid.org/0000-0003-1966-8724 (2026) Multimodal Learning-Based Frameworks for Sensor-Free Force-Aware Perception and Autonomous Navigation in Cardiac Catheterization. PhD thesis, Concordia University.

Preview

Text (application/pdf)
Fekri_PhD_S2026.pdf - Accepted Version
Available under License Spectrum Terms of Access.

9MB

Abstract

Cardiac catheterization is a procedure in minimally invasive cardiovascular interventions, where
safe and effective performance relies on accurate force awareness, catheter visualization, and navigation
within complex vascular geometries. This doctoral research investigates learning-based,
sensor-free frameworks using standard clinical imaging modalities, with the objective of advancing
toward autonomous catheter-based systems. The work is conducted as a multidisciplinary study
at the intersection of artificial intelligence, deep learning–based computer vision, medical robotics,
mechatronics, and autonomous systems, based on the premise that catheter deflections observed
from multiple imaging viewpoints encode sufficient information to infer three-dimensional contact
forces and support goal-directed navigation. A lightweight fusion-based convolutional neural network
is first proposed to estimate three-dimensional contact forces directly from stereo catheter
images without reliance on physical sensors or explicit mechanical modeling. This is extended
to a multitask encoder–decoder architecture that simultaneously performs catheter segmentation
and force estimation from biplane fluoroscopic images within a single end-to-end framework, supported
by a synthetic X-ray image generator designed to resemble clinical fluoroscopy. The framework
is further extended through a Vision Transformer–based architecture with cross-attention for
joint stereo segmentation and force estimation. Finally, autonomous catheter navigation is explored
through a goal-conditioned, multimodal vision-to-action model trained using imitation learning for
perception-driven catheter steering in a physical robotic setup. Overall, this dissertation demonstrates
that force estimation, catheter perception, and navigation can be learned within sensor-free
frameworks, supporting future autonomous robotic catheterization systems.

Divisions:	Concordia University > Gina Cody School of Engineering and Computer Science > Mechanical, Industrial and Aerospace Engineering
Item Type:	Thesis (PhD)
Authors:	Fekri, Pedram
Institution:	Concordia University
Degree Name:	Ph. D.
Program:	Mechanical Engineering
Date:	January 2026
Thesis Supervisor(s):	Dargahi, Javad and Zadeh, Mehrdad
ID Code:	996852
Deposited By:	Pedram Fekri
Deposited On:	29 Jun 2026 17:57
Last Modified:	29 Jun 2026 17:57

Repository Staff Only: item control page

Download Statistics

Downloads per month over past year

Research related to the current document (at the CORE website)

Spectrum Research Repository

Multimodal Learning-Based Frameworks for Sensor-Free Force-Aware Perception and Autonomous Navigation in Cardiac Catheterization

Multimodal Learning-Based Frameworks for Sensor-Free Force-Aware Perception and Autonomous Navigation in Cardiac Catheterization

Abstract