Login | Register

New paradigms for radiological segmentation with AI foundation models: automatic prompting and interactive VR agent design

Title:

New paradigms for radiological segmentation with AI foundation models: automatic prompting and interactive VR agent design

Spiegler, Pascal ORCID: https://orcid.org/0009-0002-6600-957X (2025) New paradigms for radiological segmentation with AI foundation models: automatic prompting and interactive VR agent design. Masters thesis, Concordia University.

[thumbnail of Spiegler_MCompSc_F2025.pdf]
Preview
Text (application/pdf)
Spiegler_MCompSc_F2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.
46MB

Abstract

This thesis investigates data-efficient and interactive methods for medical image segmentation, addressing two major related hurdles: the scarcity of high-quality annotations for fully-supervised lesion segmentation and the labor/expertise-intensive nature of manual segmentation workflows. Our first contribution, YOLO-URSAM, is a weakly supervised intracranial hemorrhage (ICH) segmentation model that leverages the Segment-Anything Model (SAM), where we fine-tune YOLOv8 on bounding-box annotated CT scans, introduce automatic point prompting for SAM, and apply perturbation-based majority-voting for uncertainty rectification. YOLO-URSAM achieves 0.933 detection accuracy, 0.796 AUC, and a mean Dice score of 0.629, surpassing existing weakly supervised methods and popular supervised models (U-Net and Swin-UNETR) on available public data. Our second contribution, SAMIRA, is a virtual reality system with a conversational AI agent for semi-automated 3D radiological segmentation. Users issue simple voice commands to initialize masks, correct them using a human-in-the-loop approach, then view the 3D segmentations as a true-to-scale 3D mesh. In a user study, SAMIRA achieved a System Usability Scale score of 90.0±9.0, demonstrated a low cognitive load, and was praised for its intuitive guidance, educational benefits, and immersive visualization. Together, these methods combine AI foundation models, prompting, and limited human interaction to deliver accurate and efficient segmentation for clinical imaging.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:Thesis (Masters)
Authors:Spiegler, Pascal
Institution:Concordia University
Degree Name:M. Comp. Sc.
Program:Computer Science
Date:7 August 2025
Thesis Supervisor(s):Xiao, Yiming
ID Code:996218
Deposited By: Pascal Spiegler
Deposited On:04 Nov 2025 15:40
Last Modified:04 Nov 2025 15:40
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top