Login | Register

Defending Object Detection Models against Image Distortions

Title:

Defending Object Detection Models against Image Distortions

Ofori-Oduro, Mark (2024) Defending Object Detection Models against Image Distortions. PhD thesis, Concordia University.

[thumbnail of Ofori-Oduro_PhD_F2024.pdf]
Preview
Text (application/pdf)
Ofori-Oduro_PhD_F2024.pdf - Accepted Version
Available under License Spectrum Terms of Access.
9MB

Abstract

Object detection has significantly advanced with deep learning but faces challenges under image distortions like noise, compression, blur, fog, and snow. This issue is critical in applications such as self-driving cars and healthcare. While defence methods aim to enhance robustness under distortions, maintaining performance on clean images remains a challenge.
To address this challenge, we propose a novel defence approach that generates copies of the original training images and adds distortion-like content to these copies at the pixel level. We balance the number of distorted pixels to prevent bias during learning. Our approach includes two augmentation methods to generate augmented samples: AISbod, which uses artificial immune systems (AIS), and GSES, which employs kernel density estimation (KDE).

Our AISbod uses AIS to distort the original sample (antigen) through cycles of “select, clone, mutate, select” until the augmented data (antibody) reaches a specified similarity to the antigen. However, AIS is limited in diversifying generated antibodies and is computationally expensive. Therefore, in GSES, we create samples by selecting pixels from the original samples, estimating the pixel distribution from multiple distorted versions of the original samples via KDE, and then replacing the selected pixels with new values sampled from the estimated distribution. GSES generates more diverse data than AISbod.

We evaluate our methods on 15 image distortions using state-of-the-art object detection models like DINO and YOLOv7. Our methods improve accuracy under distorted and clean images and remain consistent across datasets and detection models. For instance, DINO on the COCO dataset shows a 4.50% improvement under clean samples, 8.40% on average across all distortions, 2.50% under snow, and 29.30% under impulse noise. The observed improvement is due to the weight regularization effect of our methods, which is evident in the smoother convergence of training and validation loss curves, indicating reduced learning fluctuations and a more stable optimization path. Additionally, the narrower gap between the curves suggests reduced over-fitting, leading to better generalization to unseen data. Simulations indicate that our approach outperforms related defence methods against distortions and extends beyond object detection, improving accuracy in image classification and object tracking models.

Divisions:Concordia University > Gina Cody School of Engineering and Computer Science > Electrical and Computer Engineering
Item Type:Thesis (PhD)
Authors:Ofori-Oduro, Mark
Institution:Concordia University
Degree Name:Ph. D.
Program:Electrical and Computer Engineering
Date:17 July 2024
Thesis Supervisor(s):Amer, Maria
ID Code:994497
Deposited By: Mark Ofori-Oduro
Deposited On:24 Oct 2024 16:54
Last Modified:24 Oct 2024 16:54
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top