Detecting fashion apparels and their landmarks

Title:

Detecting fashion apparels and their landmarks

Saini, Himani (2019) Detecting fashion apparels and their landmarks. Masters thesis, Concordia University.

Preview

Text (application/pdf)
Saini_MCompSc_S2020.pdf - Accepted Version
Available under License Spectrum Terms of Access.

5MB

Abstract

Fashion landmarks are the functional key-points on the apparels that can be used for a more discriminative visual analysis of the apparel images. Such a framework can facilitate apparel alignment in displaying apparel images on the websites or help build a system to ensure dress code in a particular environment. However, challenges such as background clutter, human poses, scales and apparel variation can render such a task difficult. We present a conceptually simple, flexible, and general framework for apparels’ landmark detection that can be simultaneously used for apparel classification and localization. In addition to the position of the landmarks in the apparels, we also classify the landmarks as visible or occluded in the same framework. The fashion landmark detection task is similar to joint localization and detection problems like human pose estimation, hence our approach extends stacked hourglass architecture, originally proposed to solve human pose estimation. We perform all these tasks in parallel using multi-task learning. Our proposed convolutional neural network is end-to-end differentiable and simple to train, since all these tasks are performed on the same architecture without any additional parameters to learn. Over the past few years, many modifications have been proposed to improve this architecture. We also compare the performances of some of these different variations of stacked hourglass architectures. These architectures leverage both global and local features captured by the deep convolutional neural networks to better localize the apparel in the image as well as the landmarks in those apparels. We test and analyze our results on DeepFashion dataset. We also weigh the trade-offs of the detecting the landmarks in a category-aware environment, i.e., pre-classified apparels and category-agnostic environment, i.e., unclassified apparels.

Divisions:	Concordia University > Gina Cody School of Engineering and Computer Science
Item Type:	Thesis (Masters)
Authors:	Saini, Himani
Institution:	Concordia University
Degree Name:	M. Comp. Sc.
Program:	Computer Science
Date:	25 September 2019
Thesis Supervisor(s):	Yu, Jia Yuan
ID Code:	986026
Deposited By:	Himani Saini
Deposited On:	27 Oct 2022 13:50
Last Modified:	27 Oct 2022 13:50