Automatic Handwriting Analysis for Classifying Multi-Label Personality Traits using Transformer OCR

Title:

Automatic Handwriting Analysis for Classifying Multi-Label Personality Traits using Transformer OCR

Adeli Shamsabad, Marzieh ORCID: https://orcid.org/0009-0005-7680-6689 (2025) Automatic Handwriting Analysis for Classifying Multi-Label Personality Traits using Transformer OCR. Masters thesis, Concordia University.

[thumbnail of Adeli Shamsabad_MASc_S2025.pdf]

Preview

Text (application/pdf)
Adeli Shamsabad_MASc_S2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.

12MB

Abstract

Handwriting analysis, or graphology, studies an individual’s psychological traits through handwriting patterns and features. It is used in forensic science, criminology, and disease diagnosis.

Previous studies have evaluated the correlation between psychological questionnaires and manual handwriting analysis, but results were inconsistent due to its limitations and human error. This research addresses these challenges by developing an automated handwriting analysis system using deep learning to predict multi-label personality traits based on the Big Five Factor Model (BFFM).

The proposed model is built on the Transformer OCR (TrOCR) architecture, pre-trained on diverse datasets, including handwritten texts like IAM. In this study, the text generation function is replaced with a classification approach to predict levels (Low, Average, High) of BFFM traits from handwriting samples. The model uses Focal Loss to handle class imbalance and Binary Cross-Entropy with Logits for accurate classification.

The dataset includes 873 French and 181 English handwriting samples from CENPARMI, originally labeled for Extraversion and Conscientiousness. It has been expanded to cover all five BFFM traits: Extraversion, Neuroticism, Agreeableness, Conscientiousness, and Openness to Experience, totaling 1,054 samples. Each sample is segmented into individual lines to improve generalization.

The model's performance is compared with ResNet50 and Vision Transformers (ViT Base 16 - 224 and 384). Results show that TrOCR outperforms them in accuracy and overall performance. For two personality traits, it achieves 90.05% accuracy, AUROC of 0.97, and F-Score of 89%. For all five traits, it reaches 89.01% accuracy, AUROC of 0.95, and F-Score of 87%. Extraversion shows the weakest performance (AUROC of 91), while Agreeableness performs best (AUROC of 97). These results highlight the model's effectiveness in classifying BFFM traits despite class imbalance.

Divisions:	Concordia University > Research Units > Centre for Pattern Recognition and Machine Intelligence
Item Type:	Thesis (Masters)
Authors:	Adeli Shamsabad, Marzieh
Institution:	Concordia University
Degree Name:	M.A. Sc.
Program:	Computer Science
Date:	4 April 2025
Thesis Supervisor(s):	Suen, Ching Yee
ID Code:	995415
Deposited By:	Marzieh Adeli Shamsabad
Deposited On:	17 Jun 2025 17:30
Last Modified:	17 Jun 2025 17:30

Repository Staff Only: item control page

Download Statistics

Downloads per month over past year

Research related to the current document (at the CORE website)

Spectrum Research Repository

Automatic Handwriting Analysis for Classifying Multi-Label Personality Traits using Transformer OCR

Automatic Handwriting Analysis for Classifying Multi-Label Personality Traits using Transformer OCR

Abstract