Studies on Dynamic Loss Functions and Curriculum Learning in OffensEval Datasets

Title:

Studies on Dynamic Loss Functions and Curriculum Learning in OffensEval Datasets

Zhou, Zhanfan (2021) Studies on Dynamic Loss Functions and Curriculum Learning in OffensEval Datasets. Masters thesis, Concordia University.

Preview

Text (application/pdf)
Zhou_MCompSc_F2021.pdf - Accepted Version
Available under License Spectrum Terms of Access.

1MB

Abstract

The spread of offensive language has become a severe social problem and may stress unmeasurable mental health illnesses. The rapid usage of social media worsens the situation. We develop a lite but robust offensive language identification system and evaluate the system on two SemEval offensive language identification shared tasks: SemEval 2019 Task 6 and SemEval 2020 Task 12. In order to take the advantage of a large semi-supervised dataset, and reduce the processing complexity of such huge data, we investigate approaches to adapt a model to the silver standards via curriculum learning and dynamic loss functions. By adapting a model to such data with the curriculum learning or dynamic loss functions, the systems are capable of scattering the focus properly on data of different difficulty levels. Experiments show both help the model learn effectively and acquire more messages from the hard cases without impairing the performance on easy cases. The best run on each task achieves competitive F1 scores of 81.6% and 91.7% on the official test data of SemEval 2019 Task 6 and SemEval 2020 Task 12 respectively with at least 50\% parameters and less data overhead, compared to the state-of-the-art systems.

Divisions:	Concordia University > Gina Cody School of Engineering and Computer Science > Computer Science and Software Engineering
Item Type:	Thesis (Masters)
Authors:	Zhou, Zhanfan
Institution:	Concordia University
Degree Name:	M. Comp. Sc.
Program:	Computer Science
Date:	24 August 2021
Thesis Supervisor(s):	Bergler, Sabine
ID Code:	988799
Deposited By:	Zhanfan Zhou
Deposited On:	29 Nov 2021 16:17
Last Modified:	29 Nov 2021 16:17

Repository Staff Only: item control page

Download Statistics

Downloads per month over past year

Research related to the current document (at the CORE website)

Spectrum Research Repository

Studies on Dynamic Loss Functions and Curriculum Learning in OffensEval Datasets

Studies on Dynamic Loss Functions and Curriculum Learning in OffensEval Datasets

Abstract