Login | Register

A Deep Few-Shot Network for Protein Family Classification

Title:

A Deep Few-Shot Network for Protein Family Classification

Jamali, Saeedeh (2024) A Deep Few-Shot Network for Protein Family Classification. Masters thesis, Concordia University.

[thumbnail of Jamali_MSc_S2024.pdf]
Preview
Text (application/pdf)
Jamali_MSc_S2024.pdf - Accepted Version
Available under License Spectrum Terms of Access.
1MB

Abstract

Protein sequence analysis is arguably a challenging modern bioinformatics problem covering various applications such as disease research, precision medicine, and therapeutics. Given the emergence of sequencing technologies and the resulting large-scale databases, protein family classification is an open problem in bioinformatics. Recent advances in computer science have opened new gates to researchers in various scientific domains. Bioinformatics, as an intermediary research field, takes advantage of these advancements from conventional machine learning methods to large language models, and biostatistics. Utilized machine learning techniques for protein family classification, are dependent on domain experts to generate features which could be time-consuming and challenging. Deep learning algorithms have shown promising results in proteomics; however, their application is limited to the availability of massive data sets for training. Since the required data comes from experiments, it can be highly complex or incomplete. As an alternative, few-shot models can learn and generalize from a few observations. To address the mentioned limitations, in this research, we designed and implemented a deep few-shot network for protein family classification and our result showed outperformance to state-of-the-art baseline models. To the best of our knowledge, this is the first deep network tailored for primary sequence family classification that can highly perform with a very limited number of observations.

Divisions:Concordia University > Faculty of Arts and Science > Mathematics and Statistics
Item Type:Thesis (Masters)
Authors:Jamali, Saeedeh
Institution:Concordia University
Degree Name:M. Sc.
Program:Mathematics
Date:25 March 2024
Thesis Supervisor(s):Chaubey, Yogendra P. and Ebadi, Ashkan
ID Code:993700
Deposited By: Saeedeh Jamali
Deposited On:05 Jun 2024 16:27
Last Modified:05 Jun 2024 16:27
All items in Spectrum are protected by copyright, with all rights reserved. The use of items is governed by Spectrum's terms of access.

Repository Staff Only: item control page

Downloads per month over past year

Research related to the current document (at the CORE website)
- Research related to the current document (at the CORE website)
Back to top Back to top