Farhat, Abdallah (2023) Evaluation of Citation Graph Thematic Dataset Construction and Paper Filtering Methods for Research Literature Recommendation. Masters thesis, Concordia University.
Preview |
Text (application/pdf)
919kBFarhat_MASc_F2023.pdf - Accepted Version Available under License Spectrum Terms of Access. |
Abstract
One of the main challenges faced by new researchers is immersing themselves in the existing literature relevant to their field of interest. The vastness and continuous growth of knowledge in their field can be overwhelming, making it difficult to identify the most pertinent research papers within their research themes. To address this issue, research paper recommender systems have emerged as valuable tools. These systems allow researchers to find relevant papers based on their specific interests or research themes by analyzing various aspects such as titles, abstracts, and full texts. The quality of the dataset used is crucial for the development, testing, and refinement of these systems to ensure optimal results. Dataset quality directly impacts the accuracy and reliability of a recommender system. In this thesis, I propose a novel approach for constructing datasets using citation graph networks. These networks consist of nodes representing research papers and edges representing citations between them. By leveraging citation graph networks, we gain a more comprehensive understanding of the relationships and influences among different papers compared to traditional methods that rely solely on keyword searches. To evaluate the effectiveness of the citation graph network method, I compared it with the traditional keyword search approach for dataset construction. Additionally, I assessed the effectiveness of three recommender system algorithms: user-based collaborative filtering, combined with PageRank and personalized PageRank algorithms. The experimental findings provide clear evidence that utilizing citation graph network datasets significantly enhances the efficacy of research paper recommender systems. This improvement simplifies the process of finding relevant literature for researchers, potentially accelerating scientific discovery.
Divisions: | Concordia University > Gina Cody School of Engineering and Computer Science > Concordia Institute for Information Systems Engineering |
---|---|
Item Type: | Thesis (Masters) |
Authors: | Farhat, Abdallah |
Institution: | Concordia University |
Degree Name: | M.A. Sc. |
Program: | Quality Systems Engineering |
Date: | June 2023 |
Thesis Supervisor(s): | Wang, Chun |
ID Code: | 992323 |
Deposited By: | Abdallah Farhat |
Deposited On: | 17 Nov 2023 14:52 |
Last Modified: | 17 Nov 2023 14:52 |
Repository Staff Only: item control page