Integration of Inconsistency and Content Interaction with Deep Learning to Detect Fake Reviews

Title:

Integration of Inconsistency and Content Interaction with Deep Learning to Detect Fake Reviews

Sharifpour, Kiana (2024) Integration of Inconsistency and Content Interaction with Deep Learning to Detect Fake Reviews. Masters thesis, Concordia University.

[thumbnail of Sharifpour_M. Sc._S2025.pdf]

Preview

Text (application/pdf)
Sharifpour_M. Sc._S2025.pdf - Accepted Version
Available under License Spectrum Terms of Access.

1MB

Abstract

In this study, the challenge of detecting fake reviews in e-commerce is addressed through the application of natural language processing and deep learning techniques. The paper introduces two frameworks designed to identify fraudulent reviews, a critical concern due to their impact on consumer behavior and market dynamics. Central to this study is the exploitation of rating-sentiment inconsistency (RSI), a nuanced textual feature indicative of potential deception, aimed at enhancing the detection of fake content. Using a dataset of Amazon reviews, the paper evaluates two distinct approaches. The first method integrates RSI with word embeddings, specifically GloVe and Word2Vec, yielding an accuracy improvement of 3.07% for GloVe and 0.67% for Word2Vec, demonstrating the effectiveness of BiGRUs in capturing the sequential nature of textual data. The second method incorporates inconsistency features into Doc2Vec representations, achieving a 1.55% increase in accuracy compared to models without this feature. Both methodologies benefit from grid search optimization to fine-tune hyperparameters, enhancing model performance significantly. These combination methods not only underscore the importance of content-based feature integration but also demonstrate the practical application of inconsistency metrics in fake review detection. The observed improvements in model accuracy confirm the effectiveness of the proposed frameworks, providing new insights into enhancing online review system integrity and advancing natural language processing in commercial settings.

Divisions:	Concordia University > John Molson School of Business > Supply Chain and Business Technology Management
Item Type:	Thesis (Masters)
Authors:	Sharifpour, Kiana
Institution:	Concordia University
Degree Name:	M. Sc.
Program:	Business Administration (Supply Chain and Business Technology Management specialization)
Date:	6 December 2024
Thesis Supervisor(s):	Lahmiri, Salim
Keywords:	Fake review detection, review inconsistency, deep learning (DL), word embeddings (WE), natural language processing (NLP), text classification
ID Code:	994992
Deposited By:	Kiana Sharif pour
Deposited On:	17 Jun 2025 17:43
Last Modified:	17 Jun 2025 17:43

References:

Abdulqader, M., Namoun, A., & Alsaawy, Y. (2022). Fake Online Reviews: A Unified Detection Model Using Deception Theories. IEEE Access, 10, 128622–128655. https://doi.org/10.1109/ACCESS.2022.3227631
Abri, F., Gutierrez, L. F., Namin, A. S., Jones, K. S., & Sears, D. R. W. (2020). Linguistic Features for Detecting Fake Reviews. 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA), 352–359. https://doi.org/10.1109/ICMLA51294.2020.00063
Alghamdi, J., Lin, Y., & Luo, S. (2022). Towards Fake News Detection on Social Media. 2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA), 148–153. https://doi.org/10.1109/ICMLA55696.2022.00028
Alzubaidi, L., Zhang, J., Humaidi, A. J., Al-Dujaili, A., Duan, Y., Al-Shamma, O., Santamaría, J., Fadhel, M. A., Al-Amidie, M., & Farhan, L. (2021). Review of deep learning: Concepts, CNN architectures, challenges, applications, future directions. Journal of Big Data, 8(1), 53. https://doi.org/10.1186/s40537-021-00444-8
Amazon reviews dataset. (2018). [Dataset]. github.com/aayush210789/Deception-Detection-on-Amazon-reviews-dataset
Badri, N., Kboubi, F., & Chaibi, A. H. (2022). Combining FastText and Glove Word Embedding for Offensive and Hate speech Text Detection. Procedia Computer Science, 207, 769–778. https://doi.org/10.1016/j.procs.2022.09.132
Barbado, R., Araque, O., & Iglesias, C. A. (2019). A framework for fake review detection in online consumer electronics retailers. Information Processing & Management, 56(4), 1234–1244. https://doi.org/10.1016/j.ipm.2019.03.002
Bathla, G., Singh, P., Singh, R. K., Cambria, E., & Tiwari, R. (2022). Intelligent fake reviews detection based on aspect extraction and analysis using deep learning. Neural Computing and Applications, 34(22), 20213–20229. https://doi.org/10.1007/s00521-022-07531-8
Bhopale, J., Bhise, R., Mane, A., & Talele, K. (2021). A Review-and-Reviewer based approach for Fake Review Detection. 2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT), 1–6. https://doi.org/10.1109/ICECCT52121.2021.9616697
Birim, Ş. Ö., Kazancoglu, I., Kumar Mangla, S., Kahraman, A., Kumar, S., & Kazancoglu, Y. (2022). Detecting fake reviews through topic modelling. Journal of Business Research, 149, 884–900. https://doi.org/10.1016/j.jbusres.2022.05.081
Budhi, G. S., & Chiong, R. (2022). A Multi-type Classifier Ensemble for Detecting Fake Reviews Through Textual-based Feature Extraction. ACM Transactions on Internet Technology, 3568676. https://doi.org/10.1145/3568676
Chollet, F. (2018). Deep learning with Python. Manning Publications Co.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (arXiv:1810.04805). arXiv. http://arxiv.org/abs/1810.04805
Doshi, J., Parmar, K., Sanghavi, R., & Shekokar, N. (2023). A comprehensive dual-layer architecture for phishing and spam email detection. Computers & Security, 133, 103378. https://doi.org/10.1016/j.cose.2023.103378
Du, J., Rong, J., Wang, H., & Zhang, Y. (2019). Helpfulness Prediction for Online Reviews with Explicit Content-Rating Interaction. In R. Cheng, N. Mamoulis, Y. Sun, & X. Huang (Eds.), Web Information Systems Engineering – WISE 2019 (Vol. 11881, pp. 795–809). Springer International Publishing. https://doi.org/10.1007/978-3-030-34223-4_50
Elmurngi, E., & Gherbi, A. (2017). Detecting Fake Reviews through Sentiment Analysis Using Machine Learning Techniques. DATA ANALYTICS.
Eslami, S. P., & Ghasemaghaei, M. (2018). Effects of online review positiveness and review score inconsistency on sales: A comparison by product involvement. Journal of Retailing and Consumer Services, 45, 74–80. https://doi.org/10.1016/j.jretconser.2018.08.003
Fake online reviews research. (2023). Department for Business and Trade.
Five-star fake out. (2022). https://www.canada.ca/en/competition-bureau/news/2022/03/five-star-fake-out.html
Floridi, L., & Chiriatti, M. (2020). GPT-3: Its Nature, Scope, Limits, and Consequences. Minds and Machines, 30(4), 681–694. https://doi.org/10.1007/s11023-020-09548-1
Fraud and scams. (2022). https://competition-bureau.canada.ca/fraud-and-scams
Glazkova, A., & Glazkov, M. (2022). Detecting Generated Scientific Papers using an Ensemble of Transformer Models (arXiv:2209.08283). arXiv. http://arxiv.org/abs/2209.08283
GloVe embeddings (Version Apache License, Version 2.0.). (2015). [Dataset]. https://github.com/stanfordnlp/GloVe
Goel, A., Nigam, T., Singh, T., & Agrawal, H. (2021). Classification Of Positive And Negative Fake Online Reviews Using Machine Learning Techniques. International Journal of Advanced Networking and Applications, 12(06), 4746–4749. https://doi.org/10.35444/IJANA.2021.12603
Goldberg, Y., & Levy, O. (2014). word2vec Explained: Deriving Mikolov et al.’s negative-sampling word-embedding method (arXiv:1402.3722). arXiv. http://arxiv.org/abs/1402.3722
Gutierrez-Espinoza, L., Abri, F., Siami Namin, A., Jones, K. S., & Sears, D. R. W. (2020). Ensemble Learning for Detecting Fake Reviews. 2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC), 1320–1325. https://doi.org/10.1109/COMPSAC48688.2020.00-73
Hajek, P., Barushka, A., & Munk, M. (2020). Fake consumer review detection using deep neural networks integrating word embeddings and emotion mining. Neural Computing and Applications, 32(23), 17259–17274. https://doi.org/10.1007/s00521-020-04757-2
Hajek, P., & Sahut, J.-M. (2022). Mining behavioural and sentiment-dependent linguistic patterns from restaurant reviews for fake review detection. Technological Forecasting and Social Change, 177, 121532. https://doi.org/10.1016/j.techfore.2022.121532
Harmon-Jones, E., & Mills, J. (2019). An introduction to cognitive dissonance theory and an overview of current perspectives on the theory. In E. Harmon-Jones (Ed.), Cognitive dissonance: Reexamining a pivotal theory in psychology (2nd ed.). (pp. 3–24). American Psychological Association. https://doi.org/10.1037/0000135-001
Harrison-Walker, L. J., & Jiang, Y. (2023). Suspicion of online product reviews as fake: Cues and consequences. Journal of Business Research, 160, 113780. https://doi.org/10.1016/j.jbusres.2023.113780
He, S., Hollenbeck, B., & Proserpio, D. (2022). The Market for Fake Reviews. Marketing Science, 41(5). https://doi.org/10.1287/mksc.2022.1353
Hmoud Al-Adhaileh, M., & Waselallah Alsaade, F. (2022). Detecting and Analysing Fake Opinions Using Artificial Intelligence Algorithms. Intelligent Automation & Soft Computing, 32(1), 643–655. https://doi.org/10.32604/iasc.2022.021225
Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735
Jobin, A., Ienca, M., & Vayena, E. (2019). Artificial Intelligence: The global landscape of ethics guidelines.
Kanakaris, N., & Karacapilidis, N. (2023). Predicting prices of Airbnb listings via Graph Neural Networks and Document Embeddings: The case of the island of Santorini. Procedia Computer Science, 219, 705–712. https://doi.org/10.1016/j.procs.2023.01.342
Kauffmann, E., Peral, J., Gil, D., Ferrández, A., Sellers, R., & Mora, H. (2020). A framework for big data analytics in commercial social networks: A case study on sentiment analysis and fake review detection for marketing decision-making. Industrial Marketing Management, 90, 523–537. https://doi.org/10.1016/j.indmarman.2019.08.003
Kotkov, D., Medlar, A., Satyal, U. R., Maslov, A., Neovius, M., & Glowacka, D. (2022). Rating consistency is consistently underrated: An exploratory analysis of movie-tag rating inconsistency. Proceedings of the 37th ACM/SIGAPP Symposium on Applied Computing, 1355–1364. https://doi.org/10.1145/3477314.3507270
LeCun, Y. (1989). Backpropagation Applied to Handwritten Zip Code Recognition. Neural Computation, 1(4), 541–551. https://doi.org/10.1162/neco.1989.1.4.541
Li, J., Ott, M., Cardie, C., & Hovy, E. (2014). Towards a General Rule for Identifying Deceptive Opinion Spam. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 1566–1576. https://doi.org/10.3115/v1/P14-1147
Li, L., Lee, K. Y., Lee, M., & Yang, S.-B. (2020). Unveiling the cloak of deviance: Linguistic cues for psychological processes in fake online reviews. International Journal of Hospitality Management, 87, 102468. https://doi.org/10.1016/j.ijhm.2020.102468
Liashchynskyi, P., & Liashchynskyi, P. (2019). Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS (arXiv:1912.06059). arXiv. http://arxiv.org/abs/1912.06059
Lin, T.-Y., Chakraborty, B., & Peng, C.-C. (2021). A Study on Identification of Important Features for Efficient Detection of Fake Reviews. 2021 International Conference on Data Analytics for Business and Industry (ICDABI), 429–433. https://doi.org/10.1109/ICDABI53623.2021.9655845
Liu, Y., Ott, M., Goyal, N., Du, J., Joshi, M., Chen, D., Levy, O., Lewis, M., Zettlemoyer, L., & Stoyanov, V. (2019). RoBERTa: A Robustly Optimized BERT Pretraining Approach (arXiv:1907.11692). arXiv. http://arxiv.org/abs/1907.11692
Liu, Y., Wang, L., Shi, T., & Li, J. (2022). Detection of spam reviews through a hierarchical attention architecture with N-gram CNN and Bi-LSTM. Information Systems, 103, 101865. https://doi.org/10.1016/j.is.2021.101865
Luvembe, A. M., Li, W., Li, S., Liu, F., & Xu, G. (2023). Dual emotion based fake news detection: A deep attention-weight update approach. Information Processing & Management, 60(4), 103354. https://doi.org/10.1016/j.ipm.2023.103354
Marciano, J. (2021). Fake online reviews cost $152 billion a year. Here’s how e-commerce sites can stop them. https://www.weforum.org/agenda/2021/08/fake-online-reviews-are-a-152-billion-problem-heres-how-to-silence-them/
Mohawesh, R., Xu, S., Tran, S. N., Ollington, R., Springer, M., Jararweh, Y., & Maqsood, S. (2021). Fake Reviews Detection: A Survey. IEEE Access, 9, 65771–65802. https://doi.org/10.1109/ACCESS.2021.3075573
Mridha, M. F., Keya, A. J., Hamid, Md. A., Monowar, M. M., & Rahman, Md. S. (2021). A Comprehensive Review on Fake News Detection With Deep Learning. IEEE Access, 9, 156151–156170. https://doi.org/10.1109/ACCESS.2021.3129329
Mukherjee, A., Liu, B., & Glance, N. (2012). Spotting fake reviewer groups in consumer reviews. Proceedings of the 21st International Conference on World Wide Web, 191–200. https://doi.org/10.1145/2187836.2187863
Nagi Alsubari, S., N. Deshmukh, S., Abdullah Alqarni, A., Alsharif, N., H. H. Aldhyani, T., Waselallah Alsaade, F., & I. Khalaf, O. (2022). Data Analytics for the Identification of Fake Reviews Using Supervised Learning. Computers, Materials & Continua, 70(2), 3189–3204. https://doi.org/10.32604/cmc.2022.019625
Ott, M., Cardie, C., & Hancock, J. (2012). Estimating the prevalence of deception in online review communities. Proceedings of the 21st International Conference on World Wide Web, 201–210. https://doi.org/10.1145/2187836.2187864
Ott, M., Cardie, C., & Hancock, J. T. (n.d.). Negative Deceptive Opinion Spam.
Pennington, J., Socher, R., & Manning, C. (2014). Glove: Global Vectors for Word Representation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1532–1543. https://doi.org/10.3115/v1/D14-1162
Qu, X., Li, X., & Rose, J. R. (2018). Review Helpfulness Assessment based on Convolutional Neural Network (arXiv:1808.09016). arXiv. http://arxiv.org/abs/1808.09016
Qu, Y., Zhang, W. E., Yang, J., Wu, L., & Wu, J. (2022). Knowledge-aware document summarization: A survey of knowledge, embedding methods and architectures. Knowledge-Based Systems, 257, 109882. https://doi.org/10.1016/j.knosys.2022.109882
Rao, S., Verma, A. K., & Bhatia, T. (2023). Hybrid ensemble framework with self-attention mechanism for social spam detection on imbalanced data. Expert Systems with Applications, 217, 119594. https://doi.org/10.1016/j.eswa.2023.119594
Refaeli, D., & Hajek, P. (2021). Detecting Fake Online Reviews using Fine-tuned BERT. 2021 5th International Conference on E-Business and Internet, 76–80. https://doi.org/10.1145/3497701.3497714
Ren, Y., & Ji, D. (2017). Neural networks for deceptive opinion spam detection: An empirical study. Information Sciences, 385–386, 213–224. https://doi.org/10.1016/j.ins.2017.01.015
Rout, J. K., Dash, A. K., & Ray, N. K. (2018). A Framework for Fake Review Detection: Issues and Challenges. 2018 International Conference on Information Technology (ICIT), 7–10. https://doi.org/10.1109/ICIT.2018.00014
Rumelhart, D., Hinton, G., & Williams, R. (1986). Learning representations by back-propagating errors. Nature, 323, 533–536. https://doi.org/doi.org/10.1038/323533a0
Sadasivan, V. S., Kumar, A., Balasubramanian, S., Wang, W., & Feizi, S. (2024). Can AI-Generated Text be Reliably Detected? (arXiv:2303.11156). arXiv. http://arxiv.org/abs/2303.11156
Schuman, A. (2022). What fake reviews can mean for your business. https://www.bazaarvoice.com/blog/what-fake-reviews-can-mean-for-your-business/
Schuster, M., & Paliwal, K. (1997). Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing, 45(11), 2673–2681. https://doi.org/10.1109/78.650093
Shan, G., Zhou, L., & Zhang, D. (2021). From conflicts and confusion to doubts: Examining review inconsistency for fake review detection. Decision Support Systems, 144, 113513. https://doi.org/10.1016/j.dss.2021.113513
Sherstinsky, A. (2020). Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network. Physica D: Nonlinear Phenomena, 404, 132306. https://doi.org/10.1016/j.physd.2019.132306
Singh, D., Memoria, M., & Kumar, R. (2023). Deep Learning Based Model for Fake Review Detection. 2023 International Conference on Advancement in Computation & Computer Technologies (InCACCT), 92–95. https://doi.org/10.1109/InCACCT57535.2023.10141826
Solaiman, I., Brundage, M., Clark, J., Askell, A., Herbert-Voss, A., Wu, J., Radford, A., Krueger, G., Kim, J. W., Kreps, S., McCain, M., Newhouse, A., Blazakis, J., McGuffie, K., & Wang, J. (n.d.). Release Strategies and the Social Impacts of Language Models.
Study examines the impact of fake online reviews on sales. (2022). https://phys.org/news/2022-09-impact-fake-online-sales.html
The Deceptive Marketing Practices Digest—Volume 1. (2015). https://competition-bureau.canada.ca/deceptive-marketing-practices-digest-volume-1#s3_0
Theocharopoulos, P. C., Anagnostou, P., Tsoukala, A., Georgakopoulos, S. V., Tasoulis, S. K., & Plagianakos, V. P. (2023). Detection of Fake Generated Scientific Abstracts. 2023 IEEE Ninth International Conference on Big Data Computing Service and Applications (BigDataService), 33–39. https://doi.org/10.1109/BigDataService58306.2023.00011
Thompson, R. C., Joseph, S., & Adeliyi, T. T. (2022). A Systematic Literature Review and Meta-Analysis of Studies on Online Fake News Detection. Information, 13(11), 527. https://doi.org/10.3390/info13110527
Wang, S., Lin, Y., & Zhu, G. (2023). Online reviews and high-involvement product sales: Evidence from offline sales in the Chinese automobile industry. Electronic Commerce Research and Applications, 57, 101231. https://doi.org/10.1016/j.elerap.2022.101231
Wang, Y., Ngai, E. W. T., & Li, K. (2023). The effect of review content richness on product review helpfulness: The moderating role of rating inconsistency. Electronic Commerce Research and Applications, 61, 101290. https://doi.org/10.1016/j.elerap.2023.101290
Wang, Y., Zamudio, C., & Jewell, R. D. (2023). The more they know: Using transparent online communication to combat fake online reviews. Business Horizons, 66(6), 753–764. https://doi.org/10.1016/j.bushor.2023.03.004
Weber-Wulff, D., Anohina-Naumeca, A., Bjelobaba, S., Foltýnek, T., Guerrero-Dib, J., Popoola, O., Šigut, P., & Waddington, L. (2023). Testing of detection tools for AI-generated text. International Journal for Educational Integrity, 19(1), 26. https://doi.org/10.1007/s40979-023-00146-z
Yao, J., Zheng, Y., & Jiang, H. (2021). An Ensemble Model for Fake Online Review Detection Based on Data Resampling, Feature Pruning, and Parameter Optimization. IEEE Access, 9, 16914–16927. https://doi.org/10.1109/ACCESS.2021.3051174
Yin, W., Kann, K., Yu, M., & Schütze, H. (2017). Comparative Study of CNN and RNN for Natural Language Processing (arXiv:1702.01923). arXiv. http://arxiv.org/abs/1702.01923
Zellers, R., Holtzman, A., Rashkin, H., Bisk, Y., Farhadi, A., Roesner, F., & Choi, Y. (2020). Defending Against Neural Fake News (arXiv:1905.12616). arXiv. http://arxiv.org/abs/1905.12616
Zhang, D., Zhou, L., Kehoe, J. L., & Kilic, I. Y. (2016). What Online Reviewer Behaviors Really Matter? Effects of Verbal and Nonverbal Behaviors on Detection of Fake Online Reviews. Journal of Management Information Systems, 33(2), 456–481. https://doi.org/10.1080/07421222.2016.1205907

Repository Staff Only: item control page

Download Statistics

Downloads per month over past year

Research related to the current document (at the CORE website)

Spectrum Research Repository

Integration of Inconsistency and Content Interaction with Deep Learning to Detect Fake Reviews

Integration of Inconsistency and Content Interaction with Deep Learning to Detect Fake Reviews

Abstract

References: