TopicGAN: Online fake review detection using topic embedding

Sirish Prabakar

Back

TopicGAN: Online fake review detection using topic embedding

Thesis

Open access

TopicGAN: Online fake review detection using topic embedding

Sirish Prabakar

California State University, Sacramento

Master of Science (MS), California State University, Sacramento

06/16/2023

Handle:

https://hdl.handle.net/20.500.12741/rep:10985

Abstract

Generative adversarial networks

Fake review detection

Semi-supervised learning

Text classification

Machine Learning

In recent years, online businesses and websites have become the main target of fake online reviews, where fake reviews are intentionally written to manipulate the business ratings positively or negatively. Most of the existing work to detect fake reviews focus mainly on supervised methods which use lexical and syntactic patterns of the reviews. In this paper, we propose a GAN-based semi-supervised framework, TopicGAN, for online fake review detection using topic modeling. Specifically, we first extract spatial named entities from the reviews and employ fuzzy string matching to obtain their embeddings. Second, the words and spatial named entities that appear in reviews are represented using their corresponding topic distributions by training an embedded topic model. TopicGAN builds on two discriminators, with one discriminator differentiating between real and fake reviews and the other discriminator differentiating between the fake reviews from the dataset and the fake reviews from the generator. In this way, the Generator competes with the Discriminators like a min-max game until convergence. This architecture coupled with Topic modeling and other novel features has allowed TopicGAN to compete with the state-of-the-art semi-supervised methods in terms of all performance metrics for detecting real reviews and fake reviews, respectively.

Files and links (1)

pdf

PrabakarSirish_Spring2022_508CompliantCopy902.04 kBDownload View

TextProject Open Access

Metrics

28 File views/ downloads

154 Record Views

Details

Title: TopicGAN: Online fake review detection using topic embedding
Creators: Sirish Prabakar
Contributors: Haiquan Chen (Advisor)
Anna Baynes (Committee Member)
Academic Unit: Computer Science Department
Theses and Dissertations: Master of Science (MS); Computer Science; California State University, Sacramento; 05/06/2022; 2022
Publisher: California State University, Sacramento
Publication Details: 06/16/2023
Identifiers: 99258056263901671; https://hdl.handle.net/20.500.12741/rep:10985
Resource Type: Masters Project
Language: English
Number of pages: 46
Accessibility Statement: This document has been made accessible/508 compliant by Sacramento State University Library. For questions, please contact lib-accessibility@csus.edu.