Sentiment Classification of Product Reviews Using SVM and Decision Tree Induction
DOI:
https://doi.org/10.7152/acro.v14i1.14113Abstract
This paper reports a study in automatic sentiment classification, i.e. automatically classifying documents as expressing positive or negative sentiments/opinions. The study investigates the effectiveness of using SVM (Support Vector Machine) and Decision Tree induction on various text features to classify product reviews into recommended (positive sentiment) and not recommended (negative sentiment). Compared with traditional topical classification, it was hypothesized that syntactic and semantic processing of text would be more important for sentiment classification. In this study, five different approaches, unigrams (individual words), part-of-speech tagging, association rules, use of negation, and use of WordNet synsets (identifying a set of synonyms) were investigated. A sample of 1,800 miscellaneous product reviews was retrieved from Review Centre (www.reviewcentre.com. 2003) for the study. 1,200 reviews were used for training, and 600 for validation. Using SVM, the baseline unigrams approach obtained an accuracy rate of 81.3%. The use of WordNet synsets obtained marginally better result of 81.7%. The other text features did not yield better results. Error analysis suggests 3 approaches for improving classification accuracy: making inference from superficial words, solving the problems of "comments on parts" and "negation". Finally, Decision Tree induction was used to generate a list of indicative words that can identify the polarity of articles."Downloads
Published
2003-10-01
Issue
Section
Articles
License
Authors who publish with this journal agree to the following terms:- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).