Electronic Product Feature-Based Sentiment Analysis Using Nu-SVM Method
Abstract
Sentiment in a product online review is useful and influence decision-making a person may take in buying any product as well as that of organization in determining the number of product to produce. In an opinion, reviewer may provide positive and negative reviews at the same time that can be ambiguous. This is because opinion targets are often not the product as a whole; instead they are only part of a product called as feature, which have advantages and disadvantages based on the reviewers point of view. In this paper, the goal is to produce sentiment of a mobile phone opinion based on its feature. Opinion data used in this thesis are in English taken from www.cnet.com. Feature extraction is conducted by searching for phrases that match the dependency relation template, which is followed by feature filtering. The sentiment identification, positive and negative probability value, as well as target class label of the data preparation become the Nu SVM classifier input parameters. In the study of NU SVM, some data are treated as unlabeled data. The evaluation towards sentiment identification obtained from the study shows F1 Measure of 86.25% for positive class and 77.71% for negative class. The accuracy for feature identification, however, is 82%.
Downloads
References
Barbosa, L., Kumar, R., Pang, B. and Tomkins, A., 2009, May. For a few dollars less: Identifying review pages sans human labels. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics (pp. 494-502). Association for Computational Linguistics.
Ceska, Z. and Fox, C., 2011. The influence of text pre-processing on plagiarism detection. Association for Computational Linguistics.
Chaovalit, P. and Zhou, L., 2005, January. Movie review mining: A comparison between supervised and unsupervised classification approaches. In System Sciences, 2005. HICSS'05. Proceedings of the 38th Annual Hawaii International Conference on (pp. 112c-112c). IEEE. [crossref]
Chen,Pai-Hsuen, Chih-Jen Lin, and Bernhard Scholkopf. A Tutorial on -Support Vector Machines. Department of Computer Science and Information Engineering National Taiwan University, Taipei 106, Taiwan.
Dey, L. and Haque, S.M., 2009. Opinion mining from noisy text data. International Journal on Document Analysis and Recognition (IJDAR), 12(3), pp.205-226. [crossref]
Feldman, R. and Sanger, J., IThe Text Mining Handbook.
Liu B.Opinion Mining.Department of Computer Science, University of Illinois at Chicago, 851 S. Morgan Street, Chicago, IL 60607-0753.
Liu, B., 2010. Sentiment Analysis and Subjectivity. Handbook of natural language processing, 2, pp.627-666.
Marcus, M.P., Marcinkiewicz, M.A. and Santorini, B., 1993. Building a large annotated corpus of English: The Penn Treebank. Computational linguistics, 19(2), pp.313-330.
Ohana, B. and Tierney, B., 2009, October. Sentiment classification of reviews using SentiWordNet. In 9th. IT & T Conference (p. 13).
Pang, B., Lee, L. and Vaithyanathan, S., 2002, July. Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10 (pp. 79-86). Association for Computational Linguistics. [crossref]
Pang, B. and Lee, L., 2008. Opinion mining and sentiment analysis. Foundations and trends in information retrieval, 2(1-2), pp.1-135. [crossref]
Permadi, Y., 2008. Kategorisasi Teks Menggunakan N-gram untuk Dokumen Berbahasa Indonesia.
Popescu, A.M. and Etzioni, O., 2007. Extracting product features and opinions from reviews. In Natural language processing and text mining (pp. 9-28). Springer London. [crossref]
Soumen, C., 2003. Mining the web: Discovering knowledge from hypertext data.
Santosa, B., 2007. Data Mining Teknik Pemanfaatan Data untuk Keperluan Bisnis. Yogyakarta: Graha Ilmu.
Taboada, M., Brooke, J., Tofiloski, M., Voll, K. and Stede, M., 2011. Lexicon-based methods for sentiment analysis. Computational linguistics, 37(2), pp.267-307. [crossref]
Turney, P.D., 2002, July. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th annual meeting on association for computational linguistics (pp. 417-424). Association for Computational Linguistics.
Ning, Y. and Sandra, K., 2010. Semi-supervised Learning for Opinion Detection. Indiana University: Indiana.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Manuscript submitted to IJoICT has to be an original work of the author(s), contains no element of plagiarism, and has never been published or is not being considered for publication in other journals. Author(s) shall agree to assign all copyright of published article to IJoICT. Requests related to future re-use and re-publication of major or substantial parts of the article must be consulted with the editors of IJoICT.