Analisis Sentimen pada Steam Review Menggunakan Multinomial Naïve Bayes dan Seleksi Fitur Gini Index Text
Abstract
Video game is one of the entertainment medias chosen by most people today, many of which are played through computer devices. On computer devices, many video games are obtained through one of the game distribution platforms, namely Steam. However, Steam has several shortcomings, including those related to Steam reviews. On Steam reviews, you can see the rating of the game, but the rating does not really show the actual quality or condition of the game. As one example, there are users who give a high rating to a game, but in the comments column the user actually mentions the shortcomings of the game. To reduce or anticipate unclear reviews for users who want to try or buy the game, sentiment analysis on reviews is used. In this research, the output produced is information on the results of sentiment classification in filtering reviews, using the Multinomial Naïve Bayes algorithm and combined with the Gini Index feature selection. Sentiment classification is divided into two classes, namely recommended and not recommended classes. In this study, to test the sentiment classification system, a dataset containing reviews in the form of review sentences from Steam is used. The test results using Multinomial Naïve Bayes and Gini Index, can achieve the best accuracy of 60.29%.
Keywords
References
B., V., & M., B. (2016). Analysis of Various Sentiment Classification Techniques. International Journal of Computer Applications, 140(3), 22–27. https://doi.org/10.5120/ijca2016909259
Berrar, D. (2019). Bayes’ theorem and naive Bayes classifier.
Butar, T. M. E. Y. B., Fauzi, M. A., & Indriati, I. (2019). Penentuan Rating Review Film Menggunakan Metode Multinomial Naive Bayes Classifier dengan Feature Selection Berbasis Chi-Square dan Galavotti-Sebastiani-Simi Coefficient. Jurnal Pengembangan Teknologi Informasi Dan Ilmu Komputer, 3(1), 447–453.
Chai, C. P. (2023). Comparison of Text Preprocessing Methods. Natural Language Engineering, 29(3), 509–553. https://doi.org/10.1017/S1351324922000213
Chebil, W., Wedyan, M., Alazab, M., Alturki, R., & Elshaweesh, O. (2023). Improving semantic information retrieval using multinomial naive Bayes classifier and Bayesian networks. Information, 14(5), 272.
Chen, J., Huang, H., Tian, S., & Qu, Y. (2009). Feature selection for text classification with Naïve Bayes. Expert Systems with Applications, 36(3, Part 1), 5432–5435. https://doi.org/10.1016/j.eswa.2008.06.054
Iqbal, F. (2018). Sentiment Analysis Using Ensemble Learners and Gini Index. International Journal of Engineering and Techniques, 4(2), 586–591.
Jia, X., & Sun, J. (2012). An Imroved Text Classification Method Based on Gini Index. Journal of Theoretical & Applied Information Technology, 43(2).
Lin, P., & Luo, X. (2020). A Survey of Sentiment Analysis Based on Machine Learning. In X. Zhu, M. Zhang, Y. Hong, & R. He (Eds.), Natural Language Processing and Chinese Computing (pp. 372–387). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-60450-9_30
Liu, B. (2022). Sentiment Analysis and Opinion Mining. Springer Nature.
Lux, E. (2012). Feature selection for text classification with Naive Bayes. Retrieved from https://dspace.cuni.cz/handle/20.500.11956/46007
Park, H., Kwon, S., & Kwon, H.-C. (2010). Complete Gini-Index Text (GIT) feature-selection algorithm for text classification. The 2nd International Conference on Software Engineering and Data Mining, 366–371.
Pudjihartono, N., Fadason, T., Kempa-Liehr, A. W., & O’Sullivan, J. M. (2022). A Review of Feature Selection Methods for Machine Learning-Based Disease Risk Prediction. Frontiers in Bioinformatics, 2, 927312. https://doi.org/10.3389/fbinf.2022.927312
Sahu, T. P., & Ahuja, S. (2016). Sentiment analysis of movie reviews: A study on feature selection & classification algorithms. 2016 International Conference on Microelectronics, Computing and Communications (MicroCom), 1–6. https://doi.org/10.1109/MicroCom.2016.7522583
Shang, W., Huang, H., Zhu, H., Lin, Y., Qu, Y., & Wang, Z. (2007). A novel feature selection algorithm for text categorization. Expert Systems with Applications, 33(1), 1–5. https://doi.org/10.1016/j.eswa.2006.04.001
Varela, P. d. (2012). Sentiment Analysis. Lisboa: Instituto Superior Técnico.
Varela, P. L., Martins, A., Aguiar, P., & Figueiredo, M. (2013). An empirical study of feature selection for sentiment analysis. 9th Conference on Telecommunications, Conftele, Castelo Branco.
Vujović, Ž. & others. (2021). Classification Model Evaluation Metrics. International Journal of Advanced Computer Science and Applications, 12(6), 599–606.
DOI: 10.24269/jkt.v8i2.2981
Refbacks
- There are currently no refbacks.