Sentiment Analysis Using Naive Bayes Algorithm with Feature Selection Particle Swarm Optimization (PSO) and Genetic Algorithm
Keywords:Sentiment Analysis, Twitter, Naive Bayes, Feature Selection, Particle Swarm Optimization, Genetic Algorithm
This study analyzes Sentiment to see opinions, points of view, judgments, attitudes, and emotions towards creatures and aspects expressed through texts. One of Social Media is like Twitter is one of the most widely used means of communication as a research topic. The main problem with sentiment analysis is voting and using the best feature options for maximum results. Either, the most widely known classification method is Naive Bayes. However, Naive Bayes is very sensitive to significant features. That way, in this test, a comparison of feature selection is carried out using Particle Swarm Optimization and Genetic Algorithm to improve the accuracy performance of the Naive Bayes algorithm. Analyses are performed by comparing before and after testing using feature selection. Validation uses a cross-validation technique, while the confusion matrix ??is appealed to measure accuracy. The results showed the highest increase for Naïve Bayes algorithm accuracy when using the feature selection of the Particle Swarm Optimization Algorithm from 60.26% to 77.50%, while the genetic algorithm from 60.26% to 70.71%. Therefore, the choice of the best characteristics is Particle Swarm Optimization which is superior with an increase in accuracy of 17.24%.
Vikas, BO & Mungara, J. "Enhanced Extraction and Summarization Techniques with User Review Data for Product Recommendations to Customers. " International Journal of Scientific Research in Science, Engineering and Technology, vol 2, p. 25-30, 2016.
A., Pappu Rajan., & SP Victor. Web Sentiment Analysis to Print Positive or Negative Words Using Twitter Data, International Journal of Computer Applications, vol. 96, p. 6, 2014. https://doi.org/10.5120/16801-6518
Ramadhani, Rif'at Ahdi., Indriani, Fatma & T. Nugrahadi, Dodon., Comparison of Naive Bayes smoothing methods for Twitter sentiment analysis, 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) 2016. https://doi.org/10.1109/ICACSIS.2016.7872720
Zhang, X., Shi, Z., Liu, X., & Li, X. A Hybrid Feature Selection Algorithm For Processing Unbalanced Classification Data. IEEE International Conference on the Smart Internet of Things (SmartIoT) 2018, 269-275, 2018. https://doi.org/10.1109/SmartIoT.2018.00055
Pant, H & Srivastava, R. A Survey of Feature Selection Methods For Unbalance Datasets. International Journal of Computer Engineering and Applications, vol 9 no. 2, 197-204, 2015.
The Lion, Ardiles and Murnawan. Analysis of Decision Support System Models for Proposed Activities at the District Level Development Planning Forum, IEEE Conference on Energy Internet and Energy System Integration (EI2), 2017. https://doi.org/10.1109/CITSM.2016.577522
Semuel Istia, Sean & Dwi Purnomo, Hindriyanto. " A Sentiment Analysis of Law Enforcement Performance Using Support Vector Machine and K-Nearest Neighbor" , 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) 2018. https://doi.org/10.1109/ICITISEE.2018.8720969
Iqbal, Farkhund., Maqbool Hashmi, Jahanzeb & CM Fung, Benjamin. A Hybrid Framework for Sentiment Analysis Using Genetic Algorithm Based Feature Reduction, IEEE Access. vol 7, pp 14637 - 14652 , 2019. https://doi.org/ 10.1109/ACCESS.2019.2892852
Amrane, M., Oukid, S., Gagaoua, I. & Ensari, T. 2018. Classification of Breast Cancer Using Machine Learning. IEEE Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT), p. 115-116, 2018. https://doi.org/10.1109/EBBT.2018.8391453
A, Sarlan., C, Nadam., & S, Basri. Twitter sentiment analysis, Conf. proc. - Int.6 conf. Inf. Technology. Multimed. UNITEN Cultivation. Make. Enabling Technology. Through Internet of Things, ICIMU 2014, no. November 2016 https://doi.org/10.1109/ICIMU.2014.7066632
Normawati, D., & Winarti, S. Feature Selection Using Data Mining Based on Variable Precision Rough Set (VPRS) for Diagnosis of Coronary Heart Disease. Scientific Journal of Computer Electrical Engineering and Informatics, vol 3 no 2, page 100, 2018. https://doi.org/10.26555/jiteki.v3i2.8072
R. Permatasari and N. A. Rakhmawati, "Features Selection for Entity Resolution in Prostitution on Twitter", Int. J. Adv. Data Inf. Syst., vol. 2, no. 1, pp. 53-61, Mar. 2021. https://doi.org/10.25008/ijadis.v2i1.1214
R. Novendri, A. S. . Callista, D. N. Pratama, and C. E. . Puspita, "Sentiment Analysis of YouTube Movie Trailer Comments Using Naïve Bayes", Bulletin of Comp. Sci. Electr. Eng., vol. 1, no. 1, pp. 26-32, Jun. 2020. https://doi.org/10.25008/bcsee.v1i1.5
W. Bourequat and H. Mourad, "Sentiment Analysis Approach for Analyzing iPhone Release using Support Vector Machine", Int. J. Adv. Data Inf. Syst., vol. 2, no. 1, pp. 36-44, Apr. 2021. https://doi.org/10.25008/ijadis.v2i1.1216
How to Cite
Copyright (c) 2021 Abi Rafdi, Herman Mawengkang Herman, Syahril Efendi
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.