Sentiment Analysis Using Naive Bayes Algorithm with Feature Selection Particle Swarm Optimization (PSO) and Genetic Algorithm

Authors

  • Abi Rafdi Faculty of Computer Science and Information Technology, University of Sumatera Utara, Indonesia
  • Herman Mawengkang Faculty of Computer Science and Information Technology, University of Sumatera Utara, Indonesia
  • Syahril Efendi Faculty of Computer Science and Information Technology, University of Sumatera Utara, Indonesia

DOI:

https://doi.org/10.25008/ijadis.v2i2.1224

Keywords:

Sentiment Analysis, Twitter, Naive Bayes, Feature Selection, Particle Swarm Optimization, Genetic Algorithm

Abstract

This study analyzes Sentiment to see opinions, points of view, judgments, attitudes, and emotions towards creatures and aspects expressed through texts. One of Social Media is like Twitter is one of the most widely used means of communication as a research topic. The main problem with sentiment analysis is voting and using the best feature options for maximum results. Either, the most widely known classification method is Naive Bayes. However, Naive Bayes is very sensitive to significant features. That way, in this test, a comparison of feature selection is carried out using Particle Swarm Optimization and Genetic Algorithm to improve the accuracy performance of the Naive Bayes algorithm. Analyses are performed by comparing before and after testing using feature selection. Validation uses a cross-validation technique, while the confusion matrix ??is appealed to measure accuracy. The results showed the highest increase for Naïve Bayes algorithm accuracy when using the feature selection of the Particle Swarm Optimization Algorithm from 60.26% to 77.50%, while the genetic algorithm from 60.26% to 70.71%. Therefore, the choice of the best characteristics is Particle Swarm Optimization which is superior with an increase in accuracy of 17.24%.

Downloads

Download data is not yet available.

Plum Analytics

   

Dimensions

            

References

Vikas, BO & Mungara, J. "Enhanced Extraction and Summarization Techniques with User Review Data for Product Recommendations to Customers. " International Journal of Scientific Research in Science, Engineering and Technology, vol 2, p. 25-30, 2016.

A., Pappu Rajan., & SP Victor. Web Sentiment Analysis to Print Positive or Negative Words Using Twitter Data, International Journal of Computer Applications, vol. 96, p. 6, 2014. https://doi.org/10.5120/16801-6518

Ramadhani, Rif'at Ahdi., Indriani, Fatma & T. Nugrahadi, Dodon., Comparison of Naive Bayes smoothing methods for Twitter sentiment analysis, 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) 2016. https://doi.org/10.1109/ICACSIS.2016.7872720

Zhang, X., Shi, Z., Liu, X., & Li, X. A Hybrid Feature Selection Algorithm For Processing Unbalanced Classification Data. IEEE International Conference on the Smart Internet of Things (SmartIoT) 2018, 269-275, 2018. https://doi.org/10.1109/SmartIoT.2018.00055

Pant, H & Srivastava, R. A Survey of Feature Selection Methods For Unbalance Datasets. International Journal of Computer Engineering and Applications, vol 9 no. 2, 197-204, 2015.

The Lion, Ardiles and Murnawan. Analysis of Decision Support System Models for Proposed Activities at the District Level Development Planning Forum, IEEE Conference on Energy Internet and Energy System Integration (EI2), 2017. https://doi.org/10.1109/CITSM.2016.577522

Semuel Istia, Sean & Dwi Purnomo, Hindriyanto. " A Sentiment Analysis of Law Enforcement Performance Using Support Vector Machine and K-Nearest Neighbor" , 3rd International Conference on Information Technology, Information System and Electrical Engineering (ICITISEE) 2018. https://doi.org/10.1109/ICITISEE.2018.8720969

Iqbal, Farkhund., Maqbool Hashmi, Jahanzeb & CM Fung, Benjamin. A Hybrid Framework for Sentiment Analysis Using Genetic Algorithm Based Feature Reduction, IEEE Access. vol 7, pp 14637 - 14652 , 2019. https://doi.org/ 10.1109/ACCESS.2019.2892852

Amrane, M., Oukid, S., Gagaoua, I. & Ensari, T. 2018. Classification of Breast Cancer Using Machine Learning. IEEE Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT), p. 115-116, 2018. https://doi.org/10.1109/EBBT.2018.8391453

A, Sarlan., C, Nadam., & S, Basri. Twitter sentiment analysis, Conf. proc. - Int.6 conf. Inf. Technology. Multimed. UNITEN Cultivation. Make. Enabling Technology. Through Internet of Things, ICIMU 2014, no. November 2016 https://doi.org/10.1109/ICIMU.2014.7066632

Normawati, D., & Winarti, S. Feature Selection Using Data Mining Based on Variable Precision Rough Set (VPRS) for Diagnosis of Coronary Heart Disease. Scientific Journal of Computer Electrical Engineering and Informatics, vol 3 no 2, page 100, 2018. https://doi.org/10.26555/jiteki.v3i2.8072

R. Permatasari and N. A. Rakhmawati, "Features Selection for Entity Resolution in Prostitution on Twitter", Int. J. Adv. Data Inf. Syst., vol. 2, no. 1, pp. 53-61, Mar. 2021. https://doi.org/10.25008/ijadis.v2i1.1214

R. Novendri, A. S. . Callista, D. N. Pratama, and C. E. . Puspita, "Sentiment Analysis of YouTube Movie Trailer Comments Using Naïve Bayes", Bulletin of Comp. Sci. Electr. Eng., vol. 1, no. 1, pp. 26-32, Jun. 2020. https://doi.org/10.25008/bcsee.v1i1.5

W. Bourequat and H. Mourad, "Sentiment Analysis Approach for Analyzing iPhone Release using Support Vector Machine", Int. J. Adv. Data Inf. Syst., vol. 2, no. 1, pp. 36-44, Apr. 2021. https://doi.org/10.25008/ijadis.v2i1.1216

Downloads

Published

2021-10-30

How to Cite

Rafdi, A., Mawengkang, H., & Efendi, S. . (2021). Sentiment Analysis Using Naive Bayes Algorithm with Feature Selection Particle Swarm Optimization (PSO) and Genetic Algorithm. International Journal of Advances in Data and Information Systems, 2(2), 96-104. https://doi.org/10.25008/ijadis.v2i2.1224
Abstract views : 147 times