Sentiment Analysis Approach for Analyzing iPhone Release using Support Vector Machine
Keywords:sentiment analysis, text mining, svm, Support Vector Machine, youtube, iphone
Sentiment analysis is a process of understanding, extracting, and processing textual data automatically to get sentiment information contained in a comment sentence on Twitter. Sentiment analysis needs to be done because the use of social media in society is increasing so that it affects the development of public opinion. Therefore, it can be used to analyze public opinion by applying data science, one of which is Natural Language Processing (NLP) and Text Mining or also known as text analytics. The stages of the overall method used in this study are to do text mining on the Twitter site regarding iPhone Release with methods of scraping, labeling, preprocessing (case folding, tokenization, filtering), TF-IDF, and classification of sentiments using the Support Vector Machine. The Support Vector Machine is widely used as a baseline in text-related tasks with satisfactory results, on several evaluation matrices such as accuracy, precision, recall, and F1 score yielding 89.21%, 92.43%, 95.53%, and 93.95, respectively.
C. Troussas, M. Virvou, K. J. Espinosa, K. Llaguno, and J. Caro, "Sentiment analysis of Facebook statuses using Naive Bayes classifier for language learning," in IISA 2013, 2013, pp. 1-6. https://doi.org/10.1109/IISA.2013.6623713
B. Liu, "Sentiment analysis and opinion mining," Synth. Lect. Hum. Lang. Technol., vol. 5, no. 1, pp. 1-167, 2012.
T. Nasukawa and J. Yi, "Sentiment analysis: Capturing favorability using natural language processing," in Proceedings of the 2nd international conference on Knowledge capture, 2003, pp. 70-77. https://doi.org/10.1145/945645.945658
R. K. Bakshi, N. Kaur, R. Kaur, and G. Kaur, "Opinion mining and sentiment analysis," in 2016 3rd international conference on computing for sustainable global development (INDIACom), 2016, pp. 452-455.
E. Sutoyo and A. Almaarif, "Twitter sentiment analysis of the relocation of Indonesia's capital city," Bull. Electr. Eng. Informatics, vol. 9, no. 4, pp. 1620-1630, 2020. https://doi.org/10.11591/eei.v9i4.2352
A. Agarwal, B. Xie, I. Vovsha, O. Rambow, and R. J. Passonneau, "Sentiment analysis of twitter data," in Proceedings of the workshop on language in social media (LSM 2011), 2011, pp. 30-38.
R. Novendri, A. S. Callista, D. N. Pratama, and C. E. Puspita, "Sentiment Analysis of YouTube Movie Trailer Comments Using Naïve Bayes," Bull. Comput. Sci. Electr. Eng., vol. 1, no. 1, pp. 26-32, 2020 https://doi.org/10.25008/bcsee.v1i1.5
A. Valdivia, M. V. Luzón, and F. Herrera, "Sentiment analysis in tripadvisor," IEEE Intell. Syst., vol. 32, no. 4, pp. 72-77, 2017. https://doi.org/10.1109/MIS.2017.3121555
A. Balahur et al., "Sentiment analysis in the news," arXiv Prepr. arXiv1309.6202, 2013.
H. Bhuiyan, J. Ara, R. Bardhan, and M. R. Islam, "Retrieving YouTube video by sentiment analysis on user comment," in 2017 IEEE International Conference on Signal and Image Processing Applications (ICSIPA), 2017, pp. 474-478. https://doi.org/10.1109/ICSIPA.2017.8120658
F. Giummolè, S. Orlando, and G. Tolomei, "Trending topics on Twitter improve the prediction of Google hot queries," in 2013 International Conference on Social Computing, 2013, pp. 39-44. https://doi.org/10.1109/SocialCom.2013.12
K. Dey, R. Shrivastava, and S. Kaushik, "Topical stance detection for Twitter: A two-phase LSTM model using attention," in European Conference on Information Retrieval, 2018, pp. 529-536. https://doi.org/10.1007/978-3-319-76941-7_40
A. H. Ahmed Abbasi and M. Dhar, "Benchmarking twitter sentiment analysis tools," in Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland, may. European Language Resources Association (ELRA), 2014.
S. Doan, B.-K. H. Vo, and N. Collier, "An analysis of Twitter messages in the 2011 Tohoku Earthquake," in International conference on electronic healthcare, 2011, pp. 58-66. https://doi.org/10.1007/978-3-642-29262-0_8
E. Haddi, X. Liu, and Y. Shi, "The role of text pre-processing in sentiment analysis," Procedia Comput. Sci., vol. 17, pp. 26-32, 2013. https://doi.org/10.1016/j.procs.2013.05.005
R. Feldman, J. Sanger, and others, The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge university press, 2007. https://doi.org/10.1017/CBO9780511546914
M. I. Jordan and T. M. Mitchell, "Machine learning: Trends, perspectives, and prospects," Science (80-. )., vol. 349, no. 6245, pp. 255-260, 2015. https://doi.org/10.1126/science.aaa8415
I. H. Witten, E. Frank, and M. a Hall, Data Mining: Practical Machine Learning Tools and Techniques (Google eBook). 2011.
A. Aizawa, "An information-theoretic perspective of tf--idf measures," Inf. Process. & Manag., vol. 39, no. 1, pp. 45-65, 2003. https://doi.org/10.1016/S0306-4573(02)00021-3
J. Ramos and others, "Using tf-idf to determine word relevance in document queries," in Proceedings of the first instructional conference on machine learning, 2003, vol. 242, no. 1, pp. 29-48.
L. Wang, Support vector machines: theory and applications, vol. 177. Springer Science & Business Media, 2005.
S. Visa, B. Ramsay, A. L. Ralescu, and E. Van Der Knaap, "Confusion Matrix-based Feature Selection.," MAICS, vol. 710, pp. 120-127, 2011.
W. Thuiller, M. B. Araújo, and S. Lavorel, "Generalized models vs. classification tree analysis: predicting spatial distributions of plant species at different scales," J. Veg. Sci., vol. 14, no. 5, pp. 669-680, 2003. https://doi.org/10.1111/j.1654-1103.2003.tb02199.x
F. Gorunescu, "Classification performance evaluation," in Data Mining, 2011, pp. 319-330. https://doi.org/10.1007/978-3-642-19721-5_6
D. Jurafsky and J. H. Martin, "Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition."
How to Cite
Copyright (c) 2021 Wasim Bourequat, Hassan Mourad
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.