Comparison of SVM & Naïve Bayes Methods in Sentiment Analysis of Electric Vehicle Subsidy Policy Based on X Data
DOI:
https://doi.org/10.23960/jesr.v6i1.158 - Abstract View: 203Keywords:
Electric Vehicle Subsidy Policy, Sentiment Analysis, SVM, Naïve Bayes, Social Media XAbstract
The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.Downloads
References
Korlantas Polri, “Dashborad ERI,” 2023. http://rc.korlantas.polri.go.id:8900/eri2017/laprekappolres.php?kdpolda=18&poldanya=LAMPUNG (accessed May 01, 2023).
A. Toha, P. Purwono, and W. Gata, “Model Prediksi Kualitas Udara dengan Support Vector Machines dengan Optimasi Hyperparameter GridSearch CV,” Bul. Ilm. Sarj. Tek. Elektro, vol. 4, no. 1, pp. 12–21, 2022.
IQAir, “Informasi Indeks Kualitas Udara (AQI) dan Polusi Udara di Indonesia | IQAir.” 2023. [Online]. Available: https://www.iqair.com/id/indonesia
Perpres, “PERPRES No. 55 Tahun 2019 tentang Percepatan Program Kendaraan Bermotor Listrik Berbasis Baterai (Battery Electric Vehicle) untuk Transportasi Jalan [JDIH BPK RI],” 2019, 2019. https://peraturan.bpk.go.id/Home/Details/116973/perpres-no-55-tahun-2019 (accessed May 01, 2023).
Permenperin, “Pedoman Pemberian Bantuan Pemerintah Untuk Pembelian Kendaraan Bermotor Listrik Berbasis Baterai Roda Dua,” 2023.
S. Taheri and M. Mammadov, “Learning the naive Bayes classifier with optimization models,” Int. J. Appl. Math. Comput. Sci., vol. 23, no. 4, pp. 787–795, 2013.
D. Gunawan, D. Riana, D. Ardiansyah, F. Akbar, and S. Alfarizi, “Komparasi Algoritma Support Vector Machine Dan Naïve Bayes Dengan Algoritma Genetika Pada Analisis Sentimen Calon Gubernur Jabar 2018-2023. V (1), 135–138.” 2020.
R. N. Devita, H. W. Herwanto, and A. P. Wibawa, “Perbandingan kinerja metode naive bayes dan k-nearest neighbor untuk klasifikasi artikel berbahasa indonesia,” J. Teknol. Inf. dan Ilmu Komput, vol. 5, no. 4, 2018.
D. Suyanto, “Data Mining untuk klasifikasi dan klasterisasi data,” Bandung Inform. Bandung, 2017.
S. Shevira, I. Made, A. D. Suarjaya, and P. Wira Buana, “Pengaruh Kombinasi dan Urutan Pre-Processing pada Tweets Bahasa Indonesia,” JITTER-Jurnal Ilm. Teknol. dan Komput., vol. 3, no. 2, 2022.
F. Koto and G. Y. Rahmaningtyas, “Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs,” Proc. 2017 Int. Conf. Asian Lang. Process. IALP 2017, vol. 2018-Janua, no. December, pp. 391–394, 2018, doi: 10.1109/IALP.2017.8300625.
A. Nurwalikadani, “Implementasi Algoritme Smote Dan Klasifikasi Random Forest Pada Imbalanced Data Metilasi Sequence Protein Lisin,” 2022.
R. Amalia, M. A. Bijaksana, and D. Darmantoro, “Negation handling in sentiment classification using rule-based adapted from Indonesian language syntactic for Indonesian text in Twitter,” in Journal of Physics: Conference Series, IOP Publishing, 2018, p. 12039.
B. Santosa, “Data mining teknik pemanfaatan data untuk keperluan bisnis,” Yogyakarta Graha Ilmu, vol. 978, no. 979, p. 756, 2007.
W. A. Firmansyach, U. Hayati, and Y. A. Wijaya, “Analisa Terjadinya Overfitting Dan Underfitting Pada Algoritma Naive Bayes Dan Decision Tree Dengan Teknik Cross Validation,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 1, pp. 262–269, 2023.
N. Aliyah Salsabila, Y. Ardhito Winatmoko, A. Akbar Septiandri, and A. Jamal, “Colloquial Indonesian Lexicon,” in 2018 International Conference on Asian Language Processing (IALP), 2018, pp. 226–229. doi: 10.1109/IALP.2018.8629151.
M. A. Rosid, A. S. Fitrani, I. R. I. Astutik, N. I. Mulloh, and H. A. Gozali, “Improving text preprocessing for student complaint document classification using sastrawi,” in IOP Conference Series: Materials Science and Engineering, IOP Publishing, 2020, p. 12017.
V. Kumar and B. Subba, “A TfidfVectorizer and SVM based sentiment analysis framework for text data corpus,” in 2020 national conference on communications (NCC), IEEE, 2020, pp. 1–6.
C. P. Yanti, N. Wayan, E. Agustini, N. Luh, W. Sri, and R. Ginantra, “Perbandingan Metode K-NN Dan Metode Random Forest Untuk Analisis Sentimen pada Tweet Isu Minyak Goreng di Indonesia,” vol. 7, no. April, pp. 756–765, 2023, doi: 10.30865/mib.v7i2.5900.

Downloads
Published
How to Cite
Issue
Section
License
Copyright © JESR (Journal of Engineering and Scientific Research). This article is an open access article distributed under a CC-BY Creative Commons Attribution 4.0 International License, as presented on the link https://creativecommons.org/licenses/by/4.0/deed.