Comparison of SVM & Naïve Bayes Methods in Sentiment Analysis of Electric Vehicle Subsidy Policy Based on X Data

I Wayan Darma  Wiguna; Devi Valentino  Waas; I Komang Arya Ganda  Wiguna; Made Leo  Radhitya

doi:10.23960/jesr.v6i1.158

I Wayan Darma Wiguna
Faculty Of Information Engineering, Institut Bisnis dan Teknologi Indonesia
Devi Valentino Waas
I Komang Arya Ganda Wiguna
Made Leo Radhitya

DOI: https://doi.org/10.23960/jesr.v6i1.158

Keywords Electric Vehicle Subsidy Policy, Sentiment Analysis, SVM, Naïve Bayes, Social Media X

Abstract Views (Last 12 Months)

399 Abstract Views

445 Downloads

Abstract

The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.

Downloads

Download data is not yet available.

References

Korlantas Polri, “Dashborad ERI,” 2023. http://rc.korlantas.polri.go.id:8900/eri2017/laprekappolres.php?kdpolda=18&poldanya=LAMPUNG (accessed May 01, 2023).

A. Toha, P. Purwono, and W. Gata, “Model Prediksi Kualitas Udara dengan Support Vector Machines dengan Optimasi Hyperparameter GridSearch CV,” Bul. Ilm. Sarj. Tek. Elektro, vol. 4, no. 1, pp. 12–21, 2022.

IQAir, “Informasi Indeks Kualitas Udara (AQI) dan Polusi Udara di Indonesia | IQAir.” 2023. [Online]. Available: https://www.iqair.com/id/indonesia

Perpres, “PERPRES No. 55 Tahun 2019 tentang Percepatan Program Kendaraan Bermotor Listrik Berbasis Baterai (Battery Electric Vehicle) untuk Transportasi Jalan [JDIH BPK RI],” 2019, 2019. https://peraturan.bpk.go.id/Home/Details/116973/perpres-no-55-tahun-2019 (accessed May 01, 2023).

Permenperin, “Pedoman Pemberian Bantuan Pemerintah Untuk Pembelian Kendaraan Bermotor Listrik Berbasis Baterai Roda Dua,” 2023.

S. Taheri and M. Mammadov, “Learning the naive Bayes classifier with optimization models,” Int. J. Appl. Math. Comput. Sci., vol. 23, no. 4, pp. 787–795, 2013.

D. Gunawan, D. Riana, D. Ardiansyah, F. Akbar, and S. Alfarizi, “Komparasi Algoritma Support Vector Machine Dan Naïve Bayes Dengan Algoritma Genetika Pada Analisis Sentimen Calon Gubernur Jabar 2018-2023. V (1), 135–138.” 2020.

R. N. Devita, H. W. Herwanto, and A. P. Wibawa, “Perbandingan kinerja metode naive bayes dan k-nearest neighbor untuk klasifikasi artikel berbahasa indonesia,” J. Teknol. Inf. dan Ilmu Komput, vol. 5, no. 4, 2018.

D. Suyanto, “Data Mining untuk klasifikasi dan klasterisasi data,” Bandung Inform. Bandung, 2017.

S. Shevira, I. Made, A. D. Suarjaya, and P. Wira Buana, “Pengaruh Kombinasi dan Urutan Pre-Processing pada Tweets Bahasa Indonesia,” JITTER-Jurnal Ilm. Teknol. dan Komput., vol. 3, no. 2, 2022.

F. Koto and G. Y. Rahmaningtyas, “Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs,” Proc. 2017 Int. Conf. Asian Lang. Process. IALP 2017, vol. 2018-Janua, no. December, pp. 391–394, 2018, doi: 10.1109/IALP.2017.8300625.

A. Nurwalikadani, “Implementasi Algoritme Smote Dan Klasifikasi Random Forest Pada Imbalanced Data Metilasi Sequence Protein Lisin,” 2022.

R. Amalia, M. A. Bijaksana, and D. Darmantoro, “Negation handling in sentiment classification using rule-based adapted from Indonesian language syntactic for Indonesian text in Twitter,” in Journal of Physics: Conference Series, IOP Publishing, 2018, p. 12039.

B. Santosa, “Data mining teknik pemanfaatan data untuk keperluan bisnis,” Yogyakarta Graha Ilmu, vol. 978, no. 979, p. 756, 2007.

W. A. Firmansyach, U. Hayati, and Y. A. Wijaya, “Analisa Terjadinya Overfitting Dan Underfitting Pada Algoritma Naive Bayes Dan Decision Tree Dengan Teknik Cross Validation,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 1, pp. 262–269, 2023.

N. Aliyah Salsabila, Y. Ardhito Winatmoko, A. Akbar Septiandri, and A. Jamal, “Colloquial Indonesian Lexicon,” in 2018 International Conference on Asian Language Processing (IALP), 2018, pp. 226–229. doi: 10.1109/IALP.2018.8629151.

M. A. Rosid, A. S. Fitrani, I. R. I. Astutik, N. I. Mulloh, and H. A. Gozali, “Improving text preprocessing for student complaint document classification using sastrawi,” in IOP Conference Series: Materials Science and Engineering, IOP Publishing, 2020, p. 12017.

V. Kumar and B. Subba, “A TfidfVectorizer and SVM based sentiment analysis framework for text data corpus,” in 2020 national conference on communications (NCC), IEEE, 2020, pp. 1–6.

C. P. Yanti, N. Wayan, E. Agustini, N. Luh, W. Sri, and R. Ginantra, “Perbandingan Metode K-NN Dan Metode Random Forest Untuk Analisis Sentimen pada Tweet Isu Minyak Goreng di Indonesia,” vol. 7, no. April, pp. 756–765, 2023, doi: 10.30865/mib.v7i2.5900.

Comparison of SVM & Naïve Bayes Methods in Sentiment Analysis of Electric Vehicle Subsidy Policy Based on X Data

Abstract

Downloads

References

Make a Submission

Browse

Information

JESR: p-ISSN: 2685-0338; e-ISSN: 2685-1695