Comparison of SVM & Naïve Bayes Methods in Sentiment Analysis of Electric Vehicle Subsidy Policy Based on X Data

Authors

  • I Wayan Darma Wiguna Faculty Of Information Engineering, Institut Bisnis dan Teknologi Indonesia
  • Devi Valentino Waas
  • I Komang Arya Ganda Wiguna
  • Made Leo Radhitya

DOI:

https://doi.org/10.23960/jesr.v6i1.158 - Abstract View: 203

Keywords:

Electric Vehicle Subsidy Policy, Sentiment Analysis, SVM, Naïve Bayes, Social Media X

Abstract

The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.

Downloads

Download data is not yet available.

Author Biographies

I Wayan Darma Wiguna, Faculty Of Information Engineering, Institut Bisnis dan Teknologi Indonesia

The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.

Devi Valentino Waas

The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.

I Komang Arya Ganda Wiguna

The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.

Made Leo Radhitya

The policy of subsidizing electric vehicles has become a widely discussed issue on social media platform X. The provision of electric vehicle subsidies by the Indonesian government aims to stimulate higher adoption of electric vehicles, with the overarching goal of mitigating air pollution. However, the presence of electric vehicle subsidies continues to elicit both support and opposition among the public. On social media platform X, there is a wealth of data suitable for text mining, particularly concerning the current hot topic of electric vehicle subsidies. This research aims to compare the performance of Support Vector Machine (SVM) and Naïve Bayes methods in conducting sentiment analysis on discussions related to the electric vehicle subsidy policy on social media platform X. The testing technique involves using 20% of the total dataset, comprising 5553 data points, and employing 10-fold cross-validation. The results from the 20% test data indicate that the Support Vector Machine (SVM) method's confusion matrix performance is superior, with the highest values achieved using the RBF kernel: accuracy 83.02%, precision 84.61%, and recall 83.02%. In the performance evaluation testing with 10-fold cross-validation, the Support Vector Machine (SVM) method outperforms, especially with the RBF kernel, yielding an average accuracy of 82.88% over 10 iterations.

References

Korlantas Polri, “Dashborad ERI,” 2023. http://rc.korlantas.polri.go.id:8900/eri2017/laprekappolres.php?kdpolda=18&poldanya=LAMPUNG (accessed May 01, 2023).

A. Toha, P. Purwono, and W. Gata, “Model Prediksi Kualitas Udara dengan Support Vector Machines dengan Optimasi Hyperparameter GridSearch CV,” Bul. Ilm. Sarj. Tek. Elektro, vol. 4, no. 1, pp. 12–21, 2022.

IQAir, “Informasi Indeks Kualitas Udara (AQI) dan Polusi Udara di Indonesia | IQAir.” 2023. [Online]. Available: https://www.iqair.com/id/indonesia

Perpres, “PERPRES No. 55 Tahun 2019 tentang Percepatan Program Kendaraan Bermotor Listrik Berbasis Baterai (Battery Electric Vehicle) untuk Transportasi Jalan [JDIH BPK RI],” 2019, 2019. https://peraturan.bpk.go.id/Home/Details/116973/perpres-no-55-tahun-2019 (accessed May 01, 2023).

Permenperin, “Pedoman Pemberian Bantuan Pemerintah Untuk Pembelian Kendaraan Bermotor Listrik Berbasis Baterai Roda Dua,” 2023.

S. Taheri and M. Mammadov, “Learning the naive Bayes classifier with optimization models,” Int. J. Appl. Math. Comput. Sci., vol. 23, no. 4, pp. 787–795, 2013.

D. Gunawan, D. Riana, D. Ardiansyah, F. Akbar, and S. Alfarizi, “Komparasi Algoritma Support Vector Machine Dan Naïve Bayes Dengan Algoritma Genetika Pada Analisis Sentimen Calon Gubernur Jabar 2018-2023. V (1), 135–138.” 2020.

R. N. Devita, H. W. Herwanto, and A. P. Wibawa, “Perbandingan kinerja metode naive bayes dan k-nearest neighbor untuk klasifikasi artikel berbahasa indonesia,” J. Teknol. Inf. dan Ilmu Komput, vol. 5, no. 4, 2018.

D. Suyanto, “Data Mining untuk klasifikasi dan klasterisasi data,” Bandung Inform. Bandung, 2017.

S. Shevira, I. Made, A. D. Suarjaya, and P. Wira Buana, “Pengaruh Kombinasi dan Urutan Pre-Processing pada Tweets Bahasa Indonesia,” JITTER-Jurnal Ilm. Teknol. dan Komput., vol. 3, no. 2, 2022.

F. Koto and G. Y. Rahmaningtyas, “Inset lexicon: Evaluation of a word list for Indonesian sentiment analysis in microblogs,” Proc. 2017 Int. Conf. Asian Lang. Process. IALP 2017, vol. 2018-Janua, no. December, pp. 391–394, 2018, doi: 10.1109/IALP.2017.8300625.

A. Nurwalikadani, “Implementasi Algoritme Smote Dan Klasifikasi Random Forest Pada Imbalanced Data Metilasi Sequence Protein Lisin,” 2022.

R. Amalia, M. A. Bijaksana, and D. Darmantoro, “Negation handling in sentiment classification using rule-based adapted from Indonesian language syntactic for Indonesian text in Twitter,” in Journal of Physics: Conference Series, IOP Publishing, 2018, p. 12039.

B. Santosa, “Data mining teknik pemanfaatan data untuk keperluan bisnis,” Yogyakarta Graha Ilmu, vol. 978, no. 979, p. 756, 2007.

W. A. Firmansyach, U. Hayati, and Y. A. Wijaya, “Analisa Terjadinya Overfitting Dan Underfitting Pada Algoritma Naive Bayes Dan Decision Tree Dengan Teknik Cross Validation,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 1, pp. 262–269, 2023.

N. Aliyah Salsabila, Y. Ardhito Winatmoko, A. Akbar Septiandri, and A. Jamal, “Colloquial Indonesian Lexicon,” in 2018 International Conference on Asian Language Processing (IALP), 2018, pp. 226–229. doi: 10.1109/IALP.2018.8629151.

M. A. Rosid, A. S. Fitrani, I. R. I. Astutik, N. I. Mulloh, and H. A. Gozali, “Improving text preprocessing for student complaint document classification using sastrawi,” in IOP Conference Series: Materials Science and Engineering, IOP Publishing, 2020, p. 12017.

V. Kumar and B. Subba, “A TfidfVectorizer and SVM based sentiment analysis framework for text data corpus,” in 2020 national conference on communications (NCC), IEEE, 2020, pp. 1–6.

C. P. Yanti, N. Wayan, E. Agustini, N. Luh, W. Sri, and R. Ginantra, “Perbandingan Metode K-NN Dan Metode Random Forest Untuk Analisis Sentimen pada Tweet Isu Minyak Goreng di Indonesia,” vol. 7, no. April, pp. 756–765, 2023, doi: 10.30865/mib.v7i2.5900.

Downloads

Published

2024-06-28

How to Cite

[1]
I. W. D. . Wiguna, D. V. . Waas, I. K. A. G. . Wiguna, and M. L. . Radhitya, “Comparison of SVM & Naïve Bayes Methods in Sentiment Analysis of Electric Vehicle Subsidy Policy Based on X Data”, JESR, vol. 6, no. 1, pp. 23 –, Jun. 2024.

Issue

Section

Articles