Analisis Sentimen Kebijakan Pemberlakuan Cukai pada Minuman Berpemanis dalam Kemasan Menggunakan Metode Multinomial Naive Bayes
DOI:
https://doi.org/10.62951/router.v3i4.704Keywords:
Borderline-SMOTE, MBDK, Multinomial Naive Bayes, N-gram, Stratified Cross ValidationAbstract
Sugar consumption in Indonesia remains high, with diabetes affecting 20.4 million people. This condition has prompted the government to introduce an excise policy on Minuman Berpemanis Dalam Kemasan (MBDK) to reduce sugar intake. Social media, particularly the X platform, serves as a medium for the public to express their opinions regarding this policy. This study aims to analyze public sentiment toward the MBDK excise policy using a lexicon-based approach for data labeling and the Multinomial Naive Bayes algorithm with unigram and bigram feature extraction. The initial results show that the highest performance was achieved using 5-Fold Cross Validation, with an average accuracy of 83%, precision of 84%, recall of 75%, and an F1-Score of 77%. After applying data balancing using Stratified Cross Validation combined with Borderline-SMOTE and limiting the features to the 700 most frequent terms, the model’s performance improved. The best results were obtained with 10-Fold Cross Validation, achieving 86% accuracy, 84% precision, 83% recall, and an F1-Score of 83%. These findings indicate that the Multinomial Naive Bayes model can effectively classify public sentiment regarding the MBDK excise policy after the data balancing process.
Downloads
References
Alfandi Safira, & Hasan, F. N. (2023). Analisis sentimen masyarakat terhadap paylater menggunakan metode Naive Bayes classifier. ZONAsi: Jurnal Sistem Informasi, 5(1), 59–70. https://doi.org/10.31849/zn.v5i1.12856
Emiliana, N., & Setiarini, A. (2024). Hubungan konsumsi minuman berpemanis dengan kejadian obesitas pada anak dan remaja: A systematic literature review. Holistik Jurnal Kesehatan, 18(4), 509–517.
Eni Tri, H., & Ari, S. (2021). Analisis sentimen respon masyarakat terhadap kabar harian COVID-19 pada Twitter Kementerian Kesehatan. Jurnal Teknologi dan Sistem Informasi (JTSI), 2(3), 32–37. http://repository.teknokrat.ac.id/3224/
Fikri, M. I., Sabrila, T. S., & Azhar, Y. (2020). Comparison of naïve Bayes and support vector machine methods in Twitter sentiment analysis. Smatika Jurnal, 10(2), 71–76.
Galih, I. K., & Santi, I. G. (2025). Penerapan support vector machine untuk klasifikasi tingkat risiko kebakaran hutan. Jurnal Ilmiah, 3, 763–774.
Hadaina, F., & Budiyanto, U. (2022). Implementasi metode multinomial naïve Bayes untuk sentiment analysis terhadap data ulasan produk Colearn pada Google Play Store. Dalam Seminar Nasional Mahasiswa Fakultas Teknologi Informasi (SENAFTI) (pp. 660–666). https://senafti.budiluhur.ac.id/index.php
Harsemadi, I. G., Dharmendra, I. K., & Wijaya, I. M. P. P. (2023). Klasifikasi emosi pada tweet berbahasa Indonesia. Prosiding Seminar, 86.
Khaira, U., Johanda, R., Utomo, P. E., Suratno, T., & Info, A. (2020). Sentiment analysis of cyberbullying on Twitter using SentiStrength. Jurnal Ilmiah, 3(1), 21–27.
Ningsih, W., Alfianda, B., Rahmaddeni, R., & Wulandari, D. (2024). Perbandingan algoritma SVM dan naïve Bayes dalam analisis sentimen Twitter pada penggunaan mobil listrik di Indonesia. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 4(2), 556–562. https://doi.org/10.57152/malcom.v4i2.1253
Noviyanti, N. P., Ngurah, I. G., & Cahyadi, A. (2025). Penerapan multinomial naïve Bayes dan chi-square pada analisis sentimen makan bergizi gratis. Jurnal Ilmiah, 3, 785–794.
Radhitya, M. L. (n.d.). Klasifikasi genre musik menggunakan text mining (pp. 1–9).
Ridwan, R., Hermaliani, E. H., & Ernawati, M. (2024). Penerapan metode SMOTE untuk mengatasi imbalanced data pada klasifikasi ujaran kebencian. Computer Science (CO-SCIENCE), 4(1), 80–88. https://jurnal.bsi.ac.id/index.php/co-science/article/view/2990
Ridwansyah, T. (2022). Implementasi text mining terhadap analisis sentimen masyarakat dunia di Twitter terhadap Kota Medan menggunakan k-fold cross validation dan naïve Bayes classifier. KLIK: Kajian Ilmiah Informatika dan Komputer, 2(5), 178–185. https://doi.org/10.30865/klik.v2i5.362
Sinaga, J., Sinambela, J. L., Purba, B. C., & Pelawi, S. (2024). Gula dan kesehatan: Kajian terhadap dampak kesehatan akibat konsumsi gula berlebih. Mutiara: Jurnal Ilmiah Multidisiplin Indonesia, 2(1), 54–68. https://doi.org/10.61404/jimi.v2i1.84
Vincent, R., Maulana, I., & Komarudin, O. (2024). Perbandingan klasifikasi naïve Bayes dan support vector machine dalam analisis sentimen dengan multiclass di Twitter. JATI (Jurnal Mahasiswa Teknik Informatika), 7(4), 2496–2505. https://doi.org/10.36040/jati.v7i4.7152
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Router : Jurnal Teknik Informatika dan Terapan

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.


