Analisis Sentimen Kebijakan Pemberlakuan Cukai pada Minuman Berpemanis dalam Kemasan Menggunakan Metode Multinomial Naive Bayes

Authors

  • Muhammad Firdaus Universitas Muhammadiyah Jember
  • Ulya Anisatur Rosyidah Universitas Muhammadiyah Jember
  • Luluk Handayani Universitas Muhammadiyah Jember

DOI:

https://doi.org/10.62951/router.v3i4.704

Keywords:

Borderline-SMOTE, MBDK, Multinomial Naive Bayes, N-gram, Stratified Cross Validation

Abstract

Sugar consumption in Indonesia remains high, with diabetes affecting 20.4 million people. This condition has prompted the government to introduce an excise policy on Minuman Berpemanis Dalam Kemasan (MBDK) to reduce sugar intake. Social media, particularly the X platform, serves as a medium for the public to express their opinions regarding this policy. This study aims to analyze public sentiment toward the MBDK excise policy using a lexicon-based approach for data labeling and the Multinomial Naive Bayes algorithm with unigram and bigram feature extraction. The initial results show that the highest performance was achieved using 5-Fold Cross Validation, with an average accuracy of 83%, precision of 84%, recall of 75%, and an F1-Score of 77%. After applying data balancing using Stratified Cross Validation combined with Borderline-SMOTE and limiting the features to the 700 most frequent terms, the model’s performance improved. The best results were obtained with 10-Fold Cross Validation, achieving 86% accuracy, 84% precision, 83% recall, and an F1-Score of 83%. These findings indicate that the Multinomial Naive Bayes model can effectively classify public sentiment regarding the MBDK excise policy after the data balancing process.

Downloads

Download data is not yet available.

References

Alfandi Safira, & Hasan, F. N. (2023). Analisis sentimen masyarakat terhadap paylater menggunakan metode Naive Bayes classifier. ZONAsi: Jurnal Sistem Informasi, 5(1), 59–70. https://doi.org/10.31849/zn.v5i1.12856

Emiliana, N., & Setiarini, A. (2024). Hubungan konsumsi minuman berpemanis dengan kejadian obesitas pada anak dan remaja: A systematic literature review. Holistik Jurnal Kesehatan, 18(4), 509–517.

Eni Tri, H., & Ari, S. (2021). Analisis sentimen respon masyarakat terhadap kabar harian COVID-19 pada Twitter Kementerian Kesehatan. Jurnal Teknologi dan Sistem Informasi (JTSI), 2(3), 32–37. http://repository.teknokrat.ac.id/3224/

Fikri, M. I., Sabrila, T. S., & Azhar, Y. (2020). Comparison of naïve Bayes and support vector machine methods in Twitter sentiment analysis. Smatika Jurnal, 10(2), 71–76.

Galih, I. K., & Santi, I. G. (2025). Penerapan support vector machine untuk klasifikasi tingkat risiko kebakaran hutan. Jurnal Ilmiah, 3, 763–774.

Hadaina, F., & Budiyanto, U. (2022). Implementasi metode multinomial naïve Bayes untuk sentiment analysis terhadap data ulasan produk Colearn pada Google Play Store. Dalam Seminar Nasional Mahasiswa Fakultas Teknologi Informasi (SENAFTI) (pp. 660–666). https://senafti.budiluhur.ac.id/index.php

Harsemadi, I. G., Dharmendra, I. K., & Wijaya, I. M. P. P. (2023). Klasifikasi emosi pada tweet berbahasa Indonesia. Prosiding Seminar, 86.

Khaira, U., Johanda, R., Utomo, P. E., Suratno, T., & Info, A. (2020). Sentiment analysis of cyberbullying on Twitter using SentiStrength. Jurnal Ilmiah, 3(1), 21–27.

Ningsih, W., Alfianda, B., Rahmaddeni, R., & Wulandari, D. (2024). Perbandingan algoritma SVM dan naïve Bayes dalam analisis sentimen Twitter pada penggunaan mobil listrik di Indonesia. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 4(2), 556–562. https://doi.org/10.57152/malcom.v4i2.1253

Noviyanti, N. P., Ngurah, I. G., & Cahyadi, A. (2025). Penerapan multinomial naïve Bayes dan chi-square pada analisis sentimen makan bergizi gratis. Jurnal Ilmiah, 3, 785–794.

Radhitya, M. L. (n.d.). Klasifikasi genre musik menggunakan text mining (pp. 1–9).

Ridwan, R., Hermaliani, E. H., & Ernawati, M. (2024). Penerapan metode SMOTE untuk mengatasi imbalanced data pada klasifikasi ujaran kebencian. Computer Science (CO-SCIENCE), 4(1), 80–88. https://jurnal.bsi.ac.id/index.php/co-science/article/view/2990

Ridwansyah, T. (2022). Implementasi text mining terhadap analisis sentimen masyarakat dunia di Twitter terhadap Kota Medan menggunakan k-fold cross validation dan naïve Bayes classifier. KLIK: Kajian Ilmiah Informatika dan Komputer, 2(5), 178–185. https://doi.org/10.30865/klik.v2i5.362

Sinaga, J., Sinambela, J. L., Purba, B. C., & Pelawi, S. (2024). Gula dan kesehatan: Kajian terhadap dampak kesehatan akibat konsumsi gula berlebih. Mutiara: Jurnal Ilmiah Multidisiplin Indonesia, 2(1), 54–68. https://doi.org/10.61404/jimi.v2i1.84

Vincent, R., Maulana, I., & Komarudin, O. (2024). Perbandingan klasifikasi naïve Bayes dan support vector machine dalam analisis sentimen dengan multiclass di Twitter. JATI (Jurnal Mahasiswa Teknik Informatika), 7(4), 2496–2505. https://doi.org/10.36040/jati.v7i4.7152

Downloads

Published

2025-12-16

How to Cite

Firdaus, M., Rosyidah, U. A., & Handayani, L. (2025). Analisis Sentimen Kebijakan Pemberlakuan Cukai pada Minuman Berpemanis dalam Kemasan Menggunakan Metode Multinomial Naive Bayes . Router : Jurnal Teknik Informatika Dan Terapan, 3(4), 01–10. https://doi.org/10.62951/router.v3i4.704

Similar Articles

You may also start an advanced similarity search for this article.