Analisis Sentimen Ulasan Shopee di Google Play dengan TF-IDF dan Logistic Regression
DOI:
https://doi.org/10.31004/riggs.v4i2.2850Keywords:
Analisis Sentimen, Shopee, Google Play, TF-IDF, Logistic RegressionAbstract
Pertumbuhan aplikasi belanja daring mendorong meningkatnya jumlah ulasan pengguna yang tersedia di Google Play Store. Ulasan ini tidak hanya mencerminkan pengalaman pengguna, tetapi juga berperan sebagai masukan penting bagi pengembang aplikasi. Penelitian ini menganalisis sentimen ulasan aplikasi Shopee Indonesia (package com.shopee.id) periode 2024–2025 dengan menggunakan pendekatan Term Frequency–Inverse Document Frequency (TF-IDF) dan algoritma Logistic Regression. Data sebanyak 5.000 ulasan dikumpulkan melalui pustaka google-play-scraper dan dilabeli otomatis berdasarkan skor rating: 1–2 negatif, 3 netral, dan 4–5 positif. Proses preprocessing meliputi normalisasi teks, penghapusan stopword bahasa Indonesia, serta stemming. Distribusi data yang timpang (4049 positif, 796 negatif, 155 netral) ditangani dengan Random Oversampling pada data latih. Evaluasi dilakukan menggunakan stratified split 80:20 dan 5-fold cross-validation. Hasil menunjukkan Logistic Regression memberikan performa terbaik dengan akurasi 85,11% dan macro-F1 0,58 pada data uji, lebih baik dibandingkan SVM dan Naive Bayes. Confusion matrix memperlihatkan bahwa kelas positif dapat dikenali dengan baik (F1=0,92), sementara kelas netral sulit diprediksi (F1=0,14). Visualisasi WordCloud digunakan untuk menggambarkan kata dominan pada tiap kelas. Studi ini menegaskan efektivitas Logistic Regression untuk analisis ulasan aplikasi berbahasa Indonesia, meskipun tantangan besar masih terdapat pada ketidakseimbangan kelas minoritas.
Downloads
References
A. Daza, N. D. González Rueda, M. S. Aguilar Sánchez, W. F. Robles Espíritu, and M. E. Chauca Quiñones, “Sentiment Analysis on E-Commerce Product Reviews Using Machine Learning and Deep Learning Algorithms: A Bibliometric Analysis, Systematic Literature Review, Challenges and Future Works,” International Journal of Information Management Data Insights, vol. 4, no. 2, p. 100267, Nov. 2024, doi: 10.1016/J.JJIMEI.2024.100267.
P. Vijayaragavan et al., “Sustainable sentiment analysis on E-commerce platforms using a weighted parallel hybrid deep learning approach for smart cities applications,” Sci Rep, vol. 14, no. 1, pp. 1–19, Dec. 2024, doi: 10.1038/S41598-024-78318-1;SUBJMETA.
Q. Li et al., “A Survey on Text Classification: From Traditional to Deep Learning,” ACM Transactions on Intelligent Systems and Technology (TIST), vol. 13, no. 2, p. 31, Apr. 2022, doi: 10.1145/3495162.
C. A. Nurhaliza Agustina, R. Novita, Mustakim, and N. E. Rozanda, “The Implementation of TF-IDF and Word2Vec on Booster Vaccine Sentiment Analysis Using Support Vector Machine Algorithm,” Procedia Comput Sci, vol. 234, pp. 156–163, Jan. 2024, doi: 10.1016/J.PROCS.2024.02.162.
H. Suroyo and E. J. Pratama, “Comparison of Text Representation Methods for Sentiment Analysis Using Support Vector Machine,” Journal of Advances in Information and Industrial Technology, vol. 7, no. 1, pp. 21–30, May 2025, doi: 10.52435/JAIIT.V7I1.610.
P. S. Ghatora, S. E. Hosseini, S. Pervez, M. J. Iqbal, and N. Shaukat, “Sentiment Analysis of Product Reviews Using Machine Learning and Pre-Trained LLM,” Big Data and Cognitive Computing 2024, Vol. 8, Page 199, vol. 8, no. 12, p. 199, Dec. 2024, doi: 10.3390/BDCC8120199.
M. Mujahid et al., “Data oversampling and imbalanced datasets: an investigation of performance for machine learning and feature engineering,” J Big Data, vol. 11, no. 1, pp. 1–32, Dec. 2024, doi: 10.1186/S40537-024-00943-4/TABLES/15.
A. R. Susanti and E. N. Ilahi, “Sentiment Analysis of User Reviews of E-commerce Applications: Case Study on the Shoppe Platform,” Journal of Social Science, vol. 5, no. 4, pp. 983–988, Jul. 2024, doi: 10.46799/JSS.V5I4.885.
Y. Mao, Q. Liu, and Y. Zhang, “Sentiment analysis methods, applications, and challenges: A systematic literature review,” Journal of King Saud University - Computer and Information Sciences, vol. 36, no. 4, p. 102048, Apr. 2024, doi: 10.1016/J.JKSUCI.2024.102048.
A. Siddiqua, V. Bindumathi, G. Raghu, and Y. S. V. Bhargav, “Aspect-based Sentiment Analysis (ABSA) using Machine Learning Algorithms,” 3rd IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics, ICDCECE 2024, 2024, doi: 10.1109/ICDCECE60827.2024.10549140.
F. Suandi et al., “Enhancing Sentiment Analysis Performance Using SMOTE and Majority Voting in Machine Learning Algorithms,” pp. 126–138, Dec. 2024, doi: 10.2991/978-94-6463-620-8_10.
P. Monika, C. Kulkarni, N. Harish Kumar, S. Shruthi, and V. Vani, “Machine learning approaches for sentiment analysis,” Int J Health Sci (Qassim), pp. 1286–1300, Apr. 2022, doi: 10.53730/ijhs.v6ns4.6119.
R. Kosasih and A. Alberto, “Sentiment analysis of game product on shopee using the TF-IDF method and naive bayes classifier,” ILKOM Jurnal Ilmiah, vol. 13, no. 2, pp. 101–109, Aug. 2021, doi: 10.33096/ILKOM.V13I2.721.101-109.
N. P. Arthamevia, Adiwijaya, and M. D. Purbolaksono, “Aspect-Based Sentiment Analysis in Beauty Product Reviews Using TF-IDF and SVM Algorithm,” 2021 9th International Conference on Information and Communication Technology, ICoICT 2021, pp. 197–201, Aug. 2021, doi: 10.1109/ICOICT52021.2021.9527489.
C. Wang, X. Zhu, and L. Yan, “Sentiment Analysis for E-Commerce Reviews Based on Deep Learning Hybrid Model,” ACM International Conference Proceeding Series, pp. 38–46, Aug. 2022, doi: 10.1145/3556384.3556391.
G. O. Assunção, R. Izbicki, and M. O. Prates, “Is augmentation effective to improve prediction in imbalanced text datasets?,” Apr. 2023, Accessed: Sep. 19, 2025. [Online]. Available: https://arxiv.org/pdf/2304.10283
A. H. I. B. A. Azrir, N. Palanichamy, S. C. Haw, and K. W. Ng, “Improving Sentiment Analysis of Shopee Reviews with Informal Language and Slang,” Journal of Logistics, Informatics and Service Science, vol. 11, no. 3, pp. 151–169, 2024, doi: 10.33168/JLISS.2024.0311.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Shaifany Fatriana Kadir, Ahmad Fairuzabadi

This work is licensed under a Creative Commons Attribution 4.0 International License.


















