Food and Beverage Product Review Sentiment Analysis on E-Commerce with Word Embedding and LSTM
Abstract
Sentiment analysis is a widely used method to understand customer opinions about a product. This study aims to analyze the sentiment of food and beverage product reviews on the Tokopedia marketplace using the Long Short-Term Memory (LSTM) approach and word embedding. The data used consisted of customer reviews that were categorized into three sentiment classes, namely positive, neutral, and negative. The model was developed through a series of stages of preprocessing, embedding, training with LSTM, as well as performance evaluation using accuracy and F1-score metrics. The results show that the developed model is able to classify sentiment with a fairly high level of accuracy. Based on the results of the final test on 5,000 data, the model managed to classify 122 data as negative, 130 data as neutral, and 4,871 data as positive, although it still showed an imbalance in class classification. Further analysis through word cloud visualization showed that words like "delicious", "steady", and "good" dominated the positive sentiment, while words like "disappointed", "broken", and "slow" often appeared in negative sentiment. This study provides valuable insights for businesses in understanding customer opinions and improving the quality of products and services.
References
Aditia, N. N., Hendrati, I. M., & Wardaya, W. (2023). Penerapan Digital Branding Dalam Optimalisasi Peningkatan Penjualan Produk UMKM Olahan Bunda Hamiz Di Kelurahan Klampis Ngasem. Kacanegara Jurnal Pengabdian Pada Masyarakat, 6(4), 433. https://doi.org/10.28989/kacanegara.v6i4.1724
Dang, C., García, M., & Prieta, F. (2021). Hybrid deep learning models for sentiment analysis. Complexity, 2021(1). https://doi.org/10.1155/2021/998692
Falasari, A., & Muslim, M. A. (2022). Optimize Naïve Bayes Classifier Using Chi Square and Term Frequency Inverse Document Frequency for Amazon Review Sentiment Analysis. Journal of Soft Computing Exploration, 3(1), 31–36. https://doi.org/10.52465/joscex.v3i1.68
Gondhi, N., Sharma, E., Alharbi, A., Verma, R., & Shah, M. (2022). Efficient long short-term memory-based sentiment analysis of e-commerce reviews. Computational Intelligence and Neuroscience, 2022, 1–9. https://doi.org/10.1155/2022/3464524
Gu, T., Xu, G., & Luo, J. (2020). Sentiment analysis via deep multichannel neural networks with variational information bottleneck. IEEE Access, 8, 121014–121021. https://doi.org/10.1109/access.2020.3006569
Hochreiter, S., & Schmidhuber, J. (1997). Long Short-Term Memory. Neural Computation, 9(8), 1735–1780.
Husada, H. C., & Paramita, A. S. (2021). Analisis Sentimen Pada Maskapai Penerbangan di Platform Twitter Menggunakan Algoritma Support Vector Machine (SVM). Teknika, X(1), 18–26.
Liu, B. (2012). Sentiment Analysis and Opinion Mining (Vol. 5).
Medhat, W., Hassan, A., & Korashy, H. (2014). Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal, 5(4), 1093–1113.
Meidi, N. P., Setyariningsih, E., & Utami, B. (2022). Analisis Faktor-Faktor Yang Mempengaruhi Keputusan Pembelian Pada Marketplace Shopee Di Mojokerto. Jurnal Cakrawala Ilmiah, 1(11), 2937–2952. https://doi.org/10.53625/jcijurnalcakrawalailmiah.v1i11.2879
Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed Representations of Words and Phrases and their Compositionality. Advances in Neural Information Processing Systems, 26.
Nazaruddin, E., Caroline, C., & Sulistyawati, U. (2023). Analyzing customers in e-commerce using Dempster-Shafer method. International Journal Software Engineering and Computer Science (IJSECS), 3(2), 174–183. https://doi.org/10.35870/ijsecs.v3i2.1497
Onan, A. (2020). Sentiment analysis on product reviews based on weighted word embeddings and deep neural networks. Concurrency and Computation: Practice and Experience, 33(23). https://doi.org/10.1002/cpe.5909
Puspita, N. A., Anggraeny, F. T., & Rizki, A. M. (2024). ANALISIS SENTIMEN DENGAN ALGORITMA NAÏVE BAYES CLASSIFIER TERHADAP ULASAN APLIKASI MY F&B ID. Jati (Jurnal Mahasiswa Teknik Informatika), 8(1), 1090–1095. https://doi.org/10.36040/jati.v8i1.8842
Riefky, M., & Pramesti, W. (2020). Sentiment Analysis of Southeast Asian Games (SEA Games) in Philippines 2019 Based on Opinion of Internet User of Social Media Twitter with K-Nearest Neighbor and Support Vector Machine. Jurnal Matematika, Statistika Dan Komputasi, XVII(1), 26–41.
Sachin, S., Tripathi, A., Mahajan, N., Aggarwal, S., & Nagrath, P. (2020). Sentiment analysis using gated recurrent neural networks. SN Computer Science, 1(2). https://doi.org/10.1007/s42979-020-0076-y
Sadikin, M., & Fauzan, A. (2023). Evaluation of machine learning approach for sentiment analysis using Yelp dataset. European Journal of Electrical Engineering and Computer Science, 7(6), 58–64. https://doi.org/10.24018/ejece.2023.7.6.583
Safuan, S., & Ku Ruhana, K.-M. (2024). Handling Semantic Relationships for Classification of Sparse Text: A Review. Engineering Proceedings, 84(1). https://doi.org/10.3390/engproc2025084061
Sari, F. P., & Putri, R. (2021). Pengaruh E-Commerce terhadap Perilaku Konsumen di Indonesia: Studi Kasus Tokopedia. Jurnal Ekonomi Digital, 3(2), 45–60.
Setiyawan, A., Wijayanto, A. W., & He, Y. (2022). Extracting consumer opinion on Indonesian e-commerce: A rating evaluation and lexicon-based sentiment analysis. Proceedings of the International Conference on Data Science and Official Statistics, 2021(1), 2–11. https://doi.org/10.34123/icdsos.v2021i1.22
Shang, S. (2018). An Improved Text Sentiment Analysis Algorithm Based on TF-Gini. Ijpe. https://doi.org/10.23940/ijpe.18.09.p8.20082014
Shobana, J., & Murali, M. (2021). An efficient sentiment analysis methodology based on long short-term memory networks. Complex & Intelligent Systems, 7(5), 2485–2501. https://doi.org/10.1007/s40747-021-00436-4
Tang, D., Qin, B., & Liu, T. (2015). Document Modeling with Gated Recurrent Neural Network for Sentiment Classification. Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1422–1432.
Zain, F. F., & Sibaroni, Y. (2019). Effectiveness of SVM Method by Naïve Bayes Weighting in Movie Review Classification. Khazanah Informatika Jurnal Ilmu Komputer Dan Informatika, 5(2), 108–114. https://doi.org/10.23917/khif.v5i2.7770
Zhang, W., & Gao, F. (2013). Performance Analysis and Improvement of Naïve Bayes in Text Classification Application. 1–4. https://doi.org/10.1109/anthology.2013.6784818
Zhang, Y., Jin, R., & Zhou, Z. H. (2018). Understanding bag-of-words model: A statistical framework. International Journal of Machine Learning and Cybernetics, 9(1), 15–29.
Akter, P., Hossain, S., Siddique, M. T., Ayub, M. I., Nath, A., Nath, P. C., ... & Hassan, M. M. (2025). Sentiment Analysis of Consumer Feedback and Its Impact on Business Strategies by Machine Learning. The American Journal of Applied sciences, 7(01), 6-16. https://doi.org/10.37547/tajas/Volume07Issue01-02
Liu, C., Chen, T., Pu, Q., & Jin, Y. (2025). Text Mining for Consumers’ Sentiment Tendency and Strategies for Promoting Cross-Border E-Commerce Marketing Using Consumers’ Online Review Data. Journal of Theoretical and Applied Electronic Commerce Research, 20(2), 125. https://doi.org/10.3390/jtaer20020125
Sugumaran, D., Udakandage, M. D., Kodippili, S. P., De Alwis, M. M., Attigala, D. L., Ranasinghe, N. N., ... & Liu, Y. (2025). Artificial intelligence in sustainable organic waste treatment: a review. Waste Disposal & Sustainable Energy, 1-22. https://doi.org/10.1007/s42768-025-00246-1
Bellar, O., Baina, A., & Ballafkih, M. (2024). Sentiment analysis: Predicting product reviews for E-commerce recommendations using deep learning and transformers. Mathematics, 12(15), 2403. https://doi.org/10.3390/math12152403
Nor El-Deen, D., El-Sayed, R. S., Hussein, A., & Zaki, M. (2024). Sentiment analysis for movie recommendations: Harnessing opinion mining systems to analyze user reviews. The Egyptian Statistical Journal, 68(1), 1-14.
Chen, X., Xie, H., Tao, X., Wang, F. L., Zhang, D., & Dai, H. N. (2025). A computational analysis of aspect-based sentiment analysis research through bibliometric mapping and topic modeling. Journal of Big Data, 12(1), 40. https://doi.org/10.1186/s40537-025-01068-y
Maulana, R., Aziz, M. A., & Antony, A. (2025). The Effect of User Ease and User Security on Purchase Decisions Mediated by Consumer Trust (Study of Tokopedia Users in Sukabumi City). International Journal of Economics, Management and Accounting (IJEMA), 2(9), 763-780.
Dewi, R. S. (2024, July). Digital Transformation for the Tokopedia Marketplace Application. In Proceeding Medan International Conference on Economic and Business (Vol. 2, pp. 570-575). https://doi.org/10.30596/miceb.v2i0.656
Kurniasari, D., Pradana, F. A., & Wamiliana, W. (2025). Deep Learning-Driven Comparative Study of Word Embedding Techniques: Word2Vec, GloVe, and FastText in Health Condition Reviews. ISRG Journal of Multidisciplinary Studies (ISRGJMS), 3(2), 51-62.
Al-Selwi, S. M., Hassan, M. F., Abdulkadir, S. J., & Muneer, A. (2023). LSTM inefficiency in long-term dependencies regression problems. Journal of Advanced Research in Applied Sciences and Engineering Technology, 30(3), 16-31.
Copyright (c) 2025 Journal La Multiapp

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



