Sentiment Analysis of the Song Lost - Bring Me the Horizon Based on Reviews on YouTube Using the SVM Algorithm

  • Efraim Moningkey Manado State University, Indonesia
  • Wanki Dwi Warsun Sombo Manado State University, Indonesia
Keywords: Sentiment Analysis, YouTube, SVM, TF-IDF, Music

Abstract

Bring Me The Horizon's song "Lost" received a wide response on YouTube, evident in the thousands of comments containing a variety of responses ranging from support and criticism to neutral opinions. The rapid development of social media has made it easier for people to freely share their views and experiences on musical works without being bound by space and time. YouTube, as one of the largest video-sharing platforms, plays a crucial role in documenting public perception of the song. This study was conducted to analyze listener sentiment towards the song "Lost" based on YouTube comments using the Support Vector Machine (SVM) algorithm and the Term Frequency–Inverse Document Frequency (TF-IDF) feature extraction technique. Comments were collected through web scraping from the song's official video, then processed through case folding, punctuation removal, tokenizing, stopword removal, and stemming to produce clean and uniform data. Term weights were calculated using TF-IDF and then used to label positive, negative, and neutral sentiments. The SVM model was built from training data and tested with test data to evaluate its performance using accuracy, precision, recall, and f1-score metrics so that classification quality could be assessed comprehensively. Based on the test results, the SVM algorithm was able to classify listener comments with 94% accuracy, with a distribution of negative sentiment of 207 comments, neutral comments of 1,280, and positive comments of 732. These findings demonstrate the effectiveness of SVM in analyzing the sentiment of song comments on social media and provide a more comprehensive picture of the public's view of Bring Me the Horizon's song "Lost.".

References

Ahmed, W., Semary, N. A., Amin, K., & Adel Hammad, M. (2023). Sentiment analysis on twitter using machine learning techniques and tf-idf feature extraction: A comparative study. IJCI. International Journal of Computers and Information, 10(3), 52-57. https://doi.org/10.21608/ijci.2023.236052.1128

Alemerien, K., Al-Ghareeb, A., & Alksasbeh, M. Z. (2024). Sentiment analysis of online reviews: a machine learning based approach with TF-IDF vectorization. Journal of Mobile Multimedia, 20(5), 1089-1116. https://doi.org/10.13052/jmm1550-4646.2055

Al-Gaphari, G., AL-Hagree, S., Al-Sanabani, M., Al-Shalabi, A., Al-Helali, B., & Alawdi, A. A. (2025). Enhancing Arabic sentiment analysis during the Qatar world cup: a study of generative pre-trained transformers and machine learning techniques. Social Network Analysis and Mining. https://doi.org/10.1007/s13278-025-01555-3

Amangeldi, D., Usmanova, A., & Shamoi, P. (2024). Understanding environmental posts: Sentiment and emotion analysis of social media data. IEEE Access, 12, 33504-33523. https://doi.org/10.1109/ACCESS.2024.3371585

Buckley, S. (2020). YouTube and the public sphere: What role does YouTube play in contributing towards Habermas’ notion of the public sphere (Doctoral dissertation, Cardiff University).

Darwis, D., Pratiwi, E. S., Ferico, A., & Pasaribu, O. (2020). Penerapan algoritma SVM untuk analisis sentimen pada data Twitter Komisi Pemberantasan Korupsi Republik Indonesia. Jurnal Ilmiah Edutic, 7.

Deumert, A. (2023). Linguistics in a minor key—Of atmospheres, voice (s), and ethics. Applied linguistics, 44(5), 916-929.

Doğan, B., Balcioglu, Y. S., & Elçi, M. (2025). Multidimensional sentiment analysis method on social media data: comparison of emotions during and after the COVID-19 pandemic. Kybernetes, 54(4), 2414-2456. https://doi.org/10.1108/K-09-2023-1808?urlappend=%3Futm_source%3Dresearchgate.net%26utm_medium%3Darticle

Eliwa, E. H. I., Mohamed El Koshiry, A., Abd El-Hafeez, T., & Omar, A. (2024). Secure and transparent lung and colon cancer classification using blockchain and microsoft azure. Advances in Respiratory Medicine, 92(5), 395-420. https://doi.org/10.3390/arm92050037

George, A. S., & Baskar, T. (2024). Leveraging big data and sentiment analysis for actionable insights: A review of data mining approaches for social media. Partners Universal International Innovation Journal, 2(4), 39-59. https://doi.org/10.5281/zenodo.13623777

Gifari, O. I., Adha, M., Hendrawan, I. R., Freddy, F., & Durrand, S. (2022). Analisis sentimen review film menggunakan TF-IDF dan support vector machine. JIFOTECH (Journal of Information Technology), 2(1).

Herbst, J. P., & Mynett, M. (2025). Heaviness in Metal Music Production, Volume I: How and Why It Works. CRC Press.

Husada, H. C., & Paramita, A. S. (2021). Analisis sentimen pada maskapai penerbangan di platform Twitter menggunakan algoritma support vector machine. Teknika, 10(1), 18–26. https://doi.org/10.34148/teknika.v10i1.311

Jansen, B. J., Aldous, K. K., Salminen, J., Almerekhi, H., & Jung, S. G. (2023). Understanding Audiences, Customers, and Users via Analytics. Springer International Publishing AG.

Juslin, P. N., & Laukka, P. (2004). Expression, perception, and induction of musical emotions: A review and a questionnaire study of everyday listening. Journal of new music research, 33(3), 217-238. https://doi.org/10.1080/0929821042000317813

Kim, J., & Kim, M. (2025). Viral dynamics on social media: Enhancing brand engagement through meme marketing strategies. Journal of Current Issues & Research in Advertising, 46(4), 411-428. https://doi.org/10.1080/10641734.2024.2390850

Lange, P. G. (2007). Publicly private and privately public: Social networking on YouTube. Journal of computer-mediated communication, 13(1), 361-380. https://doi.org/10.1111/j.1083-6101.2007.00400.x

Liu, H., Chen, X., & Liu, X. (2022). A study of the application of weight distributing method combining sentiment dictionary and TF-IDF for text sentiment analysis. Ieee Access, 10, 32280-32289. https://doi.org/10.1109/ACCESS.2022.3160172

Manueke, G. W., Lumando, L., & Manueke, E. G. Y. (2023). Analisis sentimen lirik lagu populer dengan pemanfaatan teknik natural language processing. Global Science, 4.

Möller, A. M., Vermeer, S. A., & Baumgartner, S. E. (2024). Cutting through the comment chaos: A supervised machine learning approach to identifying relevant youtube comments. Social Science Computer Review, 42(1), 162-185. https://doi.org/10.1177/08944393231173895

Oham, C., & Ejike, O. G. (2024). Customer interaction and engagement: A theoretical exploration of live promotional tactics in the arts. Unpublished. https://doi.org/10.30574/msarr.2024.12.1.0170

O'Hara, W. (2022). The Techne of YouTube Performance: Musical Structure, Extended Techniques, and Custom Instruments in Solo Pop Covers. Music Theory Online, 28(3). https://doi.org/10.30535/mto.28.3.9

Ppali, S., Branch, B., Boem, A., Ppali, E., Scorer, M., Ang, C. S., ... & Covaci, A. (2025). The virtual concert-goer: Audience perspectives on remote music performances. Proceedings of the ACM on Human-Computer Interaction, 9(7), 1-30. https://doi.org/10.1145/3757431

Raharjo, J. H., & Budiharseno, R. S. (2024). Hitting the right notes: Exploring the relationship between YouTube music videos and digital user engagement. Global Business & Finance Review (GBFR), 29(1), 44-56.

Rasyid, R. A., Sari, D. K., & Games, D. (2024). Faktor pendorong konsumsi musik metal pada generasi millennial di Kota Jambi berdasarkan heaviness suatu analisa tematik. Cantata Deo Jurnal Musik dan Seni, 2(1), 25–37. https://doi.org/10.69748/jmcd.v2i1.173

Sarifiyono, A. P., Purnama, S., Pujianto, A. B., Sinaga, J., & Hendrian, A. S. (2025). Emotional Connection, Social Interaction, and Content Performance Quality in Building Fan Loyalty on Indonesian Indie Music Streaming Platforms. Sinergi International Journal of Management and Business, 3(1), 43-58. https://doi.org/10.61194/ijmb.v3i1.411

Syafia, A. N., Hidayattullah, M. F., & Suteddy, W. (2023). Studi komparasi algoritma SVM dan random forest pada analisis sentimen komentar YouTube BTS. Jurnal Ilmiah, 8(3).

Thet, T. T., Na, J. C., & Khoo, C. S. G. (2010). Aspect based sentiment analysis of movie reviews on discussion boards. Journal of Information Science, 36(6), 823–848. https://doi.org/10.1177/0165551510388123

Utari, E. L., & Wibowo, S. H. (2025). Analisis komparatif algoritma SVM naive Bayes dan LSTM pada sentimen komentar lagu Labour. Jurnal Informatika Teknologi dan Sains. https://doi.org/10.51401/jinteks.v7i3.6213

Wang, X., Wang, H., Bhandari, B., & Cheng, L. (2024). AI-empowered methods for smart energy consumption: A review of load forecasting, anomaly detection and demand response. International Journal of Precision Engineering and Manufacturing-Green Technology, 11(3), 963-993. https://doi.org/10.1007/s40684-023-00537-0?urlappend=%3Futm_source%3Dresearchgate.net%26utm_medium%3Darticle

Xin, L. (2024). Research on the strategy of content innovation and audience experience improvement in media programs in the digital age. Media Commun. Res, 5(4), 1-12. https://dx.doi.org/10.23977/mediacr.2024.050414

Published
2026-02-10
How to Cite
Moningkey, E., & Sombo, W. D. W. (2026). Sentiment Analysis of the Song Lost - Bring Me the Horizon Based on Reviews on YouTube Using the SVM Algorithm. Journal La Multiapp, 7(2), 486-499. https://doi.org/10.37899/journallamultiapp.v7i2.2685