Predicting ZNERGY Athletes' Swimming Times Using Random Forest Based on Lifestyle Physiological Exercise
Abstract
This research was conducted to fill the gap in previous research in swimming performance prediction that rarely combines daily lifestyle aspects with training and physiological data, especially in the context of local swimming clubs in Indonesia that still rely on conventional approaches. The purpose of this research is to develop a prediction model for 50-meter freestyle swimming times using a holistic Random Forest Regressor algorithm. The method applied is a data-oriented methodology for prediction with longitudinal primary data from 10 ZNERGY Aquatic Swimming Club adolescent athletes (81 observations) collected through coach logbooks for training variables and time targets, and digital questionnaires for parents for physiological and lifestyle variables, including a specially developed Nutrition Compliance Index (IKG). The data was processed with Python (Scikit-Learn and Pandas), including pre-processing, lagged feature engineering, and model evaluation with an 80:20 split. The results show the model has high accuracy with an R² of 0.9591, MAE of 1.511 seconds, and RMSE of 2.254 seconds; The most important variable was the previous test time (45.66%), followed by training variables (33.67%), while lifestyle variables contributed little. The implication of this study is the availability of an evidence-based decision support system for coaches to design training programs objectively, optimize athlete progress, and prevent overtraining through a holistic approach that integrates multidimensional data.
References
Abdullah, R. W., & Kusumastuti, R. (2025). Analisis Pengolahan Ekstraksi Fitur Citra Untuk Klasifikasi Jenis Apel Menggunakan Scikit-Learn Dengan Algoritma K-Nearest Neighbor. Jurnal Teknologi Informasi dan Ilmu Komputer, 12(1), 165-174. https://doi.org/10.25126/jtiik.2025129149
AlKasasbeh, W., & Akroush, S. (2024). Investigating the interrelationships among food habits, sports nutrition knowledge, and perceived barriers to healthy eating: a study of adolescent swimmers. Frontiers in Nutrition, 11, 1381801. https://doi.org/10.3389/fnut.2024.1381801
Amendolara, A., Pfister, D., Settelmayer, M., Shah, M., Wu, V., Donnelly, S., ... & Bills, K. (2023). An overview of machine learning applications in sports injury prediction. Cureus, 15(9).
Aurélien, G. (2019). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow. Concepts, tools, and techniques to build intelligent systems, 2nd ednn.
Bardan, I., & Lakhdari, E. V. (2025). Relationship between Balanced Nutrition and Sports Performance in Swimmers. Bulletin of the Transilvania University of Braşov. Series IX: Sciences of Human Kinetics, 43-50. https://doi.org/10.31926/but.shk.2025.18.67.1.5
Barry, L., Lyons, M., McCreesh, K., Powell, C., & Comyns, T. (2022). International survey of training load monitoring practices in competitive swimming: how, what and why not?. Physical Therapy in Sport, 53, 51-59. https://doi.org/10.1016/j.ptsp.2021.11.005
Bompa, T. O., & Buzzichelli, C. (2021). Periodization of strength training for sports. USA: Human Kinetics Publishers.
Capra, M. E., Stanyevic, B., Giudice, A., Monopoli, D., Decarolis, N. M., Esposito, S., & Biasucci, G. (2024). Nutrition for children and adolescents who practice sport: A narrative review. Nutrients, 16(16), 2803. https://doi.org/10.3390/nu16162803
Carter, A. D., & Bloom, G. A. (2009). Coaching knowledge and success: Going beyond athletic experiences. Journal of sport behavior, 32(4), 419.
Cossich, V. R., Carlgren, D., Holash, R. J., & Katz, L. (2023). Technological breakthroughs in sport: Current practice and future potential of artificial intelligence, virtual reality, augmented reality, and modern data visualization in performance analysis. Applied Sciences, 13(23), 12965. https://doi.org/10.3390/app132312965
Dann, R., Duhig, S., Roberts, L., Kelly, V., Renshaw, I., & Headrick, J. (2024). A principled approach to skill acquisition in competitive surfing: Embracing representative learning design. International Journal of Sports Science & Coaching, 19(6), 2534-2547. https://doi.org/10.1177/17479541241279044
Drew, M. K., & Finch, C. F. (2016). The relationship between training load and injury, illness and soreness: a systematic and literature review. Sports medicine, 46(6), 861-883. https://doi.org/10.1007/s40279-015-0459-8
Fitriani, S., Budiman, E., Fadli, M., Surono, M., & Sulistiani, H. (2025, July). Optimalisasi Metode Random Forest menggunakan Particle Swarm Optimization dalam Prediksi Prestasi Mahasiswa. In Prosiding Seminar Nasional Teknologi Komputer dan Sains (Vol. 3, No. 1, pp. 406-415).
Fonti, F., Ross, J. M., & Aversa, P. (2023). Using sports data to advance management research: A review and a guide for future studies. Journal of Management, 49(1), 325-362. https://doi.org/10.1177/01492063221117525
Ghosh, I., Ramasamy Ramamurthy, S., Chakma, A., & Roy, N. (2023). Sports analytics review: Artificial intelligence applications, emerging technologies, and algorithmic perspective. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 13(5), e1496. https://doi.org/10.1002/widm.1496
Grandou, C., Wallace, L., Impellizzeri, F. M., Allen, N. G., & Coutts, A. J. (2020). Overtraining in resistance exercise: an exploratory systematic review and methodological appraisal of the literature. Sports Medicine, 50(4), 815-828. https://doi.org/10.1007/s40279-019-01242-2
Halson, S. L., & Jeukendrup, A. E. (2004). Does overtraining exist? An analysis of overreaching and overtraining research. Sports medicine, 34(14), 967-981. https://doi.org/10.2165/00007256-200434140-00003
Haselton, M. G., Bryant, G. A., Wilke, A., Frederick, D. A., Galperin, A., Frankenhuis, W. E., & Moore, T. (2009). Adaptive rationality: An evolutionary perspective on cognitive bias. Social Cognition, 27(5), 733-763. https://doi.org/10.1521/soco.2009.27.5.733
Li, Y., & Mu, Y. (2024). Research and performance analysis of random forest-based feature selection algorithm in sports effectiveness evaluation. Scientific Reports, 14(1), 26275. https://doi.org/10.1038/s41598-024-76706-1
Liu, C., Xu, B., Wan, K., Sun, Q., Wang, R., Feng, Y., ... & Wang, R. (2024). Improved prediction of swimming talent through random forest analysis of anthropometric and physiological phenotypes. Phenomics, 4(5), 465-472. https://doi.org/10.1007/s43657-024-00176-8
Lopes, T. J., Sampaio, T., Oliveira, J. P., Pinto, M. P., Marinho, D. A., & Morais, J. E. (2023). Using wearables to monitor swimmers’ propulsive force to get real-time feedback and understand its relationship to swimming velocity. Applied Sciences, 13(6), 4027. https://doi.org/10.3390/app13064027
Mehrabi, A. N., Imani, H., Khademnoe, O., Khantan, M., Lundberg, T. R., & Gorzi, A. (2025). Modelling of performance prediction by analysis of elite swimmers’ anthropometry, peak performance age and age-related performance progression. PLoS One, 20(9), e0332306. https://doi.org/10.1371/journal.pone.0332306
Mesquita, I., Isidro, S., & Rosado, A. (2010). Portuguese coaches’ perceptions of and preferences for knowledge sources related to their professional background. Journal of sports science & medicine, 9(3), 480.
Mujika, I., Pyne, D. B., Wu, P. P. Y., Ng, K., Crowley, E., & Powell, C. (2023). Next-generation models for predicting winning times in elite swimming events: updated predictions for the Paris 2024 Olympic games. International journal of sports physiology and performance, 18(11), 1269-1274. https://doi.org/10.1123/ijspp.2023-0174
Nash, C., & Sproule, J. (2011). Insights into experiences: Reflections of an expert and novice coach. International Journal of Sports Science & Coaching, 6(1), 149-161. https://doi.org/10.1260/1747-9541.6.1.149
Ofoghi, B., Zeleznikow, J., MacMahon, C., & Raab, M. (2013). Data mining in elite sports: a review and a framework. Measurement in Physical Education and Exercise Science, 17(3), 171-186. https://doi.org/10.1080/1091367X.2013.805137
Romdhani, M., Hammouda, O., Smari, K., Chaabouni, Y., Mahdouani, K., Driss, T., & Souissi, N. (2021). Total sleep deprivation and recovery sleep affect the diurnal variation of agility performance: the gender differences. The Journal of Strength & Conditioning Research, 35(1), 132-140. https://doi.org/10.1519/JSC.0000000000002614
Ruiz-Navarro, J. J., Santos, C. C., Born, D. P., López-Belmonte, Ó., Cuenca-Fernández, F., Sanders, R. H., & Arellano, R. (2025). Factors relating to sprint swimming performance: a systematic review. Sports Medicine, 55(4), 899-922. https://doi.org/10.1007/s40279-024-02172-4
Salas, E., Tannenbaum, S. I., Kraiger, K., & Smith-Jentsch, K. A. (2012). The science of training and development in organizations: What matters in practice. Psychological science in the public interest, 13(2), 74-101. https://doi.org/10.1177/1529100612436661
Soltani, M., Haytabakhsh, M. R., Najman, J. M., Williams, G. M., O’Callaghan, M. J., Bor, W., ... & Clavarino, A. (2012). Sleepless nights: the effect of socioeconomic status, physical activity, and lifestyle factors on sleep quality in a large cohort of Australian women. Archives of women's mental health, 15(4), 237-247. https://doi.org/10.1007/s00737-012-0281-3
Susanto, E. R., Inzaghi, M. R., Amarudin, A., & Neneng, N. (2025). Evaluasi Kinerja Model Random Forest Dalam Memprediksi Diabetes Berdasarkan Dataset Kesehatan di Indonesia. Jurnal Pendidikan dan Teknologi Indonesia, 5(7), 1857-1866.
Turfi, A. I. (2023). Analisis Performa Renang Nomor 100meter Gaya Kupu-Kupu Pada Empat Kali Penyelengaraan Olimpiade. Jurnal Prestasi Olahraga, 6(2), 67-74. https://doi.org/10.1234/jpo.v6i2.54917
Van Woezik, R. A., McLaren, C. D., Côté, J., Erickson, K., Law, B., Horning, D. L., ... & Bruner, M. W. (2021). Real versus ideal: Understanding how coaches gain knowledge. International Sport Coaching Journal, 9(2), 189-202. https://doi.org/10.1123/iscj.2021-0010
Wang, S., Li, B., Wu, Y., Ungvari, G. S., Ng, C. H., Fu, Y., ... & Xiang, Y. T. (2017). Relationship of sleep duration with sociodemographic characteristics, lifestyle, mental health, and chronic diseases in a large Chinese adult population. Journal of Clinical Sleep Medicine, 13(3), 377-384. https://doi.org/10.5664/jcsm.6484
Zacca, R., Demarie, S., & Morouço, P. (2024). Training and performance in swimming. Frontiers in physiology, 15, 1402543. https://doi.org/10.3389/fphys.2024.1402543
Copyright (c) 2026 Journal La Multiapp

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



