Comparison of AlexNet and ResNet50 Model Performance in Classifying Images of Indonesian Traditional Food
Abstract
Image classification using deep learning has become an effective approach in various fields, including visual object recognition such as food identification. This study aims to compare the performance of two well-known Convolutional Neural Network (CNN) architectures, AlexNet and ResNet50, in classifying traditional Indonesian food images. The dataset used in this research is a combination of two sources: a traditional Indonesian cake dataset from Kaggle and an additional set of images of Cirebon's traditional dishes. The final dataset consists of 24 food categories with more than 4,000 images in total. Each image was preprocessed through several steps including resizing to 224x224 pixels, applying data augmentation to training samples to enhance variation, and normalization based on standard input formats of the models. The training process was carried out using the 5-Fold Cross Validation method, while performance was evaluated using accuracy, precision, recall, and F1-score metrics. Experimental results show that ResNet50 consistently outperformed AlexNet across all evaluation metrics. ResNet50 achieved an average accuracy of 92%, compared to 86% obtained by AlexNet. Additionally, ResNet50 demonstrated superior performance in terms of precision, recall, and F1-score. This difference indicates that deeper and more complex architectures like ResNet50 are more effective in learning visual patterns in diverse traditional food images. The study concludes that ResNet50 is a more optimal choice for the task of traditional Indonesian food image classification. These findings serve as a basis for future development of image-based food recognition systems and support the preservation of culinary heritage through artificial intelligence technology.
References
Apriyanto, Y., Md Sharif, M. S., Mohd Shahril, A., Ishak, N., & Hashim, N. F. (2024). Exploring the concept of Indonesia traditional food sustainability. Journal of Tourism, Hospitality and Culinary Arts, 16(1), 351-377.
Azizah, A. N., Budiman, I., Indriani, F., Faisal, M. R., & Herteno, R. (2024). Image classification of traditional Indonesian cakes using convolutional neural network (CNN). Computer Engineering and Applications, 13, 38–49. http://dx.doi.org/10.18495/comengapp.v13i2.469
Bihari, S. (2023). Cultural heritage and indigenous knowledge: Reviving traditions for future generations. Sustainable Development Goals in SAARC Countries: Key Issues, Opportunities and Challenges, 1, 24-32. http://dx.doi.org/10.5281/zenodo.8249545
Chempak Kumar, A., & Mubarak, D. M. N. (2024). Ensembled CNN with artificial bee colony optimization method for esophageal cancer stage classification using SVM classifier. Journal of X-Ray Science and Technology, 32(1), 31-51. https://doi.org/10.3233/XST-230111
Chukwurah, G. O., Okeke, F. O., Isimah, M. O., Enoguanbhor, E. C., Awe, F. C., Nnaemeka-Okeke, R. C., ... & Okeke, C. A. (2025). Cultural Influence of Local Food Heritage on Sustainable Development. World, 6(1), 10. https://doi.org/10.3390/world6010010
Danendra, F. A., Anggraeny, F. T., & Maulana, H. (2023). Image classification of chili leaf disease using CNN architecture AlexNet and SqueezeNet. Syntax: Journal of Informatics, 12, 50–61. http://dx.doi.org/10.1016/j.ecoinf.2022.101663
Demirci, M. Y., Beşli, N., & Gümüşçü, A. (2021). Efficient deep feature extraction and classification for identifying defective photovoltaic module cells in Electroluminescence images. Expert Systems with Applications, 175, 114810. https://doi.org/10.1016/j.eswa.2021.114810
DLY, I. A., Jasril, Sanjaya, S., Handayani, L., & Yanto, F. (2023). Image classification of beef and pork using CNN AlexNet and data augmentation. Journal of Information System Research (JOSH), 4, 1176–1185. https://doi.org/10.47065/josh.v4i4.3702
Elngar, A. A., Arafa, M., Fathy, A., Moustafa, B., Mahmoud, O., Shaban, M., & Fawzy, N. (2021). Image classification based on CNN: a survey. Journal of Cybersecurity and Information Management, 6(1), 18-50. https://doi.org/10.54216/JCIM.060102
Faturrahman, R., Hariyani, Y. S., & Hadiyoso, S. (2023). Classification of Indonesian traditional snacks based on deep learning and transfer learning methods. ELKOMIKA: Journal of Electrical Energy Engineering, Telecommunication Engineering, & Electronics Engineering, 945–957.
Gusti, H. P., Haerani, E., Syafria, F., Yanto, F., & Gusti, S. K. (2024). Implementation of convolutional neural network (ResNet-50) algorithm for classification of benign and malignant skin cancer. MALCOM: Indonesian Journal of Machine Learning and Computer Science, 4, 984–992. https://doi.org/10.57152/malcom.v4i3.1398
Hamzah, A. S., & Hreshee, S. S. (2025). A Scalable Face Recognition Framework: Integrating HOG Features, PCA, and CNN for Offline and Web-Based Systems. International Journal of Intelligent Engineering & Systems, 18(2). https://doi.org/10.22266/ijies2025.0331.28
Hermanto, A. R., Aziz, A., & Sudianto. (2024). Comparison of MobileNetV2 and ResNet50 architectures for date fruit classification. Journal of Information Systems and Technology, 12, 630–637. https://doi.org/10.26418/justin.v12i4.80358
Iqbal, S., N. Qureshi, A., Li, J., & Mahmood, T. (2023). On the analyses of medical images using traditional machine learning techniques and convolutional neural networks. Archives of Computational Methods in Engineering, 30(5), 3173-3233. https://doi.org/10.1007/s11831-023-09899-9
Irfansyah, D., Mustikasari, M., & Suroso, A. (2021). AlexNet convolutional neural network (CNN) architecture for pest classification on coffee plant leaf images. Informatics Journal: IT Development Journal (JPIT), 6, 87–92. http://dx.doi.org/10.30591/jpit.v6i2.2802
Iskandar, J. S., & Kristinto, R. P. (2023). Recognition and classification of various Indonesian cakes using ResNet50V2 architecture in convolutional neural network (CNN). [Journal name missing], 81–92.
Jena, B., Nayak, G. K., & Saxena, S. (2022). Convolutional neural network and its pretrained models for image classification and object detection: A survey. Concurrency and Computation: Practice and Experience, 34(6), e6767. https://doi.org/10.1002/cpe.6767
Jiang, P., Neri, F., Xue, Y., & Maulik, U. (2024). A generalised attention mechanism to enhance the accuracy performance of neural networks. International journal of neural systems, 2450063. https://doi.org/10.1142/s0129065724500631
Lasniari, S., Jasril, Sanjaya, S., Yanto, F., & Affandes, M. (2022). Effect of ResNet-50 convolutional neural network hyperparameters on classification of beef and pork images. National Journal of Computing and Information Technology, 5, 474–481. https://doi.org/10.32672/jnkti.v5i3.4424
Liu, Y., Pu, H., & Sun, D. W. (2021). Efficient extraction of deep image features using convolutional neural network (CNN) for applications in detecting and analysing complex food matrices. Trends in Food Science & Technology, 113, 193-204. https://doi.org/10.1016/j.tifs.2021.04.042
Marpaung, F., Koemadji, Z. A., Hidayat, M., Widianto, A., & Aulia, F. (2023). Selection of Food Identification System Features Using Convolutional Neural Network (CNN) Method. Scientific Journal of Informatics, 10(2), 205-2016. http://dx.doi.org/10.15294/sji.v10i2
Matarat, K. (2024). Enhancing Thai Food Classification: A CNN-Based Approach with Transfer Learning. Mathematical Modelling of Engineering Problems, 11(6). http://dx.doi.org/10.18280/mmep.110625
Permana, F. Y., Aditya, C. S. K., & Chandranegara, D. R. (2024). Segmentation and image classification of skin cancer using convolutional neural network (CNN) with ResNet-50 architecture. REPOSITOR, 6, 391–404.
Pugra, I. W., Kencanawati, A. A. A. M., & Kurniawan, I. G. W. A. (2025). The cultural significance of traditional foods in shaping Indonesian social identity: Challenges and preservation strategies. Journal of Language, Literature, Social and Cultural Studies, 3(1), 21-31. https://doi.org/10.58881/jllscs.v3i1.318
Qawasmeh, B., Oh, J. S., & Kwigizile, V. (2025). Comparative Analysis of AlexNet, ResNet-50, and VGG-19 Performance for Automated Feature Recognition in Pedestrian Crash Diagrams. Applied Sciences, 15(6), 2928. https://doi.org/10.3390/app15062928
Rohim, A., Sari, Y. A., & Tibyani. (2019). Convolutional neural network (CNN) for classification of traditional food images. Journal of Information Technology and Computer Science Development, 3, 7037–7042.
Sharma, S., & Guleria, K. (2022, April). Deep learning models for image classification: comparison and applications. In 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE) (pp. 1733-1738). IEEE. http://dx.doi.org/10.1109/ICACITE53722.2022.9823516
Sulistia, F. Y., & Vatresia, A. (2024). Implementation of deep learning using convolutional neural network (CNN) for chicken meat classification with ResNet-50 architecture. Journal of Information Technology and Computer Science (INTECOMS), 7, 685–691. http://dx.doi.org/10.31539/intecoms.v7i3.10087
Tang, W., Sun, J., Wang, S., & Zhang, Y. (2023). Review of AlexNet for medical image classification. arXiv preprint arXiv:2311.08655. https://doi.org/10.48550/arXiv.2311.08655
Ullah, A., Elahi, H., Sun, Z., Khatoon, A., & Ahmad, I. (2022). Comparative analysis of AlexNet, ResNet18 and SqueezeNet with diverse modification and arduous implementation. Arabian journal for science and engineering, 47(2), 2397-2417. http://dx.doi.org/10.1007/s13369-021-06182-6
Vishwakarma, N., Jaiswal, A., & Verma, M. (2025). Introduction to Culinary Heritage. In Quality Assurance of Ethno-Herbals: Cultivating Confidence in Alternative Medicine (pp. 1-15). Bentham Science Publishers. http://dx.doi.org/10.2174/9789815274554125010003
Watanobe, Y., Rahman, M. M., Amin, M. F. I., & Kabir, R. (2023). Identifying algorithm in program code based on structural features using CNN classification model. Applied Intelligence, 53(10), 12210-12236. http://dx.doi.org/10.1007/s10489-022-04078-y
Yuan, S., Song, G., Huang, G., & Wang, Q. (2022). Reshaping hyperspectral data into a two-dimensional image for a CNN model to classify plant species from reflectance. Remote Sensing, 14(16), 3972. https://doi.org/10.3390/rs14163972
Zidni, E., & Akbar, M. (2024). Classification of Pasuruan City's traditional food images using convolutional neural network. Informatics and Artificial Intelligence Journal, 1, 65–72. http://dx.doi.org/10.1109/ICICoS51170.2020.9299039
Copyright (c) 2025 Journal La Multiapp

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.



