Cat Breed Classification with YOLOv11 and Optimized Training
Received: 12 January 2025 | Revised: 3 February 2025 and 7 February 2025 | Accepted: 8 February 2025 | Online: 25 February 2025
Corresponding author: Taoufik Saidani
Abstract
Accurate identification of cat breeds poses a significant challenge due to subtle inter-breed differences and intra-breed variability. This study leverages YOLOv11, the latest version of the YOLO family, to address these challenges through advanced deep-learning techniques. By training on a dataset consisting of five distinct cat breeds (Persian, Maine Coon, Siamese, Pallas's Cat, and Bengal), the model demonstrates exceptional capability in identifying nuanced breed-specific features. Data augmentation techniques were employed to enhance the dataset's diversity, while various optimization algorithms (Adam, Adamax, NAdam, AdamW, RAdam, RMSProp, and SGD) were evaluated to optimize the performance of the model. Experimental results showed that RAdam and SGD emerged as the top-performing optimizers, achieving an average recall of 96.8%, precision of 97.2%, and mAP50 of 98.1%, significantly outperforming other optimization methods. In contrast, RMSProp exhibited the lowest performance, particularly in terms of precision and mean Average Precision (mAP50). Additionally, data augmentation techniques were applied to enhance the diversity of the dataset, improving the robustness of the model. These findings highlight the effectiveness of YOLOv11 in cat breed classification, with potential applications in pet identification, animal conservation, and veterinary diagnostics.
Keywords:
cat breed classification, deep learning, YOLOv11, optimizerDownloads
References
G. Jocher, J. Qiu, and A. Chaurasia, "Ultralytics YOLO." Jan. 2023, [Online]. Available: https://github.com/ultralytics/ultralytics.
N. Qatrunnada, M. Fachrurrozi, and A. S. Utami, "Cat Breeds Classification Using Convolutional Neural Network For Multi-Object Image," Sriwijaya Journal of Informatics and Applications, vol. 3, no. 2, Aug. 2022.
K. Jangid, M. Vishwakarma, K. Pal, and L. Rodrigues, "Cat Breed & Emotion Detection Using Yolo, CNN & Canny Edge Detection," IRE Journals, vol. 7, no. 8, pp. 1–9, Feb. 2024.
R. C. Chen, V. S. Saravanarajan, and H. T. Hung, "Monitoring the behaviours of pet cat based on YOLO model and raspberry Pi," International Journal of Applied Science and Engineering, vol. 18, no. 5, pp. 1–12, 2021.
E. Cengil, A. Cinar, and M. Yildirim, "A Case Study: Cat-Dog Face Detector Based on YOLOv5," in 2021 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT), Zallaq, Bahrain, Sep. 2021, pp. 149–153.
T. Wang, "Enhanced feline facial recognition: advancing cat face detection with YOLOv8 and TensorRT," in Fourth International Conference on Computer Vision and Pattern Analysis (ICCPA 2024), Sep. 2024, vol. 13256, pp. 193–202.
R. Widyastuti and C. K. Yang, "Cat’s Nose Recognition Using You Only Look Once (Yolo) and Scale-Invariant Feature Transform (SIFT)," in 2018 IEEE 7th Global Conference on Consumer Electronics (GCCE), Nara, Japan, Oct. 2018, pp. 55–56.
H. T. Hung and R. C. Chen, "Pet cat behavior recognition based on YOLO model," in 2020 International Symposium on Computer, Consumer and Control (IS3C), Taichung City, Taiwan, Nov. 2020, pp. 391–394.
S. Ennaama, H. Silkan, A. Bentajer, and A. Tahiri, "Enhanced Real-Time Object Detection using YOLOv7 and MobileNetv3," Engineering, Technology & Applied Science Research, vol. 15, no. 1, pp. 19181–19187, Feb. 2025.
H. M. Zayani et al., "Deep Learning for Tomato Disease Detection with YOLOv8," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13584–13591, Apr. 2024.
S. N. Rao, "YOLOv11 Explained: Next-Level Object Detection with Enhanced Speed and Accuracy," Medium, Oct. 22, 2024. https://medium.com/@nikhil-rao-20/yolov11-explained-next-level-object-detection-with-enhanced-speed-and-accuracy-2dbe2d376f71.
Downloads
How to Cite
License
Copyright (c) 2025 Hafedh Mahmoud Zayani, Amani Kachoukh, Refka Ghodhbani, Nouha khediri, Emane H. Abd. Elkawy, Ikhlass Ammar, Marouan Kouki, Taoufik Saidani

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.