An Ensemble of Deep Learning Models Using a Genetic Algorithm for Improving Bird Image Segmentation
Received: 7 January 2026 | Revised: 10 February 2026 and 24 February 2026 | Accepted: 6 March 2026 | Online: 6 June 2026
Corresponding author: B. S. Chandrashekar
Abstract
Bird image segmentation and classification from natural-scene images captured by a device are important for ecological research focused on wildlife conservation and monitoring. Manually segmenting a bird image from a complex background is tedious, time-consuming, and process-intensive. Therefore, an automatic segmentation of bird images from the complex scene image, exploiting image processing techniques, has been extensively studied. Although many efforts have been made to propose segmentation techniques in a different context, the techniques cannot be generalized and may not perform well in some cases. Besides, the task of segmentation is very challenging due to occlusion, morphology, and lighting conditions. Therefore, there is a scope to fine-tune and enhance the segmentation ability of algorithms. The present study proposes a novel method of ensembling deep learning models using Genetic Algorithm (GA) for improving bird image segmentation in a complex background. The proposed methodology employs the segmentation results obtained by the individual deep learning models at the pixel level for ensembling. Five popular deep learning models, namely Unet, PSP-net, Link-net, Feature Pyramid Network (FPN), and Deeplabv3+, are used in the study. The CUB-200-2011 benchmark dataset was employed to conduct experiments, and the efficacy of the proposed approach was evaluated using the Intersection over Union (IoU) metric. The segmentation results obtained were very promising, achieving the highest IoU compared to contemporary segmentation approaches.
Keywords:
segmentation, birds, genetic algorithm, chromosome, pixel-wise combinationReferences
J. C. Canales, J. C. Canales, F. García-Lamont, A. Yee-Rendon, J. S. R. Castilla, and L. R. Mazahua, "Optimal Segmentation of Image Datasets by Genetic Algorithms Using Color Spaces," Expert Systems with Applications, vol. 238, Mar. 2024, Art. no. 121950.
H. Nematzadeh, J. García-Nieto, I. Navas-Delgado, and J. F. Aldana-Montes, "Ensemble-Based Genetic Algorithm Explainer with Automized Image Segmentation: A Case Study on Melanoma Detection Dataset," Computers in Biology and Medicine, vol. 155, Mar. 2023, Art. no. 106613.
P. Ghosh, M. Mitchell, J. A. Tanyi, and A. Y. Hung, "Incorporating Priors for Medical Image Segmentation Using a Genetic Algorithm," Neurocomputing, vol. 195, pp. 181–194, Jun. 2016.
M. Nouman, M. Mabrok, M. al-Shatouri, and E. A. Rashed, "Contextual Transformer for Brain Stroke Segmentation in MRI," Computers and Electrical Engineering, vol. 129, Jan. 2026, Art. no. 110795.
H. Zhou, L. Xu, C. Liu, and G. Li, "UDNDSNet: A Unified Deterministic and Non-Deterministic Segmentation Network for Multi-Scene Medical Image Analysis," Knowledge-Based Systems, vol. 331, Jan. 2026, Art. no. 114641.
Y. Ji, H. Li, G. Li, S. Liu, and X. Wan, "Sparse Diffusion Models for Multi-Annotator Medical Image Segmentation," Knowledge-Based Systems, vol. 330, Nov. 2025, Art. no. 114783.
W. Wang, J. Dang, Y. Wang, and R. Pan, "RefineHOS: A High-Performance Hand–Object Segmentation with Fine-Grained Spatial Features," Computer Vision and Image Understanding, vol. 262, Dec. 2025, Art. no. 104548.
H. Yin et al., "Swin Transformer-Based Maritime Objects Instance Segmentation with Dual Attention and Multi-Scale Fusion," Computer Vision and Image Understanding, vol. 262, Dec. 2025, Art. no. 104556.
S. Tang et al., "Segmentation of PET/CT Lung Cancer Lesion Images via a Semi-Supervised Improved SwinUNet Model," Biomedical Signal Processing and Control, vol. 113, Mar. 2026, Art. no. 108797.
Z. Huang, Q. Yang, and Y. Gao, "A Multi-Task Learning Network with Boundary Regression for Automatic Ultrasound Image Segmentation," Biomedical Signal Processing and Control, vol. 113, Mar. 2026, Art. no. 108828.
Z. Chen, Y. Wang, X. Li, and B. Li, "LSMAtt-Net: A Lightweight Brain Tumor Segmentation Approach with Efficiency–Complexity Tradeoff," Biomedical Signal Processing and Control, vol. 113, Mar. 2026, Art. no. 108930.
X. Chen, C. Wang, X. Wu, Q. Hu, X. Li, and S. Li, "S3-OCTA: A Precise Centerline-Based Method for Retinal Vessel Segmentation in OCTA Images," Biomedical Signal Processing and Control, vol. 113, Mar. 2026, Art. no. 109117.
H. P. Ramya Shree, Minavathi, and M. S. Dinesh, "Transformer Based Framework for Nuclei Segmentation and Cell Counting on Microscopic Images of Multiple Organs," International Journal of Intelligent Networks, vol. 6, pp. 283–292, 2025.
A. Chaurasia and E. Culurciello, "LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation," in IEEE Visual Communications and Image Processing, St. Petersburg, FL, USA, Dec. 2017, pp. 1–4.
L.-C. Chen, Y. Zhu, G. Papandreou, F. Schroff, and H. Adam, "Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation," in Computer Vision – ECCV 2018, vol. 11211, V. Ferrari, M. Hebert, C. Sminchisescu, and Y. Weiss, Eds. Cham, Switzerland: Springer International Publishing, 2018, pp. 833–851.
T.-Y. Lin, P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie, "Feature Pyramid Networks for Object Detection," in IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, Jul. 2017, pp. 936–944.
O. Ronneberger, P. Fischer, and T. Brox, "U-Net: Convolutional Networks for Biomedical Image Segmentation," in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, vol. 9351, N. Navab, J. Hornegger, W. M. Wells, and A. F. Frangi, Eds. Cham, Switzerland: Springer International Publishing, 2015, pp. 234–241.
H. Zhao, J. Shi, X. Qi, X. Wang, and J. Jia, "Pyramid Scene Parsing Network," in IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, Jul. 2017, pp. 6230–6239.
B. Kriuk and M. Yordanov, "GeloVec: Higher Dimensional Geometric Smoothing for Coherent Visual Feature Extraction in Image Segmentation," in Empowering Novel Geometric Algebra for Graphics and Engineering, vol. 16510, E. Hitzer, G. Papagiannakis, and H. Hadfield, Eds. Cham, Switzerland: Springer Nature Switzerland, 2026, pp. 25–36.
R. V. V. Krishna and S. Srinivas Kumar, "Hybridizing Differential Evolution with a Genetic Algorithm for Color Image Segmentation," Engineering, Technology & Applied Science Research, vol. 6, no. 5, pp. 1182–1186, Oct. 2016.
B. Gharnali and S. Alipour, "MRI Image Segmentation Using Conditional Spatial FCM Based on Kernel-Induced Distance Measure," Engineering, Technology & Applied Science Research, vol. 8, no. 3, pp. 2985–2990, Jun. 2018.
C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, "Caltech-UCSD Birds-200-2011 (CUB-200-2011)." California Institute of Technology, 2011, [Online]. Available: https://www.vision.caltech.edu/datasets/cub_200_2011/.
B. S. Chandrashekar, "segmentation-using-Genetic-algorithm." [Online]. Available: https://github.com/chandrashekar6/segmentation-using-Genetic-algorithm-/blob/main/drafting_GA_segmentation_final_(1)-3.ipynb.
Downloads
How to Cite
License
Copyright (c) 2026 B. S. Chandrashekar, H. S. Nagendraswamy, M. P. Pavan Kumar

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.
