Advanced Object Tracking in Video Surveillance Systems with Adaptive Deep SORT Enhancement
Received: 7 November 2024 | Revised: 2 December 2024 and 9 December 2024 | Accepted: 14 December 2024 | Online: 3 April 2025
Corresponding author: M. Koteswara Rao
Abstract
Object tracking is a crucial feature of video surveillance systems that are essential for maintaining awareness and detecting potential threats. Advanced solutions are needed to overcome the obstacles associated with video object tracking, including the complexity of everyday environments and the massive amount of data. Traditional tracking algorithms often struggle with the complexity of dynamic situations, necessitating the use of deep learning methods. This paper presents an innovative deep learning-based object tracking system that uses Multi-Level Glow-Worm Swarm Convolution Neural Networks (MLGS-CNNs) to detect objects in video frames. Subsequent object tracking is facilitated by the adaptive Deep Simple Online Real-time Tracking (DeepSORT) algorithm by incorporating an optimized Kalman filter instead of a conventional Kalman filter. The Waterwheel Plant Optimization (WPO) method is used to tune the noise covariances of the Kalman filter to further improve the tracking accuracy. Comprehensive performance criteria, including metrics such as Multiple Object Tracking Accuracy (MOTA), Multiple Object Tracking Precision (MOTP), Integrated Detection and False-alarm Rate (IDF1), Mostly Tracked (MT), and Mostly Lost (ML), are used to evaluate the effectiveness of our method.
Keywords:
Waterwheel Plant Optimization (WPO), Kalman filter, adaptive DeepSORTDownloads
References
J. Luo, H. Chen, Q. Ζhang, Y. Xu, H. Huang, and X. Zhao, "An improved grasshopper optimization algorithm with application to financial stress prediction," Applied Mathematical Modelling, vol. 64, pp. 654–668, Dec. 2018.
S. M. A. Hasan and K. Ko, "Depth edge detection by image-based smoothing and morphological operations," Journal of Computational Design and Engineering, vol. 3, no. 3, pp. 191–197, Jul. 2016.
Y. Tan, L. Liu, Q. Liu, J. Wang, X. Ma, and H. Ni, "Automatic breast DCE-MRI segmentation using compound morphological operations," in 2011 4th International Conference on Biomedical Engineering and Informatics, Shanghai, China, 2011, pp. 147–150.
K. K. Verma, P. Kumar, and A. Tomar, "Analysis of moving object detection and tracking in video surveillance system," in 2015 2nd International Conference on Computing for Sustainable Global Development, New Delhi, India, 2015, pp. 1758–1762.
R. Assaf, A. Goupil, V. Vrabie, T. Boudier, and M. Kacim, "Persistent homology for object segmentation in multidimensional grayscale images," Pattern Recognition Letters, vol. 112, pp. 277–284, Sep. 2018.
B. Zhan, D. N. Monekosso, P. Remagnino, S. A. Velastin, and L.-Q. Xu, "Crowd analysis: a survey," Machine Vision and Applications, vol. 19, no. 5–6, pp. 345–357, Oct. 2008.
Xiru W. U., Guoming H., and Lining S. U. N., "Fast Visual Identification and Location Algorithm for Industrial Sorting Robots Based on Deep Learning," Robot, vol. 38, no. 6, pp. 711–719, Nov. 2016.
H. M. Hodgetts, F. Vachon, C. Chamberland, and S. Tremblay, "See no evil: Cognitive challenges of security surveillance and monitoring," Journal of Applied Research in Memory and Cognition, vol. 6, no. 3, pp. 230–243, Sep. 2017.
R. Verschae and J. Ruiz-del-Solar, "Object Detection: Current and Future Directions," Frontiers in Robotics and AI, vol. 2, Nov. 2015, Art. no. 29.
M. Haghighat and M. Abdel-Mottaleb, "Low Resolution Face Recognition in Surveillance Systems Using Discriminant Correlation Analysis," in 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition, Washington, DC, USA, 2017, pp. 912–917.
G. Ciaparrone, F. Luque Sánchez, S. Tabik, L. Troiano, R. Tagliaferri, and F. Herrera, "Deep learning in video multi-object tracking: A survey," Neurocomputing, vol. 381, pp. 61–88, Mar. 2020.
M. Elhoseny, "Multi-object Detection and Tracking (MODT) Machine Learning Model for Real-Time Video Surveillance Systems," Circuits, Systems, and Signal Processing, vol. 39, no. 2, pp. 611–630, Feb. 2020.
H. Ahn and H.-J. Cho, "Research of multi-object detection and tracking using machine learning based on knowledge for video surveillance system," Personal and Ubiquitous Computing, vol. 26, no. 2, pp. 385–394, Apr. 2022.
K. Ullah, I. Ahmed, M. Ahmad, A. U. Rahman, M. Nawaz, and A. Adnan, "Rotation invariant person tracker using top view," Journal of Ambient Intelligence and Humanized Computing, vol. 14, no. 11, pp. 15343–15359, Nov. 2023.
J. Luiten et al., "HOTA: A Higher Order Metric for Evaluating Multi-object Tracking," International Journal of Computer Vision, vol. 129, no. 2, pp. 548–578, Feb. 2021.
Y. Zhang, C. Wang, X. Wang, W. Zeng, and W. Liu, "FairMOT: On the Fairness of Detection and Re-identification in Multiple Object Tracking," International Journal of Computer Vision, vol. 129, no. 11, pp. 3069–3087, Nov. 2021.
Y. Zhang et al., "Long-Term Tracking With Deep Tracklet Association," IEEE Transactions on Image Processing, vol. 29, pp. 6694–6706, 2020.
A. Shah, "UCSD Pedestrian Database." Kaggle, [Online]. Available: https://www.kaggle.com/datasets/aryashah2k/ucsd-pedestrian-database.
Downloads
How to Cite
License
Copyright (c) 2025 M. Koteswara Rao, P. M. Ashok Kumar

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.