Enhanced Text Detection in Natural Scenes using Advanced Machine Learning Techniques

Shivanand Μ. Patil; V. S. Malemath; Suman Muddapur; Praveen M. Dhulavvagol

doi:10.48084/etasr.10029

Authors

Shivanand Μ. Patil Department of CSE, KLE Dr M S Sheshgiri College of Engineering and Technology (affiliated to VTU Belagavi), Belagavi, India
V. S. Malemath Department of CSE, KLE Dr M S Sheshgiri College of Engineering and Technology (affiliated to VTU Belagavi), Belagavi, India
Suman Muddapur Sangolli Rayanna First Grade Constituent College Belagavi, India
Praveen M. Dhulavvagol School of Computer Science and Engineering, KLE Technological University, Hubli, India

Volume: 15 | Issue: 2 | Pages: 22114-22118 | April 2025 | https://doi.org/10.48084/etasr.10029

Received: 25 December 2024 | Revised: 11 January 2025, 21 January 2025, and 29 January 2025 | Accepted: 31 January 2025 | Online: 3 April 2025

Corresponding author: Praveen M. Dhulavvagol

Abstract

Text detection in natural scenes remains a fundamental challenge in computer vision, impacting applications from mobile navigation to document digitization. Traditional methods struggle with varying text orientations, complex backgrounds, and inconsistent lighting, while recent deep-learning approaches face computational efficiency challenges. This paper presents a novel hybrid machine-learning framework that combines traditional computer vision with advanced machine learning to achieve robust text detection. The framework integrates optimized preprocessing techniques, feature extraction methods, including Histogram Oriented Gradients (HOG) and Maximally Stable Extremal Regions (MSER), and a lightweight convolutional neural network for improved accuracy and efficiency. Experimental evaluation on benchmark datasets demonstrates superior performance, achieving 98% precision, 97.5% recall, and 97.8% F1-score, while maintaining real-time processing capabilities at 45 fps. The framework significantly outperforms existing methods in handling diverse text scenarios, establishing a new standard for natural scene text detection. This research contributes to the advancement of text detection technology and offers practical applications in augmented reality, autonomous navigation, and document processing systems.

Keywords:

text detection, natural scenes, feature extraction, machine learning, CNN, SVM

Downloads

Download data is not yet available.

References

L. Dai and C. Chen, "Intelligent Detection Method of English Text in Natural Scenes in Video," Scientific Programming, vol. 2021, no. 1, 2021, Art. no. 6239112.

L. Sun, Q. Huo, W. Jia, and K. Chen, "A robust approach for text detection from natural scene images," Pattern Recognition, vol. 48, no. 9, pp. 2906–2920, Sep. 2015.

Xiangrong Chen and A. L. Yuille, "Detecting and reading text in natural scenes," in Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., Washington, DC, USA, 2004, vol. 2, pp. 366–373.

A. Khalil, M. Jarrah, M. Al-Ayyoub, and Y. Jararweh, "Text detection and script identification in natural scene images using deep learning," Computers & Electrical Engineering, vol. 91, May 2021, Art. no. 107043.

J. Yin, J. Zhang, and D. Li, "Natural scene text recognition based on artificial intelligence machine learning," in Second International Conference on Electronic Information Technology (EIT 2023), Wuhan, China, Aug. 2023.

F. Naiemi, V. Ghods, and H. Khalesi, "Scene text detection using enhanced Extremal region and convolutional neural network," Multimedia Tools and Applications, vol. 79, no. 37, pp. 27137–27159, Oct. 2020.

T. Q. Phan, P. Shivakumara, and C. L. Tan, "Text detection in natural scenes using Gradient Vector Flow-Guided symmetry," in Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012), Tsukuba, Japan, Aug. 2012, pp. 3296–3299.

T. He, W. Huang, Y. Qiao, and J. Yao, "Text-Attentional Convolutional Neural Network for Scene Text Detection," IEEE Transactions on Image Processing, vol. 25, no. 6, pp. 2529–2541, Jun. 2016.

R. M. Badiger, R. Yakkundimath, G. Konnurmath, and P. M. Dhulavvagol, "Deep Learning Approaches for Age-based Gesture Classification in South Indian Sign Language," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13255–13260, Apr. 2024.

H. Turki, M. Elleuch, M. Kherallah, and A. Damak, "Arabic-Latin Scene Text Detection based on YOLO Models," in 2023 International Conference on Innovations in Intelligent Systems and Applications (INISTA), Hammamet, Tunisia, Sep. 2023, pp. 1–6.

D. Cao, Y. Zhong, L. Wang, Y. He, and J. Dang, "Scene Text Detection in Natural Images: A Review," Symmetry, vol. 12, no. 12, Dec. 2020, Art. no. 1956.

S. Zhao, L. Sun, G. Li, Y. Liu, and B. Liu, "A CCD based machine vision system for real-time text detection," Frontiers of Optoelectronics, vol. 13, no. 4, pp. 418–424, Dec. 2020.

H. F. Mahmood, "Text Detection and Recognition from Natural Images," Ph.D. dissertation, Loughborough University, 2020.

A. Mittal, P. P. Roy, P. Singh, and B. Raman, "Rotation and script independent text detection from video frames using sub pixel mapping," Journal of Visual Communication and Image Representation, vol. 46, pp. 187–198, Jul. 2017.

P. M. Dhulavvagol and S. G. Totad, "Performance Enhancement of Distributed Processing Systems Using Novel Hybrid Shard Selection Algorithm," Engineering, Technology & Applied Science Research, vol. 14, no. 2, pp. 13720–13725, Apr. 2024.

Vol. 15 (2025)	Vol. 7 (2017)
Vol. 14 (2024)	Vol. 6 (2016)
Vol. 13 (2023)	Vol. 5 (2015)
Vol. 12 (2022)	Vol. 4 (2014)
Vol. 11 (2021)	Vol. 3 (2013)
Vol. 10 (2020)	Vol. 2 (2012)
Vol. 9 (2019)	Vol. 1 (2011)
Vol. 8 (2018)

Enhanced Text Detection in Natural Scenes using Advanced Machine Learning Techniques

Authors

Abstract

Keywords:

Downloads

References

Downloads

How to Cite

Metrics

License

Most read articles by the same author(s)

Pneumonia Detection in Chest X-Rays using Transfer Learning and TPUs

Deep Learning Approaches for Age-based Gesture Classification in South Indian Sign Language

Enhancing the Scalability of Blockchain Networks using a Data Partitioning Technique

Performance Enhancement of Distributed Processing Systems Using Novel Hybrid Shard Selection Algorithm