An Optimized Approach for Handwritten Arabic Character Recognition based on the SVM Classifier
Received: 15 October 2024 | Revised: 18 November 2024 | Accepted: 25 November 2024 | Online: 3 April 2025
Corresponding author: Biteur Kada
Abstract
Optical Character Recognition (OCR) is an essential technology, capable of addressing complex challenges while simplifying numerous human activities. Although it emerged in the 1970s with various solutions, these efforts primarily focused on Latin-based languages, leaving other writing systems, such as Arabic, largely underexplored. In this context, this study proposes an innovative offline Arabic handwriting recognition system based on a structural segmentation method combined with the use of Support Vector Machines (SVM) for character classification. An in-depth review of different character segmentation methods was followed by an in-depth analysis of the OCR field. This study examined the challenges associated with normalization, a recurring issue in the processing of handwritten scripts. Finally, after comparing the unique characteristics of Arabic handwritten characters with existing segmentation techniques, an approach was developed based on a segmentation algorithm to improve the accuracy and efficiency of the recognition process.
Keywords:
OCR, segmentation, Arabic characters, SVM, post-processingDownloads
References
M. Bennamoun and G. J. Mamic, "Optical Character Recognition," in Object Recognition: Fundamentals and Case Studies, M. Bennamoun and G. J. Mamic, Eds. London: Springer, 2002, pp. 199–220.
S. Faizullah, M. S. Ayub, S. Hussain, and M. A. Khan, "A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges," Applied Sciences, vol. 13, no. 7, Jan. 2023, Art. no. 4584.
M. S. Khorsheed, "Off-Line Arabic Character Recognition – A Review," Pattern Analysis & Applications, vol. 5, no. 1, pp. 31–45, May 2002.
A. A. Alasadi, T. H. Aldhayni, R. R. Deshmukh, A. H. Alahmadi, and A. S. Alshebami, "Efficient Feature Extraction Algorithms to Develop an Arabic Speech Recognition System," Engineering, Technology & Applied Science Research, vol. 10, no. 2, pp. 5547–5553, Apr. 2020.
C. Bahlmann and H. Burkhardt, "The writer independent online handwriting recognition system frog on hand and cluster generative statistical dynamic time warping," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 3, pp. 299–310, Mar. 2004.
M. Cheriet, N. Kharma, C. L. Liu, and C. Suen, Character Recognition Systems: A Guide for Students and Practitioners. John Wiley & Sons, 2007.
L. A. Fletcher and R. Kasturi, "A robust algorithm for text string separation from mixed text/graphics images," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 10, no. 6, pp. 910–918, Aug. 1988.
R. M. Haralick and L. G. Shapiro, "Image segmentation techniques," Computer Vision, Graphics, and Image Processing, vol. 29, no. 1, pp. 100–132, Jan. 1985.
N. R. Pal and S. K. Pal, "A review on image segmentation techniques," Pattern Recognition, vol. 26, no. 9, pp. 1277–1294, Sep. 1993.
N. Altwaijry and I. Al-Turaiki, "Arabic handwriting recognition system using convolutional neural network," Neural Computing and Applications, vol. 33, no. 7, pp. 2249–2261, Apr. 2021.
N. Sabbour and F. Shafait, "A segmentation-free approach to Arabic and Urdu OCR," in Document Recognition and Retrieval XX, Feb. 2013, vol. 8658, pp. 215–226.
L. M. Lorigo and V. Govindaraju, "Offline Arabic handwriting recognition: a survey," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, no. 5, pp. 712–724, Feb. 2006.
H. A. Abdo, A. Abdu, R. Manza, and S. Bawiskar, "An approach to analysis of arabic text documents into text lines, words, and characters," Indonesian Journal of Electrical Engineering and Computer Science, vol. 26, no. 2, May 2022, Art. no. 754.
N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, 2000.
J. Cervantes, F. Garcia-Lamont, L. Rodríguez-Mazahua, and A. Lopez, "A comprehensive survey on support vector machine classification: Applications, challenges and trends," Neurocomputing, vol. 408, pp. 189–215, Sep. 2020.
X. Zhou, J. Li, C. Yang, and J. Hao, "Study on Handwritten Digit Recognition using Support vector machine," IOP Conference Series: Materials Science and Engineering, vol. 452, no. 4, Sep. 2018, Art. no. 042194.
B. Kiessling, G. Kurin, M. T. Miller, and K. Smail, "Advances and Limitations in Open Source Arabic-Script OCR: A Case Study," Digital Studies / Le champ numérique, vol. 11, no. 1, Dec. 2021.
Downloads
How to Cite
License
Copyright (c) 2025 Biteur Kada, Arif Mohammed, Benhammadi Abdelmajid

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain the copyright and grant the journal the right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) after its publication in ETASR with an acknowledgement of its initial publication in this journal.