International Journal of
Physical Sciences

  • Abbreviation: Int. J. Phys. Sci.
  • Language: English
  • ISSN: 1992-1950
  • DOI: 10.5897/IJPS
  • Start Year: 2006
  • Published Articles: 2572

Full Length Research Paper

Locally baseline detection for online Arabic script based languages character recognition

Muhammad Imran Razzak1*, Muhammad Sher1 and S. A. Hussain2
1International Islamic University, Pakistan.  2Air University, Pakistan.
Email: [email protected]

  •  Accepted: 15 June 2010
  •  Published: 31 July 2010

Abstract

Baseline detection is one of the most important step in character recognition and has direct influence on recognition result. Due to the complexity of the Urdu scripts based languages, handwritten character recognition is a very difficult task as compared to other languages. Baseline detection is one of the main issue and basic step of mostly preprocessing operations that is, normalization, skewness, secondary strokes segmentation and also in feature extraction. This paper presents a novel method of baseline detection for cursive handwritten Urdu script. The proposed approach is divided into three steps: diacritical marks segmentation, primary baseline estimation and local baseline estimation. The local baseline extraction is estimated using the features extracted from ending shape of the words. Due to structural difference between Nasta'liq and Naskh style, different rules are formed for baseline estimation.

 

Key words: Baseline, Arabic, Nasta'liq, Naskh, preprocessing, character recognition, skewness.