Analysis of Segmentation Methods for Brahmi Script

  • Ajay Pratap Singh Department of Library and Information Science, Banaras Hindu University, Varanasi – 221005, India
  • Ashwin Kumar Kushwaha Department of Library and Information Science, Banaras Hindu University, Varanasi – 221005, India
Keywords: Digitization, Projection profile, Segmentation process, Connected character, Line segmentation, Word segmentation, Character segmentation, Brahmi script

Abstract

Segmentation is an important step for developing any optical character recognition (OCR) system, which has to be redesigned for each script having, non-uniform nature/property. It is used to decompose the image into its sub-units, which act as a basis for character recognition. Brahmi is a non-cursive ancient script, in which characters are not attached to each other and have some spacing between them. This study analyses various segmentation methods for different scripts to develop the best suitable segmentation method for Brahmi. MATLAB software was used for segmentation purpose in the experiment. The sample data belongs to Brahmi script-based ‘Rumandei inscription’. In this paper, we discuss a segmentation methodology for distinct components, namely text lines, words and characters of Rumandei inscription, written in Brahmi script. For segmenting distinct components of inscription different approach were used like horizontal projection profile, vertical projection profile and Relative minima approach. This is fundamental research on an inscription based on Brahmi script, which acts as a foundation for developing a segmentation module of an OCR solution/system of similar scripts in future. Information search and retrieval is an important activity of a library. So, to ensure this support for digitised documents written in ancient script, their character recognition is mandatory through the OCR system.

Author Biographies

Ajay Pratap Singh, Department of Library and Information Science, Banaras Hindu University, Varanasi – 221005, India

Dr Ajay P. Singh holds a PhD in Library and Information Science. Presently, he is working as Professor in the Department of Library and Information Science, Banaras hindu university, Varanasi. He had research experience in OCR, digital preservation, cloud computing applications etc. He has published around 100 research paper in various national and international sources. These results show the follow up research activities of the previously completed UGC Major Project “Designing and Development of an Expert System for Character Recognition of Indian Manuscripts written in Brahmi Script”.

Ashwin Kumar Kushwaha, Department of Library and Information Science, Banaras Hindu University, Varanasi – 221005, India

Mr Ashwin, holds Master degree in Library and Information Science from Banaras hindu university, Varanasi. He is availing UGC Junior Research Fellowship and presently doing PhD degree from Banaras Hindu University. He is doing research on ‘OCR for ancient Indian scripts’. He has completed a number of short-term courses on the present area of research.

Published
2019-03-11
How to Cite
Singh, A., & Kushwaha, A. (2019). Analysis of Segmentation Methods for Brahmi Script. DESIDOC Journal of Library & Information Technology, 39(2), 109-116. https://doi.org/10.14429/djlit.39.2.13615
Section
Research Paper