Index of Garbledness for Automatic Recognition of Plain English Texts (Short Communication)

  • P.K. Saxena Scientific Analysis Group, DRDO, Delhi
  • Pratibha Yadav Scientific Analysis Group, DRDO, Delhi
  • Girish Mishra Scientific Analysis Group, DRDO, Delhi
Keywords: Index of garbledness, automatic recognition, fuzzy sets, fuzzy similarity relation, fuzzy dissimilarity

Abstract

In this paper, an Index of Garbledness (IG) has been defined for automatic recognition of plain English texts based on linguistic characteristics of English language without using a dictionary. It also works for continuous text without word break-up (text without blank spaces between words). These characteristics, being vague in nature, are suitably represented through fuzzy sets. A fuzzy similarity relation and a fuzzy dissimilarity measure have been used to define this Index. Based on a threshold value of the Index, one can test whether the given text (continuous without word break-up) is a plain English text or not. In case the text under consideration is not a plain text, it also gives an indication to what extent it is garbled.

Defence Science Journal, 2010, 60(4), pp.415-419, DOI:http://dx.doi.org/10.14429/dsj.60.501

Author Biographies

P.K. Saxena, Scientific Analysis Group, DRDO, Delhi

Obtained his MSc (Mathematics) from Kanpur University and PhD (Algebra) from Indian Institute of Technology, Kanpur. He joined Scientific Analysis Group (SAG), DRDO in 1981. Presently working as Scientist H & Director, SAG.

Pratibha Yadav, Scientific Analysis Group, DRDO, Delhi

Obtained her MSc (Mathematics) from Delhi University in 1985. Presently she is working as Scientist F at SAG and is heading Traffic  Analysis Group. She has reviewed many research papers for well-known international conferences and journals.

Girish Mishra, Scientific Analysis Group, DRDO, Delhi

Obtained his MSc (Mathematics) and MPhil (Special Functions) from University of Rajasthan, Jaipur in 1998 and 2003 respectively. Presently, he is working as Scientist C at SAG.

Published
2010-07-09
How to Cite
Saxena, P., Yadav, P., & Mishra, G. (2010). Index of Garbledness for Automatic Recognition of Plain English Texts (Short Communication). Defence Science Journal, 60(4), 415-419. https://doi.org/10.14429/dsj.60.501