Index of Garbledness for Automatic Recognition of Plain English Texts (Short Communication)
Abstract
In this paper, an Index of Garbledness (IG) has been defined for automatic recognition of plain English texts based on linguistic characteristics of English language without using a dictionary. It also works for continuous text without word break-up (text without blank spaces between words). These characteristics, being vague in nature, are suitably represented through fuzzy sets. A fuzzy similarity relation and a fuzzy dissimilarity measure have been used to define this Index. Based on a threshold value of the Index, one can test whether the given text (continuous without word break-up) is a plain English text or not. In case the text under consideration is not a plain text, it also gives an indication to what extent it is garbled.
Defence Science Journal, 2010, 60(4), pp.415-419, DOI:http://dx.doi.org/10.14429/dsj.60.501
Where otherwise noted, the Articles on this site are licensed under Creative Commons License: CC Attribution-Noncommercial-No Derivative Works 2.5 India