International Conference on Computer Technology and Development, 3rd (ICCTD 2011)
373 Farsi/Arabic Document Image Retrieval Through Sub-Letter Shape Coding
Download citation file:
In this paper, A Novel method for Recognition free Farsi document retrieval is proposed. In this method, the retrieval is done through recognition of sub-letters and other elements of letters such as dots and some signs like Sarkesh. So at first in pre processing phase, lines and words are extracted using blank space between them. In the next phase, each word is divided to its sub-words. A sub-word is a combination of joint letters. For each sub-word, connectors of sub-letters are removed from the initial body of it and remains are recognized as sub-letters by using of their extracted features....