Skip to Main Content
ASME Press Select Proceedings

International Conference on Computer Technology and Development, 3rd (ICCTD 2011)

By
Jianhong Zhou
Jianhong Zhou
Search for other works by this author on:
ISBN:
9780791859919
No. of Pages:
2000
Publisher:
ASME Press
Publication date:
2011

In this paper, A Novel method for Recognition free Farsi document retrieval is proposed. In this method, the retrieval is done through recognition of sub-letters and other elements of letters such as dots and some signs like Sarkesh. So at first in pre processing phase, lines and words are extracted using blank space between them. In the next phase, each word is divided to its sub-words. A sub-word is a combination of joint letters. For each sub-word, connectors of sub-letters are removed from the initial body of it and remains are recognized as sub-letters by using of their extracted features. The recognized sub-letters are encoded using a dictionary that has been defined in this system. Finally, the document content is encoded and this code can be used for retrieval of existing words in this document. Experimental results show advantages of this method in the retrieval of Persian printed documents.

This content is only available via PDF.
Close Modal
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close Modal
Close Modal