Skip to Main Content
Skip Nav Destination
ASME Press Select Proceedings
International Conference on Computer Technology and Development, 3rd (ICCTD 2011)
Jianhong Zhou
Jianhong Zhou
Search for other works by this author on:
No. of Pages:
ASME Press
Publication date:

In this paper, A Novel method for Recognition free Farsi document retrieval is proposed. In this method, the retrieval is done through recognition of sub-letters and other elements of letters such as dots and some signs like Sarkesh. So at first in pre processing phase, lines and words are extracted using blank space between them. In the next phase, each word is divided to its sub-words. A sub-word is a combination of joint letters. For each sub-word, connectors of sub-letters are removed from the initial body of it and remains are recognized as sub-letters by using of their extracted features. The recognized sub-letters are encoded using a dictionary that has been defined in this system. Finally, the document content is encoded and this code can be used for retrieval of existing words in this document. Experimental results show advantages of this method in the retrieval of Persian printed documents.

Key Words
1. Introduction
2. Previous Related Works
3. Farsi Script
4. Pre-Processing
5. Processing
6. Experimental Results
7. Conclusion
This content is only available via PDF.
You do not currently have access to this chapter.
Close Modal

or Create an Account

Close Modal
Close Modal