Skip to Main Content
Skip Nav Destination
ASME Press Select Proceedings

International Conference on Computer Research and Development, 5th (ICCRD 2013)

Editor
Fan Yama
Fan Yama
Search for other works by this author on:
ISBN:
9780791860182
No. of Pages:
278
Publisher:
ASME Press
Publication date:
2013

Deficit of classified texts for different languages especially Persian language is one of the main problems in developing text information retrieval and statistical machine translation systems. A new method of text classification in Persian language will be introduced in this paper. For this purpose, in the first step the keywords of different parts of texts will be extracted employing the classified documents of Wikipedia and through a probabilistic method. Then by employing two methods of machine learning which are based on k_Nearest Neighbor and decision tree algorithms, the texts will be classified in their specialized fields. The evaluation results reveal that this method is appropriate for classification of Persian texts. The results also show that the nearest neighbor method is more applicable to classify texts in comparison with decision tree method.

1. Introduction
2. Related Work
3. The Proposed Method
4. Experimental Results
5. Conclusions
References
This content is only available via PDF.
You do not currently have access to this chapter.
Close Modal

or Create an Account

Close Modal
Close Modal