International Conference on Mechanical Engineering and Technology (ICMET-London 2011)
128 Segmentation of Chinese Text for Web Content Filtering
Download citation file:
- Ris (Zotero)
- Reference Manager
We have been engaged in the development of an effective English and Chinese bilingual web content categorization engine for some years. Due to the nature of the two languages, the processing algorithms for the two languages also differ significantly. In this paper, we evaluate a number of segmentation methods for Chinese text with the expressed purposes of analyzing web textual content information for effective web content filtering. Based on the evaluation results, a specific method is adapted for the task.