Skip to Main Content
ASME Press Select Proceedings

International Conference on Information Technology and Computer Science, 3rd (ITCS 2011)

Editor
V. E. Muhin
V. E. Muhin
National Technical University of Ukraine
Search for other works by this author on:
W. B. Hu
W. B. Hu
Wuhan University
Search for other works by this author on:
ISBN:
9780791859742
No. of Pages:
656
Publisher:
ASME Press
Publication date:
2011

Named entity identification is a fundamental task during natural language processing. This paper puts forward identification method of Uighur which is based on maximum entropy models. Maximum entropy model can make full use of various and arbitrary language features. The language features of Chinese and English generally only integrate part of speech, word forms and other information. This paper combines with the characteristics of Uighur and makes segmentation of Uighur words; it takes word stem and word affix information as characteristics and adds to the maximum entropy model. The results of experiment show that named entity identification accuracy, recall ratio and F value have been significantly enhanced in the proposed method.

This content is only available via PDF.
Close Modal
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close Modal
Close Modal