Skip to Main Content
ASME Press Select Proceedings

International Conference on Measurement and Control Engineering 2nd (ICMCE 2011)

Yi Xie
Yi Xie
Search for other works by this author on:
No. of Pages:
ASME Press
Publication date:

This paper proposes a new method to perform preprocessing in web usage mining. The data used for this experiment is web server logs from an online newspaper in Malaysia. The preprocessing stage consists of data cleaning and user identification. In this project, Python 2.6 is used as the main language to perform the data cleaning operations. Detailed explanation on data cleaning is illustrated, as well as the steps taken to conduct user identification. The results of data cleaning and user identification based on our experiment are also discussed. The output of this study is a log file which has been cleaned, and can be used in the next stage of web usage mining; which is pattern discovery.

This content is only available via PDF.
Close Modal
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close Modal
Close Modal