Skip to Main Content
ASME Press Select Proceedings

International Conference on Instrumentation, Measurement, Circuits and Systems (ICIMCS 2011)

Chen Ming
Chen Ming
Search for other works by this author on:
No. of Pages:
ASME Press
Publication date:

Automatic text classification is the task of assigning unseen documents to a predefined set of classes or categories. Text Representation for classification have been traditionally approached with tf.idf due to its simplicity and good performance. Multi-label automatic text classification has been traditionally tackled in the literature either by transforming the problem to apply binary techniques or by adapting binary algorithms to work with multiple labels. We present tf.rrfl, a novel text representation for the multi-label classification approach. Our proposal focuses on modifying the data set input to the algorithm, differentiating the input by the label to evaluate. Performance of tf.rrfl was tested with a known benchmark and compared to alternative techniques. The results show improvement compared to alternative approaches in terms of Hamming loss.

This content is only available via PDF.
Close Modal
This Feature Is Available To Subscribers Only

Sign In or Create an Account

Close Modal
Close Modal