Intelligent Engineering Systems through Artificial Neural Networks Volume 18
66 Topographic Processing of Very Large Text Datasets
Download citation file:
A common challenge today, arising especially in the field of text and web mining, is the proper visualization of very large datasets to explore their structure and gain information that otherwise would remain concealed, buried due to the sheer amount of data. These large non-Euclidean datasets cannot be held at once within random-access memory during computation, so fast batch variants of common mapping methods like Self-Organizing Maps (SOM) or Neural Gas (NG) cannot be applied. In this work, we present fast approximate (semi-)supervised versions of SOM and NG for non-Euclidean data that are able to handle large text datasets by...