Exploiting data topology in visualization and clustering of self-organizing maps
Loading...

Date
2009
Authors
Kadim TaÅŸdemir
Erzsébet Merényi
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
The self-organizing map (SOM) is a powerful method for visualization cluster extraction and data mining. It has been used successfully for data of high dimensionality and complexity where traditional methods may often be insufficient. In order to analyze data structure and capture cluster boundaries from the SOM one common approach is to represent the SOM's knowledge by visualization methods. Different aspects of the information learned by the SOM are presented by existing methods but data topology which is present in the SOM's knowledge is greatly underutilized. We show in this paper that data topology can be integrated into the visualization of the SOM and thereby provide a more elaborate view of the cluster structure than existing schemes. We achieve this by introducing a weighted Delaunay triangulation (a connectivity matrix) and draping it over the SOM. This new visualization CONNvis also shows both forward and backward topology violations along with the severity of forward ones which indicate the quality of the SOM learning and the data complexity. CONNvis greatly assists in detailed identification of cluster boundaries. We demonstrate the capabilities on synthetic data sets and on a real 8-D remote sensing spectral image. © 2009 IEEE. © 2009 Elsevier B.V. All rights reserved.
Description
Keywords
Clustering, Data Mining, Self-organizing Map (som), Topology Preservation, Visualization, Cluster Boundaries, Cluster Structures, Clustering, Connectivity Matrixes, Data Complexity, Delaunay Triangulations, Existing Methods, High Dimensionalities, Self-organizing Map (som), Spectral Images, Synthetic Data Sets, Topology Preservation, Visualization Methods, Cluster Analysis, Conformal Mapping, Data Structures, Education, Knowledge Management, Mining, Remote Sensing, Self Organizing Maps, Strength Of Materials, Topology, Visualization, Data Visualization, Cluster boundaries, Cluster structures, Clustering, Connectivity matrixes, Data complexity, Delaunay triangulations, Existing methods, High dimensionalities, Self-organizing map (SOM), Spectral images, Synthetic data sets, Topology preservation, Visualization methods, Cluster analysis, Conformal mapping, Data structures, Education, Knowledge management, Mining, Remote sensing, Self organizing maps, Strength of materials, Topology, Visualization, Data visualization
Fields of Science
0209 industrial biotechnology, 0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology
Citation
WoS Q
Scopus Q

OpenCitations Citation Count
140
Source
IEEE Transactions on Neural Networks
Volume
20
Issue
Start Page
549
End Page
562
Collections
PlumX Metrics
Citations
CrossRef : 123
Scopus : 162
PubMed : 9
Captures
Mendeley Readers : 89
Google Scholar™


