A parameter-free similarity graph for spectral clustering


Inkaya T.

EXPERT SYSTEMS WITH APPLICATIONS, cilt.42, sa.24, ss.9489-9498, 2015 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 42 Sayı: 24
  • Basım Tarihi: 2015
  • Doi Numarası: 10.1016/j.eswa.2015.07.074
  • Dergi Adı: EXPERT SYSTEMS WITH APPLICATIONS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Sayfa Sayıları: ss.9489-9498
  • Anahtar Kelimeler: Spectral clustering, Similarity graph, k-nearest neighbor, epsilon-neighborhood, Fully connected graph, CONSTRUCTION, DENSITY
  • Bursa Uludağ Üniversitesi Adresli: Evet

Özet

Spectral clustering is a popular clustering method due to its simplicity and superior performance in the data sets with non-convex clusters. The method is based on the spectral analysis of a similarity graph. Previous studies show that clustering results are sensitive to the selection of the similarity graph and its parameter(s). In particular, when there are data sets with arbitrary shaped clusters and varying density, it is difficult to determine the proper similarity graph and its parameters without a priori information. To address this issue, we propose a parameter-free similarity graph, namely Density Adaptive Neighborhood (DAN). DAN combines distance, density and connectivity information, and it reflects the local characteristics. We test the performance of DAN with a comprehensive experimental study. We compare k-nearest neighbor (KNN), mutual KNN, epsilon-neighborhood, fully connected graph, minimum spanning tree, Gabriel graph, and DAN in terms of clustering accuracy. We also examine the robustness of DAN to the number of attributes and the transformations such as decimation and distortion. Our experimental study with various artificial and real data sets shows that DAN improves the spectral clustering results, and it is superior to the competing approaches. Moreover, it facilitates the application of spectral clustering to various domains without a priori information. (C) 2015 Elsevier Ltd. All rights reserved.