SmallSteps : an adaptive distance-based clustering algorithm

  • Gy. Koch
  • József Dombi

Abstract

In this article we propose a new distance-based clustering algorithm. Distance-based clustering methods operate on data sets that are in similarity space, where the similarities/dissimilarities between the objects are given by a matrix. These algorithms have at least O(n2) time complexity, where n is the number of objects. One of the latest distance-based method is Chameleon which, according to experiences, works well only on larger data sets and fails on relatively smaller ones. This contraditcs the fact that the O(n2) time complexity makes the distance-based algorithms unsuitable for huge data sets. Thus we developed a new distance-based method (SmallSteps), which can handle relatively small amount of objects too. In our solution we are looking for connected graphs which have edges with a maximum weight computed on the environments of the objects. The method is capable to detect clusters with different shapes, sizes or densities, it is able to automatically determine the number of clusters and has a special ability to divide clusters into subclusters.

Downloads

Download data is not yet available.
Published
2001-01-01
How to Cite
Koch, G., & Dombi, J. (2001). SmallSteps : an adaptive distance-based clustering algorithm. Acta Cybernetica, 15(2), 241-256. Retrieved from https://cyber.bibl.u-szeged.hu/index.php/actcybern/article/view/3577
Section
Regular articles