Publication

Soft Cluster Ensembles

Source:

Advances in Fuzzy Clustering and Its Applications, Wiley (2007)

Abstract:

Cluster Ensembles is a framework for combining multiple partitionings obtained from separate clustering runs into a final consensus clustering. This framework has attracted much interest recently because of its numerous practical applications, and a variety of approaches including Graph Partitioning, Maximum Likelihood, Genetic algorithms, and Voting-Merging have been proposed. The vast majority of these approaches accept hard clusterings as input. There are, however, many clustering algorithms such as EM and fuzzy c-means that naturally output soft partitionings of data, and forcibly hardening these partitions before obtaining a consensus potentially involves loss of valuable information. In this chapter we propose several consensus algorithms that accept soft clusterings and experiment over many real-life datasets to show, both conceptually and empirically, that using soft clusterings as input does offer significant advantages, especially when dealing with vertically partitioned data.