Clustering: Basic Concepts - Major Approaches - 《机器学习》

Based on different approaches we can categorise known clustering algorithms into: 根据不同的方法，我们可以将已知的聚类算法分为

Partitioning approach 分区的方法:
- Construct various partitions and then evaluate them by some criterion, e.g., minimising the sum of square errors
- Typical methods: k-means, k-medoids, CLARANS
Hierarchical approach 分层的方法:
- Create a hierarchical decomposition of the set of data (or objects) using some criterion
- Typical methods: Diana, Agnes, BIRCH, CAMELEON
Density-based approach:
- Based on connectivity and density functions
- Typical methods: DBSCAN (Density-based spatial clustering of applications with noise), OPTICS, DenClue
Grid-based approach:
- based on a multiple-level granularity structure
- Typical methods: STING, WaveCluster, CLIQUE
Model-based:
- A model is hypothesised for each of the clusters and tries to find the best fit of that model to each other
- Typical methods: EM, SOM, COBWEB
Frequent pattern-based:
- Based on the analysis of frequent patterns
- Typical methods: p-Cluster
User-guided or constraint-based:
- Clustering by considering user-specified or application-specific constraints
- Typical methods: COD (obstacles), constrained clustering
Link-based clustering:
- Objects are often linked together in various ways
- Massive links can be used to cluster objects: SimRank, LinkClus