In the next chapters, we'll show how to i) choose the appropriate clustering algorithm for your data; and ii) computing p-values for hierarchical clustering.

Recall that the goal of partitioning clustering algorithms (Part @ref(partitioning-clustering)) is to split the data set into clusters of objects, such that: In this section, we’ll describe the two commonly used indices for assessing the goodness of clustering: the silhouette width and the Dunn index.

These internal measure can be used also to determine the optimal number of clusters in the data.

is a standard tool in analytics and is an important feature for helping you develop and fine-tune data mining models.