Clustering groups items so that those in the same group/cluster have meaningful similarities (i.e. specific features or properties). Clustering facilitates informed decision-making by giving significant meaning to data through the identification of different patterns.
Why clustering data can be beneficial?
Clustering groups items so that those in the same group/cluster have meaningful similarities. Thus, clustering is a great tool to unravel hidden patterns in the data.
Relevance AI's platform provides you with a no-code workflow to cluster your vectorized data with a few clicks. Make sure to follow the vectorize workflow guide if your dataset does not include vectors.
Once you have uploaded and vectorized your data, select your dataset and click on Cluster under Workflows and follow the guide. The image below shows how to cluster a dataset based on the description field using the Kmeans algorithm.
After running this workflow, clustering results are automatically added to your dataset under a new field (
_cluster_.description_mpnet_vector_.kmeans-10 for the example used in this guide). Check the results under the Dataset -> Monitor -> Clusters.
We will learn about subclustering in the next page.
Updated 7 days ago