5/5 - (1 vote)

Write a program to cluster a set of points using K-means. Consider, K=3, clusters. Consider Euclidean distance as the distance measure. Randomly initialize a cluster mean as one of the data points. Iterate for 10 iterations. After iterations are over, print the final cluster means for each of the clusters.

Use the ground truth cluster label present in the data set to compute and print the Jacquard distance of the obtained clusters with the ground truth clusters for each of the three clusters.

Data Set Description:

Data Filename: data4_19.csv

The data set contains 150 data points, there are three clusters where each cluster refers to a type of iris plant. The first four columns represent the attributes listed below. Note that only the first four columns should be used as attributes. The last column is the ground truth cluster name and is to be used for evaluating the cluster quality.

sepal length in cm
sepal width in cm
petal length in cm
petal width in cm
Ground truth cluster name:

Iris Setosa

Iris Versicolour

Iris Virginica

Reviews

There are no reviews yet.

Only logged in customers who have purchased this product may leave a review.

Whatsapp Us

[Solved] CS60050 Assignment 4- K-Means Clustering

Reviews

Whatsapp Us

[Solved] CS60050 Assignment 4- K-Means Clustering

Reviews

Related products

[Solved] Machine Learning (CS60050) Assignment 3: Clustering

[Solved] Machine Learning (CS60050) Assignment 1: Linear Regression

[Solved] CS60050 Assignment 1- Decision trees

[Solved] CS60050 Assignment3- SVM & ANN

[Solved] Machine Learning (CS60050) Assignment 4: Neural Network

[Solved] CS60050 Assignment 3- Adaboost algorithm