PDA

View Full Version : add cluster with weka



francy_faraone
10-16-2007, 08:02 AM
Hallo.
I have a problem: after running the clustering algorithm SimpleKmean, I want to add the cluster attribute to the initial data set. The problem is that there isn't correspondence between the results with SimpleKMean and the results after applying the unsupervised attribute filter "addcluster". I show the results I refer to, below:


Clustered Instances from SimpleKMean:
0 108
1 79
2 71
3 76
4 149
(total 483)


Clustered instances after applying the filter:
cluster1 86
cluster2 79
cluster3 93
cluster4 83
cluster5 142
(total 483)

I have used the same settings both for the clustering algorithm and for the filter, but it seems that the distribution of the istances is different. Why? Which result do I have to consider?

Kind regards.

Maria Francesca.

Mark
10-16-2007, 07:32 PM
Hi there,

I can't seem to reproduce the problem using the current development version of Weka (3.5.6). I've run SimpleKMeans on the iris data (with class removed) generating 5 clusters from the Explorer, the command line, and the AddCluster filter. All three give me the same distribution of instances into clusters. What version of Weka are you using?

Cheers,
Mark.

francy_faraone
10-17-2007, 10:04 AM
My version of Weka is 3.5.2. Do you think is better for me to download your version?

Thank you very much.

Mark
10-17-2007, 05:34 PM
I've re-done my experiment using 3.5.2 and still don't have a problem (I also tried 5 clusters on the glass data set as well). Is there any chance you can send me your data set?

Cheers,
Mark.

francy_faraone
10-18-2007, 04:33 AM
Dear, I tried again and now it works. Thank you very much!! Anyway I'll download the new version of Weka.