eliza.mmatei

03-26-2007, 05:13 AM

Hello everyone!

I'm a student from Romania and I'm trying to make an application in C# for my diploma, which includes the k means algorithm .

I'm stuck on something and I would very much appreciate if you could help me.

I want to apply clustering on surveys to classify people or products. But the variables for which I want to apply the K means algorithm are nominal (like education or favourite color) ,ordinal and binary(sex: M/F).

In this case which formula for distance I can use, or I transform the data into numeric data, but again: how do I do this? All the examples and comments I found about K means algorithm ware about numeric data and I understand how they work , but what about mixed data?

Thank you very much,

Eliza :)

