Quote: Original post by Timkin
Kirk, did you actually read that sentence once you'd written it?
Well, yes. Consider:
Quote: Original post by anda
This is my problem:
Say I have a list of 20 people, who I have to split up into 5 groups of 4.
Each person has 5 integer attributes (labelled eg 0,1,2,3,4)
I want to form the 5 groups where for each of those attributes the average is pretty close compared to the other groups' averages of each of those attributes.
eg
Group 1 (avg for attribute1 is 4, avg for attribute2 is 3, etc)
Group 2 (avg for attribute1 is 4.5, avg for attribute2 is 2.8, etc)
etc
Quote: Original post by Timkin
you're dictating that you want 5 classes based on 5 attributes... and you want to minimise the variance in each attribute within a class (find things that are most alike).
From my reading of the OP's post, this approach does not agree with the original intent. Perhaps I'm reading that incorrectly - could you explain, please?
Then:
Quote: Original post by Timkin
The definition of a cluster is algorithm independent. I can choose to cluster based on single attributes, or any function of any number of attributes. There does NOT have to be a 1-1 linear mapping from attributes to classes. You have made the mistake of assuming that each object (person in the OPs example) has to exist in only one class. I could just as easily cluster in any functional space defined by any mapping of the attribute/object space.
That is true, but you've already stated in your first post:
Quote: Original post by Timkin
you're dictating that you want 5 classes based on 5 attributes... and you want to minimise the variance in each attribute within a class (find things that are most alike).
Which is exactly what I said:
Quote: Original post by KirkD
Clustering will group individuals with similar attribute values