Hi Mark,
I am trying to verify the K value found in IBK(crossvalidate = True and KNN set to max (number of instances) is correct.
When using the crossvalidate IBK, I use that on "Use training set" under Classify. Say that IBK optimize KNN gives me a correlation of .5 and optimal K of 10 NN.
Then I try to verify by switching crossvalidate to false and setting KNN to 10. Then I run a LOOCV under "Cross-validation" but get wildly different results from the correlation value found by optimizing on "Use training set".
On a similar experiment using CVparmSelection with IBK. I set K 1 100 100. There are 100 instances. I do the CVparmSelection on "Use training set" with number of folds set to number of instances so it is a LOOCV,
But when I get the K value found by CVParmSelection and then just run plain IBK with that value on "Cross-validation" with a LOOCV, the correlations between the finding the optimum by CVPS and using that optimum are again way different.
What am I misunderstanding?
Thanks,
Mike