Hi,
I'm doing document classification. I'd like to perform analysis on feature selection methods using either the explorer or experiment environment. So to avoid overly-optimistic results I've manually divided my dataset to training and testing sets. I used meta classifier to apply StringToWordVector on both sets without any incompatibility issues. However, now I don't know how to apply the feature selection (Ranker + InfGain/chiSquare) on the training set ONLY.
If anyone can tell me where or how to do multiple feature selection methods with different number of selected attribute to produce a graph that will be great.
Another thing What is the difference between NaiveBayesMultinomial and NaiveBayesMultinomialUpdatable?
thanks in advance...


Reply With Quote
