Hitachi Vantara Pentaho Community Forums
Results 1 to 2 of 2

Thread: How do data mining in real life with Weka?

  1. #1
    Join Date
    May 2017

    Question How do data mining in real life with Weka?

    I know how to use the training and test files.

    I make classification with Random Forest.

    I need to know which file format for instances without categorization. How to load it into Weka and how to sort each instance.

  2. #2
    Join Date
    Aug 2006


    For new instances without class labels you need to create an ARFF file with exactly the same structure (number of attributes, attribute names, order of attributes, and declaration of nominal values) as the training data. The value of the class can be set to missing, i.e '?' (without the quotes), in each instance. Then you can either output predictions in the Explorer, use the AddClassification filter to append predictions, or use a Knowledge Flow process with a PredictionAppender component. Note that evaluation is not possible in this scenario as there are no "ground truth" labels to compare the predictions to.


Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.