Hitachi Vantara Pentaho Community Forums
Results 1 to 2 of 2

Thread: Compare data set with target from other spreadsheet

  1. #1
    Join Date
    May 2016
    Posts
    1

    Default Compare data set with target from other spreadsheet

    I am brand-spanking new to Weka and really need help with this:
    I have these two spreadsheets of data:
    Spreadsheet A contains a list of the thousands of types of birds found in North America, each with an ID and attributes (beak, wing shape, etc.) about them.
    Spreadsheet B contains 2015's recorded sightings from a national birding organization. Whenever a sighting is made, the bird's ID is recorded along with information about the sighting.
    Now, Weka will let me use the Explorer to make numerical and visual comparisons such as "beak color vs. average weight" by opening spreadsheet A.
    But how can I make the comparison "beak color vs. 2015's sightings"? That is, analysis on the percentage of sightings with an orange beak, yellow beak, etc.
    To do this, I need both the data from spreadsheet A and spreadsheet B, but I cannot find an efficient way to do this.
    Note: Alternatively, please feel free to suggest another tool for this.
    Last edited by neno; 06-01-2016 at 10:57 AM. Reason: it duplicated my post?

  2. #2
    Join Date
    Aug 2006
    Posts
    1,741

    Default

    It doesn't sound like you need machine learning for this. I would say you'd want both spreadsheets loaded into a relational database so that you can join the two tables on the ID field, and perhaps group by ID in the sightings table in order to count sightings by ID.

    Cheers,
    Mark.

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.