Dina
11-02-2007, 12:46 AM
Hello,
I am a newbie to Weka and so far I can't find an easy solution to my problem: I have a database for movie renting stores that consists of 2 tables that I'd like to get analyzed:
Table Stores(
Store_id,
StoreType _id)
Table StoresMovies(
Store_id,
Movie_Id)
I have about 10-15 different StoreTypes (i.e. Family Friendly, Adults Oriented etc). The database contains about 1000 Stores and their Movie Database - for each store there are about 100 Movies i.e. 100 000 rows in the StoresMovies tables with 10 000 unique Movies.
Using this training database I'd like to predict StoreType of the newly added Stores based on the Movies they have. Some movies can be rented out by different StoreTypes while some may point to specific StoreType (this should be identified through training data).
I wasn't able to find how to feed my data to Weka as it seems to process only 1 table. If I create a table that will have columns for each movie with Yes/No Values for each Store, then I understand how to deal with this data, but it's not easy to get this dataset out of my data.
I would really appreciate any help!
I am a newbie to Weka and so far I can't find an easy solution to my problem: I have a database for movie renting stores that consists of 2 tables that I'd like to get analyzed:
Table Stores(
Store_id,
StoreType _id)
Table StoresMovies(
Store_id,
Movie_Id)
I have about 10-15 different StoreTypes (i.e. Family Friendly, Adults Oriented etc). The database contains about 1000 Stores and their Movie Database - for each store there are about 100 Movies i.e. 100 000 rows in the StoresMovies tables with 10 000 unique Movies.
Using this training database I'd like to predict StoreType of the newly added Stores based on the Movies they have. Some movies can be rented out by different StoreTypes while some may point to specific StoreType (this should be identified through training data).
I wasn't able to find how to feed my data to Weka as it seems to process only 1 table. If I create a table that will have columns for each movie with Yes/No Values for each Store, then I understand how to deal with this data, but it's not easy to get this dataset out of my data.
I would really appreciate any help!