03-04-2006, 01:21 AM
Anyone know where I could get my hands on a large large sample database for doing some real testing. To anything usefull with something like WEKA you need a lot of data. A simple schema (no more than 30 tables), but lot's of data. An export from an open source bug database maybe?


03-06-2006, 02:03 PM
You may want to try US Census data available at http://www.census.gov/Press-Release/www/2002/demoprofiles.html. The data is provided as CSV files; however, you should be able to import into your RDBMS tools.