marc_swingler
04-05-2006, 07:09 AM
Attachment: spoon.zip (http://forums.pentaho.org/archived_att/kettle/spoon.zip) Matt,
I have a table, CAR_PKT_GEOM in one db which contains ~450K rows. Eventually I want to transform this data before moving it to a second DB. For now though I just want to try moving it from one place to another using a straight Table Input and Table Output. If I limit the number of rows to 50K being read from the origin DB everything goes great. (takes about 70sec) If I limit the number of rows to 100K it slows a little, but its not bad. (3.5min) If I limit the number of rows to 200K it slows a more. (11min) Limiting the query to 400K or setting the Limit size to 0 causes the process to grind to a halt. (I get an IO Exception after 8hrs). I don't want to partition the table in the target DB, so how do I speed the process up? I'm considering just grabbing 50K rows at a time and looping unitl finished, but I'm not sure it's the best solution. Any help appreciated. (ktr and log file attached)
Marc
I have a table, CAR_PKT_GEOM in one db which contains ~450K rows. Eventually I want to transform this data before moving it to a second DB. For now though I just want to try moving it from one place to another using a straight Table Input and Table Output. If I limit the number of rows to 50K being read from the origin DB everything goes great. (takes about 70sec) If I limit the number of rows to 100K it slows a little, but its not bad. (3.5min) If I limit the number of rows to 200K it slows a more. (11min) Limiting the query to 400K or setting the Limit size to 0 causes the process to grind to a halt. (I get an IO Exception after 8hrs). I don't want to partition the table in the target DB, so how do I speed the process up? I'm considering just grabbing 50K rows at a time and looping unitl finished, but I'm not sure it's the best solution. Any help appreciated. (ktr and log file attached)
Marc