kettle_anonymous
07-28-2006, 03:08 AM
I'm developing a datawarehouse and I'd like to use kettle in order to populate my tables.
I have lange cvs files (30/40MB) with a date attribute. I have tryed to configure spoon for my transformations and now I'm able to populate my db (with some problems).
I have some question about:
1) First. I have never worked with datawarehouse and I have some problems with the "key mappings". In the source database I have some keys in my datawarehouse another keys (ID int4). How can I define my transformations in order to insert/update correctly my data? Where can I find good documentation about this problem?
2) if, in my source database, I have for example 2 fields Field1 (string) and Field2 (string) can I create a Field3 with a concatenation operation:
example: Field1 = cat Field2 = dog --> Field3 = catdog
3) My csv files are very large and incremental files. If I have already inserted the data up to 2006/05/01, I'd like to read only the data with date > 2006/05/01. How can I do this with a csv input file?
Thanks in advance
I have lange cvs files (30/40MB) with a date attribute. I have tryed to configure spoon for my transformations and now I'm able to populate my db (with some problems).
I have some question about:
1) First. I have never worked with datawarehouse and I have some problems with the "key mappings". In the source database I have some keys in my datawarehouse another keys (ID int4). How can I define my transformations in order to insert/update correctly my data? Where can I find good documentation about this problem?
2) if, in my source database, I have for example 2 fields Field1 (string) and Field2 (string) can I create a Field3 with a concatenation operation:
example: Field1 = cat Field2 = dog --> Field3 = catdog
3) My csv files are very large and incremental files. If I have already inserted the data up to 2006/05/01, I'd like to read only the data with date > 2006/05/01. How can I do this with a csv input file?
Thanks in advance