kedar mehta
04-04-2009, 08:01 PM
Hello Folks,
I have been trying to create a job – to implement usecase, but I was not able to find appropriate tools.
I have kettle 3.0 (finally, We upgraded!). I am using spoon/ job. I have implemented similar thing using MS SQL server BI studio. Not sure if I can do this with kettle job.
Usecase is like,
File keeps arriving on the same location. I need to archive them all. The files will be mostly csv files & o/s is windows. Based on their creation date.
The job should initiate when -
Free space on that particular windows driver gets decrease then 20 Gigs
(if this was Unix, I could have done it easily. By using Unix job or autosys stuff)
Every time it should keep zipping file based on it’s creation date. It should zip all the files , but it shoudn’t compress file which arrived in last 30 days. There should be a filter which should send a mail Based on status of compression – if it happens.
Start point à keep checking file space usage. When it crosses 20 Gigs, initialize ().
(Alternate – run this job once in a week)
Do
Find files older then 30 days à Zip themà if success then do a mail with that status.
à if failure then mail with failure logs.
I had hard time in finding conditional filters & file creation time finding methods in job. Please guide me further.
If I cant get files, based on their creation time, then I might have to use some other stuff. I don’t wanna complicate this simple task unnecessarily.
I have attached a row skeleton of job which I have created.
Thanks in ad !
I have been trying to create a job – to implement usecase, but I was not able to find appropriate tools.
I have kettle 3.0 (finally, We upgraded!). I am using spoon/ job. I have implemented similar thing using MS SQL server BI studio. Not sure if I can do this with kettle job.
Usecase is like,
File keeps arriving on the same location. I need to archive them all. The files will be mostly csv files & o/s is windows. Based on their creation date.
The job should initiate when -
Free space on that particular windows driver gets decrease then 20 Gigs
(if this was Unix, I could have done it easily. By using Unix job or autosys stuff)
Every time it should keep zipping file based on it’s creation date. It should zip all the files , but it shoudn’t compress file which arrived in last 30 days. There should be a filter which should send a mail Based on status of compression – if it happens.
Start point à keep checking file space usage. When it crosses 20 Gigs, initialize ().
(Alternate – run this job once in a week)
Do
Find files older then 30 days à Zip themà if success then do a mail with that status.
à if failure then mail with failure logs.
I had hard time in finding conditional filters & file creation time finding methods in job. Please guide me further.
If I cant get files, based on their creation time, then I might have to use some other stuff. I don’t wanna complicate this simple task unnecessarily.
I have attached a row skeleton of job which I have created.
Thanks in ad !