PDA

View Full Version : getmerge for PDI



tlynchpin
01-29-2013, 01:54 PM
Often with Pentaho MapReduce I need to concat the part-0000 files in an hdfs path, like using "hadoop fs -getmerge". I am doing it manually with TextFileInput but I often don't need to process the data in the files, instead just write them to a single file, like getmerge. I couldn't figure out how to do this with the "Hadoop Copy Files" job entry.