Hello All,

When I try to run the following sample

http://wiki.pentaho.com/display/BAD/...taho+MapReduce

I am always getting an error like

2014/01/18 17:10:34 - Spoon - Starting job...
2014/01/18 17:10:34 - aggregate_mr_output_format - Start of job execution
2014/01/18 17:10:34 - aggregate_mr_output_format - Starting entry [Pentaho MapReduce]
2014/01/18 17:10:35 - Pentaho MapReduce - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : org/apache/hadoop/mapred/lib/MultipleTextOutputFormat
2014/01/18 17:10:35 - Pentaho MapReduce - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : java.lang.NoClassDefFoundError: org/apache/hadoop/mapred/lib/MultipleTextOutputFormat
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.defineClass1(Native Method)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.defineClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.security.SecureClassLoader.defineClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader.defineClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader.access$100(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader$1.run(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader$1.run(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.security.AccessController.doPrivileged(Native Method)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader.findClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.loadClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.loadClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.di.core.plugins.KettleURLClassLoader.loadClass(KettleURLClassLoader.java:98)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.loadClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.Class.forName0(Native Method)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.Class.forName(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.hadoop.shim.HadoopConfigurationClassLoader.loadClass(HadoopConfigurationClassLoader.java:99)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.loadClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.di.job.entries.hadooptransjobexecutor.JobEntryHadoopTransJobExecutor.execute(JobEntryHadoopTransJobExecutor.java:673)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.di.job.Job.execute(Job.java:678)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.di.job.Job.execute(Job.java:815)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.di.job.Job.execute(Job.java:500)
2014/01/18 17:10:35 - Pentaho MapReduce - at org.pentaho.di.job.Job.run(Job.java:407)
2014/01/18 17:10:35 - Pentaho MapReduce - Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.mapred.lib.MultipleTextOutputFormat
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader$1.run(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader$1.run(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.security.AccessController.doPrivileged(Native Method)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.net.URLClassLoader.findClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.loadClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - at java.lang.ClassLoader.loadClass(Unknown Source)
2014/01/18 17:10:35 - Pentaho MapReduce - ... 22 more
2014/01/18 17:10:35 - aggregate_mr_output_format - Finished job entry [Pentaho MapReduce] (result=[false])
2014/01/18 17:10:35 - aggregate_mr_output_format - Job execution finished
2014/01/18 17:10:35 - Spoon - Job has ended.


I found that it is unable to find the class org.apache.hadoop.mapred.lib.MultipleTextOutputFormat. I have copied hadoop-core.jar file to the lib folder of pentaho installation. When I try to run now, it is throwing an error like

2014/01/18 17:13:25 - Spoon - Starting job...
2014/01/18 17:13:25 - aggregate_mr_output_format - Start of job execution
2014/01/18 17:13:25 - aggregate_mr_output_format - Starting entry [Pentaho MapReduce]
2014/01/18 17:13:26 - Pentaho MapReduce - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : class YearMultipleTextOutputFormat not org.apache.hadoop.mapred.OutputFormat
2014/01/18 17:13:26 - Pentaho MapReduce - ERROR (version 5.0.2, build 1 from 2013-12-04_15-52-25 by buildguy) : java.lang.RuntimeException: class YearMultipleTextOutputFormat not org.apache.hadoop.mapred.OutputFormat
2014/01/18 17:13:26 - Pentaho MapReduce - at org.apache.hadoop.conf.Configuration.setClass(Configuration.java:931)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.apache.hadoop.mapred.JobConf.setOutputFormat(JobConf.java:654)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.pentaho.hadoop.shim.common.ConfigurationProxy.setOutputFormat(ConfigurationProxy.java:84)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.pentaho.di.job.entries.hadooptransjobexecutor.JobEntryHadoopTransJobExecutor.execute(JobEntryHadoopTransJobExecutor.java:674)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.pentaho.di.job.Job.execute(Job.java:678)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.pentaho.di.job.Job.execute(Job.java:815)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.pentaho.di.job.Job.execute(Job.java:500)
2014/01/18 17:13:26 - Pentaho MapReduce - at org.pentaho.di.job.Job.run(Job.java:407)
2014/01/18 17:13:26 - aggregate_mr_output_format - Finished job entry [Pentaho MapReduce] (result=[false])
2014/01/18 17:13:26 - aggregate_mr_output_format - Job execution finished
2014/01/18 17:13:26 - Spoon - Job has ended.

Could you please let me know how to resolve this? Your help is much appreciated.

I am using Pentaho Data Integration Enterprise Edition 5.0.2.

Regards,
Vijay