- Welcome to the Hadoop Integartion Forum
- Hadoop TFI - streaming or dumping/extracting from temp files
- Question regarding test data
- Introduction to PDI Webcast recording posted
- Poll: Have you successfully installed the Hadoop beta build?
- Have you successfully tried the new Input/Output steps and Copy Files job entry?
- Hadoop Data Source for Cubes and Reports
- Hive Connection on Windows - Configuration of Data Source with JDBC driver
- HBase and ETL
- Hadoop file output, Enclosure issue.
- Apache Contribution Update
- maximum data size of hadoop cluster
- PDI error occures while executing table input with Hive
- Hadoop version dependency?
- Creating dashboards; and, using metadata models when creating analyzer reports.
- 3.7RC1 installation problem
- Looking documentaion of Hadoop Integration
- Can't connect hive from BI server User console
- Is Pentaho for Hadoop fast?
- Linking up PDI to Hadoop
- Report Designer 3.7 RC1 - Hive parameter swallows previous chararacter
- PDI client spoon.sh crashes on CDH3
- Hadoop Transformation Job Executor: Base class does not have a trans
- Push down data integration workload to MR job
- How to partition the input data?
- Request to download "Pentaho Data Integration for Hadoop 4.1 GA" for Linux
- Hadoop steps not present in CE?
- Hadoop File Input, basic question
- Is it possible parallel load data into one table using Hadoop on PDI?
- Help: Hadoop Job Executor
- Need suggestion.
- Error in configuring object
- Hadoop Transformation Job Executor: Can not get the relative path
- Hadoop Transformation Job Executor: Null Pointer Exception
- Many transofrmation gadgets do not work properly in Pentaho Hadoop!!
- user of Hadoop Job Executor Step
- Pentaho PHD 4.1.0 GA and Cloudera's CDH3b3
- Where is the PHD for Hadoop?
- PDI 4.1.1 GA considerations
- Hadoop Job Executor job step
- Hadoop job executor can not take custom output format
- Hadoop integration in Pentaho CE
- Help, its slower than a granny
- Loading weblog data in Hadoop/Hive by using kettle 4.1 transformation
- Pentaho with hive datasource FAILED
- Hive using JDBC: Error retrieving next row
- Error while starting Report Designer while using the Enterprise evaluation license
- Thrift Connection Support in PDI's HBase Output Step
- PHD 4.2 necessary for PDI EE server 4.2.0 RC1
- Insert csv file to Hadoop Hive table
- Help connecting Mondrian and Hive
- help using EMR
- help with Hadoop Job Executor
- Problem with accessing HDFS through Hadoop File Input
- Welcome to the Big Data Forum!
- Pentaho Data Integration with Hadoop
- Pentaho Distribution that runs outside of Hadoop
- Hadoop with PDI CE 4.2.1
- Facing Issues while publishing report from HADOOP HIVE Database ........HELP ME !!!!!
- Using Pentaho Analytics with Amazon AWS (Hive)
- Error retrieving next row on connecting to hive table through Pentaho Report Designer
- Problem executing MapReduce Transformation Weblogs_parse_mr
- how can I run pentaho job inside hadoop?
- integrating Cassandra with Pentaho map Reduce in 4.3
- PDI hadoop file browser no list
- Pentaho supports Mapreduce or mapred??
- job/task tracker logging with exports HADOOP_CLASSPATH
- running XSLT transformation within pentaho map reduce job
- Online training on Pentaho BI at the cheapest cost
- Cassandra Kettle 4.3.0 Preview - Read large number of rows
- Unable to load class for step/plugin with id [HadoopEnterPlugin]
- Cassandra input and output - enhancements, suggestions etc
- Scheduling options on a cluster
- Feedback and use cases please
- FYI: Big Data Contests
- Connecting to Remote Cassandra Node (errors)
- Transform Several MSAccess DB to only one DB problems
- Error retrieving data from Hive DB type
- Hadoop File Input step cannot connect to CDH3 cluster
- Pentaho MapReduce
- Tasktracker does not pick up pentaho jars from HADOOP_CLASSPATH in cloudera CDH3
- Unable to find pages.xml in order to create parametrized report with Mongo DB
- HBase output failing
- Unable to priview the content of a file loaded from HDFS on Spoon
- Pentaho Mapreduce
- Working around hadoop chunking.
- Hadoop Copy files Step in Job
- Cassandra input/output error
- Hadoop Copy Files Errors
- HBase data read with contains clause in key
- HBase read with date range in start and stop key
- Pentaho BigData Data Integration 4.3 crashed giving memory leak.
- Pentaho Big Data in Windows(standalone/clustur)
- How do kettle write hadoop job xml
- How does reduce transformation work
- How to get UTC time inserted as in my table?
- Cassandra Input and paramters
- HBase Input - Mapping binary HBase row keys as Input Fields
- Unable to Connect to Hbase
- MongoDB output Bug
- Amazon Dynamo Db
- Cassandra - SSTableOutput step and an SSTableUpload based on SSTableLoader/BulkLoader
- Spoon could not connect to remote node of cassandra cluster
- How to run a transformation with Pentaho MapReduce
- Limit Big Data
- How to pass output field from previous step into MongoDB Input step json query?
- Getting error while using HBase output..
- MongoDB-Spoon "Group By" query help
- RDBMS to HBASE schema migration
- How do I add leading zeros to a numeric data field in Hive QL?
- HBase Input error
- Big Data features open-sourced?
- Creating Amazon EMR Job
- Debugging Kettle and Big Data plugin in Eclipse
- Hadoop Job Executor error (PDI CE 4.3)
- ClassNotFoundException: KettleException
- Spoon Error on JSON Input -- "No script engine for Javascript" (jrunscript)
- Step limitations for Pentaho Map Reduce
- Cassandra Output Testing Feedback- SNAPHOT pentaho-big-data-plugin build 150
- Issues configuring Pentaho pdi 4.3 db connection with hadoop - hive instances
- Error in reading rows from Pentaho's datasource to hive
- Data load from HDFS to relational DB is very slow
- OutOfMemoryError creating PDI connection to Hive with CDH4
- How to use MapReduce with Hbase as Input and output as HDFS in Pentaho kettle 4.3.0?
- Hbase Transformation created in one machine is not running in another
- Hbase Transformation created in one machine is not running in another
- S3 CSV Input - best way to read all files in an S3 folder (EMR files)
- How to delete a record in HBase table using Pentaho
- How to delete specific rows from hbase using pentaho
- pdi-4.3.0 to CDH 4.0.1. hive connection issue
- Problem while using Hadoop File Input and Hadoop File Output
- Connecting PDI to Hadoop on VMWare
- java.lang.IllegalArgumentException: Invalid DFS directory - Error in MapReduce
- How to share UDF jar files cross different jobs using Pig executor?
- HDFS File Input
- hbase input connection issue
- Problem with "Using Pentaho MapReduce to Generate an Aggregate Dataset" tutorial
- how to load .sql file directly into hive ?
- Problem connection Hbase or hadoop
- Spoon remote execution when using DI and Hadoop in a private network
- Spoon remote execution when using DI and Hadoop in a private network
- Trouble connecting to Google BigQuery jdbc
- cassandra input component error
- Error While Running the Map Reduce
- Hadoop File Input 'Browse -> Open File' does not respect hadoop configuration files?
- MapR M5 + Pentaho Spoon 4.3 - Could not resolve file
- MapR integration
- MapR Integration
- Mongodb update
- Connection error of Hive and pentaho
- Connection errors and pentaho Hive
- How to use Hadoop computing power while you create transformations on pdi-ce-4.2.0-st
- Does PDI CE 4.2.0 created transformation able to use Hadoop Computing power ?
- Problem connecting PDI4.4 to Hive / Hadoop
- connection hdfs and hadoop input step
- trying to follow the tutorial - Using Pentaho to Parse Weblog Data
- CQL 3 - Cassandra
- Bulk load
- Following the tutorial - Loading Data into HDFS
- Following the tutorial - Loading Data into HDFS
- Integration with Cloudera Impala
- Kettle transformation from Text Input to HBase output - Mapping issue
- Kettle transformation from Text Input to HBase output - Mapping issue
- authentication issues with MongoDB output
- Apache Hadoop version 0.20.X for Pentaho on Windows
- Passing parameter file to pig script.
- Cassandra output step produces error: Cant find a deserializer for type "{0}"
- MongoDB output step: write safety
- Using Mtamarkets "Druid" as a data source in Pentaho
- How to Configure Pentaho DI (pdi-ce-4.4.0 stable)with Hadoop-1.0.4
- How to specify Hadoop user to run MapReduce Job
- Accessing a depencency on the nodes of a M/R cluster
- Pentaho Map Reduce - Invalid byte 2 of 4-byte UTF-8 sequence Error
- getmerge for PDI
- Clarification of limitations of Cassandra Input Step
- Statistical Calculations in the data tier?
- MongoDB input query by ObjectId
- Hadoop Copy Files - How to set HDFS User?
- Mongo input help - join mongo collections
- Mongo Slave?
- MongoDB Output!
- MapReduce Job failes java.lang.RuntimeException: Error in configuring object
- How to get access to static class from jar when running job with Map Reduce
- Getting only one processed file from MapReduce output step for multiple input files.
- HDFS connection issue
- mapreduce mongodb
- Is there a map reduce sample for XML input files?
- CDH4 Hive JDBC issue on windows
- Can connect to HDFS but cannot access the actual files (application freezes)
- MapReduce On PDI
- How to set JVM memory parameters for the nodes?
- Instaview?
- PRD can't connect to cdh4.2.0 hive