PDA

View Full Version : Big Data



  1. Welcome to the Hadoop Integartion Forum
  2. Hadoop TFI - streaming or dumping/extracting from temp files
  3. Question regarding test data
  4. Introduction to PDI Webcast recording posted
  5. Poll: Have you successfully installed the Hadoop beta build?
  6. Have you successfully tried the new Input/Output steps and Copy Files job entry?
  7. Hadoop Data Source for Cubes and Reports
  8. Hive Connection on Windows - Configuration of Data Source with JDBC driver
  9. HBase and ETL
  10. Hadoop file output, Enclosure issue.
  11. Apache Contribution Update
  12. maximum data size of hadoop cluster
  13. PDI error occures while executing table input with Hive
  14. Hadoop version dependency?
  15. Creating dashboards; and, using metadata models when creating analyzer reports.
  16. 3.7RC1 installation problem
  17. Looking documentaion of Hadoop Integration
  18. Can't connect hive from BI server User console
  19. Is Pentaho for Hadoop fast?
  20. Linking up PDI to Hadoop
  21. Report Designer 3.7 RC1 - Hive parameter swallows previous chararacter
  22. PDI client spoon.sh crashes on CDH3
  23. Hadoop Transformation Job Executor: Base class does not have a trans
  24. Push down data integration workload to MR job
  25. How to partition the input data?
  26. Request to download "Pentaho Data Integration for Hadoop 4.1 GA" for Linux
  27. Hadoop steps not present in CE?
  28. Hadoop File Input, basic question
  29. Is it possible parallel load data into one table using Hadoop on PDI?
  30. Help: Hadoop Job Executor
  31. Need suggestion.
  32. Error in configuring object
  33. Hadoop Transformation Job Executor: Can not get the relative path
  34. Hadoop Transformation Job Executor: Null Pointer Exception
  35. Many transofrmation gadgets do not work properly in Pentaho Hadoop!!
  36. user of Hadoop Job Executor Step
  37. Pentaho PHD 4.1.0 GA and Cloudera's CDH3b3
  38. Where is the PHD for Hadoop?
  39. PDI 4.1.1 GA considerations
  40. Hadoop Job Executor job step
  41. Hadoop job executor can not take custom output format
  42. Hadoop integration in Pentaho CE
  43. Help, its slower than a granny
  44. Loading weblog data in Hadoop/Hive by using kettle 4.1 transformation
  45. Pentaho with hive datasource FAILED
  46. Hive using JDBC: Error retrieving next row
  47. Error while starting Report Designer while using the Enterprise evaluation license
  48. Thrift Connection Support in PDI's HBase Output Step
  49. PHD 4.2 necessary for PDI EE server 4.2.0 RC1
  50. Insert csv file to Hadoop Hive table
  51. Help connecting Mondrian and Hive
  52. help using EMR
  53. help with Hadoop Job Executor
  54. Problem with accessing HDFS through Hadoop File Input
  55. Welcome to the Big Data Forum!
  56. Pentaho Data Integration with Hadoop
  57. Pentaho Distribution that runs outside of Hadoop
  58. Hadoop with PDI CE 4.2.1
  59. Facing Issues while publishing report from HADOOP HIVE Database ........HELP ME !!!!!
  60. Using Pentaho Analytics with Amazon AWS (Hive)
  61. Error retrieving next row on connecting to hive table through Pentaho Report Designer
  62. Problem executing MapReduce Transformation Weblogs_parse_mr
  63. how can I run pentaho job inside hadoop?
  64. integrating Cassandra with Pentaho map Reduce in 4.3
  65. PDI hadoop file browser no list
  66. Pentaho supports Mapreduce or mapred??
  67. job/task tracker logging with exports HADOOP_CLASSPATH
  68. running XSLT transformation within pentaho map reduce job
  69. Online training on Pentaho BI at the cheapest cost
  70. Cassandra Kettle 4.3.0 Preview - Read large number of rows
  71. Unable to load class for step/plugin with id [HadoopEnterPlugin]
  72. Cassandra input and output - enhancements, suggestions etc
  73. Scheduling options on a cluster
  74. Feedback and use cases please
  75. FYI: Big Data Contests
  76. Connecting to Remote Cassandra Node (errors)
  77. Transform Several MSAccess DB to only one DB problems
  78. Error retrieving data from Hive DB type
  79. Hadoop File Input step cannot connect to CDH3 cluster
  80. Pentaho MapReduce
  81. Tasktracker does not pick up pentaho jars from HADOOP_CLASSPATH in cloudera CDH3
  82. Unable to find pages.xml in order to create parametrized report with Mongo DB
  83. HBase output failing
  84. Unable to priview the content of a file loaded from HDFS on Spoon
  85. Pentaho Mapreduce
  86. Working around hadoop chunking.
  87. Hadoop Copy files Step in Job
  88. Cassandra input/output error
  89. Hadoop Copy Files Errors
  90. HBase data read with contains clause in key
  91. HBase read with date range in start and stop key
  92. Pentaho BigData Data Integration 4.3 crashed giving memory leak.
  93. Pentaho Big Data in Windows(standalone/clustur)
  94. How do kettle write hadoop job xml
  95. How does reduce transformation work
  96. How to get UTC time inserted as in my table?
  97. Cassandra Input and paramters
  98. HBase Input - Mapping binary HBase row keys as Input Fields
  99. Unable to Connect to Hbase
  100. MongoDB output Bug
  101. Amazon Dynamo Db
  102. Cassandra - SSTableOutput step and an SSTableUpload based on SSTableLoader/BulkLoader
  103. Spoon could not connect to remote node of cassandra cluster
  104. How to run a transformation with Pentaho MapReduce
  105. Limit Big Data
  106. How to pass output field from previous step into MongoDB Input step json query?
  107. Getting error while using HBase output..
  108. MongoDB-Spoon "Group By" query help
  109. RDBMS to HBASE schema migration
  110. How do I add leading zeros to a numeric data field in Hive QL?
  111. HBase Input error
  112. Big Data features open-sourced?
  113. Creating Amazon EMR Job
  114. Debugging Kettle and Big Data plugin in Eclipse
  115. Hadoop Job Executor error (PDI CE 4.3)
  116. ClassNotFoundException: KettleException
  117. Spoon Error on JSON Input -- "No script engine for Javascript" (jrunscript)
  118. Step limitations for Pentaho Map Reduce
  119. Cassandra Output Testing Feedback- SNAPHOT pentaho-big-data-plugin build 150
  120. Issues configuring Pentaho pdi 4.3 db connection with hadoop - hive instances
  121. Error in reading rows from Pentaho's datasource to hive
  122. Data load from HDFS to relational DB is very slow
  123. OutOfMemoryError creating PDI connection to Hive with CDH4
  124. How to use MapReduce with Hbase as Input and output as HDFS in Pentaho kettle 4.3.0?
  125. Hbase Transformation created in one machine is not running in another
  126. Hbase Transformation created in one machine is not running in another
  127. S3 CSV Input - best way to read all files in an S3 folder (EMR files)
  128. How to delete a record in HBase table using Pentaho
  129. How to delete specific rows from hbase using pentaho
  130. pdi-4.3.0 to CDH 4.0.1. hive connection issue
  131. Problem while using Hadoop File Input and Hadoop File Output
  132. Connecting PDI to Hadoop on VMWare
  133. java.lang.IllegalArgumentException: Invalid DFS directory - Error in MapReduce
  134. How to share UDF jar files cross different jobs using Pig executor?
  135. HDFS File Input
  136. hbase input connection issue
  137. Problem with "Using Pentaho MapReduce to Generate an Aggregate Dataset" tutorial
  138. how to load .sql file directly into hive ?
  139. Problem connection Hbase or hadoop
  140. Spoon remote execution when using DI and Hadoop in a private network
  141. Spoon remote execution when using DI and Hadoop in a private network
  142. Trouble connecting to Google BigQuery jdbc
  143. cassandra input component error
  144. Error While Running the Map Reduce
  145. Hadoop File Input 'Browse -> Open File' does not respect hadoop configuration files?
  146. MapR M5 + Pentaho Spoon 4.3 - Could not resolve file
  147. MapR integration
  148. MapR Integration
  149. Mongodb update
  150. Connection error of Hive and pentaho
  151. Connection errors and pentaho Hive
  152. How to use Hadoop computing power while you create transformations on pdi-ce-4.2.0-st
  153. Does PDI CE 4.2.0 created transformation able to use Hadoop Computing power ?
  154. Problem connecting PDI4.4 to Hive / Hadoop
  155. connection hdfs and hadoop input step
  156. trying to follow the tutorial - Using Pentaho to Parse Weblog Data
  157. CQL 3 - Cassandra
  158. Bulk load
  159. Following the tutorial - Loading Data into HDFS
  160. Following the tutorial - Loading Data into HDFS
  161. Integration with Cloudera Impala
  162. Kettle transformation from Text Input to HBase output - Mapping issue
  163. Kettle transformation from Text Input to HBase output - Mapping issue
  164. authentication issues with MongoDB output
  165. Apache Hadoop version 0.20.X for Pentaho on Windows
  166. Passing parameter file to pig script.
  167. Cassandra output step produces error: Cant find a deserializer for type "{0}"
  168. MongoDB output step: write safety
  169. Using Mtamarkets "Druid" as a data source in Pentaho
  170. How to Configure Pentaho DI (pdi-ce-4.4.0 stable)with Hadoop-1.0.4
  171. How to specify Hadoop user to run MapReduce Job
  172. Accessing a depencency on the nodes of a M/R cluster
  173. Pentaho Map Reduce - Invalid byte 2 of 4-byte UTF-8 sequence Error
  174. getmerge for PDI
  175. Clarification of limitations of Cassandra Input Step
  176. Statistical Calculations in the data tier?
  177. MongoDB input query by ObjectId
  178. Hadoop Copy Files - How to set HDFS User?
  179. Mongo input help - join mongo collections
  180. Mongo Slave?
  181. MongoDB Output!
  182. MapReduce Job failes java.lang.RuntimeException: Error in configuring object
  183. How to get access to static class from jar when running job with Map Reduce
  184. Getting only one processed file from MapReduce output step for multiple input files.
  185. HDFS connection issue
  186. mapreduce mongodb
  187. Is there a map reduce sample for XML input files?
  188. CDH4 Hive JDBC issue on windows
  189. Can connect to HDFS but cannot access the actual files (application freezes)
  190. MapReduce On PDI
  191. How to set JVM memory parameters for the nodes?
  192. Instaview?
  193. PRD can't connect to cdh4.2.0 hive