Hitachi Vantara Pentaho Community Forums
Results 1 to 4 of 4

Thread: Loading weblog data in Hadoop/Hive by using kettle 4.1 transformation

  1. #1

    Default Loading weblog data in Hadoop/Hive by using kettle 4.1 transformation

    Hello All, I've not found much documentation on this but i'm trying to load the weblog data in Hadoop/Hive by using kettle 4.1

    I'm able to create an hadoop/hive database connection but when i use the table output in transformation, it error out on the first line of the input file with the following error




    2011/06/29 09:34:15 - hadoopweblogtableout.0 - ERROR (version 4.1.0-stable, build 14410 from 2010-11-16 16.43.28 by buildguy) : Error inserting/updating row
    2011/06/29 09:34:15 - hadoopweblogtableout.0 - ERROR (version 4.1.0-stable, build 14410 from 2010-11-16 16.43.28 by buildguy) : Method not supported
    2011/06/29 09:34:15 - hadoopweblogtableout.0 - ERROR (version 4.1.0-stable, build 14410 from 2010-11-16 16.43.28 by buildguy) :

    Is it possible to load data in Hadoop/Hive hdfs by using kettle transformation?

    Meanwhile i can try using the shell script and load the file.

    Please let me know.

    Highly appreciate it.

  2. #2

    Default

    no, the table output step does not currently work for inserting data into Hive via JDBC. At the time we added Hive support, it Hive did not support the necessary INSERT structure needed by the Table Output step. I believe the latest Hive (0.7) now provides this, so we could likely now update our flavor of the driver to support this. I would recommend opening an enhancement for this feature in our Jira system.

    -Jake

  3. #3

    Default

    Thanks Jake !

    I also did fine one more post dated: Aug 2010 from Jordan Ganoff :

    Support for Hive via the Table Output Step is on our roadmap; however, that functionality will not be included in the GA release. Best estimate I can give you is "soon".

    Best Regards,

    Jordan Ganoff

    So it's not done yet? correct?

    Meanwhile i'll send the request in the Jira system.

    Thanks Again

  4. #4
    Join Date
    Aug 2010
    Posts
    87

    Default

    Correct. Due to limitations in the current state of the Hive JDBC driver this is not possible.

    Best,
    Jordan

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.