Hitachi Vantara Pentaho Community Forums
Results 1 to 5 of 5

Thread: Excel input step not reading the 'xlsx' format

  1. #1

    Default Excel input step not reading the 'xlsx' format

    Hi all,

    I am using PDI-4.1 CE. I am struck with the problem that the excel input step is not reading
    xlsx format.


    I tried some steps,

    1. replaced POI jar file with latest one in libext folder .Version :3.7 & 3.8 beta version.

    2.Then I converted my excel to ODS format and in excel input step ,"spread sheet type(engine)" tab i changed to "open office ODS"

    3. changed the "spread sheet type(engine)" tab to Excel 2007 XLSX (Apache POI) for xlsx file.

    All these above dint help me,and I am getting error as follows,

    "jxl.read.biff.BiffException: Unable to recognize OLE stream
    Unable to recognize OLE stream"

    Finally i converted the file to csv format. But some of my fields are datetime, When converted to csv i can get only date. So i cant go ahead with this as time with date is very important for me.

    So can anyone suggest me how I can go ahead, when i convert my xlsx fileto lower formats my record counts is getting reduced drastically as lower version can't have more than 60k records.


    Hoping for a positive reply,

    Thanks & regards,

    Dhanesh

  2. #2
    Join Date
    Apr 2008
    Posts
    4,683

    Default

    Quote Originally Posted by dhaneshmkumar View Post
    2.Then I converted my excel to ODS format
    Leave it as XLSX... Don't change the file to ODS.
    If you change the file to ODS, you have to tell the Excel Input that the file is ODS
    **THIS IS A SIGNATURE - IT GETS POSTED ON (ALMOST) EVERY POST**
    I'm no expert.
    Take my comments at your own risk.

    PDI user since PDI 3.1
    PDI on Windows 7 & Linux

    Please keep in mind (and this may not apply to this thread):
    No forum member is going to do your work for you. We will help you sort out how to do a specific part of the work, as best we can, in the timelines that our work will allow us.
    Signature Updated: 2014-06-30

  3. #3

    Default

    hi,

    So some please suggest me how can I proceed using xlsx file. replacing poi jar files in libext folder dint help.

    regards,
    Dhanesh

  4. #4
    Join Date
    Nov 1999
    Posts
    9,729

  5. #5
    Join Date
    Mar 2008
    Posts
    22

    Default

    Matt,

    I am having the same problem reading an Excel file. I am getting the message "jxl.read.biff.BiffException: Unable to recognize OLE stream". I am using Kettle-4.2.0-RC1. Strangely enough my transformation works in Windows but gives this error message in RedHat Linux. Using the same xlsx file and same transformation. BTW bought your book and it has helped me alot.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.