Hitachi Vantara Pentaho Community Forums
Results 1 to 5 of 5

Thread: Unwanted Skip of Files in Excel Input

  1. #1
    Join Date
    Oct 2014
    Posts
    18

    Default Unwanted Skip of Files in Excel Input

    Hi,
    following is my transformation overview:
    1- Input Excel .xls file (it have only 1 sheet and 4 rows)
    2- Transform data through JS
    3- Write data from text file output.

    Now in Input excel, I used Excel 2007 XLSX (Apache POI Streaming). In doing Preview Rows, PDI is showing me only 2 rows out of total 4 rows.

    As only 2 rows are visible in transformation out of total 4, the other major problem is that, the few columns values for available 2 rows are coming from those two rows which are not in preview.

    Please help me to sort out the issue.

    Thanks in advance.

    Regards,
    Yuvam

  2. #2
    Join Date
    Sep 2007
    Posts
    834

    Default

    Check the Content tab, and fix it accordingly. For example: uncheck "Header" if you don't have one, etc.

  3. #3
    Join Date
    Oct 2014
    Posts
    18

    Default

    Hi Maria,

    I real problem is that
    - I have input files with 1 million records ()excel input)
    - Output as text file (.csv format) and 50000 rows in each file.
    With such condition PDI should generate 20 files with 50K rows in each.

    But PDI is generating only 10 files with 50K in each. PDI escape alternate rows as found after analysis.

    Big problem : The rows which are available in the output files have values for few columns coming from the (-1) columns from their actual position in input file.

    Thanks
    Yuvam

  4. #4
    Join Date
    Jun 2012
    Posts
    5,534

    Default

    You should give us the demo described in your opening post, the one with the four row input.
    So long, and thanks for all the fish.

  5. #5
    Join Date
    Oct 2014
    Posts
    18

    Default

    Ok, I will share the sample snap shoot soon.

    I also found in the preview of the data from Excel input, it is also showing the alternate rows and mixed the values of some columns from the missing rows.
    I am using "Excel 2007 XLSX (Apache POI Streaming)" for my .xlsx file input which have only 1 sheet.

    thanks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.