Hitachi Vantara Pentaho Community Forums
Results 1 to 14 of 14

Thread: CSV Input File - Fails during Preview fields

  1. #1

    Default CSV Input File - Fails during Preview fields

    In a new transformation with only the CSV file input box, I configure it to grab a csv file from my computer. If I do 'Get fields' and select up to 50 records or so, it does just fine. However if I leave it at the default of 100, it freezes at 57 (says "Scanning line 57") and hangs there. Kettle totally crashes there and I have to re-open it. Any ideas what's occurring here? I've confirmed the actual format of the data is fine.

  2. #2

    Default

    I take that back. It was treating an id field as numeric whereas it should have been a string. However, I try to edit it to say String and it seems to crash hard. Occasionally I can get it to work, but seldom.

    2008/01/11 18:16:42:680 PST [INFO] DefaultFileReplicator - Using "/tmp/vfs_cache" as temporary files store.
    INFO 11-01 18:16:45,430 (LogWriter.java:println:403) -Spoon - Main window is created.
    INFO 11-01 18:16:45,434 (LogWriter.java:println:403) -Spoon - Asking for repository
    INFO 11-01 18:16:45,462 (LogWriter.java:println:403) -RepositoriesMeta - Reading repositories XML file: /Users/jjohnson/.kettle/repositories.xml
    INFO 11-01 18:16:45,714 (LogWriter.java:println:403) -VersionChecker - OK
    INFO 11-01 18:16:56,309 (LogWriter.java:println:403) -RepositoriesMeta - Reading repositories XML file: /Users/jjohnson/.kettle/repositories.xml
    INFO 11-01 18:17:24,263 (LogWriter.java:println:403) -Trans - This is not a replay transformation
    INFO 11-01 18:17:24,268 (LogWriter.java:println:403) -Trans - This transformation can be replayed with replay date: 2008/01/11 18:17:24
    INFO 11-01 18:17:24,269 (LogWriter.java:println:403) -Trans - Initialising 2 steps...
    INFO 11-01 18:17:24,352 (LogWriter.java:println:403) -CSV file input.0 - Starting to run...
    INFO 11-01 18:17:24,354 (LogWriter.java:println:403) -dummy.0 - Starting to run...
    INFO 11-01 18:17:24,883 (LogWriter.java:println:403) -CSV file input.0 - Finished processing (I=1003, O=0, R=0, W=1001, U=0, E=0)
    INFO 11-01 18:17:24,883 (LogWriter.java:println:403) -dummy.0 - Finished processing (I=0, O=0, R=750, W=750, U=0, E=0)
    Invalid memory access of location c00000f7 eip=960f0286
    ./spoon.sh: line 146: 560 Segmentation fault $JAVA_BIN $OPT org.pentaho.di.ui.spoon.Spoon "${1+$@}"

  3. #3
    Join Date
    May 2006
    Posts
    4,882

    Default

    Are you using Apple by any chance?

    Regards,
    Sven

  4. #4

    Default

    Yes I am. I'm running Leopard on an iMac

  5. #5
    Join Date
    May 2006
    Posts
    4,882

    Default

    It's a "known" problem... http://jira.pentaho.org/browse/PDI-567

    Maybe you can try http://landonf.bikemonkey.org/static/soylatte/ as alternative JVM as in http://www.ibridge.be/?p=91

    Regards,
    Sven

  6. #6

    Default

    Okay, I installed OpenMotif and Soylatte (from binary).

    I'm kind of confused on what I have to do next in terms of configuration. I edited the .bash_profile to:

    PATH=/usr/local/bin:$PATH
    PATH=/opt/local/bin:$PATH
    PATH=/usr/local/soylatte16-i386/soylatte16-i386-1.0.1:$PATH
    JAVA_HOME=/usr/local/soylatte16-i386/soylatte16-i386-1.0.1/bin:$JAVA_HOME


    but I don't know if this is correct. I'm kind of a newbie on Linux/Mac and not familiar with soylatte and where exactly to configure the JAVA_HOME. Also, do I need spoon to specify which java to use (that is, to use soylatte instead of JRE)?

  7. #7

    Default

    Still crashes at same place when trying to adjust the field types on a csv input. Note:

    which java:
    /usr/local/soylatte16-i386/soylatte16-i386-1.0.1/bin/java

    Here's the error log of the crash:

    Process: java_swt [4362]
    Path: /kettle/Kettle-3.0.1 Folder/libswt/osx/java_swt
    Identifier: java_swt
    Version: ??? (???)
    Code Type: X86 (Native)
    Parent Process: sh [4355]

    Date/Time: 2008-01-12 23:51:26.891 -0800
    OS Version: Mac OS X 10.5.1 (9B18)
    Report Version: 6

    Exception Type: EXC_BAD_ACCESS (SIGSEGV)
    Exception Codes: KERN_INVALID_ADDRESS at 0x00000000c5c3bce7
    Crashed Thread: 0

    Application Specific Information:

    Java information:
    Version: Java HotSpot(TM) Client VM (1.5.0_13-119 mixed mode, sharing)
    Virtual Machine version: Java HotSpot(TM) Client VM (1.5.0_13-119) for macosx-x86, built on Sep 28 2007 23:59:21 by root with gcc 4.0.1 (Apple Inc. build 5465)
    Exception type: Bus Error (0xa) at pc=0x960f0286

    Current thread (0x010014a0): JavaThread "main" [_thread_in_native, id=-1605783712]
    Stack: [0xbf800000,0xc0000000)
    Java frames: (J=compiled Java code, j=interpreted, Vv=VM code)
    j org.eclipse.swt.internal.carbon.OS.CallNextEventHandler(II)I+0
    j org.eclipse.swt.widgets.Table.kEventMouseDown(III)I+24
    j org.eclipse.swt.widgets.Widget.mouseProc(III)I+68
    j org.eclipse.swt.widgets.Display.mouseProc(III)I+464
    v ~StubRoutines::call_stub
    j org.eclipse.swt.internal.carbon.OS.SendEventToEventTarget(II)I+0
    J org.eclipse.swt.widgets.Display.readAndDispatch()Z
    v ~RuntimeStub::alignment_frame_return Runtime1 stub
    ...etc

  8. #8

    Default

    I also just tried upgrading to the developer version of Java 1.6 for Leopard, removed the related lines in the .bash_profile and restarted. Same issue with crash.

  9. #9
    Join Date
    May 2006
    Posts
    4,882

    Default

    Looking at your last stack trace it's seems you're still using the "apple" java, so probably some PATH setting is not ok to run soylatte.

    Regards,
    Sven

  10. #10
    Join Date
    Nov 1999
    Posts
    9,729

    Default

    Hey guys, I don't think it's caused by Java, this is a pure Apple Cocoa bug. It's been floating around for a while. I guess we'll have to wait for an update to Leopard.
    The issue occurs at the same point for Apple Java & SoyLatte because it is not Java related.

    Please note that my Tiger runs just fine and does not exhibit the problem ;-)

    All the best,

    Matt

  11. #11

    Default

    Matt/Sven,

    Is there some other way to set the JAVA_PATH inside kettle? Matt, if you're sure it's a Cocoa bug I won't bother. Thanks

  12. #12
    Join Date
    Nov 1999
    Posts
    9,729

    Default

    There is no Java code used to drive the GUI elements. Eclipse/SWT uses native widgets.
    Actually, I think I was wrong about the Cocoa usage. It seems like SWT/OSX uses Carbon to run.
    There seems to be a Cocoa port underway, but I have no idea about the progress.

    In any case, I bought my Mac Mini for the sole purpose of fixing these Mac bugs. I can work around a lot of bugs although it seems like a simple drop-down menu might be difficult to fix.

    All the best,

    Matt

  13. #13

    Default

    I've installed Kettle on Windows for now. Can you tell me how I specify to use a tab character as the delimiter instead of the comma? Thanks

  14. #14
    Join Date
    May 2006
    Posts
    4,882

    Default

    Use the text file input ... on there there should be a button to insert a tab. Copy the tab and put it in the csv input step. Alternatively edit the xml file.

    Also added it to http://jira.pentaho.org/browse/PDI-606 for a future release.

    Regards,
    Sven

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.