Hitachi Vantara Pentaho Community Forums
Results 1 to 14 of 14

Thread: Is sort rows broken in kettle-3.0.0-RC2?

  1. #1
    Join Date
    Nov 2007
    Posts
    11

    Default Is sort rows broken in kettle-3.0.0-RC2?

    I can't seem to get the sort rows to work on even the simplest of tasks.
    I'm a new user of kettle...am I missing something?
    I'm attaching a sample transform.

    The error I get is:

    2007/11/02 23:00:35 - Sort rows.0 - The number of binary string to data type conversions done in this sort block is 0
    2007/11/02 23:00:35 - Sort rows.0 - QuickSort algorithm has finished.
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : Unexpected error :
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : org.pentaho.di.core.exception.KettleValueException:
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : field1 String(1)<binary-string> : There was a data type error: the data type of java.lang.String object [0] does not correspond to value meta [String(1)<binary-string>]
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) :
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.core.row.ValueMeta.getString(ValueMeta.java:1258)
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.core.row.RowMeta.getString(RowMeta.java:169)
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.core.row.RowMeta.getString(RowMeta.java:529)
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.trans.steps.sort.SortRows.processRow(SortRows.java:371)
    2007/11/02 23:00:35 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.trans.steps.sort.SortRows.run(SortRows.java:484)
    2007/11/02 23:00:35 - Sort rows.0 - Finished processing (I=0, O=0, R=4, W=0, U=0, E=1)
    2007/11/02 23:00:35 - Spoon - The transformation has finished!!
    Attached Files Attached Files

  2. #2
    Join Date
    May 2006
    Posts
    4,882

    Default

    It works for me ... do you use the very latest RC2 as mentioned in the top of this forum.

    Regards,
    Sven

  3. #3
    Join Date
    Nov 2007
    Posts
    11

    Default

    Yes, RC2 on Windows.
    I've tried with jre1.5.0_10 and jre1.6.0_03.
    I'm using Kettle in the "no repository" mode...but I don't think that should make a difference?

    I'll download RC2 again from the link above just to be sure.

  4. #4
    Join Date
    Nov 2007
    Posts
    11

    Default

    Yes, same result using RC2 from this forum.
    Are you running on Windows? I've got an Ubuntu installation if you think that might be worth a try?

    Here's the complete log if that might help anyone.
    Any ideas are appreciated, thanks for responding.




    -----------------------------------------------------------

    2007/11/03 09:11:34 - SharedObjects - Reading the shared objects file [file:///C:/Documents and Settings/Dan/.kettle/shared.xml]
    2007/11/03 09:11:34 - Spoon - Transformation opened.
    2007/11/03 09:11:34 - Spoon - Launching transformation [temp]...
    2007/11/03 09:11:34 - Spoon - Started the transformation execution.
    2007/11/03 09:11:34 - temp - Dispatching started for transformation [temp]
    2007/11/03 09:11:34 - temp - Nr of arguments detected:0
    2007/11/03 09:11:34 - temp - This is not a replay transformation
    2007/11/03 09:11:34 - temp - I found 2 different steps to launch.
    2007/11/03 09:11:34 - temp - Allocating rowsets...
    2007/11/03 09:11:34 - temp - Allocating rowsets for step 0 --> CSV file input
    2007/11/03 09:11:34 - temp - prevcopies = 1, nextcopies=1
    2007/11/03 09:11:34 - temp - Transformation allocated new rowset [CSV file input.0 - Sort rows.0]
    2007/11/03 09:11:34 - temp - Allocated 1 rowsets for step 0 --> CSV file input
    2007/11/03 09:11:34 - temp - Allocating rowsets for step 1 --> Sort rows
    2007/11/03 09:11:34 - temp - Allocated 1 rowsets for step 1 --> Sort rows
    2007/11/03 09:11:34 - temp - Allocating Steps & StepData...
    2007/11/03 09:11:34 - temp - Transformation is about to allocate step [CSV file input] of type [CsvInput]
    2007/11/03 09:11:34 - CSV file input.0 - distribution activated
    2007/11/03 09:11:34 - CSV file input.0 - Starting allocation of buffers & new threads...
    2007/11/03 09:11:34 - CSV file input.0 - Step info: nrinput=0 nroutput=1
    2007/11/03 09:11:34 - CSV file input.0 - output rel. is 1:1
    2007/11/03 09:11:34 - CSV file input.0 - Found output rowset [CSV file input.0 - Sort rows.0]
    2007/11/03 09:11:34 - CSV file input.0 - Finished dispatching
    2007/11/03 09:11:34 - temp - Transformation has allocated a new step: [CSV file input].0
    2007/11/03 09:11:34 - temp - Transformation is about to allocate step [Sort rows] of type [SortRows]
    2007/11/03 09:11:34 - Sort rows.0 - distribution activated
    2007/11/03 09:11:34 - Sort rows.0 - Starting allocation of buffers & new threads...
    2007/11/03 09:11:34 - Sort rows.0 - Step info: nrinput=1 nroutput=0
    2007/11/03 09:11:34 - Sort rows.0 - Got previous step from [Sort rows] #0 --> CSV file input
    2007/11/03 09:11:34 - Sort rows.0 - input rel is 1:1
    2007/11/03 09:11:34 - Sort rows.0 - Found input rowset [CSV file input.0 - Sort rows.0]
    2007/11/03 09:11:34 - Sort rows.0 - Finished dispatching
    2007/11/03 09:11:34 - temp - Transformation has allocated a new step: [Sort rows].0
    2007/11/03 09:11:34 - temp - This transformation can be replayed with replay date: 2007/11/03 09:11:34
    2007/11/03 09:11:34 - temp - Initialising 2 steps...
    2007/11/03 09:11:34 - Sort rows.0 - Running on slave server #0/1.
    2007/11/03 09:11:34 - CSV file input.0 - Running on slave server #0/1.
    2007/11/03 09:11:34 - temp - Step [CSV file input.0] initialized flawlessly.
    2007/11/03 09:11:34 - temp - Step [Sort rows.0] initialized flawlessly.
    2007/11/03 09:11:34 - temp - Transformation has allocated 2 threads and 1 rowsets.
    2007/11/03 09:11:34 - CSV file input.0 - Starting to run...
    2007/11/03 09:11:34 - CSV file input.0 - Finished processing (I=5, O=0, R=0, W=4, U=0, E=0)
    2007/11/03 09:11:34 - Sort rows.0 - Starting to run...
    2007/11/03 09:11:34 - Sort rows.0 - Starting quickSort algorithm...
    2007/11/03 09:11:34 - Sort rows.0 - The number of binary string to data type conversions done in this sort block is 0
    2007/11/03 09:11:34 - Sort rows.0 - QuickSort algorithm has finished.
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : Unexpected error :
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : org.pentaho.di.core.exception.KettleValueException:
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : field1 String(1)<binary-string> : There was a data type error: the data type of java.lang.String object [0] does not correspond to value meta [String(1)<binary-string>]
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) :
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.core.row.ValueMeta.getString(ValueMeta.java:1258)
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.core.row.RowMeta.getString(RowMeta.java:169)
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.core.row.RowMeta.getString(RowMeta.java:529)
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.trans.steps.sort.SortRows.processRow(SortRows.java:371)
    2007/11/03 09:11:34 - Sort rows.0 - ERROR (version 3.0.0-RC2, build 299036 from 2007/10/31 23:59:46) : at org.pentaho.di.trans.steps.sort.SortRows.run(SortRows.java:484)
    2007/11/03 09:11:34 - Sort rows.0 - Finished processing (I=0, O=0, R=4, W=0, U=0, E=1)
    2007/11/03 09:11:34 - Spoon - The transformation has finished!!

  5. #5
    Join Date
    Nov 1999
    Posts
    9,729

    Default

    It works fine on OSX (Tiger) too ;-)

    However, I did implement an optimisation that was offered by a certain anonymous large corporation to improve the performance.

    Perhaps that made the difference. (although I certainly don't think so)

    The platform doesn't make a difference either.

    Matt

  6. #6
    Join Date
    Nov 2007
    Posts
    11

    Default

    Hmm....
    Should I try a build with the source from trunk?

  7. #7
    Join Date
    Nov 1999
    Posts
    9,729

    Default

    That's always an option, why not.

  8. #8
    Join Date
    Nov 2007
    Posts
    11

    Default

    Still no luck.
    What version of the jdk are you running under?

  9. #9
    Join Date
    May 2006
    Posts
    4,882

    Default

    very strange... I run JDK 1.6.0_01 for the moment.. you do need a SUN Jdk of course

    Regards,
    Sven

  10. #10
    Join Date
    Nov 2007
    Posts
    11

    Default

    I'm on update 3.

    I did just notice that I can sort data from a database
    just fine.

    I also noticed the transformation seems to work when I uncheck the lazy conversion checkbox on the CSV file input.

    That's an acceptable work around for me!

    Interesting .

  11. #11
    Join Date
    Nov 1999
    Posts
    9,729

    Default

    It's not acceptable to me. If there is a problem we should try and fix it.
    If you tell us there is a problem we typically listen to you.
    So file a bug report for good measure. I'm sure the problem will surface eventually.

    Matt

  12. #12

    Default

    Hi,

    There is a bug report for that :

    http://jira.pentaho.org/browse/PDI-409

    Rgds

    Samatar

  13. #13
    Join Date
    May 2006
    Posts
    4,882

    Default

    JIRA 409 is on v2.5.2, not on this problem in 3.0

    Regards,
    Sven

  14. #14
    Join Date
    Nov 2007
    Posts
    11

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.