Hitachi Vantara Pentaho Community Forums
Results 1 to 3 of 3

Thread: Normalizing data in a table

  1. #1
    Join Date
    Oct 2015
    Posts
    10

    Default Normalizing data in a table

    Good day,

    I am quite new to Pentaho.
    i have a data in form of .csv which each column has different value range.For example :

    A B C
    1000 10 0.5
    765 5 0.35
    800 7 0.09

    My desired output is as below :

    A B C
    1 1 1 (which is 0.5/0.5)
    0.765 0.5 0.7
    0.8 0.7 0.18 (which is 0.09/0.5)

    I will be thankful if there is any guide to direct me will be helpful.
    Any scripting in Java or Phyton would be helpful too since I am learning both.
    i am still learning my programming too.

    best regards,
    john

    PDI 5.4.0.1-130
    Last edited by exoticsty; 11-18-2015 at 08:13 PM.

  2. #2
    Join Date
    Apr 2008
    Posts
    1,771

    Default

    Have a look at the Calculator step:
    http://wiki.pentaho.com/display/EAI/Calculator
    -- Mick --

  3. #3
    Join Date
    Jun 2012
    Posts
    5,534

    Default

    Calculator can help with the division, but doesn't do any good for mapping field values to the [0,1] range.
    Depending on the actual data row normalization and denormalization might be a clever approach (not shown in demo).
    Attached Files Attached Files
    So long, and thanks for all the fish.

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.