Hitachi Vantara Pentaho Community Forums

Search Forums:

Type: Posts; User: gutlez; Keyword(s):

Page 1 of 96 1 2 3 4

Search Forums: Search took 0.07 seconds; generated 40 minute(s) ago.

  1. Always look at the age of a thread before you...

    Always look at the age of a thread before you post. This thread is from 7+ years ago. Even if the poster found a solution, they are unlikely to remember what it was now.

    Searching on google...
  2. You can simply feed the multiple incoming streams...

    You can simply feed the multiple incoming streams into one step. Similar to in SQL though, you must have identical field layout.

    Select col1 as col1, cast(date1 as date) as col2 from table1...
  3. Replies
    3
    Views
    204

    In recent versions, Spoon and Kitchen are...

    In recent versions, Spoon and Kitchen are actually the same program.
    Unless you've modified the batch / shell files that start one or the other, they start the same way.

    From current 8.2's...
  4. Replies
    3
    Views
    204

    Best guess is that you are not running these...

    Best guess is that you are not running these (both Kitchen and Spoon) on the same machine.

    The machine that you are running the Kitchen call from does not seem to have the Big Data plugin...
  5. Can you connect to TCP1433 on the DB server from...

    Can you connect to TCP1433 on the DB server from your PDI box?
    Do you have a Proxy Server in the environment?
    Are you certain that the password is correct?

    What basic troubleshooting have you...
  6. Replies
    4
    Views
    249

    Since PDI implements log4J, you shouldbe able to...

    Since PDI implements log4J, you shouldbe able to simply configure it into the configuration files.

    This SO page might help:...
  7. The original poster said they were using File...

    The original poster said they were using File Repository. It is listed as an option that users can use.

    Using a bunch of files in a directory is NOT the same as using File Repository.
    My point...
  8. I believe that this comes from using a file...

    I believe that this comes from using a file repository.
    The file repository is a bit of a pain... You have to use it consistently or not at all.

    When you run the job from spoon, you are using...
  9. Replies
    2
    Views
    264

    Based on...

    Based on https://github.com/pentaho/pentaho-kettle/blob/master/core/src/main/java/org/pentaho/di/core/database/Database.java#L1037
    It looks like PDI only supports positional parameters, not named...
  10. Replies
    4
    Views
    7,197

    Bumping threads from years ago is unlikely to...

    Bumping threads from years ago is unlikely to work out well. Authors come and go, and may not be around to answer questions any more.

    The web page referenced is still running, but following the...
  11. Replies
    2
    Views
    398

    REST API Chunking is a common problem, and one...

    REST API Chunking is a common problem, and one that I haven't seen solved cleanly here yet.
    Part of the problem is that you're not really releasing the row sets until you hit the end of the REST...
  12. Also a security note: FTP is not secure. You...

    Also a security note: FTP is not secure. You can compromise the entire machine by leaving an FTP port open on the internet.

    This forum is not really about system architecture, but about PDI and...
  13. Copying files from one Remote Windows machine to...

    Copying files from one Remote Windows machine to another Remote Windows machine (relative to PDI) is not really a good use of PDI.

    Note: You would need to have an FTP or SFTP service installed an...
  14. Replies
    4
    Views
    391

    If you absolutely have to use this RegEx (Bad...

    If you absolutely have to use this RegEx (Bad Idea!)
    Try:
    .*?([a-zA-Z0-9._-\+]+@[a-zA-Z0-9._-]+\.[a-zA-Z0-9_-]+).*?

    Since you know your line contains "EMail:" you can redirect your row using a...
  15. Replies
    4
    Views
    391

    It's becuase PDI adds an implicit ^ and $ to...

    It's becuase PDI adds an implicit ^ and $ to either end of the RegEx
    so your RegEx that is actually being run is ^([a-zA-Z0-9._-]+@[a-zA-Z0-9._-]+\.[a-zA-Z0-9_-]+)$

    Try that in your Online...
  16. Replies
    6
    Views
    674

    Not that it's a lot of help, but......

    Not that it's a lot of help, but...
    https://github.com/pentaho/pentaho-kettle/blob/master/engine/src/main/java/org/pentaho/di/trans/step/BaseStep.java#L2569
  17. Replies
    3
    Views
    562

    Much like the PRD can help you convert a DB Table...

    Much like the PRD can help you convert a DB Table into a formatted PDF, Mondrian can help you convert a structured DB to Statistical "Slice&Dice"

    Basically:
    OLTP(DB) -> PDI -> OLAP(DB) ->...
  18. Replies
    1
    Views
    323

    Use Metadata Injection and Row Denormalizer. ...

    Use Metadata Injection and Row Denormalizer.

    Row Denormalizer needs to know how many columns to make. The Metadata Injection can figure that out and send it to the template transformation.
  19. Replies
    1
    Views
    356

    The tools are telling you that one of the steps...

    The tools are telling you that one of the steps gives 4 columns.
    The other step only gives one column.

    This is not allowed.

    From the logs you've posted, we can see that each step is giving...
  20. I feel like you have asked this before. The...

    I feel like you have asked this before.

    The hint is in the expansion of the acronym: PDF
    PDF is Page Description Format
    It doesn't have standard columns and rows, so very few tools will actually...
  21. Because all variables in PDI are strings. ...

    Because all variables in PDI are strings.

    setVariable does an implicit .toString() on anything that is not a string. Parse it into a date when you are declaring your sixty_second_start...
  22. Replies
    1
    Views
    290

    You can use the generic DB connection to be able...

    You can use the generic DB connection to be able to specify a JDBC driver, but you will lose some PDI functionality since it will be falling back to lowest-common-denominator SQL.
    PDI will not know...
  23. Replies
    4
    Views
    478

    The upstream steps will always send the data. ...

    The upstream steps will always send the data.

    What if you refactor a little bit?



    public boolean processRow(StepMetaInterface smi, StepDataInterface sdi) throws KettleException
    {
    if...
  24. It's hard to tell what the product direction is....

    It's hard to tell what the product direction is.

    The changes that you linked to were made as part of a fix for https://jira.pentaho.com/browse/BISERVER-12433
    But then it seems like there was...
  25. Thread: Transpose data

    by gutlez
    Replies
    4
    Views
    1,084

    Because your expected output is re-denormalized....

    Because your expected output is re-denormalized.

    So.
    Remove the Art from the Normalize step.
    That gives you output like:



    typefield
    Art
  26. Probably has to do with CompanyRole and...

    Probably has to do with CompanyRole and CompanyType being adjacent branches, rather than being Parent/Child.

    You will likely need to chain a few Get Data from XML steps together to get a standard...
  27. Replies
    4
    Views
    321

    You have to convert your JSON to standard rows....

    You have to convert your JSON to standard rows. (JSON Input Step) - You may need to link a few of these together to get standardized rows.
    Then you can filter out the rows that don't belong (Filter...
  28. Replies
    1
    Views
    264

    The problem is in the code... ...

    The problem is in the code...

    https://github.com/pentaho/pentaho-kettle/blob/master/core/src/main/java/org/pentaho/di/core/database/VerticaDatabaseMeta.java#L262

    and...
  29. Replies
    4
    Views
    321

    Convert your JSON to standard PDI rows Then...

    Convert your JSON to standard PDI rows
    Then filter your rows by Type=Ext to allow external rows to continue through
    Write your output as JSON
  30. Replies
    2
    Views
    417

    If you consider this in the streaming nature of...

    If you consider this in the streaming nature of PDI, you'll see why this is not an easy ask.

    1) Read Line from file_import.csv
    2) Perform transformations on line
    3) Write Line to file
    3.1) If...
  31. This is similar to what marabu posted, but...

    This is similar to what marabu posted, but recreated for this thread.

    KTR File:

    <?xml version="1.0" encoding="UTF-8"?><transformation>
    <info>
    <name>Demo</name>
    <description/>
    ...
  32. Nope... Not that one. That is how to join two...

    Nope... Not that one.
    That is how to join two lines into one.
  33. I know that there's an example on this forum...

    I know that there's an example on this forum somewhere by a poster by the name of marabu.

    It uses a trinary operator looking for a line start flag, and sets a column value to 1 or 0 based on that...
  34. Replies
    6
    Views
    491

    Hint: Detect Empty Rows step will send a row of...

    Hint: Detect Empty Rows step will send a row of all nulls if no rows reach it.
    It would probably be a good feature ask ( http://jira.pentaho.com ) to have an optional "Clear Result Rows at...
  35. Replies
    2
    Views
    263

    From what I recall, rebuilding the row is an...

    From what I recall, rebuilding the row is an expensive step.
    So your best bet would be to do all your joins, and then before sending the data to the dashboard, do one Select Values to rebuild the...
  36. Thread: Longest match

    by gutlez
    Replies
    4
    Views
    349

    I feel like tree string splitting this with...

    I feel like tree string splitting this with string structures would be more CPU efficient (and also time efficient), and potentially a way to hard-code it, but I really hate hard-coding.

    EG. you...
  37. Thread: Longest match

    by gutlez
    Replies
    4
    Views
    349

    So... I'm a dork. Two streams: Incoming Row...

    So... I'm a dork.

    Two streams:
    Incoming Row
    LookupStream

    Add a field (Key Length) to the lookup row.

    Do a cartesian Join on Incoming Row.KeyField Starts With LookupStream.KeyValue
    Sort...
  38. Thread: Longest match

    by gutlez
    Replies
    4
    Views
    349

    Perhaps: Break Strings to Character Lines, and...

    Perhaps:
    Break Strings to Character Lines, and do a comparison between them.
    Then do a running total of matching and total characters
    Filter for max value where Ematch = Etotal

    But that would...
  39. Replies
    6
    Views
    402

    Which sense of big data do you mean? Formula can...

    Which sense of big data do you mean?
    Formula can be pretty slow if you're doing string searches in a very long field, but as far as I've experienced (note: I only really go up to the 100K rows...
  40. Replies
    6
    Views
    402

    So... What if some creative type enters W@TCH as...

    So... What if some creative type enters W@TCH as the first 5 characters?
  41. Replies
    6
    Views
    402

    How many lines are you hoping to push through it...

    How many lines are you hoping to push through it in a run?
    How often will you be running the data?

    How often will the characters that you're testing change?
    How often will the number of...
  42. Remember to change your spreadsheet type before...

    Remember to change your spreadsheet type before you try to find the files (don't worry... I make this mistake *ALL* the time)
    If you change it from "Excel 97 - 2003 XLS (JXL)" to "Excel 2007 XLSX...
  43. Replies
    1
    Views
    702

    If you read the other posts on this subject, you...

    If you read the other posts on this subject, you will find that you would need to have X running in order to use Spoon on a Linux machine
    This is not recommended.

    Do your design work on a...
  44. Did you do a web-search first? Your error is...

    Did you do a web-search first?

    Your error is complaining about not understanding the JSON column type.

    JSON is not a column type that PDI natively speaks. Even if you were were to upgrade to...
  45. That looks like it should work, but you haven't...

    That looks like it should work, but you haven't mentioned your version... Someone else noted that there were issues with variables in 8.1
    If you post the actual transformations and job on Drive,...
  46. Yeah, I have the same issue sometimes...

    Yeah, I have the same issue sometimes...
  47. None that I'm aware of. Table Inputs are...

    None that I'm aware of.
    Table Inputs are designed to run independently as soon as possible. You would need to have some sort of flag into the SQL to indicate if it's supposed to run... otherwise,...
  48. Considering that Merge Join is supposed to be...

    Considering that Merge Join is supposed to be able to do a number of different match joins, you would have to hold all rows in memory the whole time...
    You can't release any rows until you have all...
  49. There is a way to set up the "Join Rows" step for...

    There is a way to set up the "Join Rows" step for this... But I can't remember the right way.

    Since one data set has only one row, you want to join all the other rows against it. (This will...
  50. Probably because... Microsoft wrote the XML/A...

    Probably because...
    Microsoft wrote the XML/A standard, and promptly failed to adhere to it.

    It might take some time to get it working, but it should work -- unfortunately, I don't have SSAS...
Results 1 to 50 of 4772
Page 1 of 96 1 2 3 4
Privacy Policy | Legal Notices | Safe Harbor Privacy Policy

Copyright © 2005 - 2019 Hitachi Vantara Corporation. All Rights Reserved.