US and Worldwide: +1 (866) 660-7555
Results 1 to 9 of 9

Thread: Problem with RSS INPUT

  1. #1
    Join Date
    Jul 2012
    Posts
    5

    Default Problem with RSS INPUT

    Hi !

    I have a problem with this RSS:

    http://www.fotocasa.es/Press/Rss/Rss2_0.aspx

    When I put this in the "RSS input" module, this make a error all the time...

    If i do the same with others RSS directions:

    http://www.pentaho.com/feeds/press/
    http://ep00.epimg.net/rss/elpais/portada.xml

    all ok!

    but with this... i dont know what is the problem

    snif, snif...

    I need help!

    Penter

  2. #2
    Join Date
    Jul 2012
    Posts
    5

    Default

    My version is the 4.3 and the error is the next:

    2012/07/18 04:13:31 - fotocasa.0 - Reading of URL http://hogar.fotocasa.es/hogar/RSS/Rss2_0.aspx failed.
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : Unexpected Exception : it.sauronsoftware.feed4j.FeedXMLParseException: org.dom4j.DocumentException: Connection reset Nested exception: Connection reset
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : it.sauronsoftware.feed4j.FeedXMLParseException: org.dom4j.DocumentException: Connection reset Nested exception: Connection reset
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at it.sauronsoftware.feed4j.FeedParser.parse(FeedParser.java:53)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.pentaho.di.trans.steps.rssinput.RssInput.readNextUrl(RssInput.java:178)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.pentaho.di.trans.steps.rssinput.RssInput.getOneRow(RssInput.java:210)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.pentaho.di.trans.steps.rssinput.RssInput.processRow(RssInput.java:334)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.pentaho.di.trans.step.RunThread.run(RunThread.java:50)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at java.lang.Thread.run(Unknown Source)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : Caused by: org.dom4j.DocumentException: Connection reset Nested exception: Connection reset
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.dom4j.io.SAXReader.read(SAXReader.java:484)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.dom4j.io.SAXReader.read(SAXReader.java:291)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at it.sauronsoftware.feed4j.FeedParser.parse(FeedParser.java:37)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : ... 5 more
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : Error desconocido
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : org.pentaho.di.core.exception.KettleException:
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : it.sauronsoftware.feed4j.FeedXMLParseException: org.dom4j.DocumentException: Connection reset Nested exception: Connection reset
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : org.dom4j.DocumentException: Connection reset Nested exception: Connection reset
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) :
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.pentaho.di.trans.steps.rssinput.RssInput.processRow(RssInput.java:389)
    2012/07/18 04:13:31 - fotocasa.0 - ERROR (version 4.3.0-stable, build 16786 from 2012-04-24 14.11.32 by buildguy) : at org.pentaho.di.trans.step.RunThread.run(RunThread.java:50)
    20

  3. #3
    Join Date
    Jun 2012
    Posts
    1,473

    Default

    Well, it's not only RSS Input that complains:

    http://validator.w3.org/appc/check.c...%2FRss2_0.aspx
    pdi-ce-4.3.0-stable
    OpenJDK IcedTea 2.3.7 (7u21)
    ubuntu 12.04 LTS (x86_64)

  4. #4
    Join Date
    Jul 2012
    Posts
    5

    Default

    could be some solution to use this RSS?

  5. #5
    Join Date
    Mar 2008
    Posts
    133

    Default

    Maybe use an XML step (not sure if they support URLs, I think they do) or fetch with an HTTP step and process it. You can always create a transformation to fix the format if you really want to use the RSS Input step.

  6. #6
    Join Date
    Jun 2012
    Posts
    1,473

    Default

    That spanish server is very picky about who is asking.
    Could be some headers are missing to succeed.

    PS: User-Agent is required!

    PPS: The next problem is a BOM sent by the server.

    PPS: Beauty is in the eye of the beholder...
    Attached Files Attached Files
    Last edited by marabu; 07-18-2012 at 12:37 PM.
    pdi-ce-4.3.0-stable
    OpenJDK IcedTea 2.3.7 (7u21)
    ubuntu 12.04 LTS (x86_64)

  7. #7
    Join Date
    Jul 2012
    Posts
    5

    Default

    I am trying to pass information from http://noticias.fotocasa.es/news/RSS/Rss2_0.aspx


    to a excel file, I can not fix by passing to the RSS or XML using Http module...

    this is crazy.

  8. #8
    Join Date
    Jun 2012
    Posts
    1,473

    Default

    Just replace the Text Output step by Excel Writer, add some fields and you are done.
    pdi-ce-4.3.0-stable
    OpenJDK IcedTea 2.3.7 (7u21)
    ubuntu 12.04 LTS (x86_64)

  9. #9
    Join Date
    Jul 2012
    Posts
    5

    Default

    Thank you!! you are a great man!!

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •