html xml extractor

Discussion in 'XML' started by Marco, Jul 7, 2003.

  1. Marco

    Marco Guest

    Hi,

    I am searching for a tool that extract information from a HTLM page
    and format it in xml format

    For instance for this page:
    http://money.guardian.co.uk/pensions/story/0,6453,993138,00.html

    get an xml file
    with a <title> with the title of the article
    with a <text> with the text of the article
    with a <auuthor> with the text of the article

    Do you know such a tool?

    Marco
     
    Marco, Jul 7, 2003
    #1
    1. Advertising

  2. Marco

    FC Guest

    "Marco" <> wrote in message
    news:...
    > Hi,
    >
    > I am searching for a tool that extract information from a HTLM page
    > and format it in xml format
    >
    > For instance for this page:
    > http://money.guardian.co.uk/pensions/story/0,6453,993138,00.html
    >
    > get an xml file
    > with a <title> with the title of the article
    > with a <text> with the text of the article
    > with a <auuthor> with the text of the article
    >
    > Do you know such a tool?
    >
    > Marco



    There is a tool called HTML tidy, if I am not wrong, it converts from HTML
    into XHTML.
    Everything else is up to you.
    Search using HTML tidy.

    Bye,
    Flavio
     
    FC, Jul 12, 2003
    #2
    1. Advertising

Want to reply to this thread or ask your own question?

It takes just 2 minutes to sign up (and it's free!). Just click the sign up button to choose a username and then you can ask your own questions on the forum.
Similar Threads
  1. =?Utf-8?B?VmlqYXk=?=

    Web Extractor

    =?Utf-8?B?VmlqYXk=?=, Apr 8, 2004, in forum: ASP .Net
    Replies:
    0
    Views:
    368
    =?Utf-8?B?VmlqYXk=?=
    Apr 8, 2004
  2. Luigi Donatello Asero

    Semantics extractor

    Luigi Donatello Asero, Feb 14, 2004, in forum: HTML
    Replies:
    5
    Views:
    563
    Eric Bohlman
    Feb 14, 2004
  3. ma740988
    Replies:
    6
    Views:
    420
    ma740988
    Aug 26, 2004
  4. LioBandio

    Extractor needed

    LioBandio, Jul 9, 2006, in forum: Java
    Replies:
    0
    Views:
    404
    LioBandio
    Jul 9, 2006
  5. Desireco
    Replies:
    13
    Views:
    209
    William James
    Mar 10, 2006
Loading...

Share This Page