wikipedia parser...

Discussion in 'Java' started by boris, Sep 6, 2011.

  1. boris

    boris Guest

    hi all,
    can anyone recommend working! wikipedia parser (to text or html). I've
    tried to get wikitext working, but it looks like it has some problems..
    thanks.
    boris, Sep 6, 2011
    #1
    1. Advertising

  2. On 06/09/2011 3:03 PM, boris wrote:
    > hi all,
    > can anyone recommend working! wikipedia parser (to text or html). I've
    > tried to get wikitext working, but it looks like it has some problems..
    > thanks.


    So when you tried "java wikitext parser" in Google, what results did you
    get and which parsers did you try?
    Travers Naran, Sep 7, 2011
    #2
    1. Advertising

  3. boris

    Roedy Green Guest

    On Tue, 06 Sep 2011 18:03:29 -0400, boris
    <> wrote, quoted or indirectly quoted
    someone who said :

    >hi all,
    >can anyone recommend working! wikipedia parser (to text or html). I've
    >tried to get wikitext working, but it looks like it has some problems..
    >thanks.


    If there is only a set of things you are trying to extract, just
    download the page with http://mindprod.com/products1.html#HTTP

    Then pick out what you want with regexes and indexOf.

    See http://mindprod.com/jgloss/regex.html
    --
    Roedy Green Canadian Mind Products
    http://mindprod.com
    The modern conservative is engaged in one of man's oldest exercises in moral philosophy; that is,
    the search for a superior moral justification for selfishness.
    ~ John Kenneth Galbraith (born: 1908-10-15 died: 2006-04-29 at age: 97)
    Roedy Green, Sep 8, 2011
    #3
  4. boris

    Roedy Green Guest

    On Tue, 06 Sep 2011 18:03:29 -0400, boris
    <> wrote, quoted or indirectly quoted
    someone who said :

    >hi all,
    >can anyone recommend working! wikipedia parser (to text or html). I've
    >tried to get wikitext working, but it looks like it has some problems..
    >thanks.


    have a look at http://mindprod.com/applet/americantax.html
    There are screenscrapers for each state to extract sales tax
    information. You can use one as a starting point for what you need.
    --
    Roedy Green Canadian Mind Products
    http://mindprod.com
    The modern conservative is engaged in one of man's oldest exercises in moral philosophy; that is,
    the search for a superior moral justification for selfishness.
    ~ John Kenneth Galbraith (born: 1908-10-15 died: 2006-04-29 at age: 97)
    Roedy Green, Sep 8, 2011
    #4
  5. boris

    Arne Vajhøj Guest

    On 9/8/2011 12:29 AM, Roedy Green wrote:
    > On Tue, 06 Sep 2011 18:03:29 -0400, boris
    > <> wrote, quoted or indirectly quoted
    > someone who said :
    >> can anyone recommend working! wikipedia parser (to text or html). I've
    >> tried to get wikitext working, but it looks like it has some problems..
    >> thanks.

    >
    > If there is only a set of things you are trying to extract, just
    > download the page with http://mindprod.com/products1.html#HTTP


    He is not saying anything about needing help with HTTP
    requests.

    > Then pick out what you want with regexes and indexOf.
    >
    > See http://mindprod.com/jgloss/regex.html


    It will obviously work, but it is a DIY way.

    Arne
    Arne Vajhøj, Sep 9, 2011
    #5
  6. boris

    Arne Vajhøj Guest

    On 9/8/2011 12:31 AM, Roedy Green wrote:
    > On Tue, 06 Sep 2011 18:03:29 -0400, boris
    > <> wrote, quoted or indirectly quoted
    > someone who said :
    >> can anyone recommend working! wikipedia parser (to text or html). I've
    >> tried to get wikitext working, but it looks like it has some problems..
    >> thanks.

    >
    > have a look at http://mindprod.com/applet/americantax.html
    > There are screenscrapers for each state to extract sales tax
    > information. You can use one as a starting point for what you need.


    Does any of them use wiki markup?

    Arne
    Arne Vajhøj, Sep 9, 2011
    #6
    1. Advertising

Want to reply to this thread or ask your own question?

It takes just 2 minutes to sign up (and it's free!). Just click the sign up button to choose a username and then you can ask your own questions on the forum.
Similar Threads
  1. T.J.
    Replies:
    7
    Views:
    10,481
    Andy Dingley
    Apr 18, 2005
  2. Steve Lackey

    XML Wiki schemas (like WikiPedia)?

    Steve Lackey, Feb 11, 2004, in forum: XML
    Replies:
    0
    Views:
    404
    Steve Lackey
    Feb 11, 2004
  3. Claudio Grondi
    Replies:
    3
    Views:
    437
    Claudio Grondi
    Mar 22, 2005
  4. Replies:
    8
    Views:
    2,492
    Sunnan
    Apr 2, 2005
  5. Ivan Van Laningham

    Re: Pseudocode in the wikipedia

    Ivan Van Laningham, Apr 1, 2005, in forum: Python
    Replies:
    2
    Views:
    345
    Ivan Van Laningham
    Apr 2, 2005
Loading...

Share This Page