html parser

G

Gustavo G. Rondina

Hi all

I am using libcurl to grab an html file from a remote http site. How
can I parse this file in order to produce a "formatted" output? Is
there any lib around that performs this action?

Thanks
 
C

Cedric LEMAIRE

I am using libcurl to grab an html file from a remote http site. How
can I parse this file in order to produce a "formatted" output? Is
there any lib around that performs this action?

You can use CodeWorker, a universal parsing tool and a versatile
source code generator, freeware available at
"http://www.codeworker.org".

You describe how you want to parse the HTML page via an extended-BNF
script, which will extract only the data you are interested in. Then,
you save the resulting data in a file, writing a template-based script
for the code (text here) generation.

It is highly declarative, well-adapted to the data extraction from
HTML pages.

If you don't want to call the interpreter of CodeWorker as an external
tool, it is available as a C++ library too. But I don't know if it is
easy to link a C++ library to a C program.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,755
Messages
2,569,536
Members
45,020
Latest member
GenesisGai

Latest Threads

Top