url filtering

V

vertigo

Hello

I want to do some text analysis based on html documents grabbed from
internet.
Is there any library which could allow me easily getting text from html
documents
(cutting javascript, html tags and other not nececary data) ?

Thanx
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,755
Messages
2,569,536
Members
45,007
Latest member
obedient dusk

Latest Threads

Top