K
kjhjhjhjadsasda
Im trying to write a perl script that in a meaningful way extracts text
content from a webpage. Ive tried through modules and reg expr but
havent found a good way yet.
To avoid "crappy" text slipping through, is there a way of extracting
only sentences? ex:
-clean the html from tags
-extract sentences through identifying number of words between
punctuations or something similar.
Any other ideas on how to nicely pick out content text from a webpage?
Thanks
M
content from a webpage. Ive tried through modules and reg expr but
havent found a good way yet.
To avoid "crappy" text slipping through, is there a way of extracting
only sentences? ex:
-clean the html from tags
-extract sentences through identifying number of words between
punctuations or something similar.
Any other ideas on how to nicely pick out content text from a webpage?
Thanks
M