G
googler
I want to get the content of specific web pages and do some processing
on them. I found that the LWP class can help with the first part. I
have never used LWP before and found some simple code like the one
below that returns a web page content.
my $url = 'http://www.yahoo.com';
use LWP::Simple;
my $content = get $url;
I am interested in only the text part of the web page (that is,
without any tags, cross links etc). Is there an easy way to get this
(without having to search through the entire content and filtering out
the part that I don't need)?
on them. I found that the LWP class can help with the first part. I
have never used LWP before and found some simple code like the one
below that returns a web page content.
my $url = 'http://www.yahoo.com';
use LWP::Simple;
my $content = get $url;
I am interested in only the text part of the web page (that is,
without any tags, cross links etc). Is there an easy way to get this
(without having to search through the entire content and filtering out
the part that I don't need)?