Return HTML between tags with HTML::TokeParser ?

Maqo · Feb 23, 2005

Is it possible to use HTML::TokeParser to return the raw HTML between
two <A> tags, as opposed to just the text? My source file contains
several blocks of code--containing anchor links for each--that I'm
trying to extract by section while maintaining formatting.

My code:

my $p = HTML::TokeParser->new("file.txt" || die "Can't open file.");
while (my $t = $p->get_tag("a")) {
my $name = $t->[1]{name};
next unless $name && ($name eq "anchor");
print "$name : " . $p->get_text("a");

Example HTML source:

<A NAME='anchor1'></a>Some text and HTML formatting 
<A NAME='anchor2'></a>Some text and HTML formatting 
....
<A NAME='anchor10'></a>Some text and HTML formatting 

The above code returns the "text and formatting" portions nicely,
albeit only as text. Is there an easy way to do this using
HTML:

arser to return the desired portion, with HTML markup included?
Many thanks.

A. Sinan Unur · Feb 23, 2005

Is it possible to use HTML::TokeParser to return the raw HTML between
two <A> tags, as opposed to just the text? My source file contains
several blocks of code--containing anchor links for each--that I'm
trying to extract by section while maintaining formatting.

My code:

my $p = HTML::TokeParser->new("file.txt" || die "Can't open file.");

Cute but counter-productive. Please post real code.

while (my $t = $p->get_tag("a")) {
my $name = $t->[1]{name};
next unless $name && ($name eq "anchor");
print "$name : " . $p->get_text("a");

Example HTML source:

<A NAME='anchor1'></a>Some text and HTML formatting 

The above code returns the "text and formatting" portions nicely,
albeit only as text.

Once the bugs are fixed, the code above runs successfully and produces
no output at all. That is exactly what I expected to see based on the
sample data you provided. Problem solved.

Hvae you read the posting guidelines?

Sinan

Michael Wagg · Feb 23, 2005

A. Sinan Unur said:
Cute but counter-productive. Please post real code.

With the exception of the input filename (which was changed from
"digest.html"), this is the exact code being used.

while (my $t = $p->get_tag("a")) {
my $name = $t->[1]{name};
next unless $name && ($name eq "anchor");
print "$name : " . $p->get_text("a");

Example HTML source:

<A NAME='anchor1'></a>Some text and HTML formatting 

Click to expand...

Am I missing something here? There is no text between <a> and </a>
above.

The above code returns the text between one open tag and the next open
tag (<A> -> <A>), not between one open tag and the subsequent closing
tag (<A> -> </A>).

Sam Holden · Feb 23, 2005

With the exception of the input filename (which was changed from
"digest.html"), this is the exact code being used.

That's a really silly || with a constant true value on the left.

Why would you bother with code that can not be executed? Especially
when all it could possibly serve to do is to trick other people,
and perhaps yourself, into thinking there's error checking when
there isn't.

A. Sinan Unur · Feb 23, 2005

With the exception of the input filename (which was changed from
"digest.html"), this is the exact code being used.

my $p = HTML::TokeParser->new("file.txt")
or "Can't open file.";

while (my $t = $p->get_tag("a")) {
my $name = $t->[1]{name};
next unless $name && ($name eq "anchor");

Click to expand...

Click to expand...

Now I realize why it doesn't return anything: There are no anchors named
'anchor' in the data you provided.

Sorry, I don't have time to look at the rest of the stuff right now.

Stuck with html and css	25	Dec 14, 2022
HTML Site Problems	11	Nov 25, 2019
Need assistance finetuning HTML, CSS, Javascript - sticky header issue	3	Feb 25, 2022
Generate one HTML from API based on the object key language and their value	2	Aug 19, 2022
Help with some CSS	2	Mar 29, 2023
Popup HTML help	5	Nov 28, 2019
How to have two html audio players on one page?	0	May 3, 2022
Issue with textbox script?	0	Sep 5, 2022

Return HTML between tags with HTML::TokeParser ?

Maqo

A. Sinan Unur

Michael Wagg

Sam Holden

A. Sinan Unur

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads