An example using htmllib?

Dfenestr8 · Nov 8, 2003

Hi.

I want a routine that strips a line of html of all it's tags. e.g I want
it to turn ....

"<p><b>This is an <h1><blink>IRRITATING</blink></h1> line of </b>text</p>"

.... into ......

"This is an IRRITATING line of text"

I've been told I should use htmllib. I've tried reading the htmllib docs
in the Library Reference, but I have to say, it just confuses me.

Does anyone know of a page that shows some simple examples of the sort of
thing I want to do?

Or, is it possible to use the example provided in the docs to achieve
this? Here's the example below ...

from HTMLParser import HTMLParser

class MyHTMLParser(HTMLParser):

def handle_starttag(self, tag, attrs):
print "Encountered the beginning of a %s tag" % tag

def handle_endtag(self, tag):
print "Encountered the end of a %s tag" % tag

confused by HTMLParser class	3	May 28, 2008
HTMLParser skipping HTML? [newbie]	6	Sep 5, 2012
HTMLParser not parsing whole html file	4	Oct 24, 2010
HTMLParser can't read japanese	3	Apr 13, 2010
Newbie, list has no attribute iteritems	2	Jul 4, 2008
Parsing an HTML a tag	10	Sep 24, 2005
make a simple search function for homepage	1	Oct 31, 2006
HTMLParser and write	1	Mar 5, 2004

An example using htmllib?

Dfenestr8

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads