printing indented html code

L

Lowell Kirsh

Is there a module or library anyone knows of that will print html code
indented? What I'd like would be for a function or class which works
like this:

htmlIndent(sys.stdout, '<html><head>foobar</head>...')

and will print somethinkg like this to stdout:

<html>
<head>
foobar
</head>
...

My current way of doing this kind of stuff is less than ideal and will
write such a function if it doesn't exist.

Thanks,
Lowell
 
T

TechBookReport

Lowell said:
Is there a module or library anyone knows of that will print html code
indented? What I'd like would be for a function or class which works
like this:

htmlIndent(sys.stdout, '<html><head>foobar</head>...')

and will print somethinkg like this to stdout:

<html>
<head>
foobar
</head>
...

My current way of doing this kind of stuff is less than ideal and will
write such a function if it doesn't exist.

Thanks,
Lowell

There are lots of HTML pretty printers around, but for starters take a
look at this: http://www.oreilly.com/catalog/pythonsl/chapter/ch05.html

HTH
==========================================================================
TechBookReport - http://www.techbookreport.com
 
K

Konstantin Veretennicov

Is there a module or library anyone knows of that will print html code
indented?

Depends on whether you have to deal with xhtml, valid html or just
any, possibly invalid, "pseudo-html" that abounds on the net.

Here's an example of (too) simple html processing using standard
HTMLParser module:

from HTMLParser import HTMLParser

class Indenter(HTMLParser):

def __init__(self, out):
HTMLParser.__init__(self)
self.out = out
self.indent_level = 0

def write_line(self, text):
print >> self.out, '\t' * self.indent_level + text

def handle_starttag(self, tag, attrs):
self.write_line(
'<%s %s>' % (tag, ' '.join('%s=%s' % (k, v) for k, v in attrs))
)
self.indent_level += 1

def handle_endtag(self, tag):
self.indent_level -= 1
self.write_line('</%s>' % tag)

handle_data = write_line

# handle everything else...
# http://docs.python.org/lib/module-HTMLParser.html

if __name__ == '__main__':
import sys
i = Indenter(sys.stdout)
i.feed('<html><head>foobar</head><body color=0>body</body></html>')
i.close()

- kv
 
L

Lowell Kirsh

Looks good. I'll give it a try.

Konstantin said:
Depends on whether you have to deal with xhtml, valid html or just
any, possibly invalid, "pseudo-html" that abounds on the net.

Here's an example of (too) simple html processing using standard
HTMLParser module:

from HTMLParser import HTMLParser

class Indenter(HTMLParser):

def __init__(self, out):
HTMLParser.__init__(self)
self.out = out
self.indent_level = 0

def write_line(self, text):
print >> self.out, '\t' * self.indent_level + text

def handle_starttag(self, tag, attrs):
self.write_line(
'<%s %s>' % (tag, ' '.join('%s=%s' % (k, v) for k, v in attrs))
)
self.indent_level += 1

def handle_endtag(self, tag):
self.indent_level -= 1
self.write_line('</%s>' % tag)

handle_data = write_line

# handle everything else...
# http://docs.python.org/lib/module-HTMLParser.html

if __name__ == '__main__':
import sys
i = Indenter(sys.stdout)
i.feed('<html><head>foobar</head><body color=0>body</body></html>')
i.close()

- kv
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,769
Messages
2,569,579
Members
45,053
Latest member
BrodieSola

Latest Threads

Top