Converting HTML to ASCII

G

gf gf

Hi. I'm looking for a Python lib to convert HTML to
ASCII. Of course, a quick Google search showed
several options (although, I must say, less than I
would expect, considering how easy this is to do in
*other* languages... :| ), but, I have 2 requirements,
which none of them seem to meet:

1) Be able to handle badly formed, or illegal, HTML,
as best as possible. Some of the converters I tried
ended up dieing on a weird character (that is, a high
ascii char). Others somehow got confused and dumped
the JavaScript as well.

2) Not embellish the text in any way - no asterisks,
no bracket links, no __ for underlines.

Can anyone direct me to something which could help me
for this?

--Thanks a mil.



__________________________________
Do you Yahoo!?
Yahoo! Mail - Helps protect you from nasty viruses.
http://promotions.yahoo.com/new_mail
 
W

William Park

gf gf said:
Hi. I'm looking for a Python lib to convert HTML to
ASCII. Of course, a quick Google search showed
several options (although, I must say, less than I
would expect, considering how easy this is to do in
*other* languages... :| ), but, I have 2 requirements,
which none of them seem to meet:

1) Be able to handle badly formed, or illegal, HTML,
as best as possible. Some of the converters I tried
ended up dieing on a weird character (that is, a high
ascii char). Others somehow got confused and dumped
the JavaScript as well.

2) Not embellish the text in any way - no asterisks,
no bracket links, no __ for underlines.

Can anyone direct me to something which could help me
for this?

man lynx
man links
man w3m
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,755
Messages
2,569,537
Members
45,022
Latest member
MaybelleMa

Latest Threads

Top