to_plain_text() in hpricot doesn't do what you'd think it'd do...

  • Thread starter Just Another Victim of the Ambient Morality
  • Start date
J

Just Another Victim of the Ambient Morality

You'd think that the to_plain_text() method would conver the HTML to
whatever text would be rendered by a browser. That means stripping all the
extraneous whitespace, converting <br> and <p> tags to having appropriate
carriage returns, etc... Not only does it not do that but links exhibit
exceptionally weird behaviour. They append the URL of the link in square
brackets. What's up with that?
Can anyone explain the motivation behind the to_plain_text() method in
hpricot? Does anyone agree with my opinion on what it should do?
Thank you...
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
474,438
Messages
2,571,699
Members
48,796
Latest member
Greg L.
Top