extracting HTML fragments and counting words

K

Ksenia Marasanova

Hi,

I want to show preview of several HTML formatted newsitems on one
page, preserving markup (and images) intact, but showing not more
thatn X first _readable_ words of every page. Is anyone aware of some
Python library that makes programming this easy? I already started to
program it with Beautiful Soup, but maybe there is a more easy way...

Thanks!
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,774
Messages
2,569,596
Members
45,143
Latest member
SterlingLa
Top