Rubyful Soup v0.8

L

Leonard Richardson

I've created a Ruby port of Beautiful Soup, my Python module for HTML
screen-scraping. The goal is to make it trivial to get the data you
need out of complex and/or poorly-formed *ML.

A beta release is available at:
http://www.crummy.com/software/RubyfulSoup/

Please let me know what you think of this library.

Leonard
 
J

Josh Charles

How good is this for scraping google search results (web)? I have an
application I'm working on that needs this functionality. I've
currently tried writing a C# library to do this, but it worked
terribly. The google web service API is not going to work because I
need to not be limited by the number of searches I can do in a day
(there could be thousands of users doing searches).
 
J

James Edward Gray II

How good is this for scraping google search results (web)?

I would expect Google to have fairly good pages, but that's a random
guess, not me speaking from experience.

However, be careful to examine the legal issues here. I seriously
doubt this is allowed.

James Edward Gray II
 
T

TLOlczyk

I've created a Ruby port of Beautiful Soup, my Python module for HTML
screen-scraping. The goal is to make it trivial to get the data you
need out of complex and/or poorly-formed *ML.

A beta release is available at:
http://www.crummy.com/software/RubyfulSoup/

Please let me know what you think of this library.

Leonard
Hmm. Being new to these packages, I wouldn't understand
exactly what the differences are, but wouldn't you be better
off building it on top of htmltools?

Or at lest using their SGML parser ( which I understand is a port
or the Python SGML parser ).



The reply-to email address is (e-mail address removed).
This is an address I ignore.
To reply via email, remove 2002 and change yahoo to
interaccess,

**
Thaddeus L. Olczyk, PhD

There is a difference between
*thinking* you know something,
and *knowing* you know something.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,769
Messages
2,569,581
Members
45,056
Latest member
GlycogenSupporthealth

Latest Threads

Top