HTTP headers and source

H

Haze Noc

Ok guys, lets say i wanted to grab the source for google.com or
something... it wont allow if unless i send the correct headers to spoof
the program.. Can anyone give me a working example of how to send
headers and download a webpage source?

I tried looking through all of the docs and coming up with something but
i failed...

Thanks for any reply's
 
R

rking

Here's my suggestion:

Firefox + LiveHTTPHeaders - http://livehttpheaders.mozdev.org/installation.html

LHH shows all HTTP chatter, so there's nothing that a server can see
that you can't. From there it's just a matter of imitating the headers
with Net::HTTP.

Remember, though, that you have some vague sort of obligation to
maintain netiquette. If a server rejects automated requests, they may
have a good reason to, and you're going against their wishes to mimic
a real browser. I doubt the Feds are going to come kicking your door
in over it, but it's still worth trying to be respectful.

Google, for example, has an API that they encourage for automated
usage. Here are some details: http://code.google.com/apis/soapsearch/api_terms.html

-rking
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
474,431
Messages
2,571,679
Members
48,796
Latest member
Greg L.

Latest Threads

Top