How can one get the Hpricot DOM document from Mechanize?

Just Another Victim of the Ambient Morality · Sep 13, 2008

I was wondering if there were some way of getting the Hpricot DOM (for
lack of a better term) from a Mechanize page. For example:

agent = WWW:Mechanize.new
page = agent.get(http://www.website.com)

# I am currently doing this
doc = Hpricot(page.body)

# I would like to do this
doc = page.get_hpricot_dom

The idea is that since Mechanize apparently uses Hpricot and it's surely
using it to parse the HTML begotten from the agent.get method, it would be
nice if I didn't have to repeat that work.
Is there a way to get this Hpricot document? ...or am I just totally
wrong about how Mechanize uses Hpricot?
Thank you...

Lex Williams · Sep 13, 2008

perhaps it's only me , but would you please detail what is it you want
to accomplish? maybe , with an example perhaps ?

Matthias Reitinger · Sep 13, 2008

Just said:
# I would like to do this
doc = page.get_hpricot_dom

Try page.parser or page.root (they're eqivalent).

Regards,
Matthias

Aaron Patterson · Sep 18, 2008

I was wondering if there were some way of getting the Hpricot DOM (for
lack of a better term) from a Mechanize page. For example:

agent = WWW:Mechanize.new
page = agent.get(http://www.website.com)

# I am currently doing this
doc = Hpricot(page.body)

# I would like to do this
doc = page.get_hpricot_dom

The idea is that since Mechanize apparently uses Hpricot and it's surely
using it to parse the HTML begotten from the agent.get method, it would be
nice if I didn't have to repeat that work.
Is there a way to get this Hpricot document? ...or am I just totally
wrong about how Mechanize uses Hpricot?

You can get at the Hpricot document by using the "parser" accessor on
WWW::Mechanize:

age. Page also responds to "search", "/", and "at",
which just delegate to the Hpricot document.

So you can just do:

(agent.get('http://tenderlovemaking.com')/'tr').each do |tr|
...
end

Mechanize	2	Dec 17, 2007
Mechanize	0	Jun 20, 2009
Using Mechanize and hpricot to get property taxes	6	May 18, 2008
[ANN] Mechanize 2.0.pre.2	0	Apr 18, 2011
Is it possible to get some informations from a document in Google Docs and show it on my website ?	0	Nov 19, 2022
problems with mechanize and inheritance	1	Mar 3, 2010
Mechanize/Nokogiri from file	0	Sep 16, 2009
[ANN] hpricot 0.7	23	Mar 17, 2009

How can one get the Hpricot DOM document from Mechanize?

Just Another Victim of the Ambient Morality

Lex Williams

Matthias Reitinger

Aaron Patterson

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads