Problems with scRUBYt

Cs Webgrl · Dec 30, 2008

Hi.

I am currently scraping a page with scRUBYt and am not getting the
results as expected.

Instead of the correctly formatted xml document I'm getting the
following.

<record>
<1>book a</1>
<1>book b</1>
<1>book c</1>
<2>chapter aa</2>
<2>chapter bb</2>
<2>chapter cc</2>
<3>verse aaa</3>
<3>verse bbb</3>
<3>verse ccc</3>
</record>

My code looks like this:

listing "//a[@id*='volume'>" do
book "//a[@class='1']"
chapter "//span[@class='2']"
verse "//a[@id*='3']"
end

Any ideas?

Sorry for the sample data, but hopefully someone has seen this before
and can help.

Aaron Patterson · Dec 31, 2008

Hi.

I am currently scraping a page with scRUBYt and am not getting the
results as expected.

Instead of the correctly formatted xml document I'm getting the
following.

<record>
<1>book a</1>
<1>book b</1>
<1>book c</1>
<2>chapter aa</2>
<2>chapter bb</2>
<2>chapter cc</2>
<3>verse aaa</3>
<3>verse bbb</3>
<3>verse ccc</3>
</record>

This is a correctly formatted XML document. You just have numbers for
tag names.

My code looks like this:

listing "//a[@id*='volume'>" do
book "//a[@class='1']"
chapter "//span[@class='2']"
verse "//a[@id*='3']"
end

Any ideas?

Have you tried something like this:

book "//2[@id='whatevs']"

That should get you access to the tags.

Hope that helps!

Cs Webgrl · Dec 31, 2008

Aaron said:
Have you tried something like this:

book "//2[@id='whatevs']"

That should get you access to the tags.

This gives me a ton of data, but now I have lost the specific pieces of
information that I'm looking for. Instead it looks like the output of
all of the sourced code on that page. Was I to change something else in
the code to get the specific piece of data that I need?

scrubyt scraper help	0	Oct 1, 2010
Data extraction using Scrubyt	3	Dec 5, 2008
selecting text in scrubyt	0	Oct 31, 2008
Problem with xpath in scrubyt.	2	Jul 15, 2009
scRUBYt! 0.3.1 released	0	May 29, 2007
Sort by number of characters	1	Nov 2, 2023
Problems with using event handlers for button and textarea input	1	Nov 29, 2021
Working on mobile css menu with plenty of frustration!	2	Dec 29, 2022

Problems with scRUBYt

Cs Webgrl

Aaron Patterson

Cs Webgrl

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads