Data extraction using Scrubyt

Vipin Vm · Dec 5, 2008

Hi All,

I need to fetch some information from http://www.ebay.in.
My required fields are : Name of the product, Image, Price and the link
to that product.

am able to get the data using this method.
require 'rubygems'
require 'scrubyt'

google_data = Scrubyt::Extractor.define do
fetch 'http://www.ebay.in'
fill_textfield 'satitle', 'ipod shuffle'
submit
record
"/html/body/div[2]/div[4]/div[2]/div/div/div[2]/div[2]/div/div/div[3]/div/div/table/tr"
do
name "/td[2]/div/a"
price "/td[5]"
image "/td/a/img" do
url "src", :type => :attribute
end
link "/td[2]/div/a" do
url "href", :type => :attribute
end
end
end

google_data.to_xml.write($stdout, 1)

but my problem is for some products its not working properly. (div may
be changed). is there any better solution for this?

Thanks in advance,
Vipin

Peter Szinek · Dec 5, 2008

[Note: parts of this message were removed to make it a legal post.]

You need to create smarter XPaths, relying on CSS id/class attributes
or other properties rather than a full XPath from the root - for
example:

require 'rubygems'
require 'scrubyt'

ebay_data = Scrubyt::Extractor.define do

fetch 'http://www.ebay.in/'
fill_textfield 'satitle', 'ipod'
submit

record "//table[@class='nol']" do
name "//td[@class='details']/div/a"
end
end

puts ebay_data.to_xml

etc.

This way your scraper will be more robust and prone to page changes.

HTH,
Peter
___
http://www.rubyrailways.com
http://scrubyt.org

Vipin Vm · Dec 6, 2008

Hi Peter,

Thanks for the Help... its working fine

Vipin

Peter said:
You need to create smarter XPaths, relying on CSS id/class attributes
or other properties rather than a full XPath from the root - for
example:

require 'rubygems'
require 'scrubyt'

ebay_data = Scrubyt::Extractor.define do

fetch 'http://www.ebay.in/'
fill_textfield 'satitle', 'ipod'
submit

record "//table[@class='nol']" do
name "//td[@class='details']/div/a"
end
end

puts ebay_data.to_xml

etc.

This way your scraper will be more robust and prone to page changes.

HTH,
Peter
___
http://www.rubyrailways.com
http://scrubyt.org

Peter Szinek · Dec 6, 2008

[Note: parts of this message were removed to make it a legal post.]

Hi Peter,

Thanks for the Help... its working fine

Glad that I could help. I am just working on a new release btw, so
stay tuned!

Cheers,
Peter
___
http://www.rubyrailways.com
http://scrubyt.org

Help : Error in scrubyt	0	Feb 18, 2010
Problem while using scrubyt	0	Oct 8, 2008
scrubyt scraper help	0	Oct 1, 2010
Only one table shows up with the information	2	Mar 29, 2023
NoMethodError	0	Jul 15, 2009
How to add dropdown selected data to table using jquery	2	Jul 2, 2022
Repost : Trouble with Error: "..undefined method `write'.."	0	Apr 30, 2009
selecting text in scrubyt	0	Oct 31, 2008

Data extraction using Scrubyt

Vipin Vm

Peter Szinek

Vipin Vm

Peter Szinek

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads