selecting text in scrubyt

V

Vinay Gowda

hi,

i am using scrubyt, and below is my code to scap

google.

Scrubyt.logger = Scrubyt::Logger.new
google_data = Scrubyt::Extractor.define do
#Perform the action(s)
fetch 'http://www.google.com/'
fill_textfield 'q', 'ruby'
submit
#Construct the wrapper
#
link "//div[3]/div/ol/li" do

head "/h3[@class='r']"
des "/div[@class='s']"

end
next_page "Next", :limit => 2
end


and this wil output some thing like this

# Ruby Programming Language
# A dynamic, interpreted, open source programming language with a focus
on simplicity and productivity. Site includes news, downloads,
documentation, ...www.ruby-lang.org/ - 12k - Cached - Similar
pagesDownloadsDocumentationin Twenty MinutesWhat's RubyDownload
RubyLibrariesAbout RubySecurityMore results from ruby-lang.org »
# Ruby (programming language) - Wikipedia, the free encyclopedia
# Ruby is a dynamic, reflective, general purpose object-oriented
programming language that combines syntax inspired by Perl with
Smalltalk-like features.
...en.wikipedia.org/wiki/Ruby_(programming_language) - 118k - Cached -
Similar pages

since <div class ='s'> has text and some child nodes. I m getting all
text of <div class ='s'> as well as its chlid nodes.

how to filter this( i dont want child node's text). Can any body help in
this. What procedure i have to follow.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

Forum statistics

Threads
473,772
Messages
2,569,592
Members
45,104
Latest member
LesliVqm09
Top