[ANN] nokogiri 1.0.5 Released

Discussion in 'Ruby' started by Aaron Patterson, Nov 13, 2008.

  1. nokogiri version 1.0.5 has been released!

    * <http://nokogiri.rubyforge.org/>
    * <http://github.com/tenderlove/nokogiri/wikis>
    * <http://github.com/tenderlove/nokogiri/tree/master>
    * <http://rubyforge.org/mailman/listinfo/nokogiri-talk>
    * <http://nokogiri.lighthouseapp.com/projects/19607-nokogiri/overview>

    Nokogiri (鋸) is an HTML, XML, SAX, and Reader parser with CSS3 and
    XPath search support.

    Changes:

    ### 1.0.5

    * Bugfixes

    * Added mailing list and ticket tracking information to the README.txt
    * Sets ENV['PATH'] on windows if it doesn't exist
    * Caching results of NodeSet#[] on Document

    == FEATURES:

    * XPath support for document searching
    * CSS3 selector support for document searching
    * XML/HTML builder
    * Drop in replacement for Hpricot (though not bug for bug)

    Nokogiri parses and searches XML/HTML very quickly, and also has
    correctly implemented CSS3 selector support as well as XPath support.

    Here is a speed test:

    * http://gist.github.com/22176

    Nokogiri also features an Hpricot compatibility layer to help ease the change
    to using correct CSS and XPath.

    == SUPPORT:

    The Nokogiri mailing list is available here:

    * http://rubyforge.org/mailman/listinfo/nokogiri-talk

    The bug tracker is available here:

    * http://nokogiri.lighthouseapp.com/projects/19607-nokogiri/overview

    == SYNOPSIS:

    require 'nokogiri'
    require 'open-uri'

    doc = Nokogiri::HTML(open('http://www.google.com/search?q=tenderlove'))

    ####
    # Search for nodes by css
    doc.css('h3.r a.l').each do |link|
    puts link.content
    end

    ####
    # Search for nodes by xpath
    doc.xpath('//h3/a[@class="l"]').each do |link|
    puts link.content
    end

    ####
    # Or mix and match.
    doc.search('h3.r a.l', '//h3/a[@class="l"]').each do |link|
    puts link.content
    end

    * <http://nokogiri.rubyforge.org/>
    * <http://github.com/tenderlove/nokogiri/wikis>
    * <http://github.com/tenderlove/nokogiri/tree/master>
    * <http://rubyforge.org/mailman/listinfo/nokogiri-talk>
    * <http://nokogiri.lighthouseapp.com/projects/19607-nokogiri/overview>
     
    Aaron Patterson, Nov 13, 2008
    #1
    1. Advertisements

  2. Aaron Patterson

    Dingding Ye Guest

    [Note: parts of this message were removed to make it a legal post.]

    hi.

    Is nested search supported?

    For example,

    xml =
    "<root><body><subbody>subbody1</subbody></body><body><subbody>subbody2</subbody></body></root>"
    doc.xpath("//body").each do |body|
    puts body.at("subbody").content
    end

    I expected to see

    subbody1
    subbody2

    but seems it always use the root element.

     
    Dingding Ye, Nov 13, 2008
    #2
    1. Advertisements

  3. Yes. Try using an xpath search as such:

    xml =
    "<root><body><subbody>subbody1</subbody></body><body><subbody>subbody2</subbody></body></root>"

    Nokogiri::XML(xml).xpath("//body").each do |body|
    puts body.at(".//subbody").content
    end
     
    Aaron Patterson, Nov 13, 2008
    #3
  4. Aaron Patterson

    Mark Thomas Guest

    Let me try to clear up the confusion.

    at() is from Hpricot, and it searches from the root node. It is
    typically used as doc.at('element')

    search() is also from Hpricot, and it searches from the current node.
    This is perhaps the behavior you were expecting. An alias is '/', as
    in doc/"html/body"

    xpath() is like find() from libxml and you can nest those calls too.
    Note that the Hpricot xpath(), which returns the path to the node, is
    available in Nokogiri as path().

    -- Mark.
     
    Mark Thomas, Nov 13, 2008
    #4
  5. Great work!
     
    Marcin Raczkowski, Nov 13, 2008
    #5
    1. Advertisements

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments (here). After that, you can post your question and our members will help you out.