REXML element reading error

John Butler · Aug 31, 2007

When reading in the site element from my xml file using rexml it seems
to be chopping the rest of the text off after the first 

The value in the XML file is below
<Site>123 street amstown amserland</Site>

element = REXML::XPath.first(doc, '//Site')

puts element.text #shows 123 Street

How can i get the full data and once i have it i can remove the 
I cant find any information on this?????

JB

Keith Fahlgren · Aug 31, 2007

When reading in the site element from my xml file using rexml it seems
to be chopping the rest of the text off after the first 

The value in the XML file is below
<Site>123 street amstown amserland</Site>

element = REXML::XPath.first(doc, '//Site')

I'd suggest using a bit more XPath, both text() and a each {} to
iterate through the text nodes (which are distinct):

$ irb -r rexml/document --prompt xmp
a = REXML:

ocument.new("<Site>123 street amstown amserland</Site>")
# => <UNDEFINED> ... </>
REXML::XPath.first(a, '//Site').text
# => "123 street"
REXML::XPath.first(a, '//Site/text()').to_s
# => "123 street"
REXML::XPath.each(a, '//Site/text()') {|el| puts el}
123 street
amstown
amserland
# => ["123 street", "amstown", "amserland"]

HTH,
Keith

Nobuyoshi Nakada · Sep 1, 2007

Hi,

At Sat, 1 Sep 2007 05:18:48 +0900,
Keith Fahlgren wrote in [ruby-talk:266990]:

I'd suggest using a bit more XPath, both text() and a each {} to
iterate through the text nodes (which are distinct):

$ irb -r rexml/document --prompt xmp
a = REXML:ocument.new("<Site>123 street amstown amserland</Site>")
# => <UNDEFINED> ... </>
REXML::XPath.first(a, '//Site').text
# => "123 street"

Seems like that just REXML::XPath.first(a, '//Site').to_s
returns the whole content.

not · Sep 1, 2007

When reading in the site element from my xml file using rexml it seems
to be chopping the rest of the text off after the first

Not quite. It gives you the *first* text element.

The value in the XML file is below
<Site>123 street amstown amserland</Site>

element = REXML::XPath.first(doc, '//Site')

puts element.text #shows 123 Street

How can i get the full data and once i have it i can remove the I
cant find any information on this?????

You can't find any specific info because there isn't anything specific.
You have an XML element that contains a text node, an empty element named
br, another text node, another empty element named br and another text
node. In the XML world, is a node like any other.

The REXML::Element.texts method is what you are looking for:

$ irb
irb(main):001:0> require "rexml/document"
=> true

irb(main):002:0> doc=REXML::Document.new( said:
amserland</Site>")

=> <UNDEFINED> ... </>

irb(main):003:0> doc.root.texts
=> ["123 street", "amstown", "amserland"]

irb(main):004:0> doc.root.texts.join " "
=> "123 street amstown amserland"

Enjoy!

REXML Speed Question	3	Apr 8, 2011
problems reading xml from a db field and using it in REXML	0	Jun 30, 2008
REXML Input File Question	7	Jul 19, 2010
getting XPath in REXML to dive deeper	2	Sep 24, 2007
REXML and Empty-Elements	1	Oct 21, 2008
Errors on REXML reading an HTML.	1	Dec 24, 2010
REXML problem	0	Nov 10, 2005
REXML Element exists question	4	Aug 30, 2007

REXML element reading <br /> error

John Butler

Keith Fahlgren

Nobuyoshi Nakada

not

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads