How do I get the value out of a DOM Element

kj7ny · Sep 27, 2007

I have been able to get xml.dom.minidom.parse('somefile.xml') and then
dom.getElementsByTagName('LLobjectID') to work to the point where I
get something like: [<DOM Element: LLobjectID at 0x13cba08>] which I
can get down to <DOM Element: LLobjectID at 0x13cba08> but then I
can't find any way to just get the value out from the thing!

..toxml() returns something like: u'<LLobjectID><![CDATA[1871203]]></
LLobjectID>'.

How do I just get the 1871203 out of the DOM Element?

Thanks,

Stefan Behnel · Sep 27, 2007

kj7ny said:
I have been able to get xml.dom.minidom.parse('somefile.xml') and then
dom.getElementsByTagName('LLobjectID') to work to the point where I
get something like: [<DOM Element: LLobjectID at 0x13cba08>] which I
can get down to <DOM Element: LLobjectID at 0x13cba08> but then I
can't find any way to just get the value out from the thing!

.toxml() returns something like: u'<LLobjectID><![CDATA[1871203]]></
LLobjectID>'.

How do I just get the 1871203 out of the DOM Element?

It contains a CDATA node which in turn contains a Text node (AFAIR), so you
have to walk through the children to get what you want.

Alternatively, try an XML API that makes it easy to handle XML, like
ElementTree (part of the stdlin in Python 2.5) or lxml, both of which have
compatible APIs. The code would look like this:

tree = etree.parse("some_file.xml")
id = tree.find("//LLobjectID")
print id.text

Stefan

Paul Boddie · Sep 27, 2007

I have been able to get xml.dom.minidom.parse('somefile.xml') and then
dom.getElementsByTagName('LLobjectID') to work to the point where I
get something like: [<DOM Element: LLobjectID at 0x13cba08>] which I
can get down to <DOM Element: LLobjectID at 0x13cba08> but then I
can't find any way to just get the value out from the thing!

.toxml() returns something like: u'<LLobjectID><![CDATA[1871203]]></
LLobjectID>'.

How do I just get the 1871203 out of the DOM Element?

DOM Level 3 provides the textContent property:

http://www.w3.org/TR/DOM-Level-3-Core/core.html#Node3-textContent

You'll find this in libxml2dom and possibly some other packages such
as pxdom. For the above case with minidom specifically (at least with
versions I've used), you need to iterate over the childNodes of the
element, obtaining the nodeValue for each node and joining them
together. Something like this might do it:

"".join([n.nodeValue for n in element.childNodes])

It's not pretty, but encapsulating stuff like this is what functions
are good for.

Paul

kj7ny · Sep 27, 2007

Forgot to mention I'm using Python 2.4.3.

Stefan Behnel · Sep 27, 2007

kj7ny said:
Forgot to mention I'm using Python 2.4.3.

You can install both lxml and ET on Python 2.4 (and 2.3). It's just that ET
went into the stdlib from 2.5 on.

Stefan

How can I find out the id of the first record in a database table, assuming record 1 is deleted?	2	Aug 27, 2025
Hello I am learning how to code and I tried making a calculator with HTML and js with some CSS I am stuck at thing, Like the screen value is	0	Mar 13, 2025
How do I rename and copy a file on the server?	1	Nov 21, 2025
I'm tempted to quit out of frustration	1	Aug 13, 2023
Can I count the number of times a video is played?	2	Oct 28, 2025
How do I get this to actually create what I want?	0	Sep 4, 2024
How do I create a countdown, and compare todays date/time with a date/time in a database?	0	Feb 5, 2026
How to properly insert a landing page within same container beneath an image element?	1	Oct 6, 2024

How do I get the value out of a DOM Element

kj7ny

Stefan Behnel

Paul Boddie

kj7ny

Stefan Behnel

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads