Quick question on the presence of CDATA

Dilip · Oct 25, 2006

I have been out of the XML world for a while and have sort of forgotten
the exact difference between:

<Symbol><![CDATA[IBM]]></Symbol>

and just:

<Symbol>IBM</Symbol>

Can anyone tell me why one is preferred over the other?

thanks!

Joseph Kesselman · Oct 25, 2006

Followup to the Microsoft list doesn't work through my servers, so
answering here...

<Symbol><![CDATA[IBM]]></Symbol>
<Symbol>IBM</Symbol>

Identical meaning, since there aren't any special characters in the value.

<!CDATA[]]> sections are an alternative to character-by-character
escaping of characters that would otherwise confuse XML syntax (such as
"<" and "&"). It escapes its entire contents -- with the exception of
any ]]> sequences, which require special handling.

Generally the only time you care about this is when you're hand-editing
XML, want to drop non-XML text into the value of an XML element (note
that you can't use this kluge for attribute values), and are too lazy to
fix it up by hand. If you build your XML using any XML-aware tool, it
should take care of the escaping for you and you don't have to care
whether it escapes individual characters or uses <!CDATA[]]>

Dilip · Oct 25, 2006

Joseph said:
Followup to the Microsoft list doesn't work through my servers, so
answering here...

<Symbol><![CDATA[IBM]]></Symbol>
<Symbol>IBM</Symbol>

Click to expand...

Identical meaning, since there aren't any special characters in the value.

<!CDATA[]]> sections are an alternative to character-by-character
escaping of characters that would otherwise confuse XML syntax (such as
"<" and "&"). It escapes its entire contents -- with the exception of
any ]]> sequences, which require special handling.

Generally the only time you care about this is when you're hand-editing
XML, want to drop non-XML text into the value of an XML element (note
that you can't use this kluge for attribute values), and are too lazy to
fix it up by hand. If you build your XML using any XML-aware tool, it
should take care of the escaping for you and you don't have to care
whether it escapes individual characters or uses <!CDATA[]]>

Just so that I got this straight, from the standpoint of the XML parser
does the 2 forms of elements make a difference? I mean, if I use XPath
to locate that element to retrieve its value, will I get back IBM or
something else?

Sorry if the question sounds stupid. I remember what CDATA is about
but I have forgotten what happens when a parser encounters it. (It
probably just treats whatever is inside as plain text, right?)

Joseph Kesselman · Oct 25, 2006

Dilip said:
Just so that I got this straight, from the standpoint of the XML parser
does the 2 forms of elements make a difference? I mean, if I use XPath
to locate that element to retrieve its value, will I get back IBM or
something else?

XPath doesn't distinguish the two; both yield IBM.

Parsers *CAN* distinguish the two, for the convenience of editors and
other tools which want to be able to display syntax as well as semantics
-- but aren't required to and often don't unless you ask them to.

probably just treats whatever is inside as plain text, right?)

Modulo the difference in how escaping is handled, yes, pretty much. A
SAX parser may tell the application that it's now inside the bounds of a
CDATA section; the app needs to decide whether to listen for lexical
events and whether it cares about this one. A DOM (depending on how the
builder is configured) may display the data using a CDATASection Node
rather than a Text Node, but the former is a subclass of the latter so
again that doesn't matter unless the application cares about the difference.

As far as the XML Infoset is concerned, <![CDATA[&a<]]> is just a
representation of the character sequence &a< and is identical to
&a< or &a< or &a< or any of the other possible
combinations. The Infoset considers the differences between these to be
No Difference.

CDATA output problem	2	Feb 26, 2008
Contact form question	2	May 8, 2023
JavaScript: how to keep track of the circle in canvas on specific path?	0	Mar 20, 2023
I'm tempted to quit out of frustration	1	Aug 13, 2023
XSLT2.0 Copy of CDATA into txt file under Windows and using it withUnix/Linux	1	Jan 16, 2008
Did you know that there is a match-case function in python?	4	Dec 17, 2023
How to get education and coding job coming from abroad starting new in the US? Advice of courses or places to look?	2	May 18, 2023
First time question	1	Dec 13, 2022

Quick question on the presence of CDATA

Dilip

Joseph Kesselman

Dilip

Joseph Kesselman

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads