XML parsing with Java

vk02720 · Dec 13, 2008

Egad, no! I should have reiterated Arne's caveat. OTOH, the result is
not entirely unexpected and parallels Arne's (considerable) experience.

Yes, this is confounding; I was sticking with the developer version
numbers:

Indeed, such numbers are almost meaningless, yet strangely fascinating.
Cf <http://www.google.com/intl/en/press/zeitgeist2008/index.html>

Here's a very rough measure of features/version from skimming the Java
1.5 API documentation (J2SE 5.0):

<code>
#!/bin/sh
DIR=/Developer/Documentation/Java/docs
ECHO=/bin/echo
for ((i=0; i<=6; i++)) ; do
${ECHO} -n "Since 1.${i}: "
grep -R "<DD>1.${i}" $DIR/* | wc -l
done
</code>

<console>
$ ./since.sh
Since 1.0: 26
Since 1.1: 89
Since 1.2: 965
Since 1.3: 550
Since 1.4: 1384
Since 1.5: 1321
Since 1.6: 0
</console>

A lot has to do with 1.5 being adopted very late by some popular app
server vendors due to which the customers never adopted 1.5 (even for
their non app server needs). Many companies are also slow to adopt
because of plain ignorance on part of some decision makers who had
"significant investments" building the system and there is "no budget
to take chances and move to newer version" even though risk may not be
that high. Even recompiling and taking advantage of improvements in
JVM for higher releases is still worth it but it is tough to convince
some people.
The distribution % almost looks correct - atleast in the sense that
nearly 50% are using 1.4(or less).
What would be interesting to know is that even for 1.4 how many users
are really using 1.4 new features like regular expressions, NIO,
exception chaining etc, JAXP etc. Some places even the developers are
either not aware or care about newer features.

Arne Vajhøj · Dec 15, 2008

John said:
Google - millions of hits:

java 1.1 - 22.8
java 1.2 - 16.1
java 1.3 - 12.0
java 1.4 - 12.6
java 1.5 - 38.1
java 1.6 - 10.2
java 1.7 - 5.2

Bimodal!?

Stuff posted 10 years ago is not a good indication of
current usage. And the measurement will favor versions
with many new changes.

But it is still more objective than my guess.

Arne

Arne Vajhøj · Dec 15, 2008

A lot has to do with 1.5 being adopted very late by some popular app
server vendors due to which the customers never adopted 1.5 (even for
their non app server needs). Many companies are also slow to adopt
because of plain ignorance on part of some decision makers who had
"significant investments" building the system and there is "no budget
to take chances and move to newer version" even though risk may not be
that high. Even recompiling and taking advantage of improvements in
JVM for higher releases is still worth it but it is tough to convince
some people.

The testing/certification cost can be huge.

What would be interesting to know is that even for 1.4 how many users
are really using 1.4 new features like regular expressions, NIO,
exception chaining etc, JAXP etc. Some places even the developers are
either not aware or care about newer features.

I think today the 1.4 new features are widely used.

When the first apps were deployed on 1.4 much less features
were used.

But it was a time where the number of Java EE apps grew
fast, so it was picked for a lot of new apps.

Arne

Mike Schilling · Jan 17, 2009

Lew said:
Java 1.4 has been completely retired for a few weeks now, and
obsolescent for quite some time.

It is Xerces.

I thought we'd been through this recently? In 1.4, the default parser is
(ack! pthwt!) Crimson. In 1.5, the default becomes Xerces.

Yes. JAXP will find the parser definition in the classpath and use it.

Mike Schilling · Jan 17, 2009

Mike said:
I thought we'd been through this recently?

And so we had. Never mind.

Lew · Jan 17, 2009

Mike said:
I thought we'd been through this recently? In 1.4, the default parser is
(ack! pthwt!) Crimson. In 1.5, the default becomes Xerces.

I thought for sure time would have covered my embarrassment at having been
mistaken about this. Now my face is crimson again, never, it seems, to get
surcease.

John B. Matthews · Jan 17, 2009

[informative discussion]

surcease

From Old French sursis, past participle of Old French surseoir â€˜refrain,
delay,â€™ from Latin supersedere â€˜desistâ€™ (see supersede). In an odd
syzygy, there's that word again.

Daniel Pitts · Jan 17, 2009

John said:
[informative discussion]

surcease

Click to expand...

From Old French sursis, past participle of Old French surseoir â€˜refrain,
delay,â€™ from Latin supersedere â€˜desistâ€™ (see supersede). In an odd
syzygy, there's that word again.

Please check your encoding, That looks like garbage to me.

Mike Schilling · Jan 17, 2009

Lew said:
I thought for sure time would have covered my embarrassment at
having
been mistaken about this. Now my face is crimson again, never, it
seems, to get surcease.

Nicely done. Sorry about replying to the ancient post; for some
reason, it showed up as unread.

Lew · Jan 17, 2009

Daniel said:
John said:

[informative discussion]

surcease

Click to expand...

From Old French sursis, past participle of Old French surseoir
Ã¢â‚¬Ëœrefrain, delay,Ã¢â‚¬â„¢ from Latin supersedere Ã¢â‚¬ËœdesistÃ¢â‚¬â„¢ (see
supersede). In an odd syzygy, there's that word again.

Click to expand...

Please check your encoding, That looks like garbage to me.

Came through clearly here. You must be forcing a different encoding from John's.

"Here" being Thunderbird picking up news from news.albasani.net. I can't see
what encoding John used, which must mean that T-bird assumed UTF-8.

Your message was encoded in windows-1252, which is notoriously incomplete.

Lew · Jan 17, 2009

Peter said:
Why it is that Thunderbird 2.0.0.19 doesn't interpret the post correctly
but 2.0.0.18 does I can't say. Maybe there's some user setting you have
set but Daniel doesn't that forces an encoding for posts that arrive
without encoding specified.

"Account settings" (a.k.a. "properties") / "Server settings" / "Default
Character Encoding:" "Unicode (UTF-8)"

John B. Matthews · Jan 17, 2009

[QUOTE="Lew said:
Why it is that Thunderbird 2.0.0.19 doesn't interpret the post correctly
but 2.0.0.18 does I can't say. Maybe there's some user setting you have
set but Daniel doesn't that forces an encoding for posts that arrive
without encoding specified.

"Account settings" (a.k.a. "properties") / "Server settings" / "Default
Character Encoding:" "Unicode (UTF-8)"[/QUOTE]

Ah, thank you Daniel, Lew & Peter. I appreciate your feedback. My nntp
client indicated that it was using UTF-8, but it wasn't adding an
explicit Content-Type:

Content-Type: text/plain; charset=UTF-8; format=flowed

â€˜single-quoteâ€™
â€œdouble-quoteâ€
â€¹single-angleâ€º
Â«double-angleÂ»

John B. Matthews · Jan 17, 2009

"Peter Duniho said:
[...]
Content-Type: text/plain; charset=UTF-8; format=flowed

â€˜single-quoteâ€™
â€œdouble-quoteâ€
â€¹single-angleâ€º
Â«double-angleÂ»

Click to expand...

Much better.

Of course, it begs the question, why not just use the "normal" ASCII
characters, rather than something that could create a character encoding
issue?

Ordinarily, I wouldn't; they slipped in with a cut and paste. Now to
convince my new reader to decode my own posts correctly outside of
alt.test!

Roedy Green · Jan 18, 2009

see http://mindprod.com/jgloss/xml.html
for an overview of the features. You can use what is build into Java
or a third party package.
--
Roedy Green Canadian Mind Products
http://mindprod.com

We are almost certainly going to miss our [global warming] deadline.
We cannot get the 10 lost years back, and by the time a new global agreement to
replace the Kyoto accord is negotiated and put into effect, there will probably
not be enough time left to stop the warming short of the point where we must not
go. ~ Gwynne Dyer

Java with Netbeans	2	Apr 12, 2022
Parsing XML with Java 1.4.2 own tools?	13	Sep 12, 2008
Parsing Soap Response in java	10	Apr 4, 2014
How to implement a html parser in java?	1	Dec 28, 2023
Detect XML document encodings with SAX	42	Nov 21, 2012
How to save textBox values into a xml-file(with naming an choosing directory)?	1	Aug 23, 2022
Eclipse to Java command line	3	Apr 29, 2010
Parsing XML documents behind a firewall; java makes a connect to theactual DTD?	1	Apr 18, 2008

XML parsing with Java

vk02720

Arne Vajhøj

Arne Vajhøj

Mike Schilling

Mike Schilling

Lew

John B. Matthews

Daniel Pitts

Mike Schilling

Lew

Lew

John B. Matthews

John B. Matthews

Roedy Green

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads