XML Parse and UTF-8 Problem

N

Nick Hoddinott

Hi,

I'm using the org.w3c.dom.* libraries to process an xml file which has, for
example, an umlaut character in (ö). After using the parse() method of
Document, the umlaut code seems to get decoded (which is logical I suppose).
However, I need to get a document with the character codes left in their
encoded &...; form.

I've tried to do this many different ways, but each time I get either a
document with an umlaut or garbage characters. Is there a correct way to do
this? Apologies for my ignorance of the correct terminology for this stuff.

Thanks,

Nick H
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,770
Messages
2,569,584
Members
45,075
Latest member
MakersCBDBloodSupport

Latest Threads

Top