Xerces parse aborted on IOError

J

Jim Cobban

Due to a problem which I am discussing on another thread the UTF-8 text in
my XML file is being corrupted.

The problem that leaves me with is that as soon as the Xerces parser hits
the bad UTF-8 character it throws:
java.io.UTFDataFormatException: invalid byte 2 of 3-byte UTF-8 sequence

and the parse is aborted.

This seems overkill.

Short of going in and modifying my copy of Xerces is there any way to get it
to keep on parsing the XML file past this error? Since this is an IOError,
not a SAXParseException it is not reported to the ErrorHandler interface.

--
Jim Cobban (e-mail address removed)
34 Palomino Dr.
Kanata, ON, CANADA
K2M 1M1
+1-613-592-9438
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Members online

No members online now.

Forum statistics

Threads
473,769
Messages
2,569,581
Members
45,056
Latest member
GlycogenSupporthealth

Latest Threads

Top