Any parser can handle 2.1GB+ file?

Craig Petty · Sep 13, 2003

Well, I tried to use SAX to process a large document. However, we are
hitting an unfortunate limit in the xerces sax parser. I wish they
had used longs or unsigned ints (see below, which is throwing a
runtime exception) to keep track of the position in the document.
Atleast I'm guessing thats whats happening here.

Any ideas?

(from a utils class is xerces)
221 public int addString(int offset, int length) {
222 int chunk = offset >> CHUNK_SHIFT;
223 if (chunk != fChunk) {
224 if (fPreviousChunk == null)
225 throw new RuntimeException(new
ImplementationMessages().createMessage(null,
ImplementationMessages.INT_PCN, 0, null));
226 return fPreviousChunk.addString(offset, length);
227 }
228 int lastChunk = (offset + length - 1) >> CHUNK_SHIFT;
229 if (chunk == lastChunk) {
230 addRef();
231 return fStringPool.addString(this, offset &
CHUNK_MASK, length);
232 }
233 String str = toString(offset & CHUNK_MASK, length);
234 return fStringPool.addString(str);
235 }

here's the java stack trace...

java.lang.RuntimeException: Internal Error: fPreviousChunk == NULL
at org.apache.xerces.framework.XMLParser.parse(Unknown Source)
at org.apache.xerces.framework.XMLParser.parse(Unknown Source)
at Test.main(Test.java:177)

Encoding, "extended ansi", and unicode in 1.9	2	Jun 16, 2010
XML file parsing/validating with xerces-j	2	Mar 16, 2005
writing on file not until the end	8	May 24, 2009
Why not success write the data?	1	Aug 8, 2008
integer sqrt() table implementation	4	Mar 11, 2005
How can I make a better program from the following one	1	Jun 14, 2008
atan2 weirdness	3	Jul 20, 2008
Expected constructor, destructor or type conversion before...	2	Jul 19, 2007

Any parser can handle 2.1GB+ file?

Craig Petty

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads