F
Frank LaRosa
Hi,
What's the recommended way to parse an arbitrary HTML document which
may or may not conform to strict XML syntax requirements?
I tried using a DocumentBuilder, but immediately got the exception
message "Value must be quoted". The exception is thrown out of the
parse method so I have no way to ignore it.
Would a SAXParser be a better idea?
I don't have the option to fix the errors in the source documents.
What's the recommended way to parse an arbitrary HTML document which
may or may not conform to strict XML syntax requirements?
I tried using a DocumentBuilder, but immediately got the exception
message "Value must be quoted". The exception is thrown out of the
parse method so I have no way to ignore it.
Would a SAXParser be a better idea?
I don't have the option to fix the errors in the source documents.