Skip to Content
0
Former Member
Feb 25, 2009 at 04:45 PM

Problem in SAX Java mapping

92 Views

Hi,

I'm using SAX Java mapping in one scenario. Problem is when I get some Croatina characters, like Đ or u0160,

output XML is not valid. XML Spy complains, IE complains and so on. Customer is sure that data ( XML in CLOB field in Oracle DB) is UTF-8? What could be a problem?

What I'm doing is reading entire XML into string with help of BufferedReader, then do some manipulation and write String into byte array with:

			byte[] bytes = file.toString().getBytes("UTF-8");
			saxParser.parse(new ByteArrayInputStream(bytes), handler);

and then of course parse XML. readLine method reads data and problematic is "Ä�" - ￯0 - 0xC490.

For this character XML Spy doesn't complain, IE also. After conversion, this character looks like "Ä?" - 0xC43F, and this is not good any more. Why?