Hi,
I'm using SAX Java mapping in one scenario. Problem is when I get some Croatina characters, like Đ or u0160,
output XML is not valid. XML Spy complains, IE complains and so on. Customer is sure that data ( XML in CLOB field in Oracle DB) is UTF-8? What could be a problem?
What I'm doing is reading entire XML into string with help of BufferedReader, then do some manipulation and write String into byte array with:
byte[] bytes = file.toString().getBytes("UTF-8");
saxParser.parse(new ByteArrayInputStream(bytes), handler);
and then of course parse XML. readLine method reads data and problematic is "�" - 0 - 0xC490.
For this character XML Spy doesn't complain, IE also. After conversion, this character looks like "Ä?" - 0xC43F, and this is not good any more. Why?