I am trying to parse an XML document that appears to be correct in syntax, but give this error. I am parsing by tag name getElementsByTagName(tag). This has worked fine until now.
Any thoughts.
Kind regards,
Andy
=================================================
Exception: org.xml.sax.SAXParseException: An invalid XML character (Unicode: 0x1
ab960) was found in the element content of the document.
================================================
This is the document
================================================
170348Octreotide (SANDOSTATIN)Schiedermaier P</Auth
or>G
One of the Document authors, Göke, has an accented character in his name. It looks as if the creator of your xml encoded that character into a single byte, so I think you need to specify
<?xml version="1.0" encoding="iso-8859-1"?>
in the xml header for your xml.
I missed that.
Thank you very much.
Andy