N
Nick Head
We've got some Word RTF docs that we are saving as HTML and then transforming
to IP-compatible XML for editing. Problem ocurrs when extended characters are
found in the XML file.
For example the character ≥ is exported by MS Word as ≥ when you save as
filtered HTML. Then when trying to open any document with a character like
this in IP I get the error:
"The form contains schema validation errrors - Reference to undefined entity
'ge'."
'Fair enough!' I thought and so added a DTD character entity reference to
the document so that it knew how to handle the character. My resulting XML
looks like this (with IP processing instructions and namespace references
removed for clarity):
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE myFields [
<!ENTITY ge "≥" >
]>
<my:myFields>
<my:legacyContent>
3 ≥ 2
</my:legacyContent>
</my:myFields>
However I still get the same issue. I tried opening this XML file in IE to
test it, and sure enough it displays perfectly with no validation errors.
Has anyone managed to successfully do this in IP before? Or does IP just not
handle DTD character entity references?
TIA
Nick
to IP-compatible XML for editing. Problem ocurrs when extended characters are
found in the XML file.
For example the character ≥ is exported by MS Word as ≥ when you save as
filtered HTML. Then when trying to open any document with a character like
this in IP I get the error:
"The form contains schema validation errrors - Reference to undefined entity
'ge'."
'Fair enough!' I thought and so added a DTD character entity reference to
the document so that it knew how to handle the character. My resulting XML
looks like this (with IP processing instructions and namespace references
removed for clarity):
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE myFields [
<!ENTITY ge "≥" >
]>
<my:myFields>
<my:legacyContent>
3 ≥ 2
</my:legacyContent>
</my:myFields>
However I still get the same issue. I tried opening this XML file in IE to
test it, and sure enough it displays perfectly with no validation errors.
Has anyone managed to successfully do this in IP before? Or does IP just not
handle DTD character entity references?
TIA
Nick