Word 2003 XML

S

Steve Hodgson

I have just moved to Office 2003 on my Windows PC at work and was
playing around with the option to save as XML rather than doc format.
There were a few queries sprang to mind.

1. Is this compatable with Word 2004 on the Mac?

2. Are there any advantages/disadvantages to adopting the XML format -
I notice that file sizes seem to be reduced.
--
Cheers,

Steve

The reply-to email address is a spam trap.
Email steve 'at' shodgson 'dot' org 'dot' uk
 
J

Jim Gordon

Hi Steve,

Office 2004 is unable to understand the new XML file format for Word and
PowerPoint at this moment. Excel 2004 is able to understand the new
format (I think - haven't had a chance to try it yet).

The advantage of XML is that it is human readable, at least to those
humans who read XML code. XML is an open format meaning that anyone can
create XML code and it will be interchangeable with other documents. It
is not proprietary.

The appearance of file sizes being smaller is sleight of hand. The old
..doc format is a binary format. Binary formats are more efficient than
text, but are not open standards based. XML is text, and hence requires
large file sizes. The sleight of hand is because the XML files are
automatically zipped (compressed) before they are saved they seem to be
smaller. For a fair comparison, zip a .doc format document and compare
that size to the same XML document.

-Jim Gordon
Mac MVP
 
C

Chris Ridd

Hi Steve,

Office 2004 is unable to understand the new XML file format for Word
and PowerPoint at this moment. Excel 2004 is able to understand the new
format (I think - haven't had a chance to try it yet).

The advantage of XML is that it is human readable, at least to those
humans who read XML code. XML is an open format meaning that anyone can
create XML code and it will be interchangeable with other documents. It
is not proprietary.

That's the Microsoft line, but it avoids mentioning the possibility
that proprietary stuff gets encoded inside the XML files, making the
content more or less unusable and non-interchangeable. Example: an XML
document that contains the text in the clear (good) but all the styling
and formatting in a secret proprietary blob (not good).

But this isn't the place for *that* particular discussion. I just
wanted to note that the issue is sadly not quite as simple as Jim's
making out.

Cheers,

Chris
 
S

Steve Hodgson

Office 2004 is unable to understand the new XML file format for Word
and PowerPoint at this moment. Excel 2004 is able to understand the new
format (I think - haven't had a chance to try it yet).

Thanks. I think it sounds as though it is best avoided for the moment.
I basically work in a Windows world so it is useful to be able to move
doc files over to the Mac for homework.
The advantage of XML is that it is human readable, at least to those
humans who read XML code. XML is an open format meaning that anyone can
create XML code and it will be interchangeable with other documents. It
is not proprietary.

The appearance of file sizes being smaller is sleight of hand. The old
.doc format is a binary format. Binary formats are more efficient than
text, but are not open standards based. XML is text, and hence requires
large file sizes. The sleight of hand is because the XML files are
automatically zipped (compressed) before they are saved they seem to be
smaller. For a fair comparison, zip a .doc format document and compare
that size to the same XML document.

I have to say it didn't strike me as looking like a zip file when I
opened it in a text editor. When I opened an ODF file in this way the
PKZIP reference at the top of the file was immediately striking.

ODF does seem to be a very much more /open/ format. Having played with
it in the past I really like that one can open the zip file, edit the
XML that is the text, change the PNG images then open the file in
OpenOffice with all those changes in place.
--
Cheers,

Steve

The reply-to email address is a spam trap.
Email steve 'at' shodgson 'dot' org 'dot' uk
 
P

Paul Berkowitz

Thanks. I think it sounds as though it is best avoided for the moment.
I basically work in a Windows world so it is useful to be able to move
doc files over to the Mac for homework.

Microsoft has said that when Office 2007 is released, updaters for earlier
Win Office versions will also be released that can read the new XML file
formats. I'd be very surprised if they didn't also read the 2003 XML files
too. (I'm not up on this enough to know if these are basically the same
thing, but I think they should be close enough. The 2007 files come with new
extensions - .docx, .xlsx. Do the 2003 XML files also do so?) I'd guess that
previous Mac Office versions will also get similar updates in order to
maintain compatibility with all the new XML-format files coming over from
2007.

--
Paul Berkowitz
MVP MacOffice
Entourage FAQ Page: <http://www.entourage.mvps.org/faq/index.html>
AppleScripts for Entourage: <http://macscripter.net/scriptbuilders/>

Please "Reply To Newsgroup" to reply to this message. Emails will be
ignored.

PLEASE always state which version of Microsoft Office you are using -
**2004**, X or 2001. It's often impossible to answer your questions
otherwise.
 
J

John McGhie [MVP - Word and Word Macintosh]

Hi Steve:

The Office 2003 version of XML is *not* a zip file. The Office 2007 version
is (in other words, they both make an XML file, but Office 2007 compresses
it in Zip format).

Other than that, they're the same. However, the difference is in the
Document Type Definition.

Because this is not the place for it, I won't go into the details, but
Google for "SGML and Document Type Definition" for an explanation. XML is
similar to SGML: the concept is exactly the same.

The difference is that Word 2003 will work with XML provided that you supply
and attach a DTD and a Formatting Specification. In Word 2007, those are
built-in.

So in a large corporation that has DTDs and Transforms already written, Word
2003 can use their XML. In Word 2007, those things are provided so that
Word can render *any* kind of file into XML.

And "NO", the current version of Mac Word can't handle the resulting files.
The next version can/will. In the next Mac version, the file formats will
all switch to XML the same as they will on the PC.

Just to set Chris' mind at rest: the formatting in the Office 12 file
formats is all in a clear-text Cascading Style Sheet. No Secret Binary
Blobs for formatting. It is possible you may find Secret Binary Blobs in a
Microsoft Office 12 file, of course, as there could be in any version of
XML: but their content will be things that cannot be described in "text",
such as embedded applications.

Hope this helps

Thanks. I think it sounds as though it is best avoided for the moment.
I basically work in a Windows world so it is useful to be able to move
doc files over to the Mac for homework.


I have to say it didn't strike me as looking like a zip file when I
opened it in a text editor. When I opened an ODF file in this way the
PKZIP reference at the top of the file was immediately striking.

ODF does seem to be a very much more /open/ format. Having played with
it in the past I really like that one can open the zip file, edit the
XML that is the text, change the PNG images then open the file in
OpenOffice with all those changes in place.

--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
 
C

Chris Ridd

Just to set Chris' mind at rest: the formatting in the Office 12 file
formats is all in a clear-text Cascading Style Sheet. No Secret Binary
Blobs for formatting. It is possible you may find Secret Binary Blobs in a
Microsoft Office 12 file, of course, as there could be in any version of
XML: but their content will be things that cannot be described in "text",
such as embedded applications.

I shall sleep soundly tonight, thanks John ;-)

Cheers,

Chris
 
J

Jim Gordon

Hi Steve,

The nail in the coffin of ODF was laid when the European open standards
group chose the XML format that Microsoft adopted over the open format
that Sun Microsystem's OpenOffice group was hoping for. The short
oversimplified story is that the Microsoft XML format is "more open" and
"more standard" than Sun corporation's Open Document Format.

-Jim Gordon
Mac MVP
 
C

Chris Ridd

Hi Steve,

The nail in the coffin of ODF was laid when the European open standards
group chose the XML format that Microsoft adopted over the open format
that

Ah, you're talking about ECMA. They've certainly chosen Microsoft's XML
document format in their committee working to establish a standard, but
they haven't finished their work and it is not a standard.

I'll raise your standards body with one with a slightly better
reputation: ISO. OpenDocument *is* now a standard: ISO/IEC 26300.

Cheers,

Chris
 
J

John McGhie [MVP - Word and Word Macintosh]

Hi Chris:

Yeah: The key question is "Does 'xyz application' support XML?

Let's talk through the implications of this a bit, for the benefit of others
reading here who may be new to the concepts fundamental to "markup
languages" such as XML ...

If an application *really* supports XML, it doesn't matter WHOSE XML you're
looking at, it will handle it. The whole point of XML (and the defining
difference between XML and SGML...) is that while BOTH are "open" standards,
XML is, by its very design "extensible".

The coming versions of Microsoft Office will cheerfully read and write
OpenDoc format, on both the Mac and the PC. We will need to obtain and
install the OpenDoc DTD and some transforms for it. I guess Sun has them
available for download?

The new versions of Microsoft Office are XML applications: they will read
and write ANY document in XML, provided that it is "well formed". "Well
Formed" is a technical term that means the internal coding of the file
follows the XML syntax requirements.

Now, *any* of the markup language applications supports (relies upon, in
some cases...) the "EMPTY" property of a tag to incorporate content that is
not part of the coding specification. "Empty" in this case means that the
tag is 'empty' of markup language: it's does indeed have content, it's just
not markup language that the XML parser needs to read. The XML parser will
simply turn a deaf ear to whatever it finds in an EMPTY tag, handing it off
to an external application.

You can't have XML without the EMPTY tag, it's a required part of the
specification. What you put in it is up to you, and it's your
responsibility to ensure that your receiving application knows how to decode
and use the content.

This is where the rot sets in, and why XML and all its look-alikes are only
a "partial" answer to world hunger :)

When any Microsoft Applications come across XML "empty" tags, they will read
and encapsulate the binary data within them. It's as though the file
contained a reference to an external picture (sometimes, that's exactly what
it *is*). The XML application (let's say it's Word...) then says "OK, I
have this binary content. Do I know what to do with it?" It then looks
among its converters and filters to see if it has one that advertises the
ability to handle the resulting binary. If it has (let's say it's a GIF or
a PNG it found...) Word will then call its internal converter and display
the content.

If Word does not contain a filter, it then looks outside itself to the
Office Suite to see if there is a handling application there. If not, it
calls the Operating System: "Hey, Fred, do YOU know what to do with this
thing?"

Hopefully it does :) If not, it will look amongst its installed
applications for something that claims expertise with this format. Let's
assume we're looking at a RAM format: the OS will find, and start, the Real
Audio player, then hand off whatever Word dug out of the document to the RAM
player.

The way Microsoft is setting up its XML applications, this process will work
999 times out of a thousand. They are keeping and enhancing the mechanism
currently seen in Windows that says "Windows could not determine what to do
with this content, would you like to connect to the Internet to see if we
can find a suitable application?"

On the Mac, this will call Apple's equivalent mechanism. It makes a call to
an internet database that suggests a suitable application for the file type.
Hint: When working in XML, get your file extensions correct, because this
mechanism relies on them, and if you don't, some hapless recipient of your
document won't be able to see one of the pictures in it :)

But Microsoft is not the United Nations: it can't PREVENT anyone from
embedding some strange binary blob in its files. In fact, in Microsoft
Office, they're very proud of their ability to enable embedding of almost
any kind of content. So someone, somewhere, sometime, is going to send us a
document containing "something" we can't display, and can't find a converter
for.

It's the way of the world :)

Cheers

Ah, you're talking about ECMA. They've certainly chosen Microsoft's XML
document format in their committee working to establish a standard, but
they haven't finished their work and it is not a standard.

I'll raise your standards body with one with a slightly better
reputation: ISO. OpenDocument *is* now a standard: ISO/IEC 26300.

Cheers,

Chris

--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
 
P

Phillip M. Jones, CE.T.

MicroSoft and "Open Format" seem an oxymoron to me.

They have made their life's work out of being secretive, and Fighting
using any standards.

Look at the World Wide Web Consortium as an example.
Microsoft is one of the Major signatories. Yet Their web browser makes
use of stuff such as Active-X, and self-healing so That a monkey can
write code for a website that will work in IE. But will not work in any
other browser.

The main reason for MS joining was to find out what code The W3C was
agreeing to settle on as official standards, and then figure out ways to
break it so that certain sites can only use IE to view it. Use a captive
audience.

Maybe When Billy leaves office and lets other run the show Maybe they
can grow business by using "honest" competition. Instead of do it my way
or not at all , that we have seen since MS inception.

Jim said:
Hi Steve,

The nail in the coffin of ODF was laid when the European open standards
group chose the XML format that Microsoft adopted over the open format
that Sun Microsystem's OpenOffice group was hoping for. The short
oversimplified story is that the Microsoft XML format is "more open" and
"more standard" than Sun corporation's Open Document Format.

-Jim Gordon
Mac MVP


--
------------------------------------------------------------------------
Phillip M. Jones, CET |LIFE MEMBER: VPEA ETA-I, NESDA, ISCET, Sterling
616 Liberty Street |Who's Who. PHONE:276-632-5045, FAX:276-632-0868
Martinsville Va 24112 |[email protected], ICQ11269732, AIM pjonescet
------------------------------------------------------------------------

If it's "fixed", don't "break it"!

mailto:p[email protected]

<http://www.kimbanet.com/~pjones/default.htm>
<http://www.kimbanet.com/~pjones/90th_Birthday/index.htm>
<http://www.kimbanet.com/~pjones/Fulcher/default.html>
<http://www.kimbanet.com/~pjones/Harris/default.htm>
<http://www.kimbanet.com/~pjones/Jones/default.htm>

<http://www.vpea.org>
 
J

John McGhie [MVP - Word and Word Macintosh]

Hi Phillip:

Thanks -- I needed a good giggle :) But surely not even you believe this
stuff?

If Microsoft really had made a life's work out of not using any standards, I
suspect the constant references to the RFCs in their knowledgebase would not
be there :)

Bill Gates has announced that he is stepping down from day-to-day
involvement with the management of Microsoft. But he will remain chairman
for the foreseeable future.

And I suspect we ought to hope that he does. You see, if Bill left and
Microsoft became totally shareholder-profit-driven, ALL decisions would be
made by the Pension Funds and HMOs that own the bulk of the shares. And we
all know what nice people THEY are :)

Cheers


MicroSoft and "Open Format" seem an oxymoron to me.

They have made their life's work out of being secretive, and Fighting
using any standards.

Look at the World Wide Web Consortium as an example.
Microsoft is one of the Major signatories. Yet Their web browser makes
use of stuff such as Active-X, and self-healing so That a monkey can
write code for a website that will work in IE. But will not work in any
other browser.

The main reason for MS joining was to find out what code The W3C was
agreeing to settle on as official standards, and then figure out ways to
break it so that certain sites can only use IE to view it. Use a captive
audience.

Maybe When Billy leaves office and lets other run the show Maybe they
can grow business by using "honest" competition. Instead of do it my way
or not at all , that we have seen since MS inception.

--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
 
C

Chris Ridd

Hi Chris:

Yeah: The key question is "Does 'xyz application' support XML?

Let's talk through the implications of this a bit, for the benefit of others
reading here who may be new to the concepts fundamental to "markup
languages" such as XML ...

If an application *really* supports XML, it doesn't matter WHOSE XML you're
looking at, it will handle it. The whole point of XML (and the defining
difference between XML and SGML...) is that while BOTH are "open" standards,
XML is, by its very design "extensible".

The coming versions of Microsoft Office will cheerfully read and write
OpenDoc format, on both the Mac and the PC. We will need to obtain and
install the OpenDoc DTD and some transforms for it. I guess Sun has them
available for download?

DTDs are very old school I'm afraid - the cool kids have been using XML
schemas now for a while.

Yes, there is an openly available and usable (no strings attached)
schema for Open Document.
The new versions of Microsoft Office are XML applications: they will read
and write ANY document in XML, provided that it is "well formed". "Well
Formed" is a technical term that means the internal coding of the file
follows the XML syntax requirements.

Now, *any* of the markup language applications supports (relies upon, in
some cases...) the "EMPTY" property of a tag to incorporate content that is
not part of the coding specification. "Empty" in this case means that the

Which bit of XML spec talks about these?

Cheers,

Chris
 
C

Chris Ridd

Hi Phillip:

Thanks -- I needed a good giggle :) But surely not even you believe this
stuff?

If Microsoft really had made a life's work out of not using any standards, I
suspect the constant references to the RFCs in their knowledgebase would not
be there :)

One simple example: although Active Directory *uses* the LDAP RFCs, it
violates them by forcing some protocol fields to contain non-standard
values.

In most people's eyes the word "uses" doesn't include "abuses" :)

Cheers,

Chris
 
P

Phillip M. Jones, CE.T.

Try a test on your vacation for laughs.

Locate some Strictly MS oriented websites, one that use MS FrontPage to
design.

View them in IE.

Then Try them in Opera, or FireFox or any other non Ms Web Browser.

Run the sites through the W3C web page analyzers. See how many standards
defects are there.

Even web pages designed and saved from Word has issues. Just run the
page through MacroMedia's Dreamweaver's command Fix Word HTML defects.

See how many HTML Defects it finds.

Web Pages should be the in the same form the same form regardless of the
Browser used. But Ms doesn't want that. They want pages designed in
such a way That no one can read or view them except with IE. And On The
Mac that puts us out because they have discontinued IE for Mac because
Jobs added Safari, just so people could have another choice. So there
are many sites That are written that have information Mac users could
use, but can't get at it because FronTPage was used to design it, And
therefore Only IE users can view it.
Hi Phillip:

Thanks -- I needed a good giggle :) But surely not even you believe this
stuff?

If Microsoft really had made a life's work out of not using any standards, I
suspect the constant references to the RFCs in their knowledgebase would not
be there :)

Bill Gates has announced that he is stepping down from day-to-day
involvement with the management of Microsoft. But he will remain chairman
for the foreseeable future.

And I suspect we ought to hope that he does. You see, if Bill left and
Microsoft became totally shareholder-profit-driven, ALL decisions would be
made by the Pension Funds and HMOs that own the bulk of the shares. And we
all know what nice people THEY are :)

Cheers


--
------------------------------------------------------------------------
Phillip M. Jones, CET |LIFE MEMBER: VPEA ETA-I, NESDA, ISCET, Sterling
616 Liberty Street |Who's Who. PHONE:276-632-5045, FAX:276-632-0868
Martinsville Va 24112 |[email protected], ICQ11269732, AIM pjonescet
------------------------------------------------------------------------

If it's "fixed", don't "break it"!

mailto:p[email protected]

<http://www.kimbanet.com/~pjones/default.htm>
<http://www.kimbanet.com/~pjones/90th_Birthday/index.htm>
<http://www.kimbanet.com/~pjones/Fulcher/default.html>
<http://www.kimbanet.com/~pjones/Harris/default.htm>
<http://www.kimbanet.com/~pjones/Jones/default.htm>

<http://www.vpea.org>
 
J

John McGhie [MVP - Word and Word Macintosh]

You mean... Like www.word.mvps.org?

Phillip, I'm the webmaster for it. I know all about trying to get stuff
working across five different browsers. (Or rather: I know how to waste
*days* failing to do so...)

Apple Safari used to be the worst of the bunch. Now, the worst of them is
IE 6. IE 7 is practically civilised in comparison...

Cheers


Try a test on your vacation for laughs.

Locate some Strictly MS oriented websites, one that use MS FrontPage to
design.

View them in IE.

Then Try them in Opera, or FireFox or any other non Ms Web Browser.

Run the sites through the W3C web page analyzers. See how many standards
defects are there.

Even web pages designed and saved from Word has issues. Just run the
page through MacroMedia's Dreamweaver's command Fix Word HTML defects.

See how many HTML Defects it finds.

Web Pages should be the in the same form the same form regardless of the
Browser used. But Ms doesn't want that. They want pages designed in
such a way That no one can read or view them except with IE. And On The
Mac that puts us out because they have discontinued IE for Mac because
Jobs added Safari, just so people could have another choice. So there
are many sites That are written that have information Mac users could
use, but can't get at it because FronTPage was used to design it, And
therefore Only IE users can view it.

--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
 
J

John McGhie [MVP - Word and Word Macintosh]

Sure, and "Recall this Message" has never *officially* been supported by the
Mail RFCs (regardless of how much I occasionally wish it were...)

But you can say the same thing about ANY software company. None of them are
perfect (I don't think *any* of them are even "good"). Because Microsoft
produces more software than many companies, it produces more mistakes than
many companies.

But if you really want an exercise in frustration, try getting IBM or Apple
or Sun to observe their own standards :)

One simple example: although Active Directory *uses* the LDAP RFCs, it
violates them by forcing some protocol fields to contain non-standard
values.

In most people's eyes the word "uses" doesn't include "abuses" :)

Cheers,

Chris

--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
 
J

John McGhie [MVP - Word and Word Macintosh]

Yeah, I know DTDs are ancient technology. Their one saving grace is that
they "always" work. I don't trust Schemas 100 per cent yet. They're lovely
when they work, and Office 2007 supports (and uses) them.

However, if I were working "cross platform" trying to make Office do the
right thing with Open Doc, I would rather bet my life on doing it with a DTD
and a FOSI than with a Scheme.

EMPTY properties, see http://www.w3.org/TR/rdf-syntax-grammar/

Cheers

DTDs are very old school I'm afraid - the cool kids have been using XML
schemas now for a while.

Yes, there is an openly available and usable (no strings attached)
schema for Open Document.


Which bit of XML spec talks about these?

Cheers,

Chris

--

Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.

John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
 
C

Chris Ridd

Yeah, I know DTDs are ancient technology. Their one saving grace is that
they "always" work. I don't trust Schemas 100 per cent yet. They're lovely
when they work, and Office 2007 supports (and uses) them.

I'm sure you're right. We can probably say "bah" and "humbug" at this point!
However, if I were working "cross platform" trying to make Office do the
right thing with Open Doc, I would rather bet my life on doing it with a DTD
and a FOSI than with a Scheme.

EMPTY properties, see http://www.w3.org/TR/rdf-syntax-grammar/

I think that might be specific to RDF, and not XML in general.

My very limited understanding of XML extensibility is that it is made
clear to the parser/application which elements are foreign by declaring
namespaces in the root element. I don't think any general spec says
what an application's meant to do when alien elements or attributes are
discovered in a document.

If you have foreign stuff in a document, validation through a DTD
becomes impossible. I've a feeling that you may be able to validate
such a document using a schema?

I fully anticipate being corrected on some or all of the above!

Cheers,

Chris
 
P

Peter Jamieson

My very limited understanding of XML extensibility is that it is made
clear to the parser/application which elements are foreign by declaring
namespaces in the root element.

I would see it slightly differently.

In essence the parser determines whether the XML provided to it, i.e.
a. is "well-formed" (i.e. conforms to XML syntax rules, e.g. that elements
begin with "<" and end with ">")
b. is "valid" (i.e. conforms to any XML Schemas that the parser is obliged
to honour)

The parser isn't expected to know what any of the XML "means." and it has no
way of knowing what the application "understands" or what the application
sees as "native" or "foreign."

It's up to the application what it makes of the XML coming from the parser.
I would agree that there is no general-purposes XML mechanism that tells an
application what to do with XML it doesn't recognise, but that does not
prevent an application from trying to do something sensible with that XML.
It could do that by (for example)
a. being provided with enough information in XML elements it /does/
understand to locate and invoke an application (cf. an olde worlde Microsoft
Office applet) that knows what to do eith the "alien" elements. That's more
or less what Word does with embedded objects in .doc format documents, and
there's no reason why the OLE/ActiveX mechanism could not be modelled using
XML.
b. trying to guess what application to use based on the XML content.
Hazardous, no doubt!
c. providing a standardised "I don't understand this content" response,
e.g. by displaying a placeholder, or simply displaying "I couldn't
understand the following XML" and quoting the XML, or some such.

It may also be worth pointing out that in the general case, a parser may not
have much info. about the schemas associated with an XML document. The
schemas, or rather the associated namespasces, may be named in the document
(e.g. using attributes that define alias names), but the names are just
names, primarily intended to ensure uniqueness in the event of name clashes
(which is what namespaces are all about). Even where the names look like
URLs they do not necessarily point to any additional information about the
schema. For example, one of the first snippets in a WordProcessingML
document defines an alias "w" as follows:

<w:wordDocument
xmlns:w="http://schemas.microsoft.com/office/word/2003/wordml"

If you open the http addres in your browser, there is nothing there - in
fact I get a "page does not exist" error. There is another mecahnism for
providing information about the location of schema information to a parser,
but as far as the XML standards are concerned, the mechanism merely provides
a /hint/ - the parser can choose to go and find the schema, or ignore the
hint. If an application wants a parser to honour the hint, it had better use
a parser that can be relied on to do so.

Peter Jamieson
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Similar Threads


Top