J
John McGhie [MVP - Word and Word Macintosh]
All:
We're going to get a lot of questions about the new Office File Formats
coming down the pike, so let's get the ball rolling.
* Microsoft Office is changing to new file formats in the next release.
* The new formats are nothing like the old formats. Not even "related".
* Yes, this is a very good idea. They held off as long as they could to
avoid disruption, but it's a move that has been long overdue.
* Word, Excel and PowerPoint will move first, in Office 2007/Mac Office 12,
the coming iteration. Everything else will change over next time.
The old formats are NOT text, they are binary structures in which
"containers" of data of each type are linked by binary pointers that
indicate the off-set of each container from the end of the file.
The new formats are Open (published) compressed XML, very similar to HTML.
Each document is a mini-web site, compressed with the standard Zip
compression algorithm. It may contain binary objects for things such as
graphics, but usually it won't.
Just a tip for young players: when Word changes file formats then don't
up-convert HEAVILY customised templates, re-make them.
While up-conversion works fine for documents, templates have a more complex
file structure and may have more complex widgets in them. Whenever you
convert from one file format to another, you may be asking the converter to
re-express relational binary objects (List Templates and Style Tables come
to mind).
If you are, what you get depends upon whether the new file format has a
direct equivalent of the object. In a document, you can usually take it as
read that up-conversion will be totally safe. In a template, not
necessarily so: you need to test it.
DOWN conversion, on the other hand, is another name for suicide. If you
convert a document UP, keep the old one if you need the old format. If you
need the old version of the application to read the new content, you can
save back, but you may lose data doing so, so keep the new version.
Don't even THINK about round-tripping between file formats. You WILL lose
data, and almost certainly gain corruption.
In the forthcoming conversion, up-conversion from .doc to .docx is
guaranteed to work. Up conversion from .html to .docx will work.
Up conversion from .dot to .doct is probably OK. You'll get a file that
should work without problems.
Up-conversion from .dot to .docm, or down-conversion from .docm to .doc
simply won't work. According to my reading of what is supposed to happen, I
suspect the application will actually disallow it.
You have been warned
Cheers
--
Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.
John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410
We're going to get a lot of questions about the new Office File Formats
coming down the pike, so let's get the ball rolling.
* Microsoft Office is changing to new file formats in the next release.
* The new formats are nothing like the old formats. Not even "related".
* Yes, this is a very good idea. They held off as long as they could to
avoid disruption, but it's a move that has been long overdue.
* Word, Excel and PowerPoint will move first, in Office 2007/Mac Office 12,
the coming iteration. Everything else will change over next time.
The old formats are NOT text, they are binary structures in which
"containers" of data of each type are linked by binary pointers that
indicate the off-set of each container from the end of the file.
The new formats are Open (published) compressed XML, very similar to HTML.
Each document is a mini-web site, compressed with the standard Zip
compression algorithm. It may contain binary objects for things such as
graphics, but usually it won't.
Just a tip for young players: when Word changes file formats then don't
up-convert HEAVILY customised templates, re-make them.
While up-conversion works fine for documents, templates have a more complex
file structure and may have more complex widgets in them. Whenever you
convert from one file format to another, you may be asking the converter to
re-express relational binary objects (List Templates and Style Tables come
to mind).
If you are, what you get depends upon whether the new file format has a
direct equivalent of the object. In a document, you can usually take it as
read that up-conversion will be totally safe. In a template, not
necessarily so: you need to test it.
DOWN conversion, on the other hand, is another name for suicide. If you
convert a document UP, keep the old one if you need the old format. If you
need the old version of the application to read the new content, you can
save back, but you may lose data doing so, so keep the new version.
Don't even THINK about round-tripping between file formats. You WILL lose
data, and almost certainly gain corruption.
In the forthcoming conversion, up-conversion from .doc to .docx is
guaranteed to work. Up conversion from .html to .docx will work.
Up conversion from .dot to .doct is probably OK. You'll get a file that
should work without problems.
Up-conversion from .dot to .docm, or down-conversion from .docm to .doc
simply won't work. According to my reading of what is supposed to happen, I
suspect the application will actually disallow it.
You have been warned
Cheers
--
Please reply to the newsgroup to maintain the thread. Please do not email
me unless I ask you to.
John McGhie <[email protected]>
Microsoft MVP, Word and Word for Macintosh. Consultant Technical Writer
Sydney, Australia +61 (0) 4 1209 1410