If you save as "Web Page, Filtered" the resulting code is cleaned up pretty
good.
I guess "pretty good" is a relative term. I created a simple test
document containing three headers, some text, an unordered list, and
an ordered list. I converted it to html using the Save As as you
suggested and then using
http://word2cleanhtml.com. You tell me which
is "cleaned up".
Here's the output from word2cleanhtml.com:
<div>
<h1>
Head 1
</h1>
<p>
Sample document to compare converting to html using Save As
and <a
href="
http://word2cleanhtml.com/">
http://word2cleanhtml.com/</a>.
</p>
<h2>
Head 2
</h2>
<p>
Some text…
</p>
<h3>
Head 3
</h3>
<p>
Some text…
</p>
<p>
Unordered list:
</p>
<ul>
<li>
Item 1
</li>
<li>
Item 2
</li>
<li>
Item 3
</li>
</ul>
<p>
Ordered list:
</p>
<p>
1. Item 1
</p>
<p>
2. Item 2
</p>
</div>
Here's the Word output:
<html>
<head>
<meta http-equiv=Content-Type content="text/html;
charset=windows-1252">
<meta name=Generator content="Microsoft Word 12 (filtered)">
<title>January 31, 2002</title>
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:"Arial Unicode MS";
panose-1:2 11 6 4 2 2 2 2 2 4;}
@font-face
{font-family:Cambria;
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"EngraversGothic BT";
panose-1:2 11 5 7 2 2 3 2 2 4;}
@font-face
{font-family:Verdana;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"\@Arial Unicode MS";
panose-1:2 11 6 4 2 2 2 2 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
h1
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
page-break-after:avoid;
font-size:14.0pt;
font-family:"Arial","sans-serif";
color:#548DD4;
font-weight:bold;}
h2
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
page-break-after:avoid;
font-size:14.0pt;
font-family:"Arial","sans-serif";
color:#548DD4;
font-weight:bold;}
h3
{mso-style-link:"Heading 3 Char";
margin-top:10.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
page-break-after:avoid;
font-size:12.0pt;
font-family:"Cambria","serif";
color:#4F81BD;
font-weight:bold;}
p.MsoFootnoteText, li.MsoFootnoteText, div.MsoFootnoteText
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoHeader, li.MsoHeader, div.MsoHeader
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoFooter, li.MsoFooter, div.MsoFooter
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
span.MsoFootnoteReference
{vertical-align:super;}
p.MsoBodyText, li.MsoBodyText, div.MsoBodyText
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:red;}
p.MsoBodyTextIndent, li.MsoBodyTextIndent, div.MsoBodyTextIndent
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:.25in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Times New Roman","serif";
color:windowtext;
font-weight:bold;
font-style:italic;}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{color
urple;
text-decoration:underline;}
p.MsoDocumentMap, li.MsoDocumentMap, div.MsoDocumentMap
{mso-style-link:"Document Map Char";
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:8.0pt;
font-family:"Tahoma","sans-serif";
color:windowtext;}
p
{margin-right:0in;
margin-left:0in;
font-size:10.0pt;
font-family:"Verdana","sans-serif";
color:black;}
pre
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Arial Unicode MS","sans-serif";
color:windowtext;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoListParagraphCxSpFirst, li.MsoListParagraphCxSpFirst,
div.MsoListParagraphCxSpFirst
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoListParagraphCxSpMiddle, li.MsoListParagraphCxSpMiddle,
div.MsoListParagraphCxSpMiddle
{margin-top:0in;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoListParagraphCxSpLast, li.MsoListParagraphCxSpLast,
div.MsoListParagraphCxSpLast
{margin-top:0in;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.BizLtrHdHdrSm, li.BizLtrHdHdrSm, div.BizLtrHdHdrSm
{mso-style-name:$BizLtrHdHdrSm;
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:9.0pt;
font-family:"EngraversGothic BT","sans-serif";
color:windowtext;}
p.BizLtrHdHdrLg, li.BizLtrHdHdrLg, div.BizLtrHdHdrLg
{mso-style-name:$BizLtrHdHdrLg;
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
text-align:center;
font-size:18.0pt;
font-family:"EngraversGothic BT","sans-serif";
color:windowtext;}
p.BizLtrHdHdrMd, li.BizLtrHdHdrMd, div.BizLtrHdHdrMd
{mso-style-name:$BizLtrHdHdrMd;
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
text-align:center;
font-size:12.0pt;
font-family:"EngraversGothic BT","sans-serif";
color:windowtext;}
span.DocumentMapChar
{mso-style-name:"Document Map Char";
mso-style-link:"Document Map";
font-family:"Tahoma","sans-serif";}
p.Title, li.Title, div.Title
{mso-style-name:$Title;
mso-style-link:"$Title Char";
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:14.0pt;
font-family:"Times New Roman","serif";
color:#548DD4;
font-weight:bold;}
p.Instructions, li.Instructions, div.Instructions
{mso-style-name:$Instructions;
mso-style-link:"$Instructions Char";
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Arial","sans-serif";
color:blue;
font-weight:bold;}
span.TitleChar
{mso-style-name:"$Title Char";
mso-style-link:$Title;
color:#548DD4;
font-weight:bold;}
span.InstructionsChar
{mso-style-name:"$Instructions Char";
mso-style-link:$Instructions;
font-family:"Arial","sans-serif";
color:blue;
font-weight:bold;}
span.Heading3Char
{mso-style-name:"Heading 3 Char";
mso-style-link:"Heading 3";
font-family:"Cambria","serif";
color:#4F81BD;
font-weight:bold;}
..MsoChpDefault
{font-size:10.0pt;}
/* Page Definitions */
@page Section1
{size:8.5in 11.0in;
margin:.5in .5in .3in .5in;}
div.Section1
{page:Section1;}
/* List Definitions */
ol
{margin-bottom:0in;}
ul
{margin-bottom:0in;}
-->
</style>
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<h1>Head 1</h1>
<p class=MsoNormal>Sample document to compare converting to html using
Save As
and <a
href="
http://word2cleanhtml.com/">
http://word2cleanhtml.com/</a>.</p>
<h2>Head 2</h2>
<p class=MsoNormal>Some text…</p>
<h3>Head 3</h3>
<p class=MsoNormal>Some text…</p>
<p class=MsoNormal>Unordered list:</p>
<p class=MsoListParagraphCxSpFirst style='text-indent:-.25in'><span
style='font-family:Symbol'>·<span style='font:7.0pt "Times New
Roman"'>
</span></span>Item 1</p>
<p class=MsoListParagraphCxSpMiddle style='text-indent:-.25in'><span
style='font-family:Symbol'>·<span style='font:7.0pt "Times New
Roman"'>
</span></span>Item 2</p>
<p class=MsoListParagraphCxSpLast style='text-indent:-.25in'><span
style='font-family:Symbol'>·<span style='font:7.0pt "Times New
Roman"'>
</span></span>Item 3</p>
<p class=MsoNormal>Ordered list:</p>
<p class=MsoListParagraphCxSpFirst
style='margin-left:.45in;text-indent:-.25in'>1.<span
style='font:7.0pt "Times New Roman"'>
</span>Item
1</p>
<p class=MsoListParagraphCxSpLast
style='margin-left:.45in;text-indent:-.25in'>2.<span
style='font:7.0pt "Times New Roman"'>
</span>Item
2</p>
</div>
</body>
</html>