Word to HTML for Craigslist ad

S

Sesquipedalian Sam

I need to write a few ads for Craigslist. If I write them in Word and
paste them into the ad, much of the formatting is lost. If I use Word
just for the text and manually code the html tags, the document is
pretty much unreadable.

I see that there are a number of tools that will convert a Word
document to html. Can anyone recommend one over the others?

All I need is simple tags for an ad: bold, italics, underline, head1
and head2, and lists (mostly unordered).

It doesn't need to be free.
 
Y

Yves Dhondt

Just use the 'Other Formats' option from the 'Save As' menu in Word and
select 'Webpage, Filtered', as your type. That should keep things like bold,
italics, ...

Yves
 
S

Sesquipedalian Sam

Just use the 'Other Formats' option from the 'Save As' menu in Word and
select 'Webpage, Filtered', as your type. That should keep things like bold,
italics, ...

When I did that, I got a new document with filetype .htm that looks
like the original .docx document. I don't see any html markup.

What I need is a text document with html markup that I can paste into
the Craigslist ad.

Is there a way to get the markup?
 
Y

Yves Dhondt

Open the .htm file from within Notepad or Wordpad, that should give you the
html source. Alternatively, open the htm file in your favourite browser and
look for the 'Show source' function that almost every browser has.

Yves
 
S

Sesquipedalian Sam

Open the .htm file from within Notepad or Wordpad, that should give you the
html source. Alternatively, open the htm file in your favourite browser and
look for the 'Show source' function that almost every browser has.

OK. That worked, sort of. The source code has a lot of stuff in there.
After it was processed by Craig's List, it mostly looked good. Some of
the bulleted lists were a little off.

There are a lot of websites selling utilities to "clean up" the html
output from Word. The general feeling seems to be that Word does not
get html and really doesn't care. They want you to buy Front Page.

I found a website that does the conversion online:

http://word2cleanhtml.com/

I tried the same document there. It came out a lot simpler and the
formatted ad after processing by Craig's List looked a lot better.
 
Y

Yves Dhondt

If you save as "Web Page, Filtered" the resulting code is cleaned up pretty
good. But you have to realize that Word was never made to create webpages in
the first place, there exist other programs for that.

FrontPage doesn't exist anymore, so it seems unlikely that they want you to
buy it. FrontPage has been replaced by several other tools from Microsoft
depending on what you want to do. For creating ordinary webpages, try Visual
Web Developer
(http://www.microsoft.com/express/downloads/#2008-Visual-Web-Developer). It
is a free website creator from Microsoft. The only limitation is that you
can't use it for commercial purposes.

Yves
 
S

Sesquipedalian Sam

If you save as "Web Page, Filtered" the resulting code is cleaned up pretty
good.

I guess "pretty good" is a relative term. I created a simple test
document containing three headers, some text, an unordered list, and
an ordered list. I converted it to html using the Save As as you
suggested and then using http://word2cleanhtml.com. You tell me which
is "cleaned up".



Here's the output from word2cleanhtml.com:

<div>
<h1>
Head 1
</h1>
<p>
Sample document to compare converting to html using Save As
and <a
href="http://word2cleanhtml.com/">http://word2cleanhtml.com/</a>.
</p>
<h2>
Head 2
</h2>
<p>
Some text…
</p>
<h3>
Head 3
</h3>
<p>
Some text…
</p>
<p>
Unordered list:
</p>
<ul>
<li>
Item 1
</li>
<li>
Item 2
</li>
<li>
Item 3
</li>
</ul>
<p>
Ordered list:
</p>
<p>
1. Item 1
</p>
<p>
2. Item 2
</p>
</div>



Here's the Word output:

<html>

<head>
<meta http-equiv=Content-Type content="text/html;
charset=windows-1252">
<meta name=Generator content="Microsoft Word 12 (filtered)">
<title>January 31, 2002</title>
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Wingdings;
panose-1:5 0 0 0 0 0 0 0 0 0;}
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:"Arial Unicode MS";
panose-1:2 11 6 4 2 2 2 2 2 4;}
@font-face
{font-family:Cambria;
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"EngraversGothic BT";
panose-1:2 11 5 7 2 2 3 2 2 4;}
@font-face
{font-family:Verdana;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:"\@Arial Unicode MS";
panose-1:2 11 6 4 2 2 2 2 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
h1
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
page-break-after:avoid;
font-size:14.0pt;
font-family:"Arial","sans-serif";
color:#548DD4;
font-weight:bold;}
h2
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
page-break-after:avoid;
font-size:14.0pt;
font-family:"Arial","sans-serif";
color:#548DD4;
font-weight:bold;}
h3
{mso-style-link:"Heading 3 Char";
margin-top:10.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
page-break-after:avoid;
font-size:12.0pt;
font-family:"Cambria","serif";
color:#4F81BD;
font-weight:bold;}
p.MsoFootnoteText, li.MsoFootnoteText, div.MsoFootnoteText
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoHeader, li.MsoHeader, div.MsoHeader
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoFooter, li.MsoFooter, div.MsoFooter
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
span.MsoFootnoteReference
{vertical-align:super;}
p.MsoBodyText, li.MsoBodyText, div.MsoBodyText
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:red;}
p.MsoBodyTextIndent, li.MsoBodyTextIndent, div.MsoBodyTextIndent
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:.25in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Times New Roman","serif";
color:windowtext;
font-weight:bold;
font-style:italic;}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{color:purple;
text-decoration:underline;}
p.MsoDocumentMap, li.MsoDocumentMap, div.MsoDocumentMap
{mso-style-link:"Document Map Char";
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:8.0pt;
font-family:"Tahoma","sans-serif";
color:windowtext;}
p
{margin-right:0in;
margin-left:0in;
font-size:10.0pt;
font-family:"Verdana","sans-serif";
color:black;}
pre
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:10.0pt;
font-family:"Arial Unicode MS","sans-serif";
color:windowtext;}
p.MsoListParagraph, li.MsoListParagraph, div.MsoListParagraph
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoListParagraphCxSpFirst, li.MsoListParagraphCxSpFirst,
div.MsoListParagraphCxSpFirst
{margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoListParagraphCxSpMiddle, li.MsoListParagraphCxSpMiddle,
div.MsoListParagraphCxSpMiddle
{margin-top:0in;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.MsoListParagraphCxSpLast, li.MsoListParagraphCxSpLast,
div.MsoListParagraphCxSpLast
{margin-top:0in;
margin-right:0in;
margin-bottom:0in;
margin-left:.5in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";
color:windowtext;}
p.BizLtrHdHdrSm, li.BizLtrHdHdrSm, div.BizLtrHdHdrSm
{mso-style-name:$BizLtrHdHdrSm;
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:9.0pt;
font-family:"EngraversGothic BT","sans-serif";
color:windowtext;}
p.BizLtrHdHdrLg, li.BizLtrHdHdrLg, div.BizLtrHdHdrLg
{mso-style-name:$BizLtrHdHdrLg;
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
text-align:center;
font-size:18.0pt;
font-family:"EngraversGothic BT","sans-serif";
color:windowtext;}
p.BizLtrHdHdrMd, li.BizLtrHdHdrMd, div.BizLtrHdHdrMd
{mso-style-name:$BizLtrHdHdrMd;
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
text-align:center;
font-size:12.0pt;
font-family:"EngraversGothic BT","sans-serif";
color:windowtext;}
span.DocumentMapChar
{mso-style-name:"Document Map Char";
mso-style-link:"Document Map";
font-family:"Tahoma","sans-serif";}
p.Title, li.Title, div.Title
{mso-style-name:$Title;
mso-style-link:"$Title Char";
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:14.0pt;
font-family:"Times New Roman","serif";
color:#548DD4;
font-weight:bold;}
p.Instructions, li.Instructions, div.Instructions
{mso-style-name:$Instructions;
mso-style-link:"$Instructions Char";
margin-top:6.0pt;
margin-right:0in;
margin-bottom:0in;
margin-left:0in;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Arial","sans-serif";
color:blue;
font-weight:bold;}
span.TitleChar
{mso-style-name:"$Title Char";
mso-style-link:$Title;
color:#548DD4;
font-weight:bold;}
span.InstructionsChar
{mso-style-name:"$Instructions Char";
mso-style-link:$Instructions;
font-family:"Arial","sans-serif";
color:blue;
font-weight:bold;}
span.Heading3Char
{mso-style-name:"Heading 3 Char";
mso-style-link:"Heading 3";
font-family:"Cambria","serif";
color:#4F81BD;
font-weight:bold;}
..MsoChpDefault
{font-size:10.0pt;}
/* Page Definitions */
@page Section1
{size:8.5in 11.0in;
margin:.5in .5in .3in .5in;}
div.Section1
{page:Section1;}
/* List Definitions */
ol
{margin-bottom:0in;}
ul
{margin-bottom:0in;}
-->
</style>

</head>

<body lang=EN-US link=blue vlink=purple>

<div class=Section1>

<h1>Head 1</h1>

<p class=MsoNormal>Sample document to compare converting to html using
Save As
and <a
href="http://word2cleanhtml.com/">http://word2cleanhtml.com/</a>.</p>

<h2>Head 2</h2>

<p class=MsoNormal>Some text…</p>

<h3>Head 3</h3>

<p class=MsoNormal>Some text…</p>

<p class=MsoNormal>Unordered list:</p>

<p class=MsoListParagraphCxSpFirst style='text-indent:-.25in'><span
style='font-family:Symbol'>·<span style='font:7.0pt "Times New
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span>Item 1</p>

<p class=MsoListParagraphCxSpMiddle style='text-indent:-.25in'><span
style='font-family:Symbol'>·<span style='font:7.0pt "Times New
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span>Item 2</p>

<p class=MsoListParagraphCxSpLast style='text-indent:-.25in'><span
style='font-family:Symbol'>·<span style='font:7.0pt "Times New
Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span></span>Item 3</p>

<p class=MsoNormal>Ordered list:</p>

<p class=MsoListParagraphCxSpFirst
style='margin-left:.45in;text-indent:-.25in'>1.<span
style='font:7.0pt "Times New Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span>Item
1</p>

<p class=MsoListParagraphCxSpLast
style='margin-left:.45in;text-indent:-.25in'>2.<span
style='font:7.0pt "Times New Roman"'>&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
</span>Item
2</p>

</div>

</body>

</html>
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top