Changing charset result in ? instead of å,ä and ö

W

WhiplashInfo.se

I have published my site using among others, Windows-1252 in many pages.
At present I use FP2003, but started the site using FP-98.

I have more then 3,000 pages and need a way to change to UTF-8 in a way that
don't result in losing all Swedish characters å,ä,ö, Å,Ä and Ö

Is their maybe a add-in or other solution, that spare me manually change
every affected pages?

Thanks
 
T

Thomas A. Rowe

If UTF-8 doesn't support the Swedish characters å,ä,ö, Å,Ä and Ö, then you can't use it.


--
==============================================
Thomas A. Rowe
Microsoft MVP - FrontPage

http://www.Ecom-Data.com
==============================================
 
W

WhiplashInfo.se

I expressed myself unclear, I'm afraid. UTF-8 supports Swedish characters,
that's not my problem.

The problem occurs when I changed the charset in a page from Windows-1252 to
UTF-8, or ISO-8859. Then all Swedish characters disapere and a ? shows
instead.

Tomas
 
R

Ronx

Are you viewing the page from a server? If so it could be that the
server is adding a charset header that overrules the charset your page
is built to. You can check this with FireFox and the developer toolbar
addin, or post a link and I will check for you.

--
Ron Symonds - Microsoft MVP (FrontPage)
Reply only to group - emails will be deleted unread.

http://www.rxs-enterprises.org/fp

FrontPage Support: http://www.frontpagemvps.com/
 
R

Ronx

Your server is not adding any charset header. All your Windows-1252
pages appear OK in IE7 and FireFox - but that was not the problem.

However, your Blog server is adding the header
Content-Type: text/html; charset=UTF-8, which over-rules any encoding in
the page.

Changing all pages to UTF-8 should not cause any problems, but including
a UTF-8 encoded page in a Windows 1252 or ISO-8852 encoded page will
present the problem you have.


Using a copy of your home page with FrontPage 2003, Expression Web, IE7
and FireFox 2 on disc based and server based webs, I changed the page to
UTF-8 without problems.

Try changing the default language setting for your website - Tools->Site
Settings - Language Tab
Set the Default Page encoding to UTF-8 etc.

Then change the pages to UTF-8.
In my tests I kept the default setting as Windows 1252.
--
Ron Symonds - Microsoft MVP (FrontPage)
Reply only to group - emails will be deleted unread.

http://www.rxs-enterprises.org/fp

FrontPage Support: http://www.frontpagemvps.com/
 
W

WhiplashInfo.se

I tried that, with the result that other included pages did show garbage
characters.

Can the problem bee related to that I at the same time did a <!DOCTYPE HTML
PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> declaration in all pages at
the site?

What about asp-pages that also use HTML? Should I declare charset and
DOCTYPE in this pages, or...?

Tomas
 
R

Ronx

The problem is that you are including pages that have a different
character encoding to your main page. When this happens some characters
do not display correctly.

Having said that, your home page displays correctly in all my browsers.
Changing the character encoding to UFT-8 makes no difference.

If you change ALL the pages to UTF-8, including all the included files,
there should be no problems.
This has nothing to do with the <!doctype in the page(s). All pages
need a charset declaration, and need to be saved with the declared
charset encoding.
--
Ron Symonds - Microsoft MVP (FrontPage)
Reply only to group - emails will be deleted unread.

http://www.rxs-enterprises.org/fp

FrontPage Support: http://www.frontpagemvps.com/
 
W

WhiplashInfo.se

OK Ron, but what I still can’t understand is why the Swedish characters
disappear and instead a ? character is displayed in a HTML page, when
changing the charset in the same page.

I understand that the declared charset instruct the browser how to display
texts, but how do the declared charset interact with the page in question so
it remove the Swedish characters?

Tomas
 
R

Ronx

As I said, I cannot duplicate the problem, and with the pages on your
site I cannot see the problem. All the accented characters have accents
- there are no out of place ? marks.

I have taken your Home page and edited it in FrontPage 2003, changing
the charset to UTF-8 - no visible change when previewed in Browser. No
change when published to my server.

The only way to get the ? marks is to save the page with one charset,
and display it with another.

Try changing the default language setting in Tools->Site Settings -
Language tab.
Then change the charset in the pages and save the pages from FrontPage -
This means opening each page and saving it, not using the Change and
Replace dialogue which may not affect the charset the page is saved in.

Finally, are the ? marks appearing in the whole page or just the
included sections? Where do the included sections come from? I have
noticed a section being included by means of JavaScript from a different
server - perhaps this server is changing the charset for the included
section.

--
Ron Symonds - Microsoft MVP (FrontPage)
Reply only to group - emails will be deleted unread.

http://www.rxs-enterprises.org/fp

FrontPage Support: http://www.frontpagemvps.com/
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top