Wrong charsets picked up

Thiago Macieira thiago.macieira at kdemail.net
Fri Apr 9 22:58:07 BST 2004


Hello,

I have done some tests in Konqueror regarding charsets and what I have 
come up with seems to be buggy behaviour.

Working case: HTML page on an HTTP server
	Konqueror honours the Content-Type parameter.

Broken case: HTML page on local disk, Konqueror on Auto-detect | Western 
european mode
	Konqueror doesn't try the locale's encoding. The result is that my 
UTF-8 pages get shown in Latin-1.
	Setting it to Auto-detect | Semi-automatic makes the detection work, 
but it'll break buggy sites that don't send charset: they get displayed 
in UTF-8, when they are not.

Broken case: external CSS stylesheet on an HTTP server
	Konqueror does not honour the Content-Type parameter.
	This probably affects JavaScript pages as well.

Broken case: external CSS stylesheet on local disk
	No attempt at guessing the encoding is done: Konqueror defaults to 
Latin-1 in all cases.

In the last two cases, adding a charset="utf-8" to the <link> line makes 
the encoding work as expected, but <style> and <script> external 
references are still broken.

Also for the last two cases, I've come up with a fix. It's attached to 
bug #79065. I want to commit it if possible.

As for the remaining case, I'd suggest that local file's default 
encoding be the locale encoding. Any thoughts?

-- 
  Thiago Macieira  -  Registered Linux user #65028
   thiago (AT) macieira (DOT) info
    ICQ UIN: 1967141   PGP/GPG: 0x6EF45358; fingerprint:
    E067 918B B660 DBD1 105C  966C 33F5 F005 6EF4 5358
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
URL: <https://mail.kde.org/mailman/private/kfm-devel/attachments/20040409/804e7d1d/attachment.sig>


More information about the kfm-devel mailing list