HTML entities in href links

Aaron J. Seigo aseigo at kde.org
Fri Oct 17 23:44:59 BST 2003


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

hi...

there's been a recent serious regression in KHTML and it's related to revision 
1.275 of html/htmltokenizer.cpp:

- ----------------------------
revision 1.275
date: 2003/10/09 14:39:41;  author: mueller;  state: Exp;  lines: +11 -8
improve tokenizing of unterminated entities
- ---------------------------

now, if you have a link like this:

<a href="http://somewhere.com/foo.php?param=1&section=2">

it'll become:

<a href="http://somewhere.com/foo.php?param=1%A7ion=2">

this is because § is an HTML entity for ASCII character %A7, and it 
renders the link broken. it seems that this either needs to be reverted/fixed 
or links should be treated specially (though i doubt the tokenizer knows 
enough about what it's tokenizing to do so...)

i'm rebuilding html now w/out this commit to see if it fixes it

- -- 
Aaron J. Seigo
GPG Fingerprint: 8B8B 2209 0C6F 7C47 B1EA  EE75 D6B7 2EB1 A7F1 DB43
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.2-rc1-SuSE (GNU/Linux)

iD8DBQE/kHDu1rcusafx20MRAmZhAKCtIGoQiDufeCgVMy+T2KYUil4ZSACfe/4+
f016Ny2VmUNnbNIXrhV5mwk=
=jXI5
-----END PGP SIGNATURE-----




More information about the kfm-devel mailing list