[kde] [Bug 463848] KDE Text Encoding for Korean (applies to KWrite and SubtitleComposer in Flatpaks)

Sat Jan 7 13:18:41 GMT 2023

https://bugs.kde.org/show_bug.cgi?id=463848

--- Comment #2 from Shinjo Park <kde at peremen.name> ---
As I didn't stored any Korean text in CP949 and made everything UTF-8 since
decades ago, even I was not fully aware of the situation...

> This price was worth it to ensure that a typo character (`낥` instead of `날`) would not be lost.
Not only for typo or character in composition, but also for some proper names
and newly invented words.

> It only knows "ms949", "windows-949", and "windows-949-2000", which seem to all refer to the same codec?
According to https://icu4c-demos.unicode.org/icu-bin/convexp?conv=windows-949
it seems true. To make thing more worse, one of the alias (KSC_5601) is shared
between CP949 and EUC-KR internally in ICU (see
https://icu4c-demos.unicode.org/icu-bin/convexp?conv=euc-kr). As CP949 is a
superset of EUC-KR, most Korean implementations interchangeably use them, this
is somewhat reflected even in the WHATWG
(https://encoding.spec.whatwg.org/#names-and-labels). So I think it is just a
matter of naming.

-- 
You are receiving this mail because:
You are the assignee for the bug.