[konsole] [Bug 498808] Special character in output cuts rest of output

Matan Ziv-Av bugzilla_noreply at kde.org
Fri Jan 17 22:29:10 GMT 2025


https://bugs.kde.org/show_bug.cgi?id=498808

Matan Ziv-Av <matan at svgalib.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |matan at svgalib.org

--- Comment #2 from Matan Ziv-Av <matan at svgalib.org> ---
(In reply to g111 from comment #0)
> Created attachment 177465 [details]
> Example text with a character that breaks the output
> 
> SUMMARY
> I am using  e.g. "tail -f" to watch a logfile. If there is a special
> character in it (see the attached example file with the problematic char in
> the street field) the output stops with this character and nothing more is
> printed. The same happens when printing the file with "cat".
> 
> The konsole profile encoding is set to utf-8.
> 
> The special character originally is utf-8, but somehow, maybe when writing
> to the log file, utf-8 has become encoded wrong and the string is messed up.
> Nevertheless the output should be printed in the konsole.
> 
> As a workaround you can pipe the output to "cat -v". Or you can use xterm
> instead of konsole. Here the whole file is printed as expected.
> 
> STEPS TO REPRODUCE
> 1. save the attached file test-output.txt
> 2. open konsole and do "cat test-output.txt" or "tail -100 test-output.txt"
> 
> OBSERVED RESULT
> The output stops after "Meine Straƃ"
> 
> EXPECTED RESULT
> The whole file should be printed
> 
> SOFTWARE/OS VERSIONS
> Windows: 
> macOS: 
> (available in the Info Center app, or by running `kinfo` in a terminal
> window)
> Linux/KDE Plasma: 
> KDE Plasma Version: 
> KDE Frameworks Version: 
> Qt Version: 
> 
> ADDITIONAL INFORMATION

I am not sure this is a bug. This seems to be a doubly encoded data: You want
U+00DF, which in UTF-8 is the byte sequence 0xc3 0x9f.

Instead you have the byte sequence 0xc3 0x83 0xc3 0x9f when decodes to U+00C3
U+009F.

U+00C3 is A tilda,

U+009F is a control char (APC), so everything after is considered part of the
control sequence, rather than characters to be printed.

Please note that xterm and xfce4-terminal behave like konsole.

-- 
You are receiving this mail because:
You are the assignee for the bug.


More information about the konsole-devel mailing list