[Okular-devel] Review Request: Fix word detection for DVI documents

Luigi Toscano luigi.toscano at tiscali.it
Fri Nov 23 00:44:08 UTC 2012


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
http://git.reviewboard.kde.org/r/107429/
-----------------------------------------------------------

Review request for Okular.


Description
-------

This patch attempts to restore the functionalities broken by some changes (maybe the text column selection code).

It uses a bit of heuristics to identify the end of a word and merge the boxes which enclose each character of a word (so that char_x.right==char_{x+1}.left). It also try to recognize if there is a newline ("after_space") after that a space is found.


This addresses bug 302004.
    http://bugs.kde.org/show_bug.cgi?id=302004


Diffs
-----

  generators/dvi/dviRenderer_draw.cpp b816353 

Diff: http://git.reviewboard.kde.org/r/107429/diff/


Testing
-------

Tested the faulty document from the bug, and the usual dvistd0.dvi and KDVI-features.dvi.

Tested against KDE/4.9; the patch applies to master but I didn't try it there (but dvi code did not change at all).

What is still not working?
- a isolated comma or dot is seen on a separate line (because its height does not match the rest of the line). I think that this is a problem with the algorithm which recognize the lines, as (with this patch) the information about blocks of characters are correctly passed by the dvi generator.


Thanks,

Luigi Toscano

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/okular-devel/attachments/20121123/b244ac06/attachment.html>


More information about the Okular-devel mailing list