[Okular-devel] [Bug 161324] recognise columns in the text of a page

Robert Knight robertknight at gmail.com
Sat Jan 10 16:11:52 CET 2009


http://bugs.kde.org/show_bug.cgi?id=161324


Robert Knight robertknight gmail com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |robertknight at gmail.com




--- Comment #13 from Robert Knight <robertknight gmail com>  2009-01-10 16:11:51 ---
> Pattern recognizion of what is a column and what is not based on
> coordinates of each character is something your brain can do very easily
> but programming an algorithm that does that is not trivial by far,
> so i guess the answer is no 

It is certainly possible but not trivial - Ocropus provides a free software C++
implementation of algorithms to do this if you're interested.  The basic
approach is to try to the largest columns of whitespace in the page and divide
the text into columns based on that.


-- 
Configure bugmail: http://bugs.kde.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.


More information about the Okular-devel mailing list