Questions on POLE library (and HancomWord support)

Park Shinjo kde at peremen.name
Mon Jan 7 00:30:52 GMT 2013


So, as far as I understand, I can take the pole.(cpp|h) code (possibly
synchronize often with libmso) and need to build my own library for
handling HWP document. Thumbnailing and metadata extraction only needs to
read some part of entire document, I can tune for that kind of processing
in external program too.

Hope I can revive HWP backend for Calligra :)

Regards,
Park Shinjo


2013/1/7 <matus.uzak at gmail.com>

> Hi,
>
> compared to the latest version found at [1], the version of POLE inside
> libmso includes a number of bug fixes and code to detect invalid and
> corrupt files in the Windows Compound Binary File Format.  It's not
> maintained as a separate library, It's simply part of libmso.  And you
> should know it's fine tuned to files produced by MS Office, so it might be
> required to disable a bunch of blocks of code.
>
> In Calligra, libgsf has been superseded by libmso.
>
> Check the powerpoint or msword-odf filter for proper usage.  Look for the
> readStream function in the mswordodfimport.cpp file, which reads a stream
> from a specific storage into a buffer.
>
> [1] http://pole.berlios.de/
>
> br,
>
> -matus
>
> On Sun, Jan 6, 2013 at 5:32 PM, Park Shinjo <kde at peremen.name> wrote:
>
>> Hello, This is my first message to this list.
>>
>> I am working on Hancom Word (mostly known as HWP) support on KDE
>> environment. Haansoft has released specifications of its HWP file format in
>> late 2010, making HWP support outside of Hancom Word possible.
>>
>> The HWP document is based on Microsoft OLE compound document format, and
>> currently Calligra Words uses POLE (as far as I know) for parsing this
>> format. Since its internal structure is different from any other kind of
>> files, and I want to reuse code as much as possible, I want to ask some
>> questions for POLE.
>>
>> 1) How can I use POLE library outside of Calligra? I am currently working
>> on thumbnailer and metadata indexer of HWP format, and it currently uses
>> libgsf. For further integration with Calligra, I want to switch codebase to
>> what Calligra currently uses. Rendering of document is not necessarily
>> required here, since HWP file contains preview image and text. I'm also
>> interested in implementing viewer function inside Okular too.
>>
>> 2) Is POLE library inside libmso considered as "upstream"? I have found
>> implementation of POLE outside of Calligra, but it seems that code is not
>> exactly same as Calligra's code.
>>
>> I am not "yet" interested for restoring hancomword importer (and maintain
>> it) in Calligra words, as there's no real code now. Choosing right codebase
>> will help integration, so I'm asking how I can handle OLE compound document
>> via POLE.
>>
>> Regards,
>> Park Shinjo
>>
>> _______________________________________________
>> calligra-devel mailing list
>> calligra-devel at kde.org
>> https://mail.kde.org/mailman/listinfo/calligra-devel
>>
>>
>
> _______________________________________________
> calligra-devel mailing list
> calligra-devel at kde.org
> https://mail.kde.org/mailman/listinfo/calligra-devel
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/calligra-devel/attachments/20130107/7f85e8f7/attachment.htm>


More information about the calligra-devel mailing list