Hey everyone<br><br>I'm currently working on improving KDE File Indexing infrastructure.<br><br>One of the areas where we are lacking is proper support for Open Document Formats and Microsoft Document Formats. It occurred to me that maybe I could use the calligra libraries to do so. I even looked at the code base ( a little bit ) and extracting the basic metadata is really simple (KoDocumentInfo).<br>

<br>I also looked at the Calligra Converter code, which seems to be using a print job to convert the formats. It can convert the file to a pdf, which I can then easily parse, but that seems like a bit too much effort. Not to mention that it's probably very slow.<br>

<br>So my question is - Is it possible to use Calligra to quickly extract the plain text from the file?<br><br>Also, what kind of dependencies an I looking at? Just calligra-libs or something else?<br><br>-- <br><span style="color:rgb(192,192,192)">Vishesh Handa</span><br>

<br>PS: Please keep me cced. I'm not on the mailing list.<br>