[Nepomuk] Review Request: Extend popplerextractor with firstpage parsing

Jörg Ehrichs joerg.ehrichs at gmx.de
Sun Dec 23 12:45:28 UTC 2012


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
http://git.reviewboard.kde.org/r/107870/
-----------------------------------------------------------

Review request for Nepomuk and Vishesh Handa.


Description
-------

Extend popplerextractor with firstpage parsing

Often the pdf metadata is not available or wrong data is added
to the title field (pdf exporter names instead of title).
    
This patch adds the possibility to parse the first page for a possible
title. A possibel title is determined by the connected text with the
biggest font that was more than one character.


Diffs
-----

  services/fileindexer/indexer/popplerextractor.h c7dfa50 
  services/fileindexer/indexer/popplerextractor.cpp 7015195 

Diff: http://git.reviewboard.kde.org/r/107870/diff/


Testing
-------

tested various pdf files, title is added correctly if it was possible to find one


Thanks,

Jörg Ehrichs

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/nepomuk/attachments/20121223/1759e4ed/attachment.html>


More information about the Nepomuk mailing list