[Nepomuk] Review Request: Extend popplerextractor with firstpage parsing

Vishesh Handa me at vhanda.in
Sun Dec 23 12:50:01 UTC 2012


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
http://git.reviewboard.kde.org/r/107870/#review23896
-----------------------------------------------------------

Ship it!


Apart from the minor nitpicks, this seems good. 

Perhaps you want to add your copyright on the top of the file? It is a sizable contribution.


services/fileindexer/indexer/popplerextractor.cpp
<http://git.reviewboard.kde.org/r/107870/#comment18228>

    Could you please change this to 
    
    QString title = pdfDoc->info(QLatin1String("title")).trimmed();



services/fileindexer/indexer/popplerextractor.cpp
<http://git.reviewboard.kde.org/r/107870/#comment18229>

    This line is no longer required since the title has now been trimmed



services/fileindexer/indexer/popplerextractor.cpp
<http://git.reviewboard.kde.org/r/107870/#comment18230>

    Maybe this should be case-insensitive?


- Vishesh Handa


On Dec. 23, 2012, 12:45 p.m., Jörg Ehrichs wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> http://git.reviewboard.kde.org/r/107870/
> -----------------------------------------------------------
> 
> (Updated Dec. 23, 2012, 12:45 p.m.)
> 
> 
> Review request for Nepomuk and Vishesh Handa.
> 
> 
> Description
> -------
> 
> Extend popplerextractor with firstpage parsing
> 
> Often the pdf metadata is not available or wrong data is added
> to the title field (pdf exporter names instead of title).
>     
> This patch adds the possibility to parse the first page for a possible
> title. A possibel title is determined by the connected text with the
> biggest font that was more than one character.
> 
> 
> Diffs
> -----
> 
>   services/fileindexer/indexer/popplerextractor.h c7dfa50 
>   services/fileindexer/indexer/popplerextractor.cpp 7015195 
> 
> Diff: http://git.reviewboard.kde.org/r/107870/diff/
> 
> 
> Testing
> -------
> 
> tested various pdf files, title is added correctly if it was possible to find one
> 
> 
> Thanks,
> 
> Jörg Ehrichs
> 
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/nepomuk/attachments/20121223/707952bb/attachment-0001.html>


More information about the Nepomuk mailing list