Review Request 117789: Optimize word count in PlainTextExtractor.
Vishesh Handa
me at vhanda.in
Tue Apr 29 14:33:42 BST 2014
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://git.reviewboard.kde.org/r/117789/#review56900
-----------------------------------------------------------
Ship it!
Thanks! :)
- Vishesh Handa
On April 26, 2014, 1:15 p.m., Milian Wolff wrote:
>
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://git.reviewboard.kde.org/r/117789/
> -----------------------------------------------------------
>
> (Updated April 26, 2014, 1:15 p.m.)
>
>
> Review request for kdelibs and Vishesh Handa.
>
>
> Repository: kfilemetadata
>
>
> Description
> -------
>
> Optimize word count in PlainTextExtractor.
>
> Regular expressions are notoriously slow. Implementing a simple
> word-count directly in C++ is much faster, as shown by the benchmark:
>
> Before:
> 702.0 msecs per iteration (total: 7,020, iterations: 10)
> After:
> 125.5 msecs per iteration (total: 1,256, iterations: 10)
>
> Make the plaintext extractor benchmark more meaningful.
>
> It now operates on a larger file and uses QBENCHMARK to actually get some data.
>
>
> Diffs
> -----
>
> autotests/indexerextractortests.cpp 1cb8e65da7d764eab1923054659ae5841104de2d
> src/extractors/plaintextextractor.cpp 536e02d843f24dbbc19035029896b9e696e8b302
>
> Diff: https://git.reviewboard.kde.org/r/117789/diff/
>
>
> Testing
> -------
>
>
> Thanks,
>
> Milian Wolff
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/kde-core-devel/attachments/20140429/9b6d846b/attachment.htm>
More information about the kde-core-devel
mailing list