D16235: [Extractor] Exclude Postscript from being indexed
Stefan BrĂ¼ns
noreply at phabricator.kde.org
Tue Oct 16 01:11:07 BST 2018
bruns created this revision.
Herald added projects: Frameworks, Baloo.
Herald added subscribers: Baloo, kde-frameworks-devel.
bruns requested review of this revision.
REVISION SUMMARY
The postscript file contents are mostly commands and base64 or hex encoded
data. Text content is in general not available without removing
justification whitespace, i.e. extraction requires a postscript
interpreter.
TEST PLAN
run `balooshow -x <ps-file>` before
-> lots of garbage terms
`balooctl clear <ps-file>; touch <ps-file>`
run `balooshow -x <ps-file>` again
-> the terms are empty now, save the mimetype ("Mapplication Mpostscript")
REPOSITORY
R293 Baloo
BRANCH
oob
REVISION DETAIL
https://phabricator.kde.org/D16235
AFFECTED FILES
src/file/extractor/app.cpp
To: bruns
Cc: kde-frameworks-devel, #baloo, ashaposhnikov, michaelh, astippich, spoorun, ngraham, bruns, abrahams
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/kde-frameworks-devel/attachments/20181016/6b2e8c70/attachment.html>
More information about the Kde-frameworks-devel
mailing list