D16235: [Extractor] Exclude Postscript from being indexed

Stefan BrĂ¼ns noreply at phabricator.kde.org
Tue Oct 16 01:11:07 BST 2018


bruns created this revision.
Herald added projects: Frameworks, Baloo.
Herald added subscribers: Baloo, kde-frameworks-devel.
bruns requested review of this revision.

REVISION SUMMARY
  The postscript file contents are mostly commands and base64 or hex encoded
  data. Text content is in general not available without removing
  justification whitespace, i.e. extraction requires a postscript
  interpreter.

TEST PLAN
  run `balooshow -x <ps-file>` before
  -> lots of garbage terms
  `balooctl clear <ps-file>; touch <ps-file>`
  run `balooshow -x <ps-file>` again
  -> the terms are empty now, save the mimetype ("Mapplication Mpostscript")

REPOSITORY
  R293 Baloo

BRANCH
  oob

REVISION DETAIL
  https://phabricator.kde.org/D16235

AFFECTED FILES
  src/file/extractor/app.cpp

To: bruns
Cc: kde-frameworks-devel, #baloo, ashaposhnikov, michaelh, astippich, spoorun, ngraham, bruns, abrahams
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/kde-frameworks-devel/attachments/20181016/6b2e8c70/attachment.html>


More information about the Kde-frameworks-devel mailing list