Baloo - Not Indexing everything by default

Todd Rme toddrme2178 at gmail.com
Fri Oct 17 12:48:13 UTC 2014


On Thu, Oct 16, 2014 at 2:15 PM, Martin Gräßlin <mgraesslin at kde.org> wrote:
> On Thursday 16 October 2014 13:20:57 Vishesh Handa wrote:
>> Hey guys
>>
>> While Baloo performs better than Nepomuk. It does have its share of
>> problems - mostly large text files, and high IO usage. Additionally, users
>> on linux often seem to have the craziest files. Currently, we do not index
>> plain text files which do not have a `.txt` extension, because otherwise we
>> land up indexing genome data and other strange files. (Actual bugs)
>
> the txt being genome data doesn't surprise me[1], but I find it sad that now
> txt is disabled by default (I use them quite a lot for blog posts). As genome
> data is really huge wouldn't it make sense to go rather for file size or abort
> the indexing if it's obvious random gibberish?

Or skip it if it looks like csv or is mostly numbers?


More information about the Plasma-devel mailing list