<p><br>

15.08.2011 1:04 пользователь "Rainer Dorsch" <<a href="mailto:ml@bokomoko.de">ml@bokomoko.de</a>> написал:<br>

><br>

> Artem,<br>

><br>

> many thanks for your quick and very useful reply.<br>

><br>

> Am Monday, 8. August 2011 schrieben Sie:<br>

> > > Can I get a list of files, which is in my strigi index? I am aware of<br>

> > > ~/.kde/share/config/nepomukstrigirc but I would rather want to see a log<br>

> ><br>

> > or<br>

> ><br>

> > > index dump which confirms what really went in? Also is a way to determine<br>

> ><br>

> > how<br>

> ><br>

> > > much data from a certain part of the file system went into the index?<br>

> > > This would be useful for me to understand, where it is worth to finetune<br>

> > > the<br>

> ><br>

> > search<br>

> ><br>

> > > patterns in ~/.kde/share/config/nepomukstrigirc ...<br>

> ><br>

> >  One possible way is cmd: nepomukcmd query "select ?r ?u where { ?r nie:url<br>

> > ?u }". It will give you all resources with URL. You need only file:// urls.<br>

> > Most likely all of them added by strigi. There exists more exact query if<br>

> > you need them.<br>

><br>

> I did not have nepomukcmd defined on my system, but found on<br>

><br>

> <a href="http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk/TipsAndTricks#nepomukcmd">http://techbase.kde.org/Development/Tutorials/Metadata/Nepomuk/TipsAndTricks#nepomukcmd</a><br>

><br>

> a definition<br>

><br>

> alias nepomukcmd="sopranocmd --socket `kde4-config --path socket`nepomuk-socket<br>

> --model main --nrl"<br>

><br>

> which worked for me.<br>

><br>

> Indeed I got a list of 24k files<br>

><br>

> rd@blackbox:~$ nepomukcmd query "select ?r ?u where { ?r nie:url?u }"|grep<br>

> file:| wc -l<br>

> Total results: 36227<br>

> Execution time: 00:00:00.2<br>

> 24022<br>

> rd@blackbox:~$<br>

><br>

> In system settings, Desktop Search displays that more than 30k files are<br>

> indexed. Does anybody know where the difference >30k vs 24k files comes from?<br>

I dont. May be it is files inside archvies, mail attachements. No real idea though.<br>

><br>

> I just found the tutorials on<br>

><br>

> <a href="http://nepomuk.kde.org/node/2">http://nepomuk.kde.org/node/2</a><br>

><br>

> very nice!<br>

><br>

> Is<br>

> <a href="http://techbase.kde.org/index.php?title=Development/Tutorials/Metadata/Nepomuk/AdvancedQueries">http://techbase.kde.org/index.php?title=Development/Tutorials/Metadata/Nepomuk/AdvancedQueries</a><br>

> the best description to start understanding the query langugage?<br>

Well, I would recommend reading W3C SPARQL recomendation first, paragraph2-prgrph5.  <br>

><br>

> > What does 'how much data' mean ? What is the mesurement unit - RDF<br>

> > statement<br>

><br>

> System Settings tells me, that I have a Nepomuk Store Size of 1.2 GB. That<br>

> seems too much to me... I would be interested to see how these 1.2 GB break<br>

> down. <br>

I haven't heard about such utility. Try asking on IRC. If there isn't one, you may write it yourself :) It will be usefull, I suppose.<br>

> E.g. I know that I have many images indexed, maybe I should take them<br>

> out of the index (the tags probably come from the digikam database(?) not even<br>

> the files themselves).<br>

Nope, you should not :) 1.2GB is really small on modern HDD and we are really trying to make it be 1.2 GB of win, not waste. <br>

><br>

> Many thanks,<br>

> Rainer<br>

><br>

> --<br>

> Rainer Dorsch<br>

> <a href="http://bokomoko.de/">http://bokomoko.de/</a><br>

> _______________________________________________<br>

> Nepomuk mailing list<br>

> <a href="mailto:Nepomuk@kde.org">Nepomuk@kde.org</a><br>

> <a href="https://mail.kde.org/mailman/listinfo/nepomuk">https://mail.kde.org/mailman/listinfo/nepomuk</a><br>

</p>