[Nepomuk] Web Metadata Extractor GSoC idea

Evgeny Egorochkin phreedom.stdin at gmail.com
Sun Feb 21 10:17:56 CET 2010


Added one idea: 
http://community.kde.org/GSoC/2010/Ideas#Web_Metadata_Extractor_Framework_and_Service

It doesn't look very hard on the surface(mostly grunt work with configuration 
and such) and is a very useful.

One more thing left out is "slow indexing" mode. At this moment, we don't 
calculate hashes for files or MusicBrainz IDs for music to speed up indexing. 
Maybe it's worth to either let the user enable the features(possibly for a 
subset of dirs) or implement a second pass of crawling to handle the heavy 
lifting.

Basically, if you have plain mp3 with no tags, the "slow" crawler could 
calculate a MusicBrainz ID, and the Web Metadata Extractor would fetch the 
rest of metadata.

-- 
Evgeny


More information about the Nepomuk mailing list