[KPhotoAlbum] Quick search proof of concept patch

Jesper K. Pedersen blackie at blackie.dk
Sun May 13 09:41:09 BST 2007


The Makefile.am talks about a directory call Parser, which is not in your 
patch.
You may want to use option --new-file to diff.

Cheers
Jesper.

On Sunday 13 May 2007, Shawn Willden wrote:
| A lot of ideas related to my "quick search" notion were kicked around, but
| I felt like I needed to simplify it some to test out some ideas.  Attached
| is the not-so-quick-but-quite-dirty result.
|
| I'd like to get feedback and performance results (the code is instrumented
| and prints some timing data to stdout).
|
| One of my biggest concerns for the idea was performance.  Would it be
| possible to make it fast enough for interactive search-as-you-type usage,
| so I decided to focus on that first.  Besides, I had some ideas for a nifty
| approach to searching that would be fun to play with ;-)
|
| This patch implements the basic idea with the following constraints:
|
| 1.  It only works for XML DB.
| 2.  It doesn't really manage the UI correctly; some stuff acts funny.
| 3.  It is slow in the "home" view, though performance seems good to me in
| the thumbnail view.
| 4.  It doesn't support quoting for searching for keywords with embedded
| spaces 5.  It doesn't support "OR" -- all search terms are ANDed together
| 6.  It does integrate with "drilldown" searches and date bar searches, by
| the simple expedient of applying the quick search first and then testing
| the remaining images against the drilldown and data bar criteria.
| 7.  It doesn't search on date or EXIF fields, only 'tags'.  It does,
| however, search all tags, including supercategories, case insensitively.
| 8.  It searches only by tag "prefix".  That is "foo" will
| match "foo", "foobar" and "footh", but not "myfoo".
|
| In addition to all of that, the code is somewhat ugly, doesn't follow
| kphotoalbum style standards and generally needs a lot of work.
|
| That said, my goal was to test the performance of my in-memory approach,
| and I'm quite happy with the results, at least with my database on my
| system, and I think it will also do well even on larger databases and
| slower machines.
|
| In my case (Core Duo 2.2 GHz, 12.6K images, 1109 unique tags) building the
| ternary search tree used for the lookups takes about 700 ms, and the
| worst-case searches take under 40 ms.  Most searches take under 2 ms.
|
| Please give it a try, especially those of you with big image databases and
| lots of tags.  To stress it a little, pick some single-letter searches that
| have large result sets.  "i" matches "Image" from the media type category
| (it probably doesn't make sense to index that one), so that's a big result
| set. Then put them together.  For example, I tried "i j c m e".  Those
| other letters are the first letters of my kids' names, so they have large
| result sets.
|
| Oh, and do all this from the thumbnail view, with your full database
| showing. The search tree works just as well in the "home" view, but the
| process used to recalculate the tags per category is slow.
|
| Thanks,
|
| 	Shawn.



-- 
Having trouble finding a given image in your collection containing
thousands of images?

http://www.kphotoalbum.org might be the answer.



More information about the Kphotoalbum mailing list