Thinking about auto-tags (was: Re: Feature request for Digikam)

Kjetil Kjernsmo kjetil at kjernsmo.net
Tue Feb 27 00:35:41 GMT 2024


Hi all!

I found auto-tagging very interesting, but departing from the original 
subject, so I figured it was best changing the subject:

On mandag 26. februar 2024 16:48:49 CET Gilles Caulier wrote:
> About the auto-tags project, take a look to the student blog :
> 
> https://community.kde.org/GSoc/2023/StatusReports/QuocHungTran#Add_Automatic
> _Tags_Assignment_Tools_and_Improve_Face_Recognition_Engine_for_digiKam

I've spent a lot of time on tagging over the years, I currently have some 
17000 tagged images in my library, and I would be happy to help training 
models with these images and tags, if the original images didn't need to go 
anywhere, in other words, if federated training was supported, and if the 
model had an acceptable license (I would suppose that's taken of since Digikam 
uses it).

Of course, I also have tags of family members that I wouldn't want to expose, 
so Digikam would need to separate between tags for public consumption and my 
private tags. However, my private tags would also be useful. I've used face 
tagging a bit, but it hasn't replaced my manual tagging of people, simply 
because I want to know who's in the picture, also when the face is invisible. 
I can usually make this out manually based on what clothes they are wearing, 
and so it seems to me that this solution could enable person recognition 
beyond facial recognition. If this enabled me to just tag a few images and 
then have a model that figures it out based on recognition of something like 
clothing, it would save me a lot of time ;-)

This direction also seems to create demand for offloading processing to other 
machines, as ML tends to be pretty heavy. My laptop is light and silent, but I 
have a bigger box in my basement. Its disks actually hosts my pictures, I use 
NFS and a remote MySQL database, so if I could run the AI stuff on that box 
directly from my laptop, that would be very nice. Any chance that for example 
the WorkerObject could be extended to run on a remote machine, given, say a 
shared database and an NFS or SMB share?

So, it seems to be a lot of potential here, in terms of crowd sourcing model 
training in a privacy-preserving manner so that the capabilities of Digikam 
improves for everyone.

Kind regards,

Kjetil





More information about the Digikam-users mailing list