Auto tagging--is there a list of possible "objects" and "scenes" somewhere?

Sat Apr 12 05:12:20 BST 2025

 Hi,

I've been trying out the auto-tagging feature since yesterday (so far just the EfficientNet B7 option), but the documentation is rather vague. It says one model detects "1,000 different objects and scenes", while the others "detect 80 different objects". Ok...but what exactly are the objects and scenes these models can detect? The tags have to come from somewhere, so there must be a list somewhere, right? I tried running a few searches online but so far found none. As it is I have no clear idea of what the models are intended to look for or what exactly is different between them.

https://docs.digikam.org/en/left_sidebar/tags_view.html#auto-tagging-images

"The default model is EfficientNet B7. The EfficientNet B7 model is a general-purpose model that can detect 1,000 different objects and scenes. The YOLOv11-Nano model is faster and uses less memory than the EfficientNet B7 model. The YOLOv11-Nano model is recommended for users with limited memory or slower processors, and YOLOv11-XLarge is recommended for users with more memory and faster processors. Both YOLOv11 models are trained to detect 80 different objects based on the COCO dataset."

I'd appreciate whatever additional info you can provide.
--Billy  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/digikam-users/attachments/20250412/a7685055/attachment-0001.htm>