[digikam] [Bug 369051] New: Too low similarity threshold in fuzzy/duplicate search bloats the results with potentially unwished high-similarity results

Mario Frank via KDE Bugzilla bugzilla_noreply at kde.org
Mon Sep 19 15:26:27 BST 2016


https://bugs.kde.org/show_bug.cgi?id=369051

            Bug ID: 369051
           Summary: Too low similarity threshold in fuzzy/duplicate search
                    bloats the results with potentially unwished
                    high-similarity results
           Product: digikam
           Version: 5.1.0
          Platform: Archlinux Packages
                OS: Linux
            Status: UNCONFIRMED
          Severity: wishlist
          Priority: NOR
         Component: Searches-Fuzzy
          Assignee: digikam-devel at kde.org
          Reporter: mario.frank at uni-potsdam.de

When having many pictures, including variants of one picture with different
quality, e.g. due to resizing, conversion and Collage creation, the
lower-quality pictures may be found only with low similarity threshold (e.g. 45
%). But the result set will contain all pictures with a similarity between 45 %
and 100 %. This can make the search for low-quality variants frustrating.
Having the possibility to specify the maximum similarity may solve the problem.

Reproducible: Always

Steps to Reproduce:
1.Have many series pictures you want to keep and some lower-quality variants
you want to get rid off.
2. Start a duplicate search with, let's say 40 %

Actual Results:  
You will get all pictures with a similarity above 40 %

Expected Results:  
It is designed to do that. But having an option to specify a maximum similarity
could be more convenient.

I implemented and tested that. Also, I can provide a patch file against the
master branch.
Here is the local commit message describing the implementation:
"Extended the findduplicatesview and fuzzysearchview with an
 additional QSpinBox which denotes the maximum similarity. The new QSpinBox
 has a minimum value that is the current value of the minimal similarity
 threshold. When the minimum threshold is altered, the range of the new
 QSpinBox is updated. If the minimum threshold is increased beyond the current
 value of the new QSpinBox, the value of the new QSpinBox is increased
 automatically. In the fuzzysearchview, altering the maximum similarity also
 triggers the reuild of the similar images album. The extension can be highly
 valuable if you knowingly want to ignore almost identical images but want to
 find images that have a similarity of, let's say 50-60%, due to resizing,
 cropping or something similar, without bloating your image pane."

-- 
You are receiving this mail because:
You are the assignee for the bug.



More information about the Digikam-devel mailing list