<!DOCTYPE html>
<html>
<head>
<meta http-equiv="Content-Type" content="text/xhtml; charset=utf-8">
</head>
<body><div style="font-family: sans-serif;"><div class="plaintext" style="white-space: normal;"><p dir="auto">On my 5 year old M1 MacBook Air one image takes 10 to 20 seconds. I suspect one could get some juice from parallel processing, but a few thousand images will still take a lot of time.</p>
<p dir="auto">Regards,
<br>
Bill</p>
<br><p dir="auto">On 29 Mar 2025, at 17:00, Michael Miller wrote:</p>
</div><blockquote class="embedded" style="margin: 0 0 5px; padding-left: 5px; border-left: 2px solid #777777; color: #777777;"><div id="542D1F4E-7213-4B11-90BE-FD1DF3E3AE26">
<div style="overflow-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;">
Hi Bill,
<div>I’m curious how long it took to process a single image? I’m wondering if this could scale. The other issue is converting the granite model to something digiKam can use.</div>
<div><br></div>
<div>I’m looking into this now. Thanks for the idea!</div>
<div><br></div>
<div>Cheers,</div>
<div>Mike</div>
<div><br id="lineBreakAtBeginningOfMessage">
<div><br>
<blockquote type="cite">
<div>On Mar 29, 2025, at 4:40 PM, William Allen <dk@ballen.fastmail.fm> wrote:</div>
<br class="Apple-interchange-newline">
<div><span style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; float: none; display: inline !important;">I have used ollama with the granite3.2-vision model to do this locally with pretty good preliminary results. Obviously it’s not integrated with digikam, but I think with some prompt tweaking one could get it to produce some usable and searchable content about images without exposing them to the world.</span><br style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">
<br style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">
<span style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; float: none; display: inline !important;">Bill</span><br style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">
<br style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">
<span style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none; float: none; display: inline !important;">On 29 Mar 2025, at 14:26, Daniel Bauer wrote:</span><br style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">
<br style="caret-color: rgb(0, 0, 0); font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">
<blockquote type="cite" style="font-family: Helvetica; font-size: 12px; font-style: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: auto; text-align: start; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration: none;">Hi,<br>
<br>
I read now and then here about face recognition. For me personally this is not interesting, as my archive is ordered by persons.<br>
<br>
But what would be really fantastic would be "general image content analysis" using *local* AI.<br>
<br>
Like, for example, searching for images that contain<br>
- a group of persons<br>
- mountains in the background<br>
- a person with a teddybaer<br>
- somebody wearing a red shirt<br>
- somebody sleeping/laughing/sitting/running<br>
etc.<br>
<br>
Of course, google, fakebook, X etc. can analyze the content of images this way, and very precise, but the (poor) tools available for the public only work with uploaded images, which, besides of all the data protection questions, means "giving them away".<br>
<br>
A local AI (without uploading any data to anywhere) with such capabilities would be simply mind-blowing.<br>
<br>
I have no idea if something like this is even possible, of if it is already planned for the future of digikam, but maybe my post here brings somebody to the idea, and who knows?<br>
<br>
Have a nice weekend.<br>
<br>
Daniel<br>
<br>
<br>
<br>
<br>
--<span class="Apple-converted-space"> </span><br>
Daniel Bauer photographer Basel Málaga<br>
Twitter: @Marsfotografo (often explicit nudes)<br>
<a href="https://www.patreon.com/danielbauer">https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.patreon.com%2Fdanielbauer&data=05%7C02%7C%7Cc7b9f30f40e043e1cacc08dd6f01f7ee%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C638788776433550753%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=VRb2TIPF8u6FLUCz0F15hzsDzPWRe%2BlF3kXLgHnpsPY%3D&reserved=0</a><br>
<a href="https://www.daniel-bauer.com/">https://na01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.daniel-bauer.com%2F&data=05%7C02%7C%7Cc7b9f30f40e043e1cacc08dd6f01f7ee%7C84df9e7fe9f640afb435aaaaaaaaaaaa%7C1%7C0%7C638788776433588997%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=JSbgG5Fh6b18S9hdTzVajrL3WrRNS39RfZMAD0fTyDw%3D&reserved=0</a><span class="Apple-converted-space"> </span>(nudes)</blockquote>
</div>
</blockquote>
</div>
<br></div>
</div></div></blockquote>
<div class="plaintext" style="white-space: normal;">
</div>
</div>
</body>
</html>