[Digikam-devel] Threading for face detection
Aditya Bhatt
adityabhatt1991 at gmail.com
Wed Jul 21 17:18:49 BST 2010
Hi,
I had some small fragmented discussions with Gilles, and Johannes about
using OpenMP to speed up the batch face detection.
Here is part of the conversation I just had:
"
Aditya Bhatt wrote:
> In my latest bunch of commits, I have added a face scanner to digiKam,
which
> detects faces in all images in every album. The relevant code is in
> utilities/batch/batchfacedetector.cpp.
>
> It works, but due to the nature of the algorithm, the CPU usage is very
high
> and digiKam UI slows down to a crawl. (Press the "rescan all images"
button
> at the top of the people sidebar at the left).
This is definitely a bug. Sounds like you are running the detection in
the main user interface thread. Instead it must be executed in (a)
separate thread(s).
> Gilles says that using OpenMP would be a good idea for this.
> I don't know anything about OpenMP, so I'd like it if you can have a look
at
> the code and suggest how to parallelize it.
OpenMP is one way to parallelize things but it's something that was
designed to work deeply in the algorithm code, e.g. by parallelizing
single loops etc. For your task I'd suspect that a more higher level
approach is better suitable because you simply can run the algorithm as
it is on several faces in parallel. This is normally done with a
task-based approach. You first create list of tasks, which would be one
face to scan here, and then give these tasks to an executor. I don't
know if Qt provides a thread pool, but this would be the most classical
versions to create such a service. You have a synchronized queue of
tasks that is filled from the one end with new faces to recognize and
that is dispatched by the executor to several parallel working threads
that perform the recognition algorithm.
For C++ there is one especially notable library that implements a task
pattern in a very versatile way: Intel's Threading Building Blocks. We
don't use them yet in digikam but I think this is definitely the way to
go if there is nothing similar in Qt.
The main statement should be: don't manage threads on your own whenever
you can. Instead focus on tasks that can be executed in parallel. If you
want to improve a single algorithm, then OpenMP would be a solution but
not for these high-level tasks. Also, Intel's TBB include a solution to
parallelize loops in a way that's more or less equal to OpenMP.
If you want to have more insight on task-based approaches you can have a
look at the java.util.concurrent package from standard platform.
Executor and ExecutorService etc. are a very well designed implementation.
"
Also, I'm not very knowledgeable about TBB or even OpenMP for that matter.
So I've started the thread so that Alex and Marcel can join in...
Aditya
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mail.kde.org/pipermail/digikam-devel/attachments/20100721/b1d76f36/attachment.html>
More information about the Digikam-devel
mailing list