Amarok 3 architecture brainstorm (was: GSoC considerations)
Matěj Laitl
matej at laitl.cz
Fri Apr 12 14:29:52 UTC 2013
On 12. 4. 2013 Edward Toroshchin wrote:
> My personal view on this point being: the engine, media database and UI
> (at least) must be separate processes, so that:
>
> + at least the first two can be made rock-solid, valgrinded,
> unit-tested, etc.,
> + the data structures and flows are strictly defined,
> + no bullshit a la "we can't use class XXX from YYY thread if ZZZ",
That won't go away, unless you disable threading entirely (which you probably
don't want, moving all our async code to separate processes could be a lot of
work). But see below.
> + theoretically, any component should be completely replaceable.
>
> I'll try to describe the benefits (as I see them) of these changes
> below (if you bear with me).
>
> * which parts of the existing code can be spared and reused, and which
> would rather be killed with fire;
> * and so on.
I think I share 100% of the problem statement with you. In my words, our
"components" are way too much interdependent and therefore fragile to
seemingly unrelated changes to the extent that it makes development tricky.
I also share that our components (and I mean components like "playback",
"playback queue model", "collections model", "QWidget gui") must be made more
independent, replaceable and behave according to more well-defined semantics.
Our heavy usage of the singleton pattern makes it only worse.
I also agree that we should have a technical measure to actually ensure the
independence, not just encourage it and hope it happens.
OTOH I don't share the idea that multi-threading is inherently bad. For me,
multithreading *is* the fun part and classes like ThreadWeaver::Job make it
easy. The main source of problems we may currently have with multithreading is
IMO that many parts of Amarok design and implementation didn't anticipate it
at all, and were perhaps written by developers that didn't know much about it.
However I think our code-base got a lot better in this regard compared to
let's say 2 years ago.
> Now, as promised, my view on the process-based architecture.
>
> Although switching to IPC may seem over-complicated, I think it would be
> actually much easier to live with than our current thread zoo.
> Currently, before any tiny bit of code can be changed, a lot of things
> have to be considered, which raises the entry difficulty for developers
> a lot (and we're not in the best of situations there ATM).
>
> So here are the benefits, as I see them (in a loose order of decreasing
> importance):
>
> * it would be easier to debug and develop upon individual components
> (hence, improving the quality, userbase, and developer community),
I think this is tied to more general idea of better independence of the
components rather than to IPC suggestion specifically. When it comes to
debugging, I think this is very subjective and varies case-by-case, for
example debugging out-of-process kio slaves in gdb sense is all but pleasant
[1].
> * database would be easy to replace (mysql, nepomuk, whatever), it
> might even become plausible once more to maintain several backends
> (although a sql and nepomuk would probably be the favorites),
Again not specific to IPC, but of course a goal we want to achieve and we
hopefully are already marching towards that direction. The direction is that
SqlCollection will be a plugin equal to all others (you would be able to
disable it, etc), esp. to NepomukCollection in future.
> * the playback engine would do only playback, do it well, and not fuck
> up anything else (I believe currently the first question that's being
> asked in a bug report is 'have you tried changing the playback
> backend'),
Yup, EngineController should be made a proper component with and Abstract Base
Class declaration and defined semantics. And I think a lot of its current logic
is reversed (i.e. EngineController asks Playback Queue model to feed it with
the next track. Hello? Why does it even know that Playback Queue exists? It
should just signal "I'm about to finish the current track")
The comment about bug is true, however that question is sometimes asked
needlessly.
> * UI might be replaceable. We might want to have a variant of an UI for
> low-performance systems. Web UI (someone mentioned recently)? Easy!
Exactly. I'd love if our UI was just a component (although a big one) perhaps
called "QWidget MainWindow". One day we might have "QML MainWindow" or
whatever.
> * the environment and resource usage could be contained within each of
> the processes. Think MySQL (all the exit()'s, all the *argv, all the
> pain and misery and 500-megabyte caches for no reason).
Hmm, right, but what would be an advantage of that? I think that summary
memory usage of the IPC solution would be higher - a little duplication of
otherwise shared data is IMO inevitable.
> Of course, this may not require splitting the components into processes
> per se.
Yes, this is exactly my point.
> But I still think the split is important, because it requires a
> good fundamental design, and confines the development to it. Which is a
> good thing.
I agree 99%, I just suggest that "some split" may not necessary be IPC design.
See also [2] for some background info on IPC design and why it may not be the
best thing under sun, article by our lovely Aaron Seigo.
I consider rude just criticising without offering and alternative, so here's my
suggestion, something I've been toying with for 6 months. Please spend a few
moments thinking about it.
Everything Generic is a Component
===================
* code would be (re)split into "core" and everything else, where everything
else would be an implementation of a component (or a part of it); core
would contain just:
* well documented declarations for components (abstract base classes for
e.g. Logger (done; subject to rename to Notifier), CollectionManager,
EngineController, PlayBackQueueModel, mainwindow ui, config ui, tray
icon, OSD, ScriptManager)
* declarations for common abstract classes like: Meta::Track & friends,
Collection, CollectionLocation, Playlist, Observer ...
* a sort of "application controller", let's call it Amarok::Components
that would manage the components
* each component would have 2 accessors: strong and weak. strong accessor
would create the component if it is not already alive. It would be the
only way to instantiate a component.
* A component is not a singleton. It is memory-managed (using exclusively
Q{Shared,Weak}Pointers), created on demand and destroyed when not in use.
Components can depend on other components in 2 ways:
* strong dependency: it calls the strong accessor in own constructor and
holds a QSharedPointer. For example PlaybackQueue would depend strongly
on EngineController. Used with caution; circular strong dependency is a
fatal error.
* weak dependency: the master component may call the weak accessor,
getting may-be-null QWeakPointer, may store the weak pointer. It
temporarily promotes the weak pointer to the strong one when it needs to
call slave component methods. The master component can by notified about
slave component creation/destruction via Amarok::Components.
* In fact, nothing is a singleton. Amarok::Components is created in main.cpp
and is parented to QApplication. Only utility singletons internal to
a single component are allowed.
* things like SqlCollection are not components. This one is considered a part
of the "CollectionManager" component, though implemented in an extra shared
library. I.e. there should be only one component of a given *type* at a
time, but perhaps multiple implementations of a type, and you don't know
which one you've got.
* in other words, "core" would be our API (with no stability guarantee) and
everything else will use/implement it. How this could be enforced? By
linking
* currently, everything links at least to amarokcore and amaroklib and
amaroklib contains virtually everything including the GUI. This is very
prone to adding interdependencies.
* in this proposal, amaroklib will cease to exist. Much of it will be moved
to "QWidget MainWindow", some declarations to core and some
implementations to separate modules.
* amarokcore will be the only Amarok thing to link to, therefore ensuring
that only well-defined public APIs are used.
Of course, this is a very long road, but at the end, we should get the
following for free:
* when you minimize Amarok to tray, the whole mainwindow component could be
deleted; starting Amarok to tray should be blazing-fast as the bulk of gui
won't be loaded until you open it.
* when you stop playing, the enginecontroller and perhaps all phonon thingies
could be destroyed. (questionable)
* porting Amarok to let's say smartphones should be as easy as providing
alternative implementation for "GUI" and perhaps "EngineController"
components.
The single most important advantage if this approach is that it could be done
*gradually*. Other advantages of modularization mentioned by Edward should
still hold.
What do you think? This is of course a rough draft and I have omitted some
details I've thought about.
Regards,
Matěj
[1] http://techbase.kde.org/Development/Tutorials/Debugging/Debugging_IOSlaves
[2] http://aseigo.blogspot.cz/2013/02/process-separation.html
More information about the Amarok-devel
mailing list