[Kde-accessibility] [Accessibility] Re: [Accessibility-atspi] D-Bus AT-SPI - The way forward
Havoc Pennington
hp at redhat.com
Wed Dec 12 22:08:09 CET 2007
Hi,
Rob Taylor wrote:
> CC'ing the D-Bus mailing list as there's lots of interesting stuff here.
OK, this is about 400 topics in one email, cross-posted to 5 mailing
lists ;-)
I do think the dbus list could be really helpful on a lot of this, and
that in general people extending or having trouble with dbus have been
much too slow to mail the list.
Can I propose that if people are seriously working on, or seriously
having a problem with, any of these issues that they post a single-topic
thread to the dbus list per-issue.
The vast majority of these issues have been discussed before, and there
is frequently even a "plan of record" and we just lack someone
implementing it. A few of these issues are already solved, even, and
simply asking might save someone some trouble.
I'll comment in a random way on a few of the points in here below, but I
dread a thread with many topics on 5 different lists, so can I ask that
further followup be single-topic, change the subject line, and drop the
non-dbus lists? (assuming the followup is about dbus generally and not a11y)
OK, some quick comments on specific points.
Performance
===
D-Bus performance is already well-known and pretty well understood; I
got the same results as the AtSpiDbusInvestigation page in 2004:
http://lists.freedesktop.org/pipermail/dbus/2004-November/001779.html
and updated information and suggestions related to this here:
http://lists.freedesktop.org/archives/dbus/2007-October/008822.html
The basic fact is that for a round-trip blocking method call there is a
constant-factor overhead of around 3x vs. raw sockets. (ORBit is not a
lot slower than raw sockets, iirc.) If you use the bus daemon, this
inherently doubles, since there are double the number of "hops." Looking
at the causes of this overhead, it is difficult to change fundamentally;
I'm sure with effort you could get it down to 2x, and perhaps a bit
better by writing a libdbus replacement with less flexibility. But you
won't get to 1x.
The constant factor falls out of a number of design decisions, some of
them in the protocol, and others in libdbus. This was intentional. My
opinion is that these decisions were largely correct, but opinions vary.
(Some of the design decisions are easy to explore changing, such as
whether to validate the data; see the archive links above.)
Given that fact, I don't think there is much more to say. There is no
way application design should vary based on whether 2x or 3x was
measured. Unless you intend to hack on libdbus itself and want to drill
down into hotspots to fix them, what you as an app developer should have
in your mind is "there's a single-digit constant-factor overhead vs. raw
sockets" and "avoid round trips!!!!"
For AT-SPI I bet the bottom line is that you guys have got to reduce the
amount of traffic and round trips, probably by changing or extending the
API.
If you need something raw-socket-like, then stick to CORBA, or use D-Bus
for discovery only then set up a custom socket channel, or whatever.
There is no reason to drop CORBA when it is suitable. The reason for
dbus is that (in my opinion) CORBA was not suitable for many desktop
use-cases. That does not mean CORBA is not suitable for AT-SPI.
Richer Introspection Data
===
Re: struct names in the introspection, etc. I feel sure there are old
threads on this but am too lazy to dig them up.
Other than digging those up and posting the archive links, I guess the
first step would be to write up the motivation and proposal on the list.
In general it seems like a reasonable idea.
Interface Repository
===
We have certainly discussed on the list simply installing XML files so
static languages can access them at compile time. Not sure of the status
of this, but it should amount to a spec patch, maybe even we already
have a patch. There are definitely old threads and it's worth doing.
A runtime IR service, I'm not sure what it would be for. The dbus
approach is that each app provides its own introspection data.
This has pros and cons vs. keeping a global repository of types, but I'm
skeptical that doing it *both* the central IR way and the introspection
way at the same time makes sense.
IDL vs. XML
===
Remember that writing the XML files by hand is not intended. It would
not be needed if we had reasonable tools.
The original vision, which I still think would be best, is that you
implement the object in some language. The XML is then generated by
scanning that language - pulling docs from the language's native inline
docs, pulling interfaces from the language's native interfaces, etc.
This extracted XML can then be installed for use by static languages,
for example. Definitely this is how the GLib bindings were intended to
work; we did NOT want people to have to write an XML file then generate
code from it. (For the "server side" or object implementation, that is.)
A CORBA-like IDL fits into this same concept. The idea is that if you
wanted to hand-write your IDL, maybe a nice CORBA-style syntax would be
preferred. No problem. You use libIDL to write a little tool to convert
your nice syntax to the XML format.
In other words, the XML format is a lingua franca. That's part of why
XML is used, because you can write quick tools and scripts in Python or
Perl or whatever that manipulate the XML.
Passing Types With Objects/Structs
===
The major point here that I agree with: as Michael says, the
introspection calls probably add up to a lot of round trips, especially
for dynamic languages.
However, there are surely some good ways to optimize that which are
backward compatible and *simple* - i.e. that do not imply that dbus has
a global-across-all-processes type system or type repository, because it
doesn't and imo shouldn't.
<background digression>
A principle of the dbus design is that dbus is NOT a type system, it's a
marshaling system.
That was part of the whole point of dbus vs. CORBA: you are supposed to
use the type system *of your programming language* or *of your component
system*. dbus is a *marshaling* mechanism, not a universe of types. For
dbus, "struct A { int }" and "struct B { int }" are the same thing.
Unlike at least some of the old theories about how to use CORBA, dbus is
NOT intended to be a component system; it is NOT intended to be a way to
define cross-language objects or types. It *can* be used to *remote* a
component. See D-Bus FAQ for more on IPC vs. components.
The out-of-band introspection data is intended to provide optional hints
for how to generate a language binding. But it's also intended that the
introspection data can be ignored; you can just treat the dbus messages
as raw structured data.
</background digression>
So, how would I approach the introspection round trips and/or bandwidth?
I would think some combination of standard strategies, such as batch
calls to introspect multiple objects at once. Another possible approach
would be to have an implicit type repository *per application* -
something like an extended Introspect() call that lets you specify an
introspection context, and in an introspection context the app would
send you each interface only once, and refer to it by reference the
second and subsequent times.
Probably step 1 would be to profile the bottlenecks for the dynamic
bindings that use introspection, with some common apps they might be
introspecting. We should not add a bunch of complexity on performance
speculation, only on performance data.
For the solution, I think it's important to keep the layering that the
introspection data is an *optional hint* that can be used to interpret
dbus messages.
Object Paths
===
> In terms of attaching objects to a connection, it'd be really nice to
> have the attach method take not only a object path, but also a
> possible
> function for parsing the remaining components of a path whose prefix
> matches the given object path.
If I understand this correctly, this is already allowed. You can
register a handler for an entire subtree of the object path namespace.
The intended usage of that is to allow you to do your own path to
handler mapping, e.g. the example I usually give is that you could
register to handle "/documents" and then do your own interpretation of
"/documents/0", "/documents/1", etc.
Object References
===
There has been past discussion, possibly worth digging up, about some
standard format for a complete "IOR" - which would include a server
address, optionally a bus name, and an object path. i.e. the info to
allow you to create a DBusConnection and then create an object proxy.
The "shared connections" feature is intended to support this.
i.e. if you resolved this "IOR" you would have to get the DBusConnection
from the server address, then create your proxy. Without shared
connections, you would create a DBusConnection per proxy, which would be
absurdly, horrifyingly inefficient.
Anyway, I think shared connections are the only hard part about this
feature, and that part is already implemented.
I think the reason there's no "IOR" feature so far is because very few
people have needed it. It's rare to want to pass an object reference
that is "location independent" (not known to be on some specific bus or
provided by some other specific predefined program). When you're talking
to another program, and thus getting an object reference, you would
normally already know what bus that other program is on.
But, if someone thinks this feature through and codes it, it makes
sense, as I said the shared connections feature is already there and
intended to support it.
Binary Introspection Data
===
I don't see how this is worth the enormous pain of reimplementing all
kinds of stuff. It is not even clearly better than XML; there are plenty
of contexts where XML is more convenient. And there is no proven
performance problem, or proof that binary would be dramatically better,
or at least nobody has posted the proof where I've seen it.
I don't see a massive deprecation and reimplementation effort spanning
quite a few projects, justified purely by subjective aesthetics.
In any case, I bet performance would be better addressed via the
mechanisms discussed earlier - batching up the introspection data, or
allowing it to be passed "by reference" when the same app gets the same
interface a second time. Certainly that seems like the thing to try first.
Perspective
===
Let me say again. I know it's a lot of fun to screw with component
systems and type systems and IPC systems. (Obviously I've done it
myself.) However, we should not delude ourselves that this is especially
*worthwhile* in most cases.
Where are apps having the most trouble, doing the most things wrong,
etc.? Arcane improvements to the IPC system are not the answer.
Most of the problems are on a higher level. e.g. lack of convenience API
for stuff like this:
http://svn.mugshot.org/dumbhippo/trunk/client/linux/src/hippo-dbus-helper.h
Or in GNOME, we aren't even using dbus for the baseline, simple
functionality it already provides; e.g. there's still no single-instance
support in gtk. Why would we be adding all kinds of new stuff to dbus,
when we're still sucking at using the functionality we have?
Let's remember that DCOP was implemented in a very short period of time,
and was dead simple - MUCH less complex than dbus is - and people used
it heavily and successfully for lots of real functionality.
Havoc
More information about the kde-accessibility
mailing list