[Kde-pim] Problems with infrastructure

Tue Dec 16 22:58:02 GMT 2014

On Wed, Dec 17, 2014 at 10:23 AM, Jan Kundrát <jkt at kde.org> wrote:
> Hi Ben,

Hi Jan,

>
>> It isn't just
>> the tool itself which has to be maintained: we have commit hooks,
>> integration with other bits of infrastructure and so forth which also
>> needs to both be implemented and maintained.
>
>
> In case of Gerrit, there is no need for custom hooks as they stay on
> git.kde.org, and therefore I believe this point is not relevant to its
> adoption. The whole setup has been designed and implemented in a way
> suitable for long-term parallel operation alongside KDE's git.

I was referring to the audits here - commits can't make it into Gerrit
which could never be replicated to git.kde.org.

>
> As for the integration bits, they're done now. The tool just talks to LDAP,
> and maintenance of that connection is effectively zero, unless a dramatic
> revamp of our LDAP is planned. The repo mirroring was a matter of setting up
> a single user account and configuring proper ACLs, and these are also
> finished already.

If Gerrit were to become primary infrastructure then there would be an
expectation it retrieves SSH keys from LDAP in some way.
This is how Subversion and Git work from the developers point of view.

>
> I can understand the general reasons for limitting the number of services
> which we offer and support. However, I would appreciate if we mentioned how
> big these costs are, as there's some room of misinterpretation otherwise.

The cost varies depending on the tool. For some things - like Drupal -
the cost is quite minimal, especially when well known and used modules
are used.
The more integration which is required, the higher the cost basically.

Anything around source code management requires higher level
integration as a general rule. People expect hooks to close things, CI
integration and the like.

>
>> The more custom work we have, the harder it is to upgrade things.
>
>
> While true in general, I fail to see how it is relevant to Gerrit. What
> custom bits are involved here?

Depends on what functionality it is missing for becoming a full blown
KDE tool. If we keep it strictly as a code review tool, you still need
to integrate the audit hooks.
Plus it needs to work with whatever our CI system is - and that is
currently Jenkins.

If we were to replace Jenkins, you have indicated that custom work
would be required to get reports for tests and tools like cppcheck
generated and published.
That has a maintenance cost as well.

>
>> We'll confuse newcomers if
>> projects A, B and C are reviewed on tool X while projects D, E and F
>> are reviewed on tool Y.
>
>
> I haven't received a single complaint from the Trojita GCI students about
> any hardness in this. They did struggle with making an actual change to the
> code, with proper coding style, with commit message formatting, with git in
> general, they even failed to understand the written text about CCBUG and BUG
> keywords in the our wiki, but nope, I haven't seen them struggle with a need
> to use Gerrit or its differences to RB. YMMV, of course.
>
> Because the majority of complaints actually came from people who are
> well-versed with ReviewBoard, my best guess is that there's muscle memory at
> play here. This is supported by an anecdote -- when I was demoing the RB
> interface to a colleague who maintains Gerrit at $differentOrg, we both
> struggled with finding buttons for managing a list of issues within RB. It's
> been some time since I worked with RB, and it showed.
>
> I remember having hard time grokking the relation between a "review" and
> "attaching/updating a file" on RB. I didn't read the docs, and it showed.
>
>> A single tool would be best here. Let me make
>> clear that it is not a case of Reviewboard vs. Gerrit here - as other
>> options need to be evaluated too.
>
>
> I understand that people would like to avoid migrating to Gerrit if a
> migration to a $better-tool was coming. Each migration hurts, and it makes a
> lot of sense to reduce the number of hops.
>
> However, what I'm slightly worried about is postponing Gerrit indefinitely
> until all future replacements are evaluated. I don't see people putting
> significant time into any alternative for code review right now. Do we have
> any chances of these people making themselves known in close future? How
> long would be a reasonable waiting time for a testing deployment of
> alternate tools? When are we going to reduce our candidates just to the
> contenders which have been deployed and tested by some projects?

We don't postpone things indefinitely. This change around code review
and project management has been brewing for a long time.

>
>> In regards to the difficulty of Gerrit - I tend to agree with this
>> argument (it took me at least a minute to find the comment button, and
>> I didn't even get to reviewing a diff).
>
>
> The documentation, however, explains the functionality in a pretty clean
> manner, see
> https://gerrit.vesnicky.cesnet.cz/r/Documentation/user-review-ui.html .
>
> We also aren't the first project trying to work with Gerrit, so there's
> plenty of tooling available right now, not "to be written". There's
> text-mode interface, the "gertty" project, there's integration in QtCreator,
> there are pure-CLI tools for creating reviews, another web UIs in
> development, there are even Android clients.
>
>> Plus there are major concerns
>> with integration into our infrastructure, such as pulling SSH keys
>> from LDAP for instance (no, you can't have the tool maintain them
>> itself - as mainline Git and Subversion need the keys too).
>
>
> Yes, SSH-keys-in-LDAP is a PITA, but given that one needs a patched OpenSSH
> to look up keys from LDAP anyway, I don't think this is a blocker issue. The
> situation is exactly the same with the Gitolite setup which currently runs
> on git.k.o though, as that doesn't talk to LDAP either. As you mentioned
> during our IRC chat, there's a Python daemon which polls for changes in
> LDAP, and propagates these into Gitolite's config backend in a non-public
> git repo. Why wouldn't this be enough for Gerrit, then?

Because Gitolite doesn't offer people a UI to change their SSH keys.
Modification would still be needed to Gerrit to switch this off - and
to put a message telling them to head to KDE Identity.

>
> Gerrit has both SSH-authenticated API and a REST HTTPS API for adding and
> removing of SSH keys by an admin account. *If* this is needed, I'll be happy
> to make it work, it's simply a matter of calling two trivial scripts. Would
> you see any problems with hooking into the identity webapp or its backend,
> if there's any, for this? An edge trigger would be cool.

There is nothing publicly exposed. I would prefer to avoid tieins to
the web app itself, as it needs replacement as well.
As for the backend, you can use standard syncrepl to watch for
changes. OpenLDAP doesn't support anything else.

>
> Are there any other concerns?
>
>> Please note that any discussion of tools should be on the merits of
>> the tools themselves. Things like CI integration are addons, which
>> both Reviewboard and Gerrit are capable of.
>
>
> With varying levels of ease, I should add. In the end, everything is
> achievable, and you can write tools which automatically pull patches sent
> through mailing lists and build that, but the question is who is going to do
> the work, and when it's going to be ready. I know that the CI+Gerrit thing
> is now done and solved, and I also know that I won't be spending my time in
> redoing the same for ReviewBoard. Nobody bothered to do this pre-merge CI
> for RB in the past years. Do we have some volunteers now?

As a general rule, despite a few people hinting that it would be nice,
nobody has indicated they would be willing to work on it.
I certainly haven't seen any research on it.

>
>> The only reason we don't
>> have Reviewboard integration yet is a combination of technical issues
>> (lack of SNI support in Java 6)
>
>
> And nobody caring enough to do the work, I suppose. AFAIK Jenkins runs on
> Java 7 just fine, but apparently nobody found time for such an upgrade.
> There's nothing bad with this, of course, but it doesn't suggest that
> suddenly these people will have time for setting up early CI with RB.

Don't know if you are talking about us or upstream here.
For us - Jenkins runs on a Debian Squeeze-LTS system at the moment,
which locks it to Java 6.

>
>> and resource ones (some projects take
>> a long time to complete, and i'm concerned we don't have the
>> processing power).
>
>
> This is not really an all-or-nothing question. If there's a project which
> exceeds our current HW possibilities (you mentioned Calligra before, right?)
> and we cannot easily get more HW (did someone ask the foundation's
> treasurers for funds for HW rental, or approached some of the obvious
> candidates such as RedHat or SuSE asking for HW access?), perhaps that
> project can simply be omitted from these pre-merge CI runs.

When I mention "don't have the power" I mean "we don't have it at the moment".
I always assume more resources aren't available when considering
things - as in the current climate it is best not to over expect
things.

>
> We've chatted a couple of times about the limits of the current CI setup,
> about its inability to perform checks in parallel. Is the existing
> architecture going to scale with these pre-merge CI runs without substantial
> changes?

Define "perform checks in parallel" please.

>
> Again, this is something which is solved now with the CI setup that is
> behind Gerrit. The changes were into the glue code, the code which schedules
> the builds, distributes the jobs and which decides when to build what. It's
> still using the KDE CI's Python scripts for managing library deps and for
> actually launching the build (I sent the necessary patches your way).

Jenkins itself can handle that fine - the scalability issues I see are
hardware limitations.
We only have 3 systems at the moment - and only one of those is of
high CPU calibre.

>
>> In terms of a modern and consistent project tool - I agree here. A
>> long term todo item of sysadmin is to replace projects.kde.org. The
>> question is of course - with what. Chiliproject is now unmaintained,
>> so we do have to migrate off to another solution at some point. If the
>> new tool happens to be more integrated in terms of code review, that
>> is a bonus from my point of view (as it means the integration will be
>> better, and there is one less piece of infrastructure to maintain).
>
>
> See above for my view of mixing the quest for finding a decent project
> management tool and for finding a good code review system. To go a bit to
> the meta side, IMHO, a universal tool which does plenty of things in a
> not-excellent manner is worse than using diverse set of tools which do one
> thing each, and do it well. That's why I see a Chilliproject replacement to
> be an orthogonal topic to the choice of a code review tool.

The problem is many of the options we have seen thus far do both.
Phabricator for instance.

>
>> @Jan: could you please outline what you consider to be the key
>> advantages? At the moment I understand that you are after:
>>
>> 1) CI integration to pre-validate the change before it gets reviewed
>
>
> - With the CI actually testing not just the change in isolation (as a result
> of it on top of what was at the repo at the time the change was made), but
> the result of the change as applied to the current state of the repo at
> merge time, and with user-visible possibility of retruggering a check job.
>
> - Being able to do "trunk gating" for projects which care enough. That is,
> there's a tool which makes sure that there are no regressions, and which
> won't let in commits which break the build or cause tests to fail. (And yes,
> I know that there'll always be an option of direct pushes, I am not pushing
> against that, it appears to be a point which people require. OK with me.)
>
> - Being able to do cross-project verification, i.e. "does this change of kio
> break plasma-framework?"
>
> - Performing builds on various base OSes, against the Qt version provided by
> system (Trojita aims at supporting Qt 4.6-4.8 and 5.2+), using ancient
> compilers (yes, C++11 and GCC 4.4 are so much fun when taken together) and
> different mixes of optional dependencies and features to be built. In short,
> testing in the env people will be using, including Windows and Mac OS X.

Most of this is the capability of the CI system itself.

>
>> 2) Ability to directly "git pull" the patch (which Phabricator's arc
>> tool would meet I believe?)
>
>
> With Gerrit, one has an access to the full history of each change, including
> accessing them in offline mode. This is opt-in, so people who don't care
> will not have their clones "polluted by this nonsense", while people who do
> care have them "enhanced by this valuable data". This happens with no extra
> tooling. I'm sure I could come up with e.g. local scripts that build these
> git refs from the (history of) patches on RB, Phabricator, GitHub, Gitorious
> or whatever, but native support trumps scripting each time.
>
> According to the docs, `arc` is something which just pushes patches around.
> Working with patches is different from having a git ref to work on.
>
> Have you ever used OpenSUSE's Build Service and its CLI client, the `osc`?
> That's what happens when one tries to reimplement a SCM when working on a
> tool whose primary focus is something else. My favorite misfeature is the
> "expand and unexpand linked packages" thingy, and the associated hiding of
> merge conflicts when a source package changes. That's not fun to debug, and
> it won't ever happen with plain git because git comes with excellent support
> for merges. That support is a result of many years of extremely heavy use of
> git by a ton of developers. I don't expect Facebook to be able to cope with
> *that* regardless of their engineering size, and therefore I expect that
> `arc` will fail when people use it for non-obvious stuff.
>
> I suppose most of our developers are either already familiar with git, or
> they have to learn it anyhow to be able to participate in our community in
> an efficient manner. Introducing another patch manager to the mix doesn't
> help, IMHO. This is just to illustrate my experience with tools that
> behave-quite-like-a-SCM but do not actually implement full SCM
> functionality. It works well for quick demos, but it sucks when you start
> using them seriously.
>
> I would encourage anyone who evaluates these tools to pay attention to these
> not-so-obvious usage issues. Having a CLI tool that can fetch a patch and
> apply it to a local checkout is not equivalent to native git refs. It's an
> important building block, but not a finished tool.
>
> I know that I won't be in a business of building these tools. I'm quite
> happy with having them out-of-box with Gerrit.

So to cut a long story short: you want native Git refs and nothing less will do?

>
> Cheers,
>
> Jan

Regards,
Ben

>
> --
> Trojitá, a fast Qt IMAP e-mail client -- http://trojita.flaska.net/