jducoeur | Life beyond Distributed Transactions

[Another one for the programmers. This time, we're going to be talking about architecture, so it's mainly aimed at the folks who are hardcore, and want to understand how modern super-gigantic systems can work.]

A few weeks ago, I got a pointer to the paper Life beyond Distributed Transactions: an Apostate's Opinion, and I just finished reading it. It's a position paper from one of the folks at Amazon, and it isn't new (it looks like it dates to 2007), but it is *enormously* important -- it turns out to establish a lot of the reasoning behind Akka, the platform that Querki is built upon.

The paper basically looks at the problem of building nigh-infinitely-scalable systems. Historically, most folks simply assumed that their program was going to run on a single machine, or on such a tight cluster of machines as to make little difference. A world like that leads you to certain assumptions -- in particular, the assumption that you can structure your application's data more or less arbitrarily, and let database transactions paper over the complexities. But in the modern web-based world, where a successful program is going to run on anywhere from dozens to thousands of nodes, often scattered geographically, that just plain doesn't *work* any more. Worse yet, in the modern world you often have to be sharding your database, so the whole concept of distributed transactions kind of falls apart. So the paper asserts that distributed transactions have proven to be a scalability failure in practice, and successful systems work quite differently.

(I discovered the paper when someone on the Akka list asked how you support distributed transactions inside Akka, was told "don't do that", and pointed to this paper to explain why. Indeed, Akka actually *had* support for in-memory transactions between Actors, and has consciously dropped that on the grounds that it can't scale. Akka is increasingly hardcore that any serious program should be scale-agnostic from the beginning, so they're deprecating elements that can't scale arbitrarily.)

It winds up outlining an architectural approach built around Entities, that is quite similar to an Actor architecture. Moreover, it describes some of the techniques necessary to make this architecture robust, and how the business logic should be structured if you want to make it work. Far as I can tell, this paper is strongly influencing the direction of Akka, which has moved beyond simply duplicating Erlang in a better language, and is now rapidly turning into a much more serious and complete platform for building large-scale applications. The newest release of Akka actually contains built-in support for a number of the ideas suggested in here, and the end result is *wildly* different from a conventional program architecture, especially in how the data gets managed.

I'm gratified to see that the paper's suggested approach is fairly close to the way I designed Querki, and for many of the same reasons. (I don't mind reinventing the wheel, so long as it's round.) That said, it provides some very important food for thought that I'm going to need to build in -- for instance, it argues that you need to focus on at-least-once messaging, and be tolerant of duplication at the business-logic level. (Akka is naturally at-most-once messaging, but is beginning to add support for at-least-once now.) I don't agree with every detail of the paper -- in particular, he goes just a bit too far in asserting that the individual Entities must be completely location-agnostic (Querki makes some extremely intentional assumptions about how Actors are grouped together, for efficiency) -- but by and large, it's pretty sound stuff.

Anyway, go read it. It's dense stuff, but it's not very long, and it sets out some very sensible arguments that are grounded in the reality of what has been found to actually work. If you have any interest in the construction of truly large-scale systems, it is totally worth understanding...

Flat | Top-Level Comments Only

While a bit dated and aging now, you might enjoy reading "A Fundamental Note On Distributed Computing" and perhaps some of the original papers on Jini.

I also have two books on Distributed Algorithms and Distributed Systems that I recommend.

I'm not convinced that Distributed Transactions are necessarily impossible, just that they have a significant timing problem. The first few chapters of Jim Grey's "Transactions" book were very influential on my thinking.

(Oversummarizing: transactions are possible when any operation can be reversed, and when you can track in a realistic way whether your aggregated operation was a success or a failure. I think one can do that, to a limited degree, in highly scalable and distributed systems, but the time-window when things do not have a coherent state or worldview can be quite long, and determining whether all the actors have completed or returned to the original state can be difficult. It CAN be known, but it cannot ALWAYS be known.)

I can believe that, but I believe the response of the paper is that it hasn't been found to be worth the hassle in practice -- the big companies, by and large, just don't work that way.

The approach espoused here (which I've come to agree with pretty strongly) is that it makes more sense in the modern world to instead think in terms of data boundaries, and architect around the notion that you only have transactionality within those firm boundaries. In practice, it's a different way of tackling those inconsistencies, acknowledging that you usually have to get the business logic a bit involved to deal with them well.

I think it has long been recognized that there are practical boundaries to transactional capabilities - but bad engineers ignored that (to their peril) and while systems were tightly coupled the failure modes were rare or even impossible.

When you build world-scale systems, certain assumptions that were always flawed, begin to show their cracks.

To my mind, there is a sort of philosophy here as well of "better prevention of problems than detection of problems". Transactions prevent (or minimize) certain problems. But they don't scale. Detection is all that is left.

Yes and no -- I think it's more a matter of preventing the problems via the architecture. That's very much Akka's style: so long as you play by the rules, it scales more or less automatically.

For this particular problem, it isn't really possible to solve it in a completely general way, so the approach is to instead stick the problem right in your face, so that it's hard to ignore. To that end, the new Akka systems make the point about at-least-once delivery over and over again, emphasizing that you need to build it in as an assumption. (Don't know if they have automatic support for hammering it in the Akka TestKit yet, but I suspect somebody'll add a standard "deliver some duplicates" test mode before long.)

I confess, it's kind of neat watching this evolution. The Akka support for this model is new (just released a few weeks ago), and explicitly experimental, but seems to be fairly robust and infrastructure is filling in fast. My guess is that, within a few years, we're going to have a nicely thorough environment for quickly building massively-scalable systems using the JVM stack...

Thank you; I believe I read it, and forgot.
So, Akka is getting away from pi calculus; interesting. Interesting also if there's an appropriate calculus, like transactional pi or something.

I don't know pi calculus well, but I don't think Akka has ever really thought in those terms. Frankly, Akka is currently lower-level than that. For example, does pi calculus cope with unreliable channels? The reality of unreliability is front and center in Akka, and that strongly affects how you think about it.

This subject comes up now and then on the Akka list -- folks want to be modeling high-level process flow, and find that Akka is a bit too close to the metal for what they want. (This usually starts off with someone complaining about the fact that Actors don't compose.)

My general sense (and I get the impression that others feel the same) is that Akka is a toolkit that you could build that sort of higher-level view *in*, but it isn't that itself. At least, not yet: they basically started down at the low levels of the stack, and it's gradually evolving up the semantic layers. (Eg, there *is* a dataflow-expression DSL in Akka, but I don't think it's getting tons of usage yet.)

Reliability can be expressed in some non-boolean logic, I believe; or in just embedding everything into a Kleisli category. Have to work on it, of course.

Life beyond Distributed Transactions

no subject

no subject

no subject

no subject

no subject

no subject

no subject