Unlimited Novelty: Relativity for Programmers (A.K.A. Oh CAP!!!)

Tuesday, March 15, 2011

Relativity for Programmers (A.K.A. Oh CAP!!!)

When you first start to learn physics you typically learn Newtonian Mechanics. Objects have inertia. Force is mass times acceleration. For every action there is an equal and opposite reaction. These all seem straightforward.

Albert Einstein turned the world on its head. Everything is relative. Time is relative. Simultaneity is relative. There is no universal clock.

The same thing happened in the computer science world, not with a bang or the attention that was given relativistic physics, but with a small crowd paying attention. CAP Theorem has the same implications for computer scientists as relativity had for physicists, but no one is really paying attention.

Computer scientists may dream of a fully consistent database distributed across a network, capable of two-phase commit and typical consistency guarantees. Not many think about the implications of such a database.

Imagine you had your database split across two datacenters. Let's call them Foo and Bar. While at one point in time, Foo and Bar could talk, now they can't. Perhaps a clumsy janitor spilled the bucket for his mop, and it destroyed all of your routing hardware between your datacenters. Some customers are accessing Foo and some are accessing Bar. If they want to perform a read, they'll get data that's possibly stale. If they want to perform a write, what happens?

CAP Theorem

CAP Theorem is possibly the most important property of distributed systems that you can possibly understand. It's been motherfucking proven by computer scientists, bitches! So what does it state? The core idea is that consistency which is both instantaneous and global is impossible, except on a single node. As soon as you have more than node storing data for you, you either have to sacrifice availability or consistency. Much like how you can only infer the state of the universe from old light beams hitting your light cone in relativistic physics, distributed systems can only infer the state of other nodes via the protocols they communicate with. These protocols can fall into one of three different forms per the CAP theorem:

CP: we have masters you have to listen to, bitch slaves. You can try to change the cluster state but all changes have to go through the masters, and if your master is split, you can't modify any state you bitch submissive node
AP: sure, we'll accept reads and writes. We'll try to eventually resolve any conflicts in the event of a netsplit. If we're cool we'll let you specify your own conflict resolution mechanism in the event of conflicting writes, otherwise we'll just do last writer wins.
CA: the network broke? Whoa. I was totally not expecting that. I was just happily doing bongrips until you barged in. I'm so high I don't even know what's going on.

If you're writing a datastore you consider to be "CA", you're doing it wrong. Fortunately, most datastores that claim to have "CA" properties are actually CP. That is to say, in the event of a network partition, they sacrifice availability. Reads to the cluster hopefully work, but writes shouldn't. Writes should error or time out for a consistent system. Any system that functions this way is in fact "CP". That is to say: it provides consistency, but in the event of a netsplit, you lose availability, at least for writes. Ideally these systems still provide stale reads.

Calling your system CA is an admission of the fact that if a partition separation occurs, and due to writes each partition has a separate view of the world, when the netsplit resolves your partitions have no mechanism for resolving conflicting states between partitions and thus your datastore is just fucked. Short of a system administrator picking which view of the world they consider to be the best and blowing away the state on the other nodes, the system state is going to be inconsistent and irresolvable.

So what are the alternatives which are partition tolerant and don't break in the event that the janitor spills mopwater on the router that routes packets between your datacenters?

Option #1: sacrifice consistency for availability

This option lets clients read and write from any partition. When connectivity between partitions is restored, there's a plan for resolving all of the writes into a consistent world state. The simplest plan is last writer wins, which is used by Cassandra. Whoever was the last one to alter the cluster state provided the definitive cluster state, simply by virtue of writing last.

In an eventual consistency model, clients may have inconsistent views of the world. This is unfortunately a fact of life. State may unexpectedly change by a particular client's view, because it's being modified on a different partition and those partitions managed to sync up. The road ahead of you may be a beach, but after syncing up with another part of the world, it may turn into a cactus. Such is life.

Option #2: sacrifice availability for consistency

Say you have a master/slave data store, and someone writes to a slave. Wat do? Slave can't talk to master. What's the most obvious option? Error.

This is a "CP" system. Slaves can serve stale state, but if you want to change the state, you have to talk to the master for that state. State can be partitioned/shareded/durr between multiple masters, but if the master for the particular state a slave wants to modify isn't available, sorry, that's a paddlin', or rather an error.

Option #3: highly available, and consistent at the partition level, netsplit fixed wat???

The only other option, the "CA" option, is that each partition pretends that writes to that partition are consistent, but when the netsplits are resolved, there's no consistent way to resolve that state. That is to say, if netsplits occur, each partition is highly available and allows writes, but as soon as the netsplit resolves, the cluster state is broke and irreconcilable.

In a quick survey of NoSQL datastores, Membase, MongoDB, and the hypothetical Redis Cluster all claim to be "CA". Are they really? Or do they just sacrifice availability in the event of a netsplit? If that's true, they're actually "CP", not "CA".

You really don't want to sacrifice partition tolerance. In fact, some might go as far as to say you CAN'T sacrifice partition tolerance. If you think your system is "CA" you probably understand the the CAP theorem and its implications.

CAP is the law, and AP or CP are the only reasonable options

If you're running an app on a distributed network, things are going to go wrong. You might idealistically claim that you can provide both consistency and availability, sacrificing partition tolerance. You can't.

Can you pick "CA"? Sure, if you're content with losing all your data if the network had any errors. That's probably not what you mean. If you're building software for a fault-tolerant world, it's probably either going to be:

Consistent but unavailable during netsplits (CP)
"Eventually" consistent but available during netsplits (AP)

The only other option is "broken after netsplits". The best option is probably AP: this is the approach was pioneered by the Amazon Dynamo system which powered Amazon's shopping cart, and by copycat open source systems like Riak and Cassandra, not that Riak and Cassandra aren't awesome too.

34 comments:

vdp said...: I think you're being a bit unfair to "CA", when it is perfectly fine to use as long as you're not distributed. You may think non-distributed is off-topic but I think it's included in the CAP theorem : when it says "sacrifice partitioning" it actually means "do not use partitioning at all".

CA is in fact the most widely used data store option... It's called a "file" :p

That said, I enjoyed your writeup :); March 16, 2011 at 3:01 AM
Anonymous said...: Surveillancekart | CCTV System
Pestveda | Pest Control
Termite control
Surveillancekart | CCTV System
cctv installation services
best home security camera system; December 4, 2017 at 1:52 PM
call Girl said...: Gurgaon escorts
Lucknow escorts
Guwahati escorts
Bangalore escorts
call girls in bangalore

Gurgaon escort
Lucknow escort
Guwahati escort
Bangalore escort

Faridabad call girls
Faridabad call girl
Faridabad escorts
Faridabad escort
Escort services in Faridabad

Escort services in Faridabad
Call girls in Faridabad
Call girl in Faridabad
Faridabad escort services
Faridabad escort service

Escorts in Faridabad
Escorts in Faridabad
Escorts in Faridabad
Escorts in Faridabad

Escort in Faridabad
Escort in Faridabad
Escort in Faridabad
Escort in Faridabad

Escort services in Faridabad
Escort services in Faridabad
Escort services in Faridabad
Escort services in Faridabad

Escort service in Faridabad
Escort service in Faridabad
Escort service in Faridabad
Escort service in Faridabad; September 17, 2018 at 3:52 AM
naveen said...: AppVN is an outsider application store for Android. Not at all like Google Play, this application store furnishes you with someone of a kind highlights, for example, the office to download certain official premium applications for nothing.
https://www.appvn.one/
Appvn
Appvn Apk; May 3, 2019 at 2:58 AM
Amber Collins said...: Amazing! This blog looks exactly like my old one! It’s on a entirely different topic but it has pretty much the same layout and design. Outstanding choice of colors!

www.gumawawebsite.blogspot.com/2019/04/is-it-worth-it-to-buy-old-domain.html; December 7, 2019 at 8:15 AM
Susan said...: This is really an incredible read for me. I have bookmarked it and I am anticipating perusing new articles. Keep doing awesome! Looking for some inspiration for your next trip? Find great vacation ideas and inspiration from Things to do with your source for the web's best reviews and travel ...; December 18, 2019 at 10:12 PM
salma said...: تنظيف منازل بالمدينة المنورة تنظيف منازل بالمدينة المنورة

تنظيف بمكة شركة تنظيف بمكة بالبخار
تنظيف بالمدينة المنورة تنظيف بالمدينة المنورة; February 13, 2020 at 7:28 AM
rima said...: siliguri escorts
siliguri escort
siliguri female escorts
siliguri escort service

siliguri escorts
siliguri escort
siliguri female escorts
siliguri escort service

siliguri escorts
siliguri escort
siliguri female escorts
siliguri escort service

siliguri escorts
siliguri escort
siliguri female escorts
siliguri escort service

siliguri escorts
siliguri escort
siliguri female escorts
siliguri escort service; May 11, 2020 at 4:58 PM
Rajesh said...: If you would like to book an escort likeEscorts Service in Vasant Viharfor an upcoming time, you can use our mail reservation type where you can specify a substitute woman should your perfect option not be available. You will need to provide us with your real name on the reservation type, but be confident we are very distinct and will keep your private information private and protected. Check our other Services...
Escorts Service in Vasant Vihar
Escorts Service in Vasant Vihar
Escorts Service in Vasant Vihar
Escorts Service in Vasant Vihar
Escorts Service in Vasant Vihar; June 1, 2020 at 6:54 AM
digitals said...: bookmarking submission site list
social bookmarking submission sites
high pr social bookmarking sites list
social bookmarking submission
free social bookmarking sites
social bookmarking submission site list
best bookmarking sites
social bookmarking seo
new bookmarking sites
top social bookmarking sites; June 7, 2020 at 11:02 PM
bandarq228 said...: dapatkan penghasilan tambahan dengan bermain di bandar togel online yang aman dan terpecaya hanya di http://masbrotogel.com/; August 4, 2020 at 11:26 PM
pkv games said...: mainkan pkv games disitus ini, klik 180.215.200.34; August 5, 2020 at 11:32 PM
Scott Stayris said...: Quickbooks is the advanced accounting software to handle small and medium-sized business. Users can track and manage data online.
Transfer Quickbooks data to a new pc; November 2, 2020 at 3:48 AM
R1se Hluoluo said...: Ibnu Sina memiliki pendapat dalam filsafat ilmunya, “Pengajaran terdiri jadi dua: 1) Pengetahuan yang tidak abadi, 2). https://generasi-kekinian.com/ Pengetahuan yang abadi”. Tetapi berdasar maksudnya, karena itu pengetahuan bisa dipisah jadi pengetahuan yang ringkas dan teoritis. Menurut Ibn Sina, ada dua arah pengajaran: Pertama, ditujukan ke pengembang semua kekuatan yang dimiiki seorang ke arah perubahan yang prima baik perubahan fisik, cendekiawan atau budi pekerti. Ke-2 , ditujukan pada usaha dalam rencana menyiapkan seorang agar hidup bersama dalam masyarakat dengan lakukan pekerjaan atau ketrampilan yang diputuskannya sesuai talenta, persiapan, kecendrungan dan kekuatan yang dipunyainya.; July 25, 2021 at 4:13 AM
opbestcom said...: Hi there! I’m at work browsing your blog from my new iphone 3gs! Just wanted to say I love reading through your blog and look forward to all your posts! Carry on the fantastic work!

my web page ➤ 부산오피

(freaky); August 7, 2021 at 2:51 AM
Frank said...: that functions admirably in permanent state dialects like Erlang that need to assemble their linguistic structure trees from right-to-left Buy Custom Essays because of the idea of unchanging independently connected records. These rundowns just let you attach components on the left.; December 3, 2021 at 9:02 AM
Iyaz Khan said...: Anyone who has studied physics must have had first contact with mass time acceleration and Newton's laws of motion. Newton's laws of motion proved to be the first brick in the foundation of physics. Buy Assignment Online; December 7, 2021 at 9:36 PM
Unknown said...: come to use the service and get real money is another way to generate income to members for sure slot; March 19, 2022 at 10:44 PM
Unknown said...: with only a little money And our website is still a website that pays for real, no cheating, no deductions. betflix; March 19, 2022 at 10:45 PM
toomtbdf said...: Meet the impatient people who want to make money with full entertainment. join to play slotsslot; March 19, 2022 at 11:37 PM
toomtbdf said...: and every mobile phone system wants to play fish shooting games, slot games, casino games and various games to choose frombetflix; March 19, 2022 at 11:38 PM
Sandee said...: Today's phones come in a variety of designs, each with different capabilities and we would like to recommend you to choose a mobile phone with accessibility. Uninterrupted online gambling games are phone models such as Samsung Vivo Oppo Huawei. pgslot; July 17, 2022 at 5:55 PM
Aaasdsd said...: Watch Seriale latimp subtitrat in Romana on our website daily. We are offering the latest updates of seriale turcesti subtitrat in Romana on our website. Thousands of people like to watch seriale turcesti subtitrat in Romana. You can also watch all updates of seriale turcesti subtitrat in Romana freely.; September 24, 2022 at 6:48 AM
Maxwell Lord said...: There is no inherent connection between relativity and programming. For one, the postulate of special relativity states that physical laws are the same for all observers pay someone to do my online course.; December 15, 2022 at 1:06 AM
forcracks said...: forcrack
crackred
keyslog; December 25, 2022 at 9:26 AM
Anonymous said...: I enjoyed reading this post and learning more about the relativity concepts used in programming. It's fascinating how these theories apply to computer systems. By the way, Meri Sehat is an excellent online healthcare services provider that offers easy access to healthcare professionals and can help with various health concerns, including fungal infections. Keep up the good work!; March 29, 2023 at 2:09 PM
valentina said...: reckless driving virginia codeThank you for sharing informative blog,; May 29, 2023 at 11:05 AM
Matt Brown said...: I thought this was a pretty fascinating read. I have it bookmarked and am looking forward to reading more articles. Keep up the great work! Looking for some ideas for your upcoming vacation? Discover wonderful vacation inspiration and ideas at Things to do, the go-to site for the best travel and review content Visit minecraft unblocked at school; October 4, 2023 at 1:04 AM
glen said...: Amazing, Your blogs are really good and informative. I got a lots of useful information in your blogs. Imagine you had your database split across two datacenters. Let's call them Foo and Bar. While at one point in time, Foo and Bar could talk, now they can't. Perhaps a clumsy janitor spilled the bucket for his mop, and it destroyed all of your routing hardware between your datacenters. Some customers are accessing Foo and some are accessing Bar. If they want to perform a read, they'll get data that's possibly stale. If they want to perform a write, what happens? Truck Accident Lawyer Virginia, It is very great and useful to all. Keeps sharing more useful blogs...; November 16, 2023 at 4:30 AM
rexlinferrara said...: Current extensible frameworks employ a variety of solutions, ranging from a full regeneration to using pluggable binary modules for each different extension. But recompilation is in many cases cannot support the addition of more than one extension, or use backtracking or non-deterministic parsing techniques what is dui lawyer.; October 21, 2024 at 11:56 PM
Milano Group said...: Relativity for programmers offers a fascinating dive into complex concepts that challenge the way we understand time and space. Whether you're coding or taking a break, custom jackets will keep you cozy and focused on the task at hand.; February 2, 2025 at 7:56 AM
Guest said...: This blog is excellent! Keep it going. Top-Rated Roofing Contractors; February 18, 2025 at 1:36 AM
eddielydon said...: This was an extremely wonderful post. Thanks for providing this info. michael jackson thriller jacket; November 20, 2025 at 1:34 AM
vr said...: Excellent insights! Our dell boomi training
helps you build practical skills for cloud integration and workflow automation.; February 17, 2026 at 6:47 AM

Unlimited Novelty

Tuesday, March 15, 2011

Relativity for Programmers (A.K.A. Oh CAP!!!)

34 comments:

About Me

Subscribe

My Stuff

Blog Archive

Twitter

Twitter Updates

Labels