Read Uncommitted

Read uncommitted is a consistency model which prohibits dirty writes, where two transactions modify the same object concurrently before committing. In the ANSI SQL specification, read uncommitted is presumed to be the default, most permissive consistency model, allowing all behaviors; however, Berenson et al argued that it should, in fact, prohibit dirty writes.

<aside> 💡 [ANSI 中声称 Read Uncommitted 不提供任何约束，这是错误的，实际上至少提供了 P0。](https://laisky.notion.site/ANSI-P0-71572cc221a644aca547e0bb3d95e310)

</aside>

Read uncommitted is a transactional model: operations (usually termed “transactions”) can involve several primitive sub-operations performed in order. It is also a multi-object property: operations can act on multiple objects in the system.

Read uncommitted can be totally available: in the presence of network partitions, every node can make progress. Without sacrificing availability, you can also ensure that transactions do not read uncommitted state by choosing the stronger read committed.

Note that read uncommitted does not impose any real-time constraints. If process A completes write w, then process B begins a read r, r is not necessarily guaranteed to observe w. For a transactional model that provides real-time constraints, consider strict serializability.

Moreover, read uncommitted does not require a per-process order between transactions. A process can observe a write, then fail to observe that same write in a subsequent transaction. In fact, a process can fail to observe its own prior writes, if those writes occurred in different transactions.

<aside> 💡 not requre a per-process order between transactions

即使在同一个 process 内也不保证任何跨事务的顺序，一个 process 的后一个事务的读可能看不见上一个事务的写。

</aside>

Like Serializability, read uncommitted allows Pathological Ordering. For instance, a read uncommmitted database can always return the empty state for any reads, by appearing to execute those reads at time 0. It can also discard write-only transactions by reordering them to execute at the very end of the history, after any reads. Operations like increments can also be discarded, assuming the result of the increment is never observed. Luckily, most implementations don’t seem to take advantage of these optimization opportunities.

Formally

The ANSI SQL 1999 spec places essentially no constraints on the behavior of read uncommitted. Any and all weird behavior is fair game.

However, as Berenson, Bernstein, et al observed, the ANSI specification allows multiple intepretations, and one of those interpretations (the "anomaly interpretation) still admits nonserializable histories for “serializable” systems. Instead, we prefer Adya’s formalization of transactional isolation levels, which provides a concise definition of the preventative interpretation. In this model, read uncommitted prohibits:

$P0(Dirty Write): w_1(x)...w_2(x)$：两个 tx 同时修改了某个数据，然后其中一个 tx 执行回滚，无法确认该回滚到什么值。（P0 Dirty Write）

but allows:

$P1(Dirty Read): w_1(x)...r_2(x)$：读未提交
$P2(Fuzzy Read): r_1(x)...w_2(x)$：两次读到的值不一致（不可重复读）
$P3(Phantom): r_1(P)...w_2(y\ in\ P)$：幻读，两次读到的数据量不一致。

Here w denotes a write, r denotes a read, and subscripts indicate the transaction which executed that operation. The notation “…” indicates a series of micro-operations except for a commit or abort. P indicates a predicate.