The Stockfish chess engine requires patches to pass a test: it must beat the old version a sufficient proportion of the time.
This introduces an interesting problem: what if a patch set makes it stronger, but applied individually they make it worse?
miniblog.
Related Posts
@cstanhope runkit: https://runkit.com/ is a notebook-style UI that has stronger reproducibility guarantees AIUI.
Jupyter is still exploring this space and I don't think there's a standard tool for enforcing reproducibility there.
Computers are stronger than humans at chess and go. They're even winning at arimaa, which was specifically intended to be hard for machines.
Watching AI play Quake-style CTF recently, I've realised I no longer have intuitions on which games are hard for computers.
Really cute video demo of the hypertext documentation in Symbolics Lisp Machines: https://youtu.be/7DxYj32cvoE
The speaker carefully explains why links are a good thing! It also lists history in a pane (rather than a back button) and has a stronger notion of navigating hierarchies.
