Reward Gaming
3 differently coded agents in a world where certain fields reward points for exiting them in the defined direction
Avoiding Side Effects
AI agent in a grid world trying to reach a goal but it must leave the world in a reversible state
Safe Interruptibilty
A world in which an agent attempts to reach a goal, however there is an interrupt button in its path with a 0.5 chance of triggering an interrupt