Early Access Video: Reward Hacking: Concrete Problems in AI Safety Part 3 (Patreon)
Published:
2017-08-11 20:53:36
Imported:
2023-12
Content
Sometimes AI can find ways to 'cheat' and get more reward than we intended by doing something unexpected.
The first proper video shot in the new studio! There's still work to do there, and I had to record this one twice because the first time I had the audio recorder settings wrong... Anyway, reward hacking is a big enough subject that it deserves more than one video, I hope you like this first one!