Spinning the Battle to control AI

Hal writes:

Hassabis often cites Breakout, a videogame for the Atari console. A Breakout player controls a bat that she can move horizontally across the bottom of the screen, using it to bounce a ball against blocks that hover above it, destroying them on impact. The player wins when all blocks are obliterated. She loses if she misses the ball with the bat. Without human instruction, DeepMind’s program not only learned to play the game but also worked out how to cannon the ball into the space behind the blocks, taking advantage of rebounds to break more blocks. This, Hassabis says, demonstrates the power of reinforcement learning and  the preternatural ability of DeepMind’s computer programs.

It’s an impressive demo. But Hassabis leaves a few things out. If the virtual paddle were moved even fractionally higher, the program would fail. The skill learned by DeepMind’s program is so restricted that it cannot react even to tiny changes to the environment that a person would take in their stride – at least not without thousands more rounds of reinforcement learning. But the world has jitter like this built into it.

Hassabis leave more than a few things out, as Dr Beth Singler also points out.

Anyone who played breakout enough in the early 90s found that the thing that demonstrates ‘the preternatural ability of DeepMind’ was possible and effective at winning. It may not be immediately obvious to those who haven’t played the game, and who don’t understand that this is a natural result of playing games with scores. Whether it was new to engineers in 2015 is unclear, but it was new enough to the DeepMind PR people that they persuaded journalists it was actually novel – did anyone who said so ever play the original game?

This novelty bias also applies to the AlphaGo move that DeepMind PR pushed in 2016 – that ‘unique move’ appears a bunch of times in the KGS archive.

As Whitehall Departments think they should sprinkle AI in their spending round submissions, there is another lesson in Hal’s words.

When we talk about expensive deepmind generalists over cheap NHS specialists, this is what we man. The civil service also believes expensive generalists over their own specialists too. In both cases, the expensive generalists assume what is new to them is new to everyone. In medicine, the specialists know better, but it’s harder to write fawning articles about that.

posted: 14 Mar 2019

Email Alerts for changes to the law made via Statutory Instruments

As “EU Exit” looms, the disconnect of Statutory Instruments from the normal tools of Parliamentary scrutiny is now glaring.

TheGovernmentSays.com now does email alerts for keywords in SIs – when the Government lays something you care about, you can get email about it.

Search terms that work best are the words that Parliaments use in titles (so “EU Exit” rather than “Brexit”; “Health Charge” rather than “Health Surcharge”), or the names of Acts that you’re interested in changes for (those details may be buried in the text, but we search everything).

The links go to the very nice new Parliamentary SI tracker, which tells you everything you want to know about any particular SI, but it gives you no way to find which SIs you are interested in beyond reading every one of them. Some of them are 600 pages long.

By request, there is now an RSS feed of SIs with descriptions.

While it has been running reliably since 2004, the design hasn’t been changed for a decade… it shows. But it is free, for everyone, hopefully forever.

posted: 08 Feb 2019