context [fae/faer, fae/faer]M to

technology • 4 months ago

“Superhuman” Go AIs still have trouble defending against these simple exploits

29

“Superhuman” Go AIs still have trouble defending against these simple exploits

context [fae/faer, fae/faer]M to

technology • 4 months ago

Plugging up "worst-case" algorithmic holes is proving more difficult than expected.

By using unorthodox "cyclic" strategies—ones that even a beginning human player could detect and defeat—a crafty human can often exploit gaps in a top-level AI's strategy and fool the algorithm into a loss.

preprint of the actual science article summarized in the ars technica piece:

https://arxiv.org/pdf/2406.12843

Prior work found that superhuman Go AIs like KataGo can be defeated by simple adversarial strategies. In this paper, we study if simple defenses can improve KataGo’s worst-case performance. We test three natural defenses: adversarial training on hand-constructed positions, iterated adversarial training, and changing the network architecture. We find that some of these defenses are able to protect against previously discovered attacks. Unfortunately, we also find that none of these defenses are able to withstand adaptive attacks. In particular, we are able to train new adversaries that reliably defeat our defended agents by causing them to blunder in ways humans would not. Our results suggest that building robust AI systems is challenging even in narrow domains such as Go

Chat

Parsani [love/loves, comrade/them]
·
edit-2
4 months ago
KataGo isn't a top level Go AI (but it is very strong and open source which is cool). I wonder if this works on Golaxy and Fine Art. Still cool they figured this out though.

link

technology

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@hexbear.net

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

1. Obviously abide by the sitewide code of conduct. Bigotry will be met with an immediate ban
2. This community is about technology. Offtopic is permitted as long as it is kept in the comment sections
3. Although this is not /c/libre, FOSS related posting is tolerated, and even welcome in the case of effort posts
4. We believe technology should be liberating. As such, avoid promoting proprietary and/or bourgeois technology
5. Explanatory posts to correct the potential mistakes a comrade made in a post of their own are allowed, as long as they remain respectful
6. No crypto (Bitcoin, NFT, etc.) speculation, unless it is purely informative and not too cringe
7. Absolutely no tech bro shit. If you have a good opinion of Silicon Valley billionaires please manifest yourself so we can ban you.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

169 users / day
765 users / week
1.43K users / month
3.33K users / 6 months
22.5K local subscribers
23.3K subscribers
5.17K Posts
60.7K Comments
Modlog