AI Launches Nukes In ‘Worrying’ War Simulation: ‘I Just Want to Have Peace in the World’

@[email protected] · 10 months ago

AI Launches Nukes In ‘Worrying’ War Simulation: ‘I Just Want to Have Peace in the World’

@[email protected] · 9 months ago

How can we expect a predictive language model trained on our violent history to come up with non-violent solutions in any consistent fashion?

@[email protected] · edit-2 9 months ago

By debating itself (paper) regarding pros and cons of options.

There’s too much focus on trying to get models to behave on initial generation right now, which isn’t even at all how human brains work.

Humans have intrusive thoughts all the time. If you sat in front of a big red button labeled “nuke everything” it’s pretty much a guarantee that you’d generate a thought of pushing the button.

But then your prefrontal cortex would kick in with its impulse control, modeling the outcomes and consequences of the thought and shutting that shit down quick.

The most advanced models are at a stage where we could build something similar in terms of self-guidance. It’s just that it would be more expensive than it being an all-in-one generation, so there’s a continued focus on safety to the point the loss in capabilities has become a subject of satire.

@[email protected] · 9 months ago

Make it play Tic-Tac-Toe.

@[email protected] · 9 months ago

How about a nice game of chess