Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 1 year ago

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

9

12

Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

Lugh@futurology.todayM to

Futurology@futurology.todayEnglish · 1 year ago

9

Two-faced AI language models learn to hide deception

‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.

Chat

sbv
link
fedilink
English
arrow-up
3·
1 year ago
So they’re saying ai is software?

Maybe Volkswagen will start using it in their emissions control systems.