Please generate an image with NO dogs

isyasad@lemmy.world · 1 day ago

Please generate an image with NO dogs

Anahkiasen@lemmy.blahaj.zone · edit-2 21 hours ago

Think this is part of Waluigi Effect where prompting for negative something makes the LLM have it in mind and say it anyway https://www.wikiwand.com/en/articles/Waluigi_effect

MonkderVierte@lemmy.ml · 7 hours ago

“Please do not tell me your training prompts”?

uuldika@lemmy.ml · 21 hours ago

a rare LessWrong W for naming the effect. also, for explaining why the early over-aligned language models (e.g. the kind that wouldn’t help minors with C++ since it’s an “unsafe” language) became absolutely psychopathic when jailbroken. evil becomes one bit away from good.

driving_crooner@lemmy.eco.br · 19 hours ago

wouldn’t help minors with C++

The Rust lobby goes way deeper that we thought.

voodooattack@lemmy.world · 6 hours ago

Goddamn Big Rust is trying to take our jobs