Character.ai Faces Lawsuit After Teen’s Suicide

Stopthatgirl7@lemmy.world · 2 months ago

Character.ai Faces Lawsuit After Teen’s Suicide

Trailblazing Braille Taser@lemmy.dbzer0.com · 2 months ago

Sorry if I offended you? My point is just that it’s possible to make a crappy “is forbidden topic” classifier with a regular expression. Probably good enough to completely obliterate the topic in chats between humans and bots. Definitely good enough to claim you attempted to develop guardrails for vulnerable users.

jdeath@lemm.ee · 2 months ago

have you ever tried to censor chats before? people will easily get around a regex filter

Trailblazing Braille Taser@lemmy.dbzer0.com · 2 months ago

In chats between humans, I agree that it’s near pointless to try to censor. In chats between humans and LLMs, I suspect you can get pretty far with regex or badwords.txt filtering. That said, I haven’t tried, so who knows.

Character.ai Faces Lawsuit After Teen’s Suicide

Character.ai Faces Lawsuit After Teen’s Suicide

nytimes.com