Character.ai Faces Lawsuit After Teen’s Suicide

Stopthatgirl7 · 3 months ago

Character.ai Faces Lawsuit After Teen’s Suicide

Trailblazing Braille Taser · 3 months ago

We’re still interacting with LLMs through layers of classical software, which can be programmed to detect phrases related to suicide.

@[email protected] · 3 months ago

lol, glad you think so

Trailblazing Braille Taser · 3 months ago

Sorry if I offended you? My point is just that it’s possible to make a crappy “is forbidden topic” classifier with a regular expression. Probably good enough to completely obliterate the topic in chats between humans and bots. Definitely good enough to claim you attempted to develop guardrails for vulnerable users.

@[email protected] · 3 months ago

have you ever tried to censor chats before? people will easily get around a regex filter

Trailblazing Braille Taser · 3 months ago

In chats between humans, I agree that it’s near pointless to try to censor. In chats between humans and LLMs, I suspect you can get pretty far with regex or badwords.txt filtering. That said, I haven’t tried, so who knows.