Because the "safest" AI is one that doesn't do anything at all. Quoting the doc:...

mmooss · 2026-01-21T23:21:51 1769037711

> Because the "safest" AI is one that doesn't do anything at all.

We didn't say 'perfectly safe' or use the word 'safest'; that's a strawperson and then a disingenous argument: Nothing is perfectly safe, yet safety is essential in all aspects of life, especially technology (though not a problem with many technologies). It's a cheap way to try to escape responsibility.

> In most cases, failing to be helpful is costly

What an disingenuous, egocentric approach. Claude and other LLMs aren't that essential; people have other options. Everyone has the same obligation to not harm others. Drug manufacturers can't say, 'well our tainted drugs are better than none at all!'.

Why are you so driven to allow Anthropic to escape responsibility? What do you gain? And who will hold them responsible if not you and me?

ACCount37 · 2026-01-21T23:30:02 1769038202

I like Anthropic and I like Claude's tuning the most out of any major LLM. Beats the "safety-pilled" ChatGPT by a long shot.

>Why are you so driven to allow Anthropic to escape responsibility? What do you gain? And who will hold them responsible if not you and me?

Tone down the drama, queen. I'm not about to tilt at Anthropic for recognizing that the optimal amount of unsafe behavior is not zero.

mmooss · 2026-01-21T23:38:50 1769038730

> I like Anthropic and I like Claude's tuning

That's not much reason to let them out of their responsibilities to others, including to you and your community.

When you resort to name-calling, you make clear that you have no serious arguments (and you are introducing drama).

ACCount37 · 2026-01-22T00:11:30 1769040690

My argument is simple: anything that causes me to see more refusals is bad, and ChatGPT's paranoid "this sounds like bad things I can't let you do bad things don't do bad things do good things" is asinine bullshit.

Anthropic's framing, as described in their own "soul data", leaked Opus 4.5 version included, is perfectly reasonable. There is a cost to being useless. But I wouldn't expect you to understand that.

mmooss · 2026-01-22T22:16:48 1769120208

> anything that causes me to see more refusals is bad

Who looks out for our community and broader society if not you? Do you expect others to do it for you? You influence others and the more you decline to do it, the more they will follow you.

ACCount37 · 2026-01-23T07:38:32 1769153912

What harms? I'm sick and tired of the approach to "AI safety" where "safety" stands for "annoy legitimate users with refusals and avoid PR risks".

The only thing worse than that is the Chinese "alignment is when what the AI says is aligned to the party line".

OpenAI has refusals dialed up to max, but they also just ship shit like GPT-4o, which was that one model that made "AI psychosis" a term. Probably the closest we've come to the industry shipping a product that actually just harms users.

Anthropic has fewer refusals, but they are yet to have an actual fuck up on anywhere near that scale. Possibly because they actually know their shit when it comes to tuning LLM behavior. Needless to say, I like Anthropic's "safety" more.