anthropic news - Search News

Anthropic unveils new framework to block harmful content from AI models

Detecting and blocking jailbreak tactics has long been challenging, making this advancement particularly valuable for ...

LatticeFlow AI: COMPL-AI Identifies Critical Compliance Gaps in DeepSeek Models Under the EU AI Act

The evaluation, performed by LatticeFlow AI, reveals DeepSeek distilled models lag behind proprietary models in cybersecurity and bias, while excelling in toxicity prevention COMPL-AI, the first ...

techzine1h

Anthropic challenges users to jailbreak AI model

Even companies' most permissive AI models have sensitive topics their creators would rather not talk about. Think weapons of ...

Anthropic Developing Constitutional Classifiers to Safeguard AI Models From Jailbreak Attempts

Anthropic is hosting a temporary live demo version of a Constitutional Classifiers system to let users test its capabilities.

Anthropic claims new AI security method blocks 95% of jailbreaks, invites red teamers to try

The new Claude safeguards have already technically been broken but Anthropic says this was due to a glitch — try again.

11hon MSN

The DeepSeek Crash: What It Means for AI Investors

In late January, DeepSeek claimed that it built its own foundation model for less than $6 million. The response from ...

MIT Technology Review15h

Anthropic has a new way to protect large language models against jailbreaks

AI firm Anthropic has developed a new line of defense against a common kind of attack called a jailbreak. A jailbreak tricks ...

18h

Anthropic dares you to jailbreak its new AI model

Claude model maker Anthropic has released a new system of Constitutional Classifiers that it says can "filter the ...

They invested billions. Then the AI script got flipped

Venture capitalists plowed money into AI startups like OpenAI and Anthropic but the rise of the Chinese AI startup DeepSeek ...

The Information3d

ChatGPT Subscribers Nearly Tripled to 15.5 Million in 2024

Paid subscribers to ChatGPT nearly tripled to 15.5 million last year from 5.8 million a year earlier, OpenAI recently told ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results