anthropic news - Search News

Anthropic unveils new framework to block harmful content from AI models

Detecting and blocking jailbreak tactics has long been challenging, making this advancement particularly valuable for ...

Anthropic claims new AI security method blocks 95% of jailbreaks, invites red teamers to try

The new Claude safeguards have already technically been broken but Anthropic says this was due to a glitch — try again.

18h

Anthropic dares you to jailbreak its new AI model

Claude model maker Anthropic has released a new system of Constitutional Classifiers that it says can "filter the ...

MIT Technology Review15h

Anthropic has a new way to protect large language models against jailbreaks

AI firm Anthropic has developed a new line of defense against a common kind of attack called a jailbreak. A jailbreak tricks ...

Anthropic Developing Constitutional Classifiers to Safeguard AI Models From Jailbreak Attempts

Anthropic is hosting a temporary live demo version of a Constitutional Classifiers system to let users test its capabilities.

14h

How Thomson Reuters and Anthropic built an AI that lawyers actually trust

Thomson Reuters integrates Anthropic's Claude AI into its legal and tax platforms, enhancing CoCounsel with AI-powered tools that process professional content through secure Amazon cloud ...

15h

DeepSeek vs Stargate Project: How AI spending is evolving, in charts

The massive AI infrastructure project announced by US President Donald Trump raised questions about overspending. The launch ...

AFP26m

COMPL-AI Identifies Critical Compliance Gaps in DeepSeek Models Under the EU AI Act

COMPL-AI, the first evaluation framework for Generative AI models under the EU AI Act, has flagged critical compliance gaps in DeepSeek's distilled models.

LatticeFlow AI: COMPL-AI Identifies Critical Compliance Gaps in DeepSeek Models Under the EU AI Act

The evaluation, performed by LatticeFlow AI, reveals DeepSeek distilled models lag behind proprietary models in cybersecurity and bias, while excelling in toxicity prevention COMPL-AI, the first ...

Analytics India Magazine5h

Meta’s New Report Shows How to Prevent ‘Catastrophic Risks’ from AI

The framework is structured around a three-stage process: anticipating risks, evaluating and mitigating risks, and deciding ...

12h

FirstFT: Trump halts tariffs on Mexico and Canada

Also in today’s newsletter, Samsung chair cleared of fraud and stock manipulation, and the latest blow against USAID ...

techzine1h

Anthropic challenges users to jailbreak AI model

Even companies' most permissive AI models have sensitive topics their creators would rather not talk about. Think weapons of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results