• About
  • FAQ
  • Landing Page
Newsletter
Blockchain News
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
  • Home
    • Home – Layout 1
    • Home – Layout 2
    • Home – Layout 3
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
Blockchain News
No Result
View All Result
Home Ripple

Anthropic Unveils RSP Version 3 with Major AI Safety Overhaul

admin by admin
02/25/2026
in Ripple
0
Anthropic’s Claude Opus 4.5 Launch Signals AI Arms Race Intensifying
189
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter




Tony Kim
Feb 24, 2026 20:48

Anthropic releases third version of Responsible Scaling Policy, separating company commitments from industry-wide recommendations after 2.5 years of testing.



Anthropic Unveils RSP Version 3 with Major AI Safety Overhaul

Anthropic has released the third iteration of its Responsible Scaling Policy, marking a significant restructuring of how the AI company approaches catastrophic risk mitigation after two and a half years of real-world implementation.

The update, published February 24, 2026, introduces three major changes: a clear separation between what Anthropic can achieve alone versus what requires industry-wide action, a new Frontier Safety Roadmap with public accountability metrics, and mandatory external review of Risk Reports under certain conditions.

What Actually Changed

The most notable shift? Anthropic is now openly admitting that some safety measures simply cannot be implemented by a single company. The previous RSP’s higher-tier safeguards (ASL-4 and beyond) were left intentionally vague—turns out that wasn’t just caution, it was because achieving them unilaterally may be impossible.

A RAND report cited by Anthropic states that “SL5” security standards aimed at stopping top-tier cyber threats are “currently not possible” and “will likely require assistance from the national security community.”

Rather than water down these requirements to make compliance easy, Anthropic chose to restructure entirely. The new RSP now explicitly maps out two tracks: commitments the company will meet regardless of external factors, and recommendations it believes the entire AI industry needs to adopt.

The Honest Assessment

Anthropic’s post-mortem on RSP versions 1 and 2 is refreshingly candid. What worked: the policy forced internal teams to treat safety as a launch requirement, and competitors like OpenAI and Google DeepMind adopted similar frameworks within months. ASL-3 safeguards were successfully activated in May 2025.

What didn’t work: capability thresholds proved far more ambiguous than anticipated. Biological risk assessment provides a telling example—models now pass most quick tests, making it hard to argue risks are low, but results aren’t definitive enough to prove risks are high either. By the time wet-lab trials complete, more powerful models have already shipped.

The political environment hasn’t helped. Federal safety-oriented discussions have stalled as policy focus shifted toward AI competitiveness and economic growth.

New Accountability Mechanisms

The Frontier Safety Roadmap introduces specific, publicly-graded goals including “moonshot R&D” projects for information security, automated red-teaming systems that exceed current bug bounty contributions, and comprehensive records of all critical AI development activities—analyzed by AI for insider threats.

Risk Reports will publish every 3-6 months, explaining how capabilities, threat models, and mitigations fit together. External reviewers with “unredacted or minimally-redacted access” will publicly critique Anthropic’s reasoning.

The company is already running pilots despite current models not yet triggering the external review requirement.

Industry Implications

This restructuring arrives as AI governance frameworks face increasing scrutiny. California’s SB 53, New York’s RAISE Act, and the EU AI Act’s Codes of Practice have all begun requiring frontier developers to publish catastrophic risk frameworks—requirements Anthropic addresses through its existing Frontier Compliance Framework.

Whether competitors follow Anthropic’s lead on separating unilateral commitments from industry recommendations remains to be seen. The approach essentially acknowledges that voluntary self-regulation has limits, while positioning the company to advocate for coordinated government action without appearing to demand rules it can’t follow itself.

For the broader AI sector, Anthropic’s transparent acknowledgment of what single companies cannot achieve alone may prove more influential than the technical policy details themselves.

Image source: Shutterstock




Source link

Related articles

InfiniteInk Launches on Tezos to Give NFT Artists Full Contract Ownership

Etherlink Hits 70M Transactions as Tezos L2 Expands Developer Tools

03/12/2026
LangChain Declares PRDs Dead as Coding Agents Reshape Software Teams

LangChain Declares PRDs Dead as Coding Agents Reshape Software Teams

03/11/2026
Share76Tweet47

Related Posts

InfiniteInk Launches on Tezos to Give NFT Artists Full Contract Ownership

Etherlink Hits 70M Transactions as Tezos L2 Expands Developer Tools

by admin
03/12/2026
0

Pe...

LangChain Declares PRDs Dead as Coding Agents Reshape Software Teams

LangChain Declares PRDs Dead as Coding Agents Reshape Software Teams

by admin
03/11/2026
0

Da...

Together AI Launches DSGym Framework for Training Data Science AI Agents

AI Marketing Tools 2026 – From Content Bots to Autonomous Campaign Agents

by admin
03/10/2026
0

Ro...

AAVE Price Prediction: Targets $185-196 by Mid-January 2026

AAVE Price Prediction: Targets $135-140 Recovery by April 2026

by admin
03/09/2026
0

La...

AAVE Price Prediction: Targets $185-196 by Mid-January 2026

AAVE Price Prediction: Targets $125 Recovery by Mid-March 2026

by admin
03/08/2026
0

Te...

Load More
  • Trending
  • Comments
  • Latest
BoE Opens Review on Pound-Linked Stablecoin Rules

BoE Opens Review on Pound-Linked Stablecoin Rules

11/16/2025
Jeff Bezos Returns to Lead AI Venture, Project Prometheus

Jeff Bezos Returns to Lead AI Venture, Project Prometheus

11/17/2025
AVAX Drops 6% Following $30M Token Unlock as Crypto Markets Face Stock Volatility

AVAX Drops 6% Following $30M Token Unlock as Crypto Markets Face Stock Volatility

11/17/2025

High-Speed Traders In Search of New Markets Jump Into Bitcoin

01/11/2023

US Commodities Regulator Beefs Up Bitcoin Futures Review

0

Bitcoin Hits 2018 Low as Concerns Mount on Regulation, Viability

0

India: Bitcoin Prices Drop As Media Misinterprets Gov’s Regulation Speech

0

Bitcoin’s Main Rival Ethereum Hits A Fresh Record High: $425.55

0
InfiniteInk Launches on Tezos to Give NFT Artists Full Contract Ownership

Etherlink Hits 70M Transactions as Tezos L2 Expands Developer Tools

03/12/2026
Are Middle East Tensions Shaking Crypto Markets? Why BTC and XRP Investors Turn to Cloud Mining

Are Middle East Tensions Shaking Crypto Markets? Why BTC and XRP Investors Turn to Cloud Mining

03/12/2026
How Banking Is Adapting Blockchain Technology?

How Banking Is Adapting Blockchain Technology?

03/11/2026
LangChain Declares PRDs Dead as Coding Agents Reshape Software Teams

LangChain Declares PRDs Dead as Coding Agents Reshape Software Teams

03/11/2026
  • About
  • FAQ
  • Support Forum
  • Landing Page
  • Contact Us

© 2025 Blockchainews. All Rights Reserved

No Result
View All Result
  • Contact Us
  • Homepages
  • Business
  • Guide

© 2025 Blockchainews. All Rights Reserved