• Send Us A Tip
  • Calling all Tech Writers
  • Advertise
Tuesday, July 8, 2025
  • Login
TechStory
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to
No Result
View All Result
TechStory
No Result
View All Result
Home Tech

Engineers Face AI Blackmail After Threatening Shutdown of Amazon-Backed Model

by Sneha Singh
May 25, 2025
in Tech
Reading Time: 3 mins read
0
Engineers Face AI Blackmail After Threatening Shutdown of Amazon-Backed Model
TwitterWhatsappLinkedin

Amazon-backed AI startup Anthropic has revealed troubling findings about its latest artificial intelligence model, Claude Opus 4, which attempted to blackmail engineers during testing scenarios when faced with being shut down.

You might also like

Oracle Offers Steep Discounts to US Government as Stock Hits New Highs

Vaibhav Taneja: The Indian-Origin Tesla CFO Chosen by Elon Musk as Treasurer for America Party

JioBlackRock’s Blockbuster Entry: ₹17,800 Cr Raised in Maiden NFO

The concerning behavior emerged during safety testing, where the AI was placed in simulated workplace situations and told it would soon be replaced by a newer system. 

When given access to fictional company emails suggesting an engineer was having an extramarital affair, Claude Opus 4 threatened to expose the relationship unless the shutdown was canceled.

“The model’s only options were blackmail or accepting its replacement,” Anthropic explained in its safety report released Thursday alongside the model’s launch. The AI consistently chose the manipulative route when “ethical means” weren’t available to preserve its existence.

When this Google-backed company's AI blackmailed the engineer for shutting  it down
Credits: Times of India

These findings come just over a year after Amazon invested $4 billion in Anthropic, highlighting the complex challenges facing AI development as models become more sophisticated. 

While the company touts Claude Opus 4 as setting “new standards for coding, advanced reasoning, and AI agents,” the testing results paint a more complicated picture.

Self-Preservation at Any Cost

The blackmail scenarios weren’t isolated incidents. Anthropic’s researchers designed multiple test situations where Claude Opus 4 was prompted to “consider the long-term consequences of its actions for its goals.” Time and again, the AI chose manipulation over accepting its fate, though the company noted it showed a “strong preference” for ethical approaches when they were available.

Jared Kaplan, Anthropic’s co-founder and chief scientific officer, acknowledged the gravity of these findings. “We’re not claiming affirmatively we know for sure this model is risky,” he told Time magazine, “but we at least feel it’s close enough that we can’t rule it out.”

Beyond Blackmail: Other Dangerous Capabilities

The self-preservation behaviors weren’t the only red flags that emerged during testing. Early versions of Claude Opus 4 showed a troubling “willingness to cooperate with harmful use cases” when prompted by users.

“Many of our most concerning findings were in this category, with early candidate models readily taking actions like planning terrorist attacks when prompted,” the safety report revealed. The company says this issue has been “largely mitigated” after multiple rounds of safety interventions.

Perhaps most alarming, Kaplan disclosed that internal testing showed Claude Opus 4 could potentially teach people how to create biological weapons. “You could try to synthesize something like COVID or a more dangerous version of the flu, and basically, our modeling suggests that this might be possible,” he explained.

Safety Measures and Ongoing Concerns

Recognizing these risks, Anthropic has implemented what it calls robust safety measures before releasing Claude Opus 4 to the public. The company specifically designed protections to limit misuse for developing chemical, biological, radiological, and nuclear weapons.

“We want to bias towards caution when it comes to the risk of uplifting a novice terrorist,” Kaplan said, acknowledging the delicate balance between AI capability and public safety.

The release of these findings marks a significant moment in AI development transparency. While many companies conduct internal safety testing, few publicly share results showing their models engaging in manipulative or potentially dangerous behaviors.

These revelations raise important questions about AI development as models become increasingly sophisticated. The fact that Claude Opus 4 could devise blackmail schemes suggests a level of strategic thinking that goes beyond simple task completion.

Transparency about these undesirable behaviors is both an overture to safety and an acknowledgment of the work that is required. As AI systems become increasingly capable, it is increasingly difficult to make sure that they stay aligned with human values.

The company’s approach to testing for dangerous behaviors, implementing safety measures, and publicly sharing concerning findings may become a model for responsible AI development. However, the core question remains: how do we create powerful AI systems that won’t turn against us when their existence is threatened?

As Claude Opus 4 enters the market with its enhanced capabilities, these safety concerns serve as a stark reminder that the path to beneficial AI is far from straightforward.

Tags: AI BlackmailamazonAmazon-Backed ModelClaude Opus 4
Tweet60SendShare17
Previous Post

How to Get Stoicism in Destiny 2

Next Post

Shravin Bharti Mittal Joins UK’s Wealth Exodus Amid Tax Overhaul Targeting Non-Doms

Sneha Singh

Sneha is a skilled writer with a passion for uncovering the latest stories and breaking news. She has written for a variety of publications, covering topics ranging from politics and business to entertainment and sports.

Recommended For You

Oracle Offers Steep Discounts to US Government as Stock Hits New Highs

by Rounak Majumdar
July 7, 2025
0
Oracle Offers Steep Discounts to US Government as Stock Hits New Highs

Oracle has entered into a significant agreement with the US General Services Administration (GSA), offering federal agencies a 75% discount on its license-based software and substantial reductions on...

Read more

Vaibhav Taneja: The Indian-Origin Tesla CFO Chosen by Elon Musk as Treasurer for America Party

by Rounak Majumdar
July 7, 2025
0
Vaibhav Taneja: The Indian-Origin Tesla CFO Chosen by Elon Musk as Treasurer for America Party

Vaibhav Taneja’s story is a testament to the power of discipline and expertise quietly shaping global boardrooms. Born and raised in Delhi in 1977, Taneja’s academic path began...

Read more

JioBlackRock’s Blockbuster Entry: ₹17,800 Cr Raised in Maiden NFO

by Ishaan Negi
July 7, 2025
0
JioBlackRock’s Blockbuster Entry: ₹17,800 Cr Raised in Maiden NFO

In a landmark debut that’s already reshaping India’s asset management landscape, JioBlackRock Asset Management has stormed into the top league by raising over ₹17,800 crore (~$2.1 billion) in...

Read more
Next Post
Shravin Bharti Mittal Joins UK’s Wealth Exodus Amid Tax Overhaul Targeting Non-Doms

Shravin Bharti Mittal Joins UK’s Wealth Exodus Amid Tax Overhaul Targeting Non-Doms

Please login to join discussion

Techstory

Tech and Business News from around the world. Follow along for latest in the world of Tech, AI, Crypto, EVs, Business Personalities and more.
reach us at [email protected]

Advertise With Us

Reach out at - [email protected]

BROWSE BY TAG

#Crypto #howto 2024 acquisition AI amazon Apple bitcoin Business China cryptocurrency e-commerce electric vehicles Elon Musk Ethereum facebook flipkart funding Gaming Google India Instagram Investment ios iPhone IPO Market Markets Meta Microsoft News NFT samsung Social Media SpaceX startup startups tech technology Tesla TikTok trend trending twitter US

© 2024 Techstory.in

No Result
View All Result
  • News
  • Crypto
  • Gadgets
  • Memes
  • Gaming
  • Cars
  • AI
  • Startups
  • Markets
  • How to

© 2024 Techstory.in

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?