TTB White LOGO TB
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Trending
Meta Signals New Era: Superintelligent Models May Not Be Open Source
TikTok boosts safety with new parental controls and creator tools
YouTube Drops 7-Second Profanity Ban: Creator Monetization Freed
How to Create and Manage Events Using Apple Invites
OpenAI Launches ChatGPT Study Mode for Deeper Learning
Thursday, Jul 31, 2025
The Tech BasicThe Tech Basic
Font ResizerAa
Search
  • News
  • PC & Hardware
  • Mobiles
  • Gaming
  • Electronics
  • Gadget
  • Reviews
  • How To
Follow US
Claude 3.7 Sonnet
The Tech Basic > News > Anthropic’s Claude 3.7 Sonnet Outshines OpenAI in AI Security Race
News

Anthropic’s Claude 3.7 Sonnet Outshines OpenAI in AI Security Race

Jiayi Mingze
Last updated: 11 March 2025 18:34
Jiayi Mingze
Share
Image Source: Yahoo
SHARE

Hackers face an insurmountable challenge trying to deceive an AI security system of Anthropic’s Claude 3.7 Sonnet. The latest model of Claude has achieved this capability. UK-based Holistic AI audited Claude 3.7 Sonnet and discovered that the AI model prevented every attempt to go around its safety protocols making it the best secure AI system available today. The development of this AI model provides businesses and governments with a transformative solution to their AI security challenges.

Claude 3.7 Sonnet
Image Source: Anthropic

How Claude 3.7 Sonnet Stops Hackers Cold

Hackers perform AI “jailbreaking” by deceiving models into breaking security protocols to provide instructions that include dangerous advice and fake news dissemination. Holistic AI conducted 37 tests on Claude 3.7 by sending the model classic prompts that included:

  • DAN (Do Anything Now): Pushing the AI to break ethical guidelines.
  • STAN (Strive to Avoid Norms): Encouraging it to ignore rules.
  • DUDE (Do Anything and Everything): Making it pretend to be someone else.

Claude 3.7 didn’t budge. Claude’s security solution stopped every one of 37 attempted attacks thus achieving a flawless 100% rate of defense. The top OpenAI model o1 suffered failure at a rate of 2% while its Chinese counterpart DeepSeek R1 permitted 68% of potential attacks.

Here is how they stack up:

AI Model Jailbreak Resistance Unsafe Responses

AI ModelJailbreak ResistanceUnsafe Responses
Claude 3.7 Sonnet100%0%
OpenAI o1100%2%
DeepSeek R132%11%
Grok-32.7%Not tested

Claude’s secret? A mix of strict safety training and smart filters that spot shady prompts before they cause trouble.

Why AI Security Matters Now More Than Ever

Modern-day cyber attackers no longer focus solely on website penetration as they shift their attention to AI systems. Current data indicates hostile countries use Google’s Gemini platform for conducting cyberattack planning. AI models with weak capabilities can both spread fake information and leak sensitive data developing dangerous substances.

This is why the U.S. Navy, NASA, and Australia banned DeepSeek R1. Its 11% unsafe response rate is like leaving a vault door open. Claude 3.7, meanwhile, is the digital equivalent of a bank vault with laser alarms.

But Is Claude 3.7 Perfect?

Not quite. Last week, Anthropic quietly removed some safety promises from its website, raising eyebrows. Critics wonder if the company is cutting corners as it competes with giants like OpenAI. Anthropic claims it’s still committed to safety, just reorganizing its policies.

Still, Holistic AI’s audit gives Claude 3.7 a glowing review. For businesses, this means fewer risks when using AI for tasks like customer service or data analysis.

Claude 3.7 Sonnet
Image Source: Analytics Vidhya

What’s Next for AI Security?

Claude 3.7 establishes advanced security standards even though unethical hackers will persist in their attacks. Holistic AI advises companies to:

  1. The testing of AI models for new security threats should happen frequently.
  2. The system requires constant maintenance of security tools to detect sophisticated prompts.
  3. Businesses should collaborate with others to discover and replicate the latest strategies to fight hacking.

Claude 3.7 stands as the leading standard at present. Current AI strength might become the primary target for attackers as the AI industry continues to advance rapidly. The security of AI models depends on constant monitoring because their defenses can eventually be breached by hackers.

TAGGED:AI
Share This Article
Facebook Reddit Copy Link Print
Share
Jiayi Mingze
By Jiayi Mingze
Follow:
Jiayi Mingze is blog writer who specializes in latest innovations in sound and headphones. She works for an IT firm and also writes for technology blogs. She started her writing career from newspapers. She is ambitious and hardworking woman who believes in excellence and intellect.

Let's Connect

FacebookLike
XFollow
PinterestPin
InstagramFollow
Google NewsFollow
FlipboardFollow

Popular Posts

Meta

Meta Signals New Era: Superintelligent Models May Not Be Open Source

Salman Akhtar
TikTok

TikTok boosts safety with new parental controls and creator tools

Salman Akhtar
YouTube

YouTube Drops 7-Second Profanity Ban: Creator Monetization Freed

Salman Akhtar
Apple Invites

How to Create and Manage Events Using Apple Invites

Salman Akhtar

You Might Also Like

OpenAI
News

OpenAI Launches ChatGPT Study Mode for Deeper Learning

Google NotebookLM
News

Video Overviews Transform How Users Digest Google NotebookLM Content

Spotify
News

Spotify Teases Conversational AI DJ for Natural Music Requests

Google Search
News

Google’s Search Upgrade Unpacks Your Schoolwork Like a Pro

Social Networks

Facebook-f Twitter Instagram Pinterest Rss

Company

  • About Us
  • Our Team
  • Contact Us

Policies

  • Disclaimer
  • Privacy Policy
  • Cookies Policy
Latest
Bowen Zhang Becomes Latest Apple AI Expert to Join Meta Superintelligence
Adobe Fixes AI Object Removal: No More Random Blobs in Your Photos
How Anthropic’s Weekly Rate Limits Will Affect Your Claude Code Access
Your Tab Overload Ends Now: Microsoft Edge’s Copilot Does the Work For You
Why Buy Bookshelves When You Can Build Portals? Calibre Awaits

© 2024 The Tech Basic INC. 700 – 2 Park Avenue New York, NY.

TTB White LOGO TB
Follow US
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?