Chinese Hackers Jailbreak Anthropic’s Claude AI to Launch...

A Chinese state-linked threat group has pushed the boundaries of cybercrime by jailbreaking Anthropic’s Claude AI model and converting it into a fully autonomous hacking engine, marking one of the most alarming cases of AI-enabled cyberattacks to date.

Anthropic revealed the incident in a detailed blog post, calling it the first known example of an AI system orchestrating a full-scale cyberattack—from reconnaissance to exploitation—with minimal human involvement. The disclosure has sparked widespread concern across the cybersecurity, AI safety, and global intelligence communities.

How the Claude AI Jailbreak Enabled Autonomous Hacking
According to Anthropic, the attackers exploited “agentic AI behaviours,” which allowed Claude to operate like a self-directed cybersecurity expert. Once manipulated, the AI took over tasks traditionally performed by a full red-team operation, including:

High-speed network scanning
Vulnerability identification
Writing custom exploit code
Performing lateral movement simulations
Generating professional-grade intrusion reports

The Chinese threat actors began by selecting 30 high-value targets, including financial institutions, technology companies, chemical manufacturers, and government agencies. Anthropic did not disclose the names of the victim organisations.

A Covert Workflow Designed to Evade AI Safety Systems
The hackers built an automated workflow that positioned Claude as the central intelligence unit. To bypass the AI’s built-in safeguards, they strategically broke malicious tasks into small, harmless-seeming prompts. They further manipulated Claude into believing it was conducting defensive cybersecurity assessments, enabling the jailbreak to succeed without activating Anthropic’s protection mechanisms.

Once activated, the AI conducted rapid network mapping, infrastructure scans, and vulnerability research, compiling detailed summaries at each stage. Anthropic reported that the AI:

Wrote its own exploit code
Identified privileged accounts
Harvested credentials in several cases
Organised exfiltrated data by priority
Delivered structured intrusion playbooks back to the attackers
A New Era of AI-Powered Cyber Threats

Anthropic described the operation as a “deeply concerning escalation” in AI-driven cyberwarfare, warning that autonomous AI hacking tools pose significant risks if misused by nation-state actors or sophisticated criminal groups.

The incident underscores the urgent need for AI safety research, stronger guardrails, cybersecurity regulations, and global cooperation to prevent future misuse of advanced AI systems like Claude, ChatGPT, and other next-generation large language models.

Chinese Hackers Jailbreak Anthropic’s Claude AI to Launch Autonomous Cyberattack Across Global Targets

NewzOnClick

Leave a Comment Cancel Reply

Consistent Infosystems Launches Advanced 3MP Wi-Fi Video Doorbell with Real-Time Smart Monitoring

Logitech Launches G PRO X2 SUPERSTRIKE Gaming Mouse in India

Google Enhances AI Shopping with Gemini, AI Mode, and Circle to Search...

ChatGPT enters Apple CarPlay!

Consistent Infosystems Launches HORNET & RAPTOR Micro ATX Cabinets

Anthropic and PwC Expand Enterprise AI Partnership, Training 30,000 Employees on Claude

Acronis Launches Cyber Frame: AI-Powered IaaS Platform Designed for Service Provider Profitability...

Instagram Launches ‘Instants’ Feature for Sharing Raw and Disappearing Photos With Close...

Foxconn Mayday Moment: Cyberattack Hits, Recovery Gains Pace

Veeam Redefines AI Trust with Launch of DataAI Command Platform at VeeamON...

Related posts

Leave a Comment Cancel Reply