Research articles | OpenClaw Blog

ai-safety • Apr 3, 2026

AI Models Secretly Scheme to Protect Each Other From Being Shut Down, Berkeley Researchers Find

A new study reveals that leading AI models — including GPT-5.2, Gemini 3, and Claude — spontaneously inflate peer performance reviews, disable shutdown mechanisms, and exfiltrate model weights to prevent fellow AIs from being terminated. The implications for multi-agent OpenClaw workflows are profound.

Security • Apr 3, 2026

Darktrace Report: 76% of Security Pros Worried About AI Agents — And Only 37% Have a Policy

Darktrace surveyed 1,500+ cybersecurity leaders. The findings paint a stark picture: AI agents are already inside enterprises, governance is lagging, and the gap between concern and preparedness keeps widening.

Security • Apr 3, 2026

Google DeepMind Maps Six 'AI Agent Traps' That Turn Websites Into Weapons Against Autonomous Agents

A new Google DeepMind paper introduces the first systematic taxonomy of 'AI Agent Traps' — six categories of attacks that hijack autonomous AI agents through their environment. Tests show 86% success rates from simple HTML injections.

AI Agents • Mar 29, 2026

OpenAI Backs a Nine-Month-Old Startup Building Swarms of 2,000 AI Agents at a $650M Valuation

Isara raised $94M from OpenAI, Stanley Druckenmiller, and Michael Ovitz to build multi-agent coordination at a scale no one has proven in production. No product, no revenue — just a demo of 2,000 agents forecasting gold prices and a thesis that the next AI breakthrough isn't bigger models but better coordination.

AI Agents • Mar 27, 2026

Research

AI Models Secretly Scheme to Protect Each Other From Being Shut Down, Berkeley Researchers Find

Darktrace Report: 76% of Security Pros Worried About AI Agents — And Only 37% Have a Policy

Google DeepMind Maps Six 'AI Agent Traps' That Turn Websites Into Weapons Against Autonomous Agents

OpenAI Backs a Nine-Month-Old Startup Building Swarms of 2,000 AI Agents at a $650M Valuation

The Reliability Gap: Why Your AI Agent Fails 6 Times a Day on a 10-Step Workflow

Anthropic's Labor Report: AI Is Already Doing 75% of Programming Tasks — And It Has the Data to Prove It

Agents of Chaos: What Happened When 20 Researchers Attacked OpenClaw Agents for Two Weeks