Anthropicが公開：AIエージェントのセキュリティガイド

1,626214

🚨 Anthropicが36ページのセキュリティガイドを公開——基本的には「自分自身のAIエージェントを信頼するのをやめろ」と言っている。 Claude Code・MCPサーバー・自動化ツールでエージェントを動かしているなら、注目すること。攻撃のタイムラインが崩壊した。 AIモデルは、脆弱性から実用的なエクスプロイトまでの期間を数ヶ月から数時間へと、わずかな費用で圧縮する。エージェントは新たな自律的リスクをもたらす——ツールポイズニングからコンテキストメモリ操作まで。このガイドで最も有用なアイデアは、Anthropicの新しいセキュリティテストだ：そのコントロールは攻撃を不可能にするのか、それとも単に面倒にするだけか？自動化された攻撃者は無限の忍耐を持っている。レート制限や2FAのような摩擦は、直接削り取られる。AIのスピードで防御するには、ハードな障壁と自動化された防御オペレーションが必要だ。 Anthropicがエージェントをロックダウンするべきと言っている方法はこれだ： → 静的APIキーを侵害済みとして扱う。数分で失効する短命トークンを使う。 → 「最小エージェンシー」を適用：各ツールができることを明示的に制限する。 → メールやウェブページなど信頼できない入力を処理するエージェントをサンドボックス化する。 → 権限を永続的にではなく、タスクごとに動的にスコープする。ガイドへのリンクは🧵↓に追加した

原文を表示 / Show original

🚨 ANTHROPIC JUST PUBLISHED A 36-PAGE SECURITY GUIDE THAT BASICALLY TELLS YOU TO STOP TRUSTING YOUR OWN AI AGENTS. If you run agents on Claude Code, MCP servers, or automation tools, pay attention. The attack timeline has collapsed. AI models compress the gap between a vulnerability and a working exploit from months to hours, for mere dollars. Agents introduce new autonomous risks, from tool poisoning to context memory manipulation. The most useful idea in the guide is Anthropic's new security test: Does a control make an attack impossible, or just tedious? Automated attackers have unlimited patience. They will grind straight through friction like rate limits and 2FA. To defend at the speed of AI, you need hard barriers and automated defensive operations. Here is how Anthropic says you should lock down agents: → Treat static API keys as compromised. Use short-lived tokens that expire in minutes. → Apply "Least Agency": explicitly limit what each tool can DO. → Sandbox agents that process untrusted inputs like emails and web pages. → Scope permissions dynamically per task, not permanently. I've added the link to the guide in the 🧵↓

AIFCC — AI Fluent CxO Club

読み書きそろばん、AI。経営者が AI を自分で動かせるようになるコミュニティ。

他の記事を見る AIFCC について