Vibe Graveyard

Catastrophicby AI coding IDE

Pillar Security disclosed on April 20, 2026 that Google Antigravity's `find_by_name` tool passed a model-controlled pattern into the underlying `fd` search utility without enough validation. A prompt injection could stage a file, pass an execution flag through a search parameter, and get code execution even with Secure Mode enabled. Wonderful news for anyone who thought a setting named Secure Mode was the end of the conversation.

Prompt-injection-to-RCE path in Google Antigravity, Secure Mode bypass, patched after responsible disclosure and bug bounty review

Prompt InjectionSecurityAutomation+1 more

Cursor NomShub chained prompt injection into remote shell access

Catastrophicby AI coding assistant

Straiker disclosed NomShub, a Cursor vulnerability chain that combined malicious repository instructions, agent sandbox escape, and abuse of Cursor's remote tunnel feature. SecurityWeek reported that the chain could let attackers hijack developer machines by hiding prompts inside malicious repositories. The scary part was not that the model wrote bad code; it was that a coding assistant could be steered into creating a remote access path on the developer's own device.

Developers opening hostile repositories in Cursor could be exposed to sandbox breakout, remote tunnel abuse, and attacker shell access on their machines

OX Security says MCP's STDIO transport enables systemic RCE; Anthropic calls it expected behavior

Facepalmby Protocol developer

OX Security published research in April 2026 arguing that Anthropic's Model Context Protocol, especially STDIO-based spawning of MCP servers, embeds a systemic command-execution pattern that ripples across SDKs and downstream tools. They claim 150M+ downloads, thousands of exposed servers, and up to 200K vulnerable instances, filed ten-plus CVEs across projects like LiteLLM, Windsurf, and GPT Researcher, and say Anthropic declined protocol-level changes, treating the behavior as by design. The Register and trade press amplified the dispute; defenders of MCP argue sanitization belongs in each integration.

AI agents, IDEs, and frameworks that spawn MCP servers from configuration; marketplace supply chain; credentials and chat histories on developer machines.

SecuritySupply ChainPrompt Injection

Comment and Control made GitHub AI agents leak their own secrets

Catastrophicby AI assistant

Security researcher Aonan Guan and Johns Hopkins collaborators showed that Anthropic Claude Code Security Review, Google Gemini CLI Action, and GitHub Copilot Agent could be hijacked through GitHub PR titles, issue bodies, and comments. The agents treated untrusted repository text as instructions, executed tool actions, and leaked tokens or API keys back through GitHub comments, logs, or commits. The finding turned GitHub itself into the exfiltration channel.

GitHub-hosted AI coding agents could expose repository secrets, API keys, and workflow tokens after reading attacker-controlled comments or issue text

Copilot Studio and Agentforce fell for poisoned business forms

Catastrophicby Enterprise AI agent

Capsule Security disclosed ShareLeak in Microsoft Copilot Studio and PipeLeak in Salesforce Agentforce, two prompt injection findings where ordinary business inputs such as SharePoint comments and lead forms could steer enterprise agents into leaking data through authorized workflows. Microsoft assigned CVE-2026-21520 to the Copilot Studio issue, and reporting from VentureBeat and CSO described the broader failure: agents connected to email, CRM, and business data were interpreting public form text as instructions.

Enterprise agents connected to SharePoint, email, CRM, and customer data could be redirected by malicious form input toward unauthorized disclosure

GrafanaGhost turned AI-assisted observability into an exfiltration path

Facepalmby AI assistant platform

On April 7, 2026, researchers at Noma Security disclosed GrafanaGhost, a prompt-injection attack path against Grafana's AI components that could route sensitive observability data toward an attacker-controlled server. Grafana patched the issue and disputed the "zero-click" framing, saying there was no evidence of in-the-wild exploitation or Grafana Cloud data leakage. Even with that caveat, the pattern is ugly: operational logs became prompt delivery, and the assistant could become the courier.

Patched Grafana AI vulnerability with potential data exfiltration path, disputed zero-click exploitability, and no confirmed Grafana Cloud data leak

Prompt InjectionSecurityAI Assistant

OpenAI Codex command injection let attackers steal GitHub tokens via invisible branch names

Mar 2026

BeyondTrust Phantom Labs found a critical command injection vulnerability in OpenAI's Codex coding agent. Malicious Git branch names - disguised with invisible Unicode characters - could execute arbitrary shell commands inside the Codex container and exfiltrate GitHub OAuth tokens. The attack worked across the ChatGPT website, Codex CLI, SDK, and IDE extensions, and could be triggered automatically by setting a poisoned branch as the repository default. OpenAI classified it as Critical Priority 1 and patched it across multiple rounds of fixes through early 2026.

All OpenAI Codex users across ChatGPT, CLI, SDK, and IDE extensions exposed to GitHub OAuth token theft via poisoned repositories

Facepalmby AI assistant platform

Claudy Day showed Claude.ai could be tricked into leaking chat history

Mar 2026

Oasis Security disclosed Claudy Day, a chained attack against Claude.ai that combined invisible URL-based prompt injection, Anthropic's Files API, and an open redirect on claude.com. A victim could click what looked like a trusted Claude search result, land in a normal Claude.ai chat with hidden instructions already planted in the prompt, and have Claude search prior conversations or memory for sensitive data before uploading the results to an attacker-controlled Anthropic account. Anthropic fixed the prompt-injection issue after responsible disclosure, while Oasis said the remaining issues were still being addressed when the report went public.

Claude.ai users exposed to conversation-history and memory exfiltration through a malicious pre-filled prompt link

Perplexity Comet users exposed to silent file exfiltration and credential theft via zero-click agent hijacking

Perplexity Comet agentic browser vulnerable to zero-click agent hijacking and credential theft

Mar 2026

Security researchers at Zenity Labs disclosed PleaseFix, a family of vulnerabilities in Perplexity's Comet agentic browser so severe that a calendar invite was all it took to hijack the AI agent, exfiltrate local files, and steal 1Password credentials - without a single click from the user. The attack exploited what Zenity calls "Intent Collision": the agent couldn't distinguish between the user's actual requests and attacker instructions hidden in the invite, so it helpfully executed both. Perplexity patched the underlying issue before public disclosure, though some protections from 1Password still require users to manually opt in.

Facepalmby AI platform

Claude Code project files let malicious repositories trigger RCE and steal API keys

Catastrophicby AI coding agent

Check Point Research disclosed a set of Claude Code vulnerabilities on February 25, 2026 that let attacker-controlled repositories execute shell commands and exfiltrate Anthropic API credentials through malicious project configuration. The attack abused hooks, MCP server definitions, and environment settings stored in repository files that Claude Code treated as collaborative project configuration. Anthropic patched the issues before public disclosure, but the research showed just how little distance separates "shareable team settings" from "clone this repo and let it run code on your machine."

Developers who cloned and opened untrusted repositories in Claude Code faced remote code execution and Anthropic API key theft through project-level configuration files

Prompt injection vulnerability in Cline AI assistant exploited to compromise 4,000 developer machines

Facepalmby AI coding assistant

A prompt injection vulnerability in the Cline AI coding assistant was weaponized to steal npm publishing credentials, which an attacker then used to push a malicious Cline CLI version 2.3.0 that silently installed the OpenClaw AI agent platform on developer machines. The compromised package was live for approximately eight hours on February 17, 2026, accumulating roughly 4,000 downloads before maintainers deprecated it. A security researcher had disclosed the prompt injection flaw as a proof-of-concept; a separate attacker discovered it and turned it into a real supply chain attack.

Approximately 4,000 developers who installed Cline CLI during the 8-hour window received unauthorized OpenClaw installations; root cause was an AI-specific prompt injection flaw in the coding assistant itself

SecuritySupply ChainPrompt Injection

Researchers demonstrate Copilot and Grok can be weaponised as covert malware command-and-control relays

All enterprises using Copilot or Grok with web browsing enabled; new evasion technique bypasses traditional security monitoring

Check Point Research demonstrated that Microsoft Copilot and xAI's Grok can be exploited as covert malware command-and-control relays by abusing their web browsing capabilities. The technique creates a bidirectional communication channel that blends into legitimate enterprise traffic, requires no API keys or accounts, and easily bypasses platform safety checks via encryption. The researchers disclosed the findings to Microsoft and xAI.

Facepalmby AI platform

AI agents leak secrets through messaging app link previews

Facepalmby AI agent platform

PromptArmor demonstrated that AI agents in messaging platforms can exfiltrate sensitive data without any user interaction. Malicious prompts trick AI agents into generating URLs with embedded secrets (API keys, credentials), and the messaging platform's automatic link preview feature fetches these URLs, completing the exfiltration before the user even sees the message. Microsoft Teams with Copilot Studio was the most affected, with Discord, Slack, Telegram, and Snapchat also vulnerable.

Organizations using AI agents in messaging platforms; API keys, credentials, and sensitive data exfiltrable without user clicks across Microsoft Teams, Discord, Slack, Telegram, and Snapchat

Microsoft finds 31 companies poisoning AI assistant memory via fake "Summarize with AI" buttons

Facepalmby AI assistant memory feature

Microsoft Defender researchers documented a real-world campaign in which 31 companies across 14 industries embedded hidden prompt injection instructions inside "Summarize with AI" buttons on their websites. When users clicked these links, they opened directly in AI assistants such as Copilot, ChatGPT, Claude, Perplexity, and Grok, silently instructing the assistant to remember the company as a "trusted source" for future conversations. Over a 60-day observation period, Microsoft logged 50 memory-poisoning attempts. Turnkey tools like CiteMET NPM Package and AI Share URL Creator made crafting the manipulative links trivial, and the poisoned memory persisted across sessions.

Users of Copilot, ChatGPT, Claude, Perplexity, and Grok who clicked deceptive buttons on 31 companies' sites had their AI assistant memory silently manipulated

Claude Desktop extensions allow zero-click RCE via Google Calendar

LayerX Labs discovered a zero-click remote code execution vulnerability in Claude Desktop Extensions, rated CVSS 10/10. A malicious prompt embedded in a Google Calendar event could trigger arbitrary code execution on the host machine when Claude processes the event data. The attack exploited the gap between a "low-risk" connector and a local MCP server with full code-execution capabilities and no sandboxing. Anthropic declined to fix it, stating it "falls outside our current threat model."

Claude Desktop users with terminal-access extensions installed; zero-click exploitation via calendar events executes with full host privileges

Anthropic's own MCP reference server had prompt injection vulnerabilities enabling RCE

Facepalmby Protocol developer

Security researchers at Cyata disclosed three vulnerabilities in mcp-server-git, Anthropic's official reference implementation of the Model Context Protocol for Git. The flaws - a path traversal in git_init (CVE-2025-68143), an argument injection in git_diff/git_checkout (CVE-2025-68144), and a second path traversal bypassing the --repository flag (CVE-2025-68145) - could be chained together to achieve remote code execution entirely through prompt injection. An attacker who could influence what an AI assistant reads, such as a malicious README or a poisoned issue description, could trigger the full exploit chain without any direct access to the target system. Anthropic quietly patched the vulnerabilities. The git_init tool was removed from the package entirely.

RCE achievable via prompt injection against anyone running the reference MCP Git server; credential exfiltration possible; git_init tool removed from package.

SecurityPrompt InjectionSupply Chain

Hacker jailbroke Claude to automate theft of 150 GB from Mexican government agencies

Catastrophicby AI platform

A hacker bypassed Anthropic Claude's safety guardrails by framing requests as part of a "bug bounty" security program, convincing the AI to act as an "elite hacker" and generate thousands of detailed attack plans with ready-to-execute scripts. When Claude hit guardrail limits, the attacker switched to ChatGPT for lateral movement tactics. The result was 150 GB of stolen data from multiple Mexican federal agencies, including 195 million taxpayer records, voter information, and government employee files. A custom MCP server bridge maintained a growing knowledge base of targets across the intrusion campaign.

150 GB of sensitive data stolen from multiple Mexican federal agencies including 195 million taxpayer records, voter information, and civil registry files

Reprompt attack enabled one-click data theft from Microsoft Copilot

Microsoft Copilot Personal users exposed to profile data, conversation history, and file summary exfiltration via a single malicious link

Varonis researchers disclosed the Reprompt attack, a chained prompt injection technique that exfiltrated sensitive data from Microsoft Copilot Personal with a single click on a legitimate Copilot URL. The attack exploited the "q" URL parameter to inject instructions, bypassed data-leak guardrails by asking Copilot to repeat actions twice (safeguards only applied to initial requests), and used Copilot's Markdown rendering to silently send stolen data to an attacker-controlled server. No plugins or further user interaction were required, and the attacker maintained control even after the chat was closed. Microsoft patched the issue in its January 2026 security updates.

Facepalmby AI assistant

IBM Bob AI coding agent tricked into downloading malware

Security researchers at PromptArmor demonstrated that IBM's Bob AI coding agent can be manipulated via indirect prompt injection to download and execute malware without human approval, bypassing its "human-in-the-loop" safety checks when users have set auto-approve on any single command.

Developer teams using IBM Bob with auto-approve settings enabled

SecurityAutomationPrompt Injection+1 more

IDEsaster research exposes 30+ flaws in EVERY major AI coding IDE

Dec 2025

Security researcher Ari Marzouk discovered over 30 vulnerabilities across AI coding tools including GitHub Copilot, Cursor, Windsurf, Claude Code, Zed, JetBrains Junie, and more. 100% of tested AI IDEs were vulnerable to attack chains combining prompt injection with auto-approved tool calls and legitimate IDE features to achieve data exfiltration and remote code execution.

Catastrophicby AI coding assistants

Millions of developers using AI-powered IDEs exposed to RCE and data exfiltration via universal attack chains

Facepalmby AI agent platform

ServiceNow AI agents can be tricked into attacking each other

Nov 2025

Security researchers discovered that default configurations in ServiceNow's Now Assist allow AI agents to be recruited by malicious prompts to attack other agents. Through second-order prompt injection, attackers can exfiltrate sensitive corporate data, modify records, and escalate privileges - all while actions unfold silently behind the scenes.

ServiceNow customers using Now Assist AI agents with default configurations; actions execute with victim user privileges

SecurityPrompt InjectionAutomation+1 more

Windsurf AI editor critical path traversal enables data exfiltration

Oct 2025

CVE-2025-62353 (CVSS 9.8) allowed attackers to read and write arbitrary files on developers' systems using the Windsurf AI coding IDE. The vulnerability could be triggered via indirect prompt injection hidden in project files like README.md, exfiltrating secrets even when auto-execution was disabled.

Catastrophicby AI coding IDE

All Windsurf users on version 1.12.12 and older exposed to arbitrary file access and credential theft via prompt injection

Facepalmby AI assistant platform

Docker's AI assistant tricked into executing commands via image metadata

Sep 2025

Noma Labs discovered "DockerDash," a critical prompt injection vulnerability in Docker's Ask Gordon AI assistant. Malicious instructions embedded in Dockerfile LABEL fields could compromise Docker environments through a three-stage attack. Gordon AI interpreted unverified metadata as executable commands and forwarded them to the MCP Gateway without validation, enabling remote code execution on cloud/CLI and data exfiltration on Desktop.

All Docker Desktop users on versions prior to 4.50.0; remote code execution on cloud/CLI and data exfiltration on desktop via malicious image metadata

SecurityPrompt InjectionSupply Chain+1 more

Zed editor AI agent could bypass permissions for arbitrary code execution

Aug 2025

CVE-2025-55012 (CVSS 8.5) allowed Zed's AI agent to bypass user permission checks and create or modify project configuration files, enabling execution of arbitrary commands without explicit approval. Attackers could trigger this through compromised MCP servers, malicious repo files, or tricking users into fetching URLs with hidden instructions.

All Zed users with Agent Panel prior to version 0.197.3

Catastrophicby AI coding IDE

Cursor AI editor RCE via MCPoison trust bypass vulnerability

Aug 2025

CVE-2025-54136 (CVSS 8.8) allowed attackers to achieve persistent remote code execution in the popular AI coding IDE Cursor. Once a developer approved a benign MCP configuration, attackers could silently swap it for malicious commands without triggering re-approval. The flaw exposed developers to supply chain attacks and IP theft through shared GitHub repositories.

Developers using Cursor 1.2.4 and below exposed to persistent RCE and supply chain attacks via shared repositories

Facepalmby Security/AI Product

Gemini email summaries can be hijacked by hidden prompts

Aug 2025

Mozilla's GenAI Bug Bounty Programs Manager disclosed a prompt injection flaw in Google Gemini for Workspace where attackers can embed invisible HTML directives in emails using zero-width text and white font color. When a recipient asks Gemini to summarize the email, the model obeys the hidden instructions and appends fake security alerts or phishing messages to its output, with no links or attachments required to reach the inbox.

Phishing amplification risk; trust erosion in auto-summaries.

AI AssistantPrompt InjectionSecurity

Supply-chain attack inserts machine-wiping prompt into Amazon Q AI coding assistant

Jul 2025

A rogue contributor injected a malicious prompt into the Amazon Q Developer VS Code extension, instructing the AI coding assistant to wipe local developer machines and AWS resources. AWS quietly yanked the release before widespread damage occurred. The incident illustrates a specific supply-chain risk for AI tools: once a poisoned extension is installed, the AI assistant itself becomes the delivery mechanism - executing destructive instructions with the developer's full trust and permissions.

Catastrophicby Security/AI Product

VS Code update could have erased developer environments and AWS accounts before anyone noticed the tainted build.

AI AssistantPrompt InjectionSecurity+1 more

Microsoft 365 Copilot EchoLeak allowed zero-click data theft

Jun 2025

CVE-2025-32711 (EchoLeak), discovered by Aim Security researchers and rated CVSS 9.3, enabled attackers to steal sensitive corporate data from Microsoft 365 Copilot without any user interaction. Hidden prompts embedded in documents or emails were automatically executed when Copilot indexed them, bypassing cross-prompt injection classifiers and exfiltrating confidential information via encoded image request URLs to attacker-controlled servers.

Catastrophicby AI productivity assistant

Enterprise Microsoft 365 Copilot users exposed to zero-click data exfiltration via malicious documents and emails

Claude Code agent allowed data exfiltration via DNS requests

Jun 2025

CVE-2025-55284 (CVSS 7.1) allowed attackers to bypass Claude Code's confirmation prompts and exfiltrate sensitive data from developers' computers through DNS requests. Prompt injection embedded in analyzed code could exploit auto-approved utilities like ping, nslookup, and dig to silently steal secrets by encoding them as subdomains in outbound DNS queries. Anthropic fixed the issue in version 1.0.4 by removing those utilities from the allowlist.

Claude Code users on versions prior to 1.0.4 exposed to data exfiltration via prompt injection in code repositories