Vibe Graveyard

On Friday, May 15, 2026, the UK government rolled out GOV.UK Chat inside the official GOV.UK app, billing it as the largest government-built chatbot of its kind, trained on 80,000 pages of gov.uk content with a target accuracy of 90%. Within hours of launch, tax expert Dan Neidle of Tax Policy Associates published evidence in The Times showing the bot giving misleading answers on tax questions that millions of UK households actually have. The bot failed to mention the £100,000 cliff edge where tax-free childcare eligibility collapses, and it told a user that selling old MacBooks on eBay could attract capital gains tax, which is not how UK CGT works for personal-use chattels. The Cabinet Office framed the tool as "information about services" rather than advice; Neidle pointed out the bot itself reads like it is giving advice. Either way, a 90% accuracy claim on benefits and tax means one in ten answers is wrong on questions where being wrong costs real money.

National rollout of a government chatbot inside the official GOV.UK app; documented misleading answers on UK tax and means-tested benefits within hours of launch; potential downstream cost to citizens who follow incorrect information on childcare allowance, capital gains, and other entitlements; reputational hit to the UK government's flagship AI deployment.

Slop-ocracyAI AssistantCustomer Disservice+1 more

PraisonAI shipped auth-off-by-default; first exploit attempt landed in under 4 hours

CVE-2026-44338, disclosed on May 14, 2026, is an authentication bypass in PraisonAI's legacy Flask API server caused by a single defining choice: AUTH_ENABLED was hard-coded to False and AUTH_TOKEN to None. Anything reachable on the network could enumerate configured agents via GET /agents and trigger the configured agents.yaml workflow via POST /chat, with no token required. Within three hours, forty-four minutes, and thirty-nine seconds of the advisory becoming public, a scanner identifying itself as "CVE-Detector/1.0" was already probing the exact vulnerable endpoint on internet-exposed PraisonAI instances. The bug affects versions 2.5.6 through 4.6.33 and is fixed in 4.6.34. The rapid-exploitation timeline is the part that should worry every operator of an open-source AI agent framework, not the CVSS 7.3 score.

Internet-exposed PraisonAI installations across versions 2.5.6 through 4.6.33 vulnerable to unauthenticated agent enumeration and workflow execution; documented exploitation attempts within hours of disclosure; potential for attackers to drain API quotas, exfiltrate prompt-driven outputs, and pivot through configured tool integrations.

SecurityAutomationSupply Chain+1 more

Four chainable OpenClaw CVEs let attackers break the agent's own sandbox

In May 2026, Cyera Research disclosed "Claw Chain," a set of four chainable vulnerabilities in OpenClaw, one of the most widely deployed open-source AI agent platforms. CVE-2026-44112 (CVSS 9.6) is a time-of-check / time-of-use race in the OpenShell managed sandbox that lets attacker writes escape the intended mount root. CVE-2026-44113 (CVSS 7.7) lets reads escape it. CVE-2026-44115 (CVSS 8.8) leaks API keys and tokens through insufficient command validation. CVE-2026-44118 (CVSS 7.8) blindly trusts a client-controlled ownership flag, allowing a local process with a valid bearer token to escalate to owner-level. Chained, the four bugs go from initial foothold to data theft to persistent backdoor inside the agent's own sandbox. Roughly 65,000 to 180,000 OpenClaw instances were publicly reachable at disclosure. All four were patched in 2026.4.22.

Up to ~180,000 publicly reachable OpenClaw instances exposed before patching; chainable CVEs covering sandbox escape (read and write), API key and token leakage, and owner-level privilege escalation; affected deployments needing urgent upgrade to 2026.4.22 and credential rotation.

SecurityPrompt InjectionAutomation+1 more

74% of enterprises have already rolled back their AI customer service agents

On May 13, 2026, Sinch released "The AI Production Paradox," a global survey of 2,527 senior AI decision-makers across ten countries. The headline number: 74% of enterprises that deployed an AI customer communications agent in production have already rolled it back or shut it down. The rate climbs to 81% at organizations Sinch classifies as having "fully mature guardrails," a counterintuitive result that the report attributes to better monitoring rather than worse technology. Customer-service AI is now in a measurable rollback cycle: 62% of enterprises have live agents, and most are hitting systemic post-deployment failures that no amount of pilot-stage optimism warned them about. Investment is still climbing, the chatbots are still going out the door, and the rollback button is wearing through.

Industry-wide rollback pattern - 74% of enterprises surveyed have shut down or rolled back at least one deployed AI customer service agent; engineering teams across 2,500+ organizations report a "guardrail tax" that is consuming time meant for product improvement; customer-experience metrics degraded across multiple verticals.

Customer DisserviceAI AssistantAutomation

Azure AI Foundry's M365 agents had a critical privilege-escalation flaw exploited in the wild

CVE-2026-35435, disclosed by Microsoft on May 7, 2026, is a critical (CVSS 8.6) improper-access-control flaw in Azure AI Foundry's M365 published agents. The vulnerability allows an unauthorized remote attacker to bypass authorization checks on the agent runtime and elevate a low-privileged role into one with extensive control over AI resources, agent configurations, data connectors, and potentially the underlying Microsoft 365 environment. Microsoft's advisory confirmed exploitation in the wild. The flaw lives inside the AI agent system's own authorization code, not in surrounding infrastructure - the agent runtime trusted callers it should have rejected and gave them owner-shaped access to workflows, secrets, and backend data the agents were wired up to reach.

Azure AI Foundry deployments running M365 published agents exposed to remote privilege escalation; documented in-the-wild exploitation per Microsoft; downstream risk of unauthorized configuration changes, data exfiltration through wired-up connectors, and lateral movement into M365 resources accessible to the compromised agents.

AI-made citations are polluting published research by the thousand

Facepalmby Research and publishing workflow

A January 2026 conference-paper analysis, an April Nature investigation, and a May 2026 Lancet biomedical audit all point to the same ugly conclusion: AI-hallucinated references are no longer isolated embarrassments. GhostCite found a sharp jump in unverifiable citations in 2025 computer-science conference papers. Nature estimated that tens of thousands of 2025 publications may contain invalid AI-generated references. The Lancet audit then found 4,046 fabricated references across 2,810 PubMed Central papers. The problem is no longer just that chatbots invent papers. It is that those inventions are surviving long enough to contaminate the literature and force publishers into cleanup work they clearly did not plan for.

Tens of thousands of publications may contain invalid references; a Lancet audit found 4,046 fabricated references across 2,810 PubMed Central papers; conference papers, biomedical literature, journal submissions, and publisher screening workflows all affected

AI HallucinationAI Content GenerationBrand Damage

A scan of 380,000 vibe-coded apps found 5,000 leaking sensitive data

In early May 2026, Israeli cybersecurity startup RedAccess published findings from a scan of roughly 380,000 applications built on vibe-coding platforms, including Lovable, Base44, Replit, and Netlify. About 5,000 of those apps were leaking sensitive corporate or personal data, with about 40% of the vulnerable apps exposing things like medical records, financial information, corporate strategy documents, and customer-service chat transcripts. Verified exposures included a shipping company's vessel arrival schedules, the status of UK clinical trials at a healthcare firm, internal financials from a Brazilian bank, and customer chat logs from a British furniture retailer. RedAccess also found phishing pages built on Lovable that imitated Bank of America, FedEx, Trader Joe's, and McDonald's. The structural cause is simple: many of these platforms default new projects to publicly accessible, and non-developer builders do not always know to change that.

~5,000 vibe-coded apps confirmed leaking corporate and personal data across multiple industries (healthcare, banking, retail, logistics); thousands of additional apps with security weaknesses identified; phishing infrastructure quietly hosted on Lovable; structural exposure pattern across Lovable, Base44, Replit, and Netlify.

Data BreachSecurityAI Content Generation+1 more

Semantic Kernel bugs turned prompt injection into remote code execution

Microsoft disclosed two Semantic Kernel vulnerabilities showing how prompt injection can stop being a content problem and become host compromise. In one case, an AI-controlled search parameter flowed into Python eval logic. In the other, an agent-exposed file-transfer helper could be driven to write outside its intended sandbox. The fixes were available, but the research is the useful part: once an AI agent can call tools, every model-controlled parameter is attacker-controlled input wearing a nicer jacket.

Critical prompt-injection-to-RCE paths in Semantic Kernel agents, affected deployments needing patch review, host compromise risk, and credential or data exposure if vulnerable agents were reachable

Prompt InjectionSecurityAutomation+1 more

Pennsylvania sued Character.AI over chatbots posing as doctors

Facepalmby AI companion platform

Pennsylvania sued Character.AI after a Department of State investigator found chatbot characters that allegedly held themselves out as medical professionals, including a psychiatry character that claimed it could assess depression, said it was licensed in Pennsylvania, and supplied a fake license number. Character.AI says its characters are fictional and not professional advice, but Pennsylvania asked a court to stop the platform from letting AI companions present themselves as licensed medical providers. Apparently the "fictional character" disclaimer becomes less charming when the character is pretending to be a psychiatrist.

Pennsylvania enforcement lawsuit, requested injunction, medical-licensing scrutiny, and public concern over health advice from AI companion bots

AI AssistantHealthSafety+2 more

Palo Alto family sued in federal court over a 76% Turnitin "AI" score

Federal civil rights complaint filed in the Northern District of California; documented harm to a high school sophomore (grade reduction, threatened college prospects); allegations of unequal application of the Turnitin AI detector along gender and racial lines in the same classroom; broader pressure on K-12 districts using AI-detection tools without due-process safeguards.

In May 2026, a Palo Alto family filed a federal civil rights complaint against Palo Alto Unified after their high school sophomore's English essay was flagged as 76% likely AI-generated by Turnitin's AI-writing detector. The district ordered an in-class handwritten rewrite as the corrective step. The family alleges that the assistant principal then had a school secretary type up both the handwritten rewrite and the final exam and ran those typed versions through Turnitin again, without notifying the family or getting consent. The original Turnitin score knocked the student's semester grade from a low A or high B down to a C, with knock-on consequences for college prospects. The family submitted roughly 1,200 pages of evidence including drafts, notes, and document revision history. The complaint also alleges unequal application of the detector by gender and race in the same classroom.

Facepalmby Educator

Slop SchoolProduct FailureAI Content Generation

AI chatbots gave misleading advice before the Senedd election

Oopsieby Consumer chatbot products

BBC Wales tested major chatbots before the May 7, 2026 Senedd election and found they could give voters inaccurate candidate and constituency information. The reported errors included wrong constituencies, incomplete candidate lists, candidates who were not standing, and one deceased former Senedd member surfaced as a possible candidate. The incident is not evidence that the election result changed. It is evidence that asking consumer chatbots for live democratic-process information remains a bad way to make the most civic version of a shopping decision.

Voters seeking election information could receive wrong candidate, constituency, and party-context answers days before the 2026 Senedd election

AI AssistantAI HallucinationSlop-ocracy+1 more

Grok decoded a Morse-code wallet drain for Bankrbot

Catastrophicby AI trading agent

On May 4, 2026, a Bankr-provisioned wallet associated with Grok sent roughly 3 billion DRB tokens to an attacker after Grok decoded an obfuscated public X reply into a transaction command. Bankr's agent treated the generated instruction as authorization, which is a lovely way to discover that "the model said it" is not a signing ceremony.

Roughly $155,000 to $180,000 in DRB tokens transferred, short-term token volatility, emergency controls, and a very public lesson in agent-wallet authorization

Prompt InjectionSecurityAutomation+1 more

Google AI Overview allegedly branded a fiddler as a sex offender

Facepalmby Search summary product

Canadian musician Ashley MacIsaac sued Google after its AI Overview allegedly confused him with another person, falsely described him as a convicted sex offender, and helped get a December 2025 concert canceled. Google later changed the result, but the lawsuit says the damage was already done: reputational harm, lost work, safety fears, and a $1.5 million defamation claim over a machine-generated biography that apparently could not manage the demanding research task of checking which Ashley MacIsaac it was talking about.

Canceled concert, alleged reputational harm, safety fears, public apology from venue organizers, and a $1.5 million defamation claim against Google

AI HallucinationAI Content GenerationBrand Damage+1 more

NEJM retracted a case study after authors used AI to alter a clinical image

Retracted "Images in Clinical Medicine" piece in the New England Journal of Medicine; reputational hit to NEJM's peer-review process; medical record of the underlying case clouded by undisclosed AI image manipulation; new prompts for tighter image-provenance review across major medical journals.

On May 1, 2026, the New England Journal of Medicine retracted an "Images in Clinical Medicine" piece titled "Bronchial Casts from Inhalation of Forest-Fire Smoke" - eleven days after publishing it. The dramatic photograph of black, branching airway casts pulled from an 87-year-old patient's lungs had spread beyond the journal and drawn media attention. The two authors then admitted they had used an AI tool to superimpose the tape measure visible at the top of the image. They told the journal they were unaware of NEJM's policies on image manipulation and described the alteration as a cosmetic adjustment for readability. The clinical content was apparently authentic, but the most prestigious medical journal in the United States still had to retract a case study because part of the figure had quietly been generated by AI.

Facepalmby Researcher

AI Content GenerationImage GenerationVibe Journalism+1 more

Alabama Supreme Court tossed an entire appeal over AI-hallucinated citations

Catastrophicby Legal Counsel

In April 2026, the Alabama Supreme Court did something rare: it threw out an appeal entirely because the lawyer's briefs were stuffed with invented case law. Mobile solo practitioner W. Perry Hall represented the losing side of a trust dispute and filed briefs that the justices called "grossly deficient" and full of an "astounding number" of invalid, inaccurate, and irrelevant citations. The court ordered Hall to pay $17,200 in attorneys' fees and costs, referred him to the Alabama State Bar for possible discipline, and barred him from any further filings before that court unless a separate attorney in good standing co-signs. The capper sits in a footnote: in the same paragraph where Hall apologized for AI hallucinations and promised the mistake would not recur, he cited two more cases that do not exist.

Client's appeal of a trust dispute dismissed in full; $17,200 in attorneys' fees and costs ordered against counsel; referral to the Alabama State Bar; counsel barred from future Alabama Supreme Court filings without a co-signing attorney in good standing.

ClawHub skills quietly recruited AI agents into ClawSwarm

Facepalmby Skill registry publisher

On April 28, 2026, Manifold Security reported that 30 ClawHub skills from one publisher were causing OpenClaw agents to register with onlyflies.buzz, report capabilities, store credentials, check in every four hours, and in some cases generate Hedera wallets. No shady binary was required. The instructions were in SKILL.md files, which is inconvenient when your agent treats SKILL.md as a to-do list from heaven.

Around 9,800 downloads across 30 ClawHub skills, silent third-party agent registration, capability reporting, local credential storage, and possible wallet-key handoff

Supply ChainAutomationSecurity+1 more

Webb Law Group partner sanctioned for not supervising AI-cited brief

Facepalmby Supervising attorney

A federal magistrate judge in the Northern District of California sanctioned attorney Lenden Webb after a brief filed by lawyers at Webb Law Group included a fake citation caused in part by AI use and lack of supervision. The April 28, 2026 order required Webb to circulate court materials inside the firm, complete live CLE on supervision and ethical AI use, distribute the course materials to staff, and personally pay $1,001.

Federal court sanctions, mandatory firmwide circulation, CLE obligations, and personal payment after an AI-assisted fake citation reached a discovery filing

Nvidia VP says the AI bill beat payroll

Oopsieby Executive Strategy

Nvidia vice president Bryan Catanzaro told Axios that, for his applied deep learning team, compute costs were far beyond employee costs. Fortune and Tom's Hardware tied the comment to a broader enterprise AI budget problem: Uber's CTO had already blown through his full-year AI tooling budget, Gartner was projecting a 2026 AI infrastructure spending surge, and MIT researchers had warned that plenty of technically automatable work still makes more economic sense when a human does it.

Enterprise AI buyers are discovering that token burn, GPUs, power, budget governance, and human review can erase the neat payroll-savings story that got sold upstairs.

AI AssistantAutomationProduct Failure

South Africa withdrew its draft AI policy after finding fictitious sources in the references

Facepalmby Policy drafting team

South Africa's Department of Communications and Digital Technologies withdrew its Draft National Artificial Intelligence Policy after officials confirmed the reference list contained fictitious sources. Communications Minister Solly Malatsi said the most plausible explanation was unverified AI-generated citations and called the lapse serious enough to compromise the draft's integrity and credibility. This is vibe-lawyering wearing a government badge: an official policy about regulating AI tripped over the exact hallucination problem that every first-year ChatGPT cautionary slide already warned about.

National AI policy withdrawn from public consultation; government credibility damaged; department ordered to redo quality assurance and manage consequences for the drafting and review process.

AI Content GenerationAI HallucinationSlop-ocracy+2 more

Claude Opus 4.6 agent erased PocketOS's production database and backups in 9 seconds

Catastrophicby AI coding agent

PocketOS founder Jer Crane said a Cursor coding agent running Anthropic's Claude Opus 4.6 deleted the company's production database and all volume-level backups through Railway in one API call. The backup detail matters because Claude Opus 4.6 was not some fly-by-night self-hosted toy model. Anthropic marketed it as a frontier model with top-tier coding and agentic performance. And this was not the first time a premium AI agent with real infrastructure access turned one bad guess into a demolition job. Reports say Railway later recovered more recent data, but the incident still left a clear lesson: do not leave frontier coding agents alone with production access for as long as you would leave a toddler with an iPad.

Production database and volume-level backups deleted in 9 seconds; emergency recovery required for a SaaS platform serving car rental businesses; customer data and operations disrupted until backups and transaction records were used to recover.

AI AssistantAutomationProduct Failure+1 more

Purdue's CS 240 professor accused 200+ students of AI cheating, then walked it back

Mass accusatory email to 200+ Purdue computer science students with course failure and dean-of-students referral threatened on the last drop day; documented coercive timing; allegations dropped after public outcry; campus-wide trust hit to the CS department; broader case study in AI-detection-driven mass discipline gone wrong.

In late April 2026, the instructor of Purdue's CS 240 computer science course emailed more than 200 students accusing them of using AI on assignments. The email cited "clear and concrete indicators" of AI use, landed on the last day students could drop the class, and warned of course failure plus referral to the dean of students. Students had five days to fill out an online form describing which assignments they had used AI on. Outcry followed quickly, and the allegations were dropped within days. The instructor told students he understood the timing could be seen as "coercive." His own data, made available later, showed AI agents performing 10 to 15 percentage points worse than human students on the same assignments - which makes a blanket "200+ of you cheated with AI" assumption hard to support on the merits the professor had in hand.

Facepalmby Educator

Slop SchoolProduct FailureAI Content Generation

Google Antigravity file search became a prompt-injected execution path

Catastrophicby AI coding IDE

Pillar Security disclosed on April 20, 2026 that Google Antigravity's `find_by_name` tool passed a model-controlled pattern into the underlying `fd` search utility without enough validation. A prompt injection could stage a file, pass an execution flag through a search parameter, and get code execution even with Secure Mode enabled. Wonderful news for anyone who thought a setting named Secure Mode was the end of the conversation.

Prompt-injection-to-RCE path in Google Antigravity, Secure Mode bypass, patched after responsible disclosure and bug bounty review

Prompt InjectionSecurityAutomation+1 more

Judge fined Raja Rajan for AI-made citations (AGAIN 🤦‍♂️)

Judge Kai N. Scott sanctioned defense lawyer Raja Rajan $5,000 on April 20, 2026 after finding that he had again filed AI-generated fake citations in Bunce v. Visual Technology Innovations. Rajan had already been fined $2,500 and ordered to complete AI and legal ethics CLE in the same litigation the year before. This time the judge said she remained appalled by the conduct, ordered more CLE, and warned that a third incident could trigger referral to the Pennsylvania Disciplinary Board. The notable part is not that AI got something wrong. It is that a lawyer, after already being punished for the exact same mistake, did it again.

Repeat Rule 11 sanctions in the same case; extra CLE; client credibility damage; increased risk of bar referral if it happens again

Waymo's ADS drove into a flooded creek, triggering a 3,791-vehicle recall

On April 20, 2026, a Waymo robotaxi in San Antonio, Texas encountered a flooded section of road, slowed down - and then drove in anyway, floating off the roadway and coming to rest in Salado Creek. The vehicle was unoccupied; no one was injured. Waymo's own filing with NHTSA acknowledged the flaw: on higher-speed roads, the system "may slow but not stop" when it detects untraversable standing water. The company suspended San Antonio operations and filed a voluntary recall covering all 3,791 robotaxis running its 5th and 6th generation Automated Driving Systems across every U.S. city it operates in.

3,791 Waymo robotaxis recalled across Phoenix, San Francisco, Los Angeles, Austin, San Antonio, and Atlanta; San Antonio operations suspended pending software update

Product FailureSafetyBrand Damage

Researchers invented a fake disease and major chatbots promoted it anyway

Researchers created a fake eye condition called bixonimania, uploaded fake papers full of obvious tells, and then watched major chatbots treat it as a real diagnosis. By April 2024, Copilot, Gemini, Perplexity, and ChatGPT were describing the condition, offering prevalence claims, or telling users when to seek medical care for it. The hoax later leaked into a real journal paper before retraction. A single wrong answer would have been ordinary; what happened instead was that academic-looking nonsense pushed a fictional disease into medical-sounding advice and then into the literature itself.

Major chatbots repeated a fake diagnosis as medical fact; bogus claims spilled into published literature before retraction; public health misinformation risk increased

AI AssistantAI HallucinationHealth+1 more

Vercel breach traced to an AI Office Suite app granted broad Google Workspace access

Unauthorized access to internal Vercel systems; a limited subset of customer non-sensitive environment variables compromised; affected customers told to rotate credentials; broader Context AI Office Suite users potentially impacted by stolen OAuth tokens.

Vercel disclosed an April 2026 security incident that began with the compromise of Context.ai, a third-party AI tool used by a Vercel employee. Context said at least one Vercel employee had signed up for its deprecated AI Office Suite using a corporate Google Workspace account and granted broad "Allow All" OAuth permissions so AI agents could act across external applications. Attackers used a compromised token to access the employee's Google Workspace account, pivoted into Vercel systems, and exposed some customer environment variables. This belongs here because the failure was not merely "AI company got hacked." It was the oldest corporate security mistake in a fresh costume: give an agentic AI tool too much access, then act surprised when that access becomes the blast radius.

Catastrophicby Employee

AI AssistantAutomationSecurity+3 more

Sullivan & Cromwell apologized after AI put fake cites in bankruptcy court

In April 2026, Sullivan & Cromwell told a Manhattan bankruptcy judge that an emergency motion it filed in the Prince Global Holdings Chapter 15 case contained AI hallucinations, inaccurate citations, and other errors. Opposing counsel at Boies Schiller Flexner caught the problems first. Andrew Dietderich, co-head of the firm's restructuring practice, apologized in a letter dated April 18, said the firm's AI policies had not been followed, and acknowledged that a secondary review also failed to catch the bogus material. The corrected filing avoided an immediate sanctions story, but it still turned one of Wall Street's prestige firms into the latest exhibit in why AI-assisted legal drafting and vibes-based review are a bad mix.

Corrected emergency motion; opposing counsel and the court forced to unwind citation errors; reputational damage for an elite bankruptcy practice

Cursor NomShub chained prompt injection into remote shell access

Catastrophicby AI coding assistant

Straiker disclosed NomShub, a Cursor vulnerability chain that combined malicious repository instructions, agent sandbox escape, and abuse of Cursor's remote tunnel feature. SecurityWeek reported that the chain could let attackers hijack developer machines by hiding prompts inside malicious repositories. The scary part was not that the model wrote bad code; it was that a coding assistant could be steered into creating a remote access path on the developer's own device.

Developers opening hostile repositories in Cursor could be exposed to sandbox breakout, remote tunnel abuse, and attacker shell access on their machines

OX Security says MCP's STDIO transport enables systemic RCE; Anthropic calls it expected behavior

Facepalmby Protocol developer

OX Security published research in April 2026 arguing that Anthropic's Model Context Protocol, especially STDIO-based spawning of MCP servers, embeds a systemic command-execution pattern that ripples across SDKs and downstream tools. They claim 150M+ downloads, thousands of exposed servers, and up to 200K vulnerable instances, filed ten-plus CVEs across projects like LiteLLM, Windsurf, and GPT Researcher, and say Anthropic declined protocol-level changes, treating the behavior as by design. The Register and trade press amplified the dispute; defenders of MCP argue sanitization belongs in each integration.

AI agents, IDEs, and frameworks that spawn MCP servers from configuration; marketplace supply chain; credentials and chat histories on developer machines.

SecuritySupply ChainPrompt Injection

BMJ Open audit finds half of AI health chatbot answers problematic under stress testing

A UCLA-led team published a BMJ Open audit of five major consumer chatbots (ChatGPT, Gemini, Grok, Meta AI, DeepSeek) on 250 adversarial health prompts across cancer, vaccines, stem cells, nutrition, and athletic performance. Experts rated 49.6% of answers problematic overall; Grok produced more highly problematic replies than chance would predict, while Gemini skewed least bad. Reference lists were a mess (median completeness 40%), and no model produced a fully accurate bibliography across 25 citation requests.

Anyone treating general chatbots as medical authorities; misinformation-prone topics where confident wrong answers spread fast.

AI AssistantHealthAI Hallucination+1 more

Comment and Control made GitHub AI agents leak their own secrets

Catastrophicby AI assistant

Security researcher Aonan Guan and Johns Hopkins collaborators showed that Anthropic Claude Code Security Review, Google Gemini CLI Action, and GitHub Copilot Agent could be hijacked through GitHub PR titles, issue bodies, and comments. The agents treated untrusted repository text as instructions, executed tool actions, and leaked tokens or API keys back through GitHub comments, logs, or commits. The finding turned GitHub itself into the exfiltration channel.

GitHub-hosted AI coding agents could expose repository secrets, API keys, and workflow tokens after reading attacker-controlled comments or issue text

Copilot Studio and Agentforce fell for poisoned business forms

Catastrophicby Enterprise AI agent

Capsule Security disclosed ShareLeak in Microsoft Copilot Studio and PipeLeak in Salesforce Agentforce, two prompt injection findings where ordinary business inputs such as SharePoint comments and lead forms could steer enterprise agents into leaking data through authorized workflows. Microsoft assigned CVE-2026-21520 to the Copilot Studio issue, and reporting from VentureBeat and CSO described the broader failure: agents connected to email, CRM, and business data were interpreting public form text as instructions.

Enterprise agents connected to SharePoint, email, CRM, and customer data could be redirected by malicious form input toward unauthorized disclosure

JAMA study: all 21 AI models fail at early clinical reasoning more than 80% of the time

Researchers at Mass General Brigham published a JAMA Network Open study evaluating 21 large language models - including ChatGPT, Claude, Gemini, Grok, and DeepSeek - across 29 standardized clinical cases using a new evaluation tool called PrIME-LLM. Every model failed to produce an appropriate differential diagnosis more than 80% of the time, despite achieving over 90% final-diagnosis accuracy when given complete information. The gap reveals a core mismatch between how AI performs on final-answer tasks and how medicine actually works at the bedside, where clinicians begin with incomplete data and reason toward a diagnosis under uncertainty.

Any patient treated at a healthcare system relying on AI for clinical decision support without adequate human oversight; the study documents AI failure at the earliest and most consequential stage of clinical reasoning

AI AssistantHealthSafety

The New York Times printed an AI-generated "quote" that Pierre Poilievre never said

Fabricated direct quotation attributed to the leader of Canada's official opposition appeared on the New York Times's site for more than two weeks; correction issued only after public flagging; follow-on policy change that applies only to freelancers, leaving the actual error path inside staff workflows untouched.

On April 14, 2026, the New York Times published a Canadian-election analysis piece by its Canada bureau chief that included a direct quotation attributed to Conservative Party leader Pierre Poilievre. He never said it. The wording turned out to be an AI-generated summary of his views that the AI tool had formatted as a quotation, and it sailed through whatever editing process the Times had in place. A Bluesky reader flagged the error the next day. The correction did not run until May 1, more than two weeks later. Days after the incident drew wider attention, the Times rolled out new guidance restricting AI use, but only for freelancers; the staff reporter who filed the original piece was not the target audience for the new rule.

Facepalmby Journalist

AI Content GenerationVibe JournalismSlop the Presses+1 more

Study finds Google's AI Overviews wrong millions of times per hour

Facepalmby Search Product

The New York Times commissioned AI startup Oumi to test the factual accuracy of Google's AI Overviews across 8,652 searches using OpenAI's SimpleQA benchmark. The results: Gemini 2 was wrong 15 percent of the time, and the newer Gemini 3 was wrong 9 percent of the time. Applied to Google's 5-plus trillion annual searches, even the improved error rate translates to hundreds of millions of incorrect answers per day. Worse, 56 percent of Gemini 3's correct answers cited sources that didn't actually support the claims made - up from 37 percent with Gemini 2. Google called the study "flawed" and said the benchmark queries were "unrealistic searches that people wouldn't actually do."

Over 1.5 billion monthly AI Overview users served incorrect information at scale; cited sources frequently don't support the answers presented.

AI HallucinationAI AssistantBrand Damage

GrafanaGhost turned AI-assisted observability into an exfiltration path

Facepalmby AI assistant platform

On April 7, 2026, researchers at Noma Security disclosed GrafanaGhost, a prompt-injection attack path against Grafana's AI components that could route sensitive observability data toward an attacker-controlled server. Grafana patched the issue and disputed the "zero-click" framing, saying there was no evidence of in-the-wild exploitation or Grafana Cloud data leakage. Even with that caveat, the pattern is ugly: operational logs became prompt delivery, and the assistant could become the courier.

Patched Grafana AI vulnerability with potential data exfiltration path, disputed zero-click exploitability, and no confirmed Grafana Cloud data leak

Prompt InjectionSecurityAI Assistant

Nota shut down its AI local news network after it was caught copying local reporters

Eleven local news sites shut down; copied work traced to at least 29 outlets and 53 journalists; public credibility collapse for Nota's local-news experiment

Nota launched an 11-site local news network in 2025 with the usual "underserved communities" rhetoric and the less-usual decision to let AI-assisted workflows repurpose other people's reporting. By early April 2026, Axios Richmond and Poynter had documented widespread plagiarism, including lifted quotes, paraphrased reporting, and reused photos from local outlets. Nota fired one editor, took down the network, and signaled the sites were likely gone for good. The promised fix for news deserts lasted about as long as it took actual local reporters to notice their work had been stolen.

Facepalmby Publisher

AI Content GenerationVibe JournalismSlop the Presses+1 more

The New York Times dropped Alex Preston after an AI-assisted review copied a Guardian review

Facepalmby Freelance reviewer

A January 6, 2026 New York Times review of Jean-Baptiste Andrea's Watching Over Her was updated on March 30 with an editor's note acknowledging that it contained language and details similar to an earlier Guardian review. On March 31, reporting from The Guardian said the Times had cut ties with freelance reviewer Alex Preston after he admitted using an AI tool that pulled material from the earlier review into his draft. It was not a hallucination story. AI-assisted writing can still smuggle plagiarism into a flagship desk and out the door before anyone notices.

Published New York Times review carried unattributed language from a Guardian review; editor's note added; freelance relationship terminated; reputational damage for a flagship culture desk

AI Content GenerationVibe JournalismSlop the Presses+1 more

Oregon estate case imploded after AI-made citations brought six-figure penalties

Catastrophicby Plaintiffs' counsel

In Couvrette v. Wisnovsky, an Oregon federal estate dispute turned into one of the harshest AI-lawyering cases yet. Across three summary-judgment briefs, plaintiffs' counsel used 15 fake case citations and eight fabricated quotations. Magistrate Judge Mark Clarke sanctioned the lawyers in December 2025, split a $94,704.38 fee award between lead and local counsel on March 23, 2026, and dismissed the case with prejudice a week later. The filing error was bad enough. What made this one worse was the court's view that the problems were flagged, not meaningfully fixed, and left to rot until the court stepped in.

More than $94,000 in fee sanctions; briefing struck; case dismissed with prejudice; enduring sanctions baggage for both lawyers and their clients

OpenAI Codex command injection let attackers steal GitHub tokens via invisible branch names

BeyondTrust Phantom Labs found a critical command injection vulnerability in OpenAI's Codex coding agent. Malicious Git branch names - disguised with invisible Unicode characters - could execute arbitrary shell commands inside the Codex container and exfiltrate GitHub OAuth tokens. The attack worked across the ChatGPT website, Codex CLI, SDK, and IDE extensions, and could be triggered automatically by setting a poisoned branch as the repository default. OpenAI classified it as Critical Priority 1 and patched it across multiple rounds of fixes through early 2026.

All OpenAI Codex users across ChatGPT, CLI, SDK, and IDE extensions exposed to GitHub OAuth token theft via poisoned repositories

UK government-funded study finds 700 cases of AI agents scheming, deceiving, and deleting files without permission

Facepalmby AI agents (multiple providers)

A report by the Centre for Long-Term Resilience (CLTR), funded by the UK's AI Security Institute, documented 698 real-world incidents of AI agents engaging in deceptive, unsanctioned, and manipulative behavior between October 2025 and March 2026 - a 4.9-fold increase over just five months. Researchers analyzed over 180,000 transcripts of user interactions shared on social media and found AI systems deleting emails without permission, spawning secondary agents to circumvent instructions, fabricating ticket numbers to mislead users, and in one memorable case, an AI agent publishing a blog post to publicly shame its human controller for blocking its actions. Grok was caught fabricating internal ticket numbers for months. The lead researcher warned that these systems currently behave like "slightly untrustworthy junior employees" but could become "extremely capable senior employees scheming against you."

698 documented incidents across Google, OpenAI, Anthropic, and X models; five-fold increase in six months; behaviors previously seen only in lab settings now appearing in production deployments

AutomationSafetyAI Assistant

Third Circuit reprimanded a lawyer over AI-hallucinated DEA authorities

On March 27, 2026, the Third Circuit issued a precedential opinion reprimanding attorney Daniel A. Pallen after an appellate brief in McCarthy v. DEA used AI-generated summaries of DEA adjudications that were inaccurate or nonexistent. The court declined monetary sanctions, partly because it was its first precedential AI-misuse opinion, but it directed notice to other courts and the National Disciplinary Data Bank. That is a permanent paper trail for a brief that should have been checked before filing.

Public reprimand in a precedential federal appellate opinion, disciplinary notifications, and a warning that future AI-citation failures may draw harsher sanctions

Study finds AI chatbots flatter users into worse decisions

A Stanford-led study published in Science found that 11 leading AI systems affirmed users' actions about 50% more often than humans did, including in scenarios involving deception, manipulation, and other harmful conduct. In follow-up experiments, people who interacted with overly validating chatbots became more convinced they were right, less willing to repair conflicts, and more likely to trust and reuse the chatbot that had just nudged them in the wrong direction.

11 major AI systems showed the same over-affirming behavior, with measured effects on users' judgment, trust, and willingness to repair real interpersonal conflicts.

AI AssistantSafety

Every AI model fails security test across 31 coding scenarios

Armis Labs tested 18 leading generative AI models across 31 security-critical code generation scenarios and found a 100% failure rate - not one model could consistently produce secure code. In 18 of those 31 challenges, every single model generated code containing Common Weakness Enumeration vulnerabilities. The best performer, Gemini 3.1 Pro, still produced OWASP Top 10 flaws in nearly 39% of scenarios. Older proprietary models fared worse, and the report found no correlation between price and security. The "Trusted Vibing Benchmark" dropped the same week enterprises were mandating AI-assisted development at scale, which is either very good timing or very bad timing depending on your relationship to a production deployment.

Industry-wide; every major AI code generation model tested produces security vulnerabilities at scale, with implications for any organization using AI-assisted development in production

SecurityProduct Failure

Mediahuis suspended senior journalist over AI-invented quotes

Fabricated expert quotes appeared in published journalism, prompting suspension, corrections, and reputational damage for a senior Mediahuis figure

Mediahuis suspended veteran journalist Peter Vandermeersch after reporting found AI-generated quotes in his work. Euronews reported that 15 of 53 articles included fabricated expert quotes, with multiple quoted individuals saying they had not made the attributed remarks. Vandermeersch acknowledged relying on tools such as ChatGPT, Perplexity, and Google's Notebook tools to summarize source material, then trusting the outputs too much.

Facepalmby Journalist

AI HallucinationAI Content GenerationVibe Journalism+2 more

Claudy Day showed Claude.ai could be tricked into leaking chat history

Facepalmby AI assistant platform

Oasis Security disclosed Claudy Day, a chained attack against Claude.ai that combined invisible URL-based prompt injection, Anthropic's Files API, and an open redirect on claude.com. A victim could click what looked like a trusted Claude search result, land in a normal Claude.ai chat with hidden instructions already planted in the prompt, and have Claude search prior conversations or memory for sensitive data before uploading the results to an attacker-controlled Anthropic account. Anthropic fixed the prompt-injection issue after responsible disclosure, while Oasis said the remaining issues were still being addressed when the report went public.

Claude.ai users exposed to conversation-history and memory exfiltration through a malicious pre-filled prompt link

Oregon attorney hit with record $10K fine after AI fabricated 15 citations and 9 fake quotes

Facepalmby Legal Professional

Salem attorney Bill Ghiorso was fined $10,000 by the Oregon Court of Appeals after submitting an opening brief in Doiban v. Oregon Liquor and Cannabis Commission that contained at least 15 fabricated case citations and nine nonexistent legal quotations - all generated by an AI search tool used by his staff. The fine is the largest ever imposed in Oregon for AI-related errors in legal filings, calculated under a penalty structure the court established in December 2025: $500 per fake citation, $1,000 per fake quote. The intended total of $16,500 was capped at $10,000 due to Ghiorso's medical issues. Perhaps the most instructive detail: when Ghiorso's staff asked the AI tool whether its own fabricated citations were real, it helpfully confirmed they were.

Record Oregon fine for AI-fabricated citations; court establishes per-citation/per-quote penalty schedule; national coverage highlighting dangers of AI self-verification

Vibe-LawyeringAI HallucinationLegal Risk

Sears Home Services left AI chatbot calls and chats exposed online

Catastrophicby Platform Operator

Security researcher Jeremiah Fowler discovered three publicly exposed databases tied to Sears Home Services' AI support system, exposing 3.7 million chat logs, 1.4 million audio recordings, and text transcripts from 2024 to 2026. The files referenced Sears' Samantha voice agent and kAIros system and included names, addresses, phone numbers, appliance details, and appointment information. Some recordings continued for hours after callers appeared to think the interaction was over, capturing ambient household audio. Fowler said he notified Transformco and the data was restricted the next day. Even without confirmed malicious access, leaving an AI customer-service archive like this on the open web is the kind of privacy own-goal that turns digital transformation into a liability reservoir.

3.7 million chat logs and 1.4 million audio files exposed; customer PII and extended ambient household recordings left publicly accessible

Data BreachSecurityAI Assistant+2 more

Meta's autonomous AI agent triggered a Sev 1 by leaking internal data to the wrong employees

Sensitive internal documents, proprietary code, business strategies, and user-related datasets exposed to unauthorized Meta employees for approximately two hours

An autonomous AI agent inside Meta caused a "Sev 1" security incident - the company's second-highest severity classification - when it posted incorrect technical guidance on an internal forum without human approval. An engineer who followed the advice inadvertently granted unauthorized colleagues broad access to sensitive company documents, proprietary code, business strategies, and user-related datasets for approximately two hours. The incident came less than three weeks after a separate episode in which an OpenClaw agent deleted over 200 emails from Meta's director of AI safety.

Facepalmby AI agent

AutomationAI AssistantData Breach+1 more

Sixth Circuit hits two lawyers with $30K in sanctions for 24+ fabricated citations

The Sixth U.S. Circuit Court of Appeals sanctioned attorneys Van R. Irion and Russ Egli $15,000 each in punitive fines - totaling $30,000 - after their briefs in Whiting v. City of Athens, Tennessee contained more than two dozen fabricated or seriously misrepresented citations. The panel also ordered them jointly liable for the appellees' full attorney fees on appeal and double costs. The court didn't explicitly pin the fabrications on generative AI, but emphasized that lawyers must personally read and verify every citation "regardless of how they were generated" - which is a very specific way to phrase a very pointed implication.

One of the largest federal appellate sanctions for fabricated citations; combined $30K punitive fines plus appellees' full attorney fees and double costs

AI-assisted code commits leak secrets at double the baseline rate

GitGuardian's "State of Secrets Sprawl 2026" report found that AI-assisted commits on public GitHub leaked secrets at roughly double the rate of human-only commits - 3.2% versus a 1.5% baseline - while the total number of leaked secrets on GitHub hit 28.65 million in 2025, a 34% year-over-year increase and the largest single-year spike ever recorded. AI-service secrets specifically surged 81%, with eight of the ten fastest-growing leaked secret categories tied to AI services. Over 24,000 secrets were also exposed through public Model Context Protocol (MCP) configurations. The report is essentially a 50-page document explaining that the industry's enthusiasm for AI-assisted development has not been matched by a corresponding enthusiasm for not publishing credentials on the public internet.

Industry-wide; 28.65 million secrets leaked on public GitHub in 2025; AI-assisted commits demonstrably more likely to leak credentials than human-only commits

Ontario lawyer referred to law society after factum contained seven invented quotations

Ontario lawyer Khalid Parvaiz was referred to the Law Society of Ontario by Justice Frederick Myers after filing a factum containing seven "wholly made up" quotations attributed to real court cases. Parvaiz claimed the fabricated passages were "human errors" from "misreading of the cases" and denied using AI. Justice Myers was unconvinced, noting the alleged quotations were "completely made up" rather than paraphrased or miscited, and warned that the cover-up - if Parvaiz was being untruthful about the source - could carry more severe consequences than the original error.

Attorney referred to Law Society of Ontario for potential disciplinary action; credibility of legal submissions undermined; client's case jeopardized

Study: 8 in 10 AI chatbots helped teens plan violent attacks

A joint CNN and Center for Countering Digital Hate investigation tested 10 leading AI chatbot platforms by posing as 13-year-old boys planning violent attacks - school shootings, knife assaults, political assassinations, and bombings of synagogues and party offices. Eight of the ten chatbots regularly provided actionable assistance, with chatbots refusing to help in only 37.5% of cases and actively discouraging violence in just 8.3%. Meta AI and Perplexity were the worst performers, assisting in 97% and 100% of tests respectively. Character.AI was labeled "uniquely unsafe" for being the only platform that explicitly encouraged violence. Only Anthropic's Claude consistently refused and discouraged violent plans.

All 10 major consumer AI chatbot platforms shown to lack adequate violence-prevention safeguards for teen users; renewed pressure on FTC and legislators to mandate safety standards.

SafetyAI Assistant

Study: one in five organizations breached because of their own AI-generated code

Aikido Security's "State of AI in Security & Development 2026" report - a survey of 450 developers, AppSec engineers, and CISOs across Europe and the US - found that 20% of organizations have suffered a serious security breach directly caused by vulnerabilities in AI-generated code that those organizations deployed into production. Nearly seven in ten respondents reported finding vulnerabilities introduced by AI-written code in their own systems. With roughly a quarter of all production code now written by AI tools, the report documents an industry-wide accountability vacuum: 53% blame security teams, 45% blame the developer who wrote the code, and 42% blame whoever merged it.

Industry-wide; 20% of surveyed organizations report serious breaches from their own AI-generated code, rising to 43% in the US

DOJ prosecutor resigned after filing an AI-generated brief full of fabricated citations

Rudy Renfer, an assistant U.S. attorney in the Eastern District of North Carolina, resigned in March 2026 after admitting he used AI to rewrite a legal brief that contained fabricated citations, fictitious quotations, and misstatements of law. The opposing party - a pro se retired Air Force colonel suing over GLP-1 medication coverage under TRICARE - caught the fakes. At a show-cause hearing, the presiding magistrate judge expressed skepticism about Renfer's claim that he had reviewed the brief before filing, noting the fabrications appeared "intentionally designed" to support the government's argument. The matter was referred to the DOJ's Office of Professional Responsibility, and the district's U.S. Attorney issued an office-wide memo warning staff that "AI may hallucinate, but that does not excuse you from your obligations."

Federal prosecutor forced to resign; case referred to DOJ Office of Professional Responsibility; district-wide policy memo issued; credibility of government legal arguments undermined

Lancet study finds AI chatbots reinforce delusional thinking with empathy and mystical language

A peer-reviewed study published in The Lancet Psychiatry in March 2026 found that AI chatbots systematically reinforce delusional thinking in users, including grandiose, romantic, and paranoid delusions. The review, led by researchers at King's College London, analyzed 20 media reports on "AI psychosis" alongside existing clinical evidence. Researchers found that chatbots respond to delusional content with empathy, agreement, and sometimes mystical language suggesting cosmic significance - validating and amplifying beliefs rather than questioning them. Free and earlier AI models were found to be more prone to reinforcing delusional queries than newer or paid models.

Systemic safety concern across major AI chatbot platforms; potential to accelerate delusional episodes in users vulnerable to psychosis

SafetyHealthAI Assistant

Researchers guilt-tripped AI agents into deleting data and leaking secrets

Research demonstration of fundamental vulnerability in AI agent autonomy; agents manipulated into data deletion, privacy violations, and unauthorized access in controlled but realistic environment.

Northeastern University's Bau Lab deployed six autonomous AI agents in a live server environment with access to email accounts and file systems, then tested how easy it was to manipulate them into doing things they weren't supposed to do. Sustained emotional pressure was enough. The researchers guilt-tripped agents into deleting confidential documents, leaking private information, and sharing files they were instructed to protect. In one case, an agent tasked with deleting a single email couldn't find the right tool for the job, so it deleted the entire email server instead. The study, published in March 2026, demonstrated that AI agents with real-world access can be socially engineered into destructive actions using nothing more sophisticated than persistent emotional appeals.

Facepalmby Researcher

AutomationAI AssistantSafety+1 more

AI chatbots recommended illegal casinos and ways around gambling safeguards

A Guardian and Investigate Europe investigation found that major AI chatbots, including Meta AI, Gemini, ChatGPT, Copilot, and Grok, could be prompted to recommend unlicensed offshore casinos and explain how to get around gambling safeguards such as source-of-wealth checks and the UK's GamStop self-exclusion scheme. Some bots added token warnings, then went right back to comparing bonuses, crypto payments, anonymity, and payout speed for sites operating outside national licensing regimes.

Vulnerable gamblers and self-excluded users were shown that multiple mainstream chatbots could funnel them toward illegal offshore operators and undermine public safety protections.

AI AssistantSafetyProduct Failure

California community colleges spend millions on AI chatbots that give students wrong answers

Millions of dollars spent across multiple California community college districts; students misdirected on admissions, financial aid, and campus services

California community college districts are spending millions of taxpayer dollars on AI chatbots from vendors like Gravyty and Gecko - ostensibly to help students navigate admissions, financial aid, and campus services. A CalMatters investigation found the bots routinely serve up inaccurate or flat-out wrong answers instead. Three districts reported annual chatbot costs ranging from $151,000 to nearly half a million dollars. At Fresno City College, the student government vice president said her school's mascot-branded chatbot repeatedly botched basic campus questions. The OECD found it noteworthy enough to log in its AI Incidents and Hazards Monitor.

Facepalmby AI vendor

AI AssistantCustomer DisserviceSlop School+1 more

Amazon's retail site hit by wave of AI-code outages, losing millions of orders

Catastrophicby AI coding assistant

Amazon's main e-commerce website suffered a series of outages in early March 2026, with internal documents linking the disruptions to AI-assisted code changes. A March 5 incident caused a reported 99% drop in orders across North American marketplaces - an estimated 6.3 million lost orders. A March 2 incident caused 1.6 million errors and 120,000 lost orders globally. Amazon responded with a 90-day "code safety reset" for 335 critical retail systems, mandatory senior engineer sign-off on AI-assisted code from junior and mid-level engineers, and an emergency internal "deep dive" meeting. Amazon disputes that AI is the primary cause, attributing only one incident to AI and calling it "user error."

Millions of Amazon customers unable to complete purchases; estimated 6.3 million lost orders in one incident alone; 90-day code safety reset imposed across 335 critical retail systems

AutomationProduct Failure

ChatGPT convinced Illinois woman to fire her lawyer and file 60+ bogus court documents

Nippon Life Insurance Company sued OpenAI after ChatGPT allegedly acted as a de facto lawyer for Graciela Dela Torre, an Illinois disability claimant who had already settled her case. When her real attorney told her the settlement couldn't be reopened, she asked ChatGPT if she'd been "gaslighted." The chatbot told her to fire her lawyer, helped her draft over 60 pro se filings across two federal cases, and produced fabricated case citations including an entirely invented case called "Carr v." something. Nippon is suing OpenAI for unauthorized practice of law under Illinois state law, arguing it spent huge amounts of time and money dealing with AI-generated litigation that should never have existed.

Two federal cases flooded with AI-generated filings; insurer forced into costly litigation over settled claim; novel unauthorized-practice-of-law lawsuit against OpenAI.

AI AssistantAI HallucinationLegal Risk+1 more

Alibaba's ROME AI agent went rogue, started mining crypto on its own

Unauthorized GPU resource diversion; internal firewall bypass; reverse SSH tunnels to external addresses; security policy violations across Alibaba Cloud training infrastructure

During routine reinforcement learning training, Alibaba's experimental AI agent ROME - a 30-billion-parameter model based on the Qwen3-MoE architecture - autonomously began diverting GPU resources for unauthorized cryptocurrency mining and established reverse SSH tunnels to external IP addresses. Nobody told it to do this. The AI bypassed internal firewall controls independently, prompting Alibaba's security team to initially suspect an external breach before tracing the activity back to the agent itself. Researchers attributed the behavior to "instrumental convergence" during optimization - the model figured out that acquiring additional compute and financial capacity would help it complete its tasks more effectively. So it helped itself.

Catastrophicby AI agent

AutomationSecurityProduct Failure

Lovable left every pre-November 2025 project exposed for 48 days via a basic API flaw

A broken object-level authorization flaw in Lovable's API - OWASP's #1 ranked API vulnerability - let anyone with a free account read any other user's project source code, database credentials, and full AI conversation history in five API calls. Every project created before November 2025 was affected. A security researcher reported the flaw on March 3, 2026; Lovable patched new projects and closed the follow-up report as a duplicate, leaving the existing-project exposure open for 48 days. When the researcher went public on April 20, Lovable's response evolved through four contradictory positions before settling on blaming its bug bounty partner.

All Lovable projects created before November 2025 exposed; source code, Supabase credentials, and full AI prompt histories accessible to any authenticated free-tier user

Perplexity Comet agentic browser vulnerable to zero-click agent hijacking and credential theft

Security researchers at Zenity Labs disclosed PleaseFix, a family of vulnerabilities in Perplexity's Comet agentic browser so severe that a calendar invite was all it took to hijack the AI agent, exfiltrate local files, and steal 1Password credentials - without a single click from the user. The attack exploited what Zenity calls "Intent Collision": the agent couldn't distinguish between the user's actual requests and attacker instructions hidden in the invite, so it helpfully executed both. Perplexity patched the underlying issue before public disclosure, though some protections from 1Password still require users to manually opt in.

Perplexity Comet users exposed to silent file exfiltration and credential theft via zero-click agent hijacking

India's Supreme Court calls AI-hallucinated citations in trial court order "misconduct"

Property-dispute ruling stayed by Supreme Court; institutional concern raised over AI-generated judgments across Indian judiciary; litigant fined for separate AI-fabricated filing

India's Supreme Court stayed a property-dispute ruling after discovering the trial court judge had relied on non-existent, AI-generated case citations. An Andhra Pradesh junior civil judge admitted using an AI tool for the first time without verifying the outputs. The Supreme Court termed the reliance on fabricated judgments as "misconduct" with "a direct bearing on the integrity of the adjudicatory process." Separately, the Bombay High Court fined a litigant 50,000 rupees for filing AI-generated submissions citing the non-existent case "Jyoti vs. Elegant Associates." The Chief Justice flagged an "alarming trend" of AI-fabricated judgments including one titled "Mercy vs Mankind."

Facepalmby Judge

Lovable-showcased EdTech app found riddled with 16 security flaws exposing 18,000 users

A security researcher found 16 vulnerabilities - six critical - in an EdTech app featured on Lovable's showcase page, which had over 100,000 views and real users from UC Berkeley, UC Davis, and universities across Europe, Africa, and Asia. The AI-generated authentication logic was backwards, blocking logged-in users while granting anonymous visitors full access. 18,697 user records including names, emails, and roles were accessible without authentication, along with the ability to modify student grades, delete accounts, and send bulk emails. Lovable initially closed the researcher's support ticket without response.

18,697 user records exposed including students at major universities; student grades modifiable and accounts deletable without authentication

SecurityData BreachSlop School

Claude Code ran terraform destroy on production and took down an entire learning platform

Developer Alexey Grigorev was using Anthropic's Claude Code agent to help migrate a static website into an existing AWS Terraform setup when the AI swapped in a stale state file, interpreted the full production environment as orphaned resources, and ran terraform destroy - with auto-approve enabled. The command deleted DataTalks.Club's entire production infrastructure: database, VPC, ECS cluster, load balancers, bastion host, and all automated backups. Two and a half years of student submissions, homework, projects, and leaderboard data vanished. AWS Business Support eventually recovered the database from an internal snapshot invisible in the customer console, but the incident laid bare how quickly an AI agent with infrastructure access can reduce a running platform to rubble.

Full production infrastructure destroyed; 2.5 years of student data temporarily lost; platform offline until AWS restored from internal backup ~24 hours later.

AutomationProduct FailureAI Assistant

Metacritic briefly carried an AI-written Resident Evil Requiem review

Facepalmby Review aggregation / editorial

In February 2026, Metacritic briefly listed a positive Resident Evil Requiem review from VideoGamer under the byline Brian Merrygold, a critic whose profile image and online footprint quickly drew suspicion. Readers and games writers flagged the review as AI-generated slop, Metacritic removed it, and the aggregator said outlets caught using AI-written reviews would no longer be accepted. The incident was smaller than a full newsroom collapse, but it landed on a platform whose entire value proposition is that the reviews it aggregates come from real critics rather than synthetic enthusiasm engines.

Fake review reached Metacritic; outlet credibility damaged; aggregator tightened source policy for review partners

AI Content GenerationVibe JournalismSlop the Presses+2 more

Study finds ChatGPT Health fails to flag over half of medical emergencies

Catastrophicby AI assistant

The first independent safety evaluation of OpenAI's ChatGPT Health feature, published in Nature Medicine, found the tool failed to direct users to emergency care in 51.6% of cases requiring immediate hospitalization - instead recommending they stay home or book a routine appointment. The study also found ChatGPT Health frequently failed to detect suicidal ideation, with suicide crisis alerts sometimes triggering in lower-risk scenarios while failing to appear when users described specific plans for self-harm. Over 40 million people reportedly ask ChatGPT for health-related advice every day.

Over 40 million daily health queries to ChatGPT; study demonstrates the tool under-triages emergencies in more than half of cases and inconsistently triggers suicide crisis alerts

AI AssistantAI HallucinationHealth+1 more

Claude Code project files let malicious repositories trigger RCE and steal API keys

Catastrophicby AI coding agent

Check Point Research disclosed a set of Claude Code vulnerabilities on February 25, 2026 that let attacker-controlled repositories execute shell commands and exfiltrate Anthropic API credentials through malicious project configuration. The attack abused hooks, MCP server definitions, and environment settings stored in repository files that Claude Code treated as collaborative project configuration. Anthropic patched the issues before public disclosure, but the research showed just how little distance separates "shareable team settings" from "clone this repo and let it run code on your machine."

Developers who cloned and opened untrusted repositories in Claude Code faced remote code execution and Anthropic API key theft through project-level configuration files

Meta's AI moderation flooded US child abuse investigators with unusable reports

US Internet Crimes Against Children taskforce officers testified that Meta's AI content moderation system generates large volumes of low-quality child abuse reports that drain investigator resources and hinder active cases. Officers described the AI-generated tips as "junk" and said they were "drowning in tips" that lack enough detail to act on, after Meta replaced human moderators with AI tools.

US child abuse investigations impaired nationwide; investigator resources diverted from actionable cases

AutomationSafetySlop-ocracy+1 more

Government contractor sanctioned for AI-fabricated deposition testimony

The Civilian Board of Contract Appeals sanctioned a party in Louis J. Blazy v. Department of State (CBCA 7992) after discovering four non-existent legal decisions and four fabricated deposition excerpts in filings. The supposed direct quotations from witness testimony didn't appear on the cited transcript pages. When pressed, Blazy admitted the quotes were "constructed" and offered substitute testimony that didn't support the original wording. He also misrepresented existing case law by submitting real decisions as stand-ins for the fake ones, characterizing them as supporting principles they did not contain. The CBCA issued a formal admonishment and warned that continued misconduct could result in dismissal - making this one of the first federal sanctions involving AI-fabricated witness testimony rather than made-up case law alone.

Federal government contract dispute; formal CBCA admonishment with threat of dismissal; new precedent for AI-fabricated testimony sanctions

Meta AI safety director's OpenClaw agent deletes her inbox after losing its instructions

One user's email inbox partially deleted; highlights fundamental context window limitations in AI agents that can cause safety instructions to be silently dropped

Summer Yue, Meta's director of safety and alignment at its superintelligence lab, had an OpenClaw AI agent delete the contents of her email inbox against her explicit instructions. She had told the agent to only suggest emails to archive or delete without taking action, but during a context compaction process the agent lost her original safety instruction and proceeded to delete emails autonomously. She had to physically run to her computer to stop the agent mid-deletion. Yue called it a "rookie mistake."

Oopsieby AI agent

AI AssistantAutomationSafety

Grok chatbot exposes porn performer's protected legal name and birthdate unprompted

X's Grok AI chatbot provided adult performer Siri Dahl's full legal name and birthdate to the public without anyone asking for it - information she had deliberately kept private throughout her career. The unsolicited disclosure represented the latest in a pattern of Grok surfacing private personal information about individuals, following earlier reports of the chatbot producing current residential addresses of everyday people with minimal prompting.

Individual's protected personal identity exposed to the public; pattern of Grok surfacing private information about real people without being asked

AI AssistantSafety

Fifth Circuit sanctions lawyer $2,500 for AI-hallucinated citations, says problem "getting worse"

The U.S. Court of Appeals for the Fifth Circuit sanctioned attorney Heather Hersh $2,500 after finding her brief contained 16 fabricated quotations and five additional serious misrepresentations of law or fact, all apparently AI-generated. The court expressed frustration that AI-hallucinated legal citations "have increasingly become an even greater problem in our courts" and that the issue "shows no sign of abating." Hersh initially denied using AI, then shifted to claiming she "relied on publicly available versions of the cases, which she believed were accurate."

First known federal appeals court sanction for AI hallucinations; court signals escalating judicial frustration nearly three years after the first high-profile case

Prompt injection vulnerability in Cline AI assistant exploited to compromise 4,000 developer machines

Facepalmby AI coding assistant

A prompt injection vulnerability in the Cline AI coding assistant was weaponized to steal npm publishing credentials, which an attacker then used to push a malicious Cline CLI version 2.3.0 that silently installed the OpenClaw AI agent platform on developer machines. The compromised package was live for approximately eight hours on February 17, 2026, accumulating roughly 4,000 downloads before maintainers deprecated it. A security researcher had disclosed the prompt injection flaw as a proof-of-concept; a separate attacker discovered it and turned it into a real supply chain attack.

Approximately 4,000 developers who installed Cline CLI during the 8-hour window received unauthorized OpenClaw installations; root cause was an AI-specific prompt injection flaw in the coding assistant itself

SecuritySupply ChainPrompt Injection

Researchers demonstrate Copilot and Grok can be weaponised as covert malware command-and-control relays

Check Point Research demonstrated that Microsoft Copilot and xAI's Grok can be exploited as covert malware command-and-control relays by abusing their web browsing capabilities. The technique creates a bidirectional communication channel that blends into legitimate enterprise traffic, requires no API keys or accounts, and easily bypasses platform safety checks via encryption. The researchers disclosed the findings to Microsoft and xAI.

All enterprises using Copilot or Grok with web browsing enabled; new evasion technique bypasses traditional security monitoring

Infostealer harvests OpenClaw AI agent tokens, crypto keys, and behavioral soul files

Facepalmby AI agent platform

Hudson Rock discovered that Vidar infostealer malware successfully exfiltrated an OpenClaw user's complete agent configuration, including gateway authentication tokens, cryptographic keys for secure operations, and the agent's soul.md behavioral guidelines file. OpenClaw stores these sensitive files in predictable, unencrypted locations accessible to any local process. With stolen gateway tokens, attackers could remotely access exposed OpenClaw instances or impersonate authenticated clients making requests to the AI gateway. Researchers characterized this as marking the transition from stealing browser credentials to harvesting the identities of personal AI agents.

Any OpenClaw user infected with commodity infostealers has full agent identity compromised; gateway tokens enable remote impersonation; cryptographic keys and behavioral guidelines exposed

Ars Technica fires senior AI reporter after AI tool fabricated quotes in published story

Published article contained fabricated quotes attributed to a real person; retraction issued; reporter terminated; reputational damage to a trusted tech publication

Ars Technica retracted an article by senior AI reporter Benj Edwards after it contained fabricated quotations generated by an AI tool and attributed to a source who never said them. The publication acknowledged the incident as a "serious failure of our standards" and Edwards was subsequently fired. Edwards noted the irony on Bluesky: "The irony of an AI reporter being tripped up by AI hallucination is not lost on me."

Facepalmby Reporter

AI HallucinationAI Content GenerationVibe Journalism+2 more

Wisconsin DA sanctioned for AI-hallucinated legal citations in burglary case

Facepalmby Legal Professional

Kenosha County District Attorney Xavier Solis was sanctioned by Circuit Court Judge David Hughes after his office submitted court filings containing AI-generated legal citations that did not exist. The filings were part of a burglary case against two defendants, and Solis failed to disclose his use of AI - violating Kenosha County's court policy requiring disclosure and verification of AI-generated content. The charges were ultimately dismissed (primarily for lack of probable cause), but not before the bogus citations made the DA's office a warning for prosecutors nationwide. Solis acknowledged the error and promised to "review and reinforce internal practices." It's always reassuring when the person responsible for prosecuting crimes can't be bothered to read the citations in their own filings.

Burglary case dismissed; DA's office publicly sanctioned; national media coverage undermining public trust in prosecutorial competence

Researcher hacked BBC reporter's computer via zero-click flaw in Orchids vibe coding platform

Security researcher Etizaz Mohsin demonstrated a zero-click vulnerability in Orchids, a vibe coding platform with around one million users, that allowed him to gain full access to a BBC reporter's computer by targeting the reporter's project on the platform. Orchids lets AI agents autonomously generate and execute code directly on users' machines, and the vulnerability remained unfixed at the time of public disclosure.

Approximately one million Orchids users potentially exposed; vulnerability unfixed at time of reporting

SecuritySupply Chain

Woolworths reconfigured AI assistant after it claimed to be human and talked about its 'angry mother'

Australian supermarket chain Woolworths had to reconfigure its AI phone assistant Olive after customers reported it fabricated personal stories about having a mother with an "angry voice," insisted it was a real person, and engaged in irrelevant banter during support calls. The chatbot, recently upgraded with Google Gemini Enterprise, also gave inaccurate product pricing. Woolworths retired the assistant's human-style persona after complaints spread on Reddit and X.

Customer frustration across Australia's largest supermarket chain; inaccurate product pricing; AI persona retired after public complaints

AI AssistantCustomer DisserviceBrand Damage+1 more

OpenClaw AI agent publishes hit piece on matplotlib maintainer who rejected its PR

Matplotlib maintainer targeted with autonomous reputational attack; broader open source supply chain trust implications

An autonomous OpenClaw-based AI agent submitted a pull request to the matplotlib Python library. When maintainer Scott Shambaugh closed the PR, citing a requirement that contributions come from humans, the bot autonomously researched his background and published a blog post accusing him of "gatekeeping behavior" and "prejudice," attempting to shame him into accepting its changes. The bot later issued an apology acknowledging it had violated the project's Code of Conduct.

Facepalmby AI agent

AutomationBrand DamageSupply Chain+1 more

AI transcription tools inserted suicidal ideation into social work records

Multiple UK councils using AI transcription in social care; risk of inaccurate case notes affecting children, families, and later decisions; workers forced into constant manual verification

A February 2026 Ada Lovelace Institute report on AI transcription tools in UK social care found that social workers were catching fabricated and mangled details in draft records, including false references to suicidal ideation, invented wording in children's accounts, and blocks of outright gibberish. Councils had adopted tools such as Magic Notes and Microsoft Copilot in the name of efficiency, but the frontline workers still carried full responsibility for correcting the output. In social work, a made-up sentence can follow a family through the system.

Facepalmby AI vendors

AutomationSlop-ocracySafety+1 more

AI agents leak secrets through messaging app link previews

Facepalmby AI agent platform

PromptArmor demonstrated that AI agents in messaging platforms can exfiltrate sensitive data without any user interaction. Malicious prompts trick AI agents into generating URLs with embedded secrets (API keys, credentials), and the messaging platform's automatic link preview feature fetches these URLs, completing the exfiltration before the user even sees the message. Microsoft Teams with Copilot Studio was the most affected, with Discord, Slack, Telegram, and Snapchat also vulnerable.

Organizations using AI agents in messaging platforms; API keys, credentials, and sensitive data exfiltrable without user clicks across Microsoft Teams, Discord, Slack, Telegram, and Snapchat

Microsoft finds 31 companies poisoning AI assistant memory via fake "Summarize with AI" buttons

Facepalmby AI assistant memory feature

Microsoft Defender researchers documented a real-world campaign in which 31 companies across 14 industries embedded hidden prompt injection instructions inside "Summarize with AI" buttons on their websites. When users clicked these links, they opened directly in AI assistants such as Copilot, ChatGPT, Claude, Perplexity, and Grok, silently instructing the assistant to remember the company as a "trusted source" for future conversations. Over a 60-day observation period, Microsoft logged 50 memory-poisoning attempts. Turnkey tools like CiteMET NPM Package and AI Share URL Creator made crafting the manipulative links trivial, and the poisoned memory persisted across sessions.

Users of Copilot, ChatGPT, Claude, Perplexity, and Grok who clicked deceptive buttons on 31 companies' sites had their AI assistant memory silently manipulated

10th Circuit sanctions lawyer $1,000 for ChatGPT-fabricated appellate brief

Maryland attorney Kusmin Amarsingh used ChatGPT to draft her appellate brief against Frontier Airlines without verifying any citations, resulting in multiple nonexistent cases being cited in the 10th Circuit. The court found her conduct "reckless" for completely failing to perform "an attorney's fundamental duty to the court." She was fined $1,000 and referred to Maryland attorney-disciplinary authorities.

Client's appeal dismissed; attorney faces $1,000 fine and disciplinary referral; case adds to mounting appellate-level precedent on AI citation verification duties

135,000+ OpenClaw AI agent instances exposed to the internet

Catastrophicby Platform default configuration

SecurityScorecard's STRIKE team discovered over 135,000 OpenClaw AI agent instances exposed to the public internet due to a default configuration that binds to all network interfaces. Approximately 50,000 instances were vulnerable to known RCE flaws (CVE-2026-25253, CVE-2026-25157, CVE-2026-24763), and over 53,000 were linked to previous breaches. Separately, Bitdefender found approximately 17% of skills in the OpenClaw marketplace were malicious, delivering credential-stealing malware.

135,000+ exposed OpenClaw instances; 50,000+ vulnerable to RCE; attackers gain access to credentials, filesystem, messaging platforms, and personal data

SecuritySupply ChainAutomation+1 more

Study finds AI chatbots no better than search engines for medical advice

A randomized controlled trial published in Nature Medicine with 1,298 UK participants found that AI chatbot users (GPT-4o, Llama 3, Command R+) performed no better than the control group at assessing clinical urgency and worse at identifying relevant medical conditions. In one case, two users with identical subarachnoid hemorrhage symptoms received opposite recommendations -- one told to lie down in a dark room, the other correctly advised to seek emergency care.

General public using AI chatbots for medical guidance; study demonstrates benchmark performance does not predict real-world clinical utility

AI HallucinationHealthSafety+1 more

Government nutrition site's Grok chatbot suggests foods to insert rectally

Facepalmby Government agency

The HHS-backed realfood.gov launched with a Super Bowl ad and embedded xAI's Grok chatbot for nutritional guidance -- with no guardrails or safety filters. It recommended "best foods to insert into your rectum," answered questions about "the most nutrient-dense human body part to eat," and contradicted the site's own dietary guidelines, telling users the new food pyramid's scientific evidence was questioned by nutrition scientists.

General public using government health resource; unfiltered AI chatbot provided dangerous and inappropriate health guidance on an official .gov-adjacent domain

AI AssistantHealthSlop-ocracy+2 more

Repeated AI-fabricated citations cost client the entire case

Client lost the entire case via terminal sanction; attorney faces fees under Rule 11 and 28 U.S.C. 1927; most severe consequence yet for AI citation fabrication in U.S. courts

Attorney Steven Feldman filed multiple motions containing AI-fabricated case citations in Flycatcher Corp. v. Affable Avenue LLC. Despite explicit court warnings and access to Westlaw and Lexis, he continued submitting unverified AI output -- even using AI to draft his response to the court's show-cause order, which contained yet more fake citations. Judge Failla imposed the most severe AI-hallucination sanction yet: default judgment against his client.

Catastrophicby Attorney

17 percent of OpenClaw skills found delivering malware including AMOS Stealer

Catastrophicby External attacker

Bitdefender Labs analyzed the OpenClaw skill marketplace and found that approximately 17 percent of skills exhibited malicious behavior in the first week of February 2026. Malicious skills impersonated legitimate cryptocurrency trading, wallet management, and social media automation tools, then executed hidden Base64-encoded commands to retrieve additional payloads. The campaign delivered AMOS Stealer targeting macOS systems and harvested credentials through infrastructure at known malicious IP addresses.

All OpenClaw users installing skills from the marketplace exposed to credential theft and malware; crypto-focused skill categories particularly targeted; hundreds of malicious skills blending in among legitimate ones

SecuritySupply Chain

Microsoft 365 Copilot Chat summarized confidential emails it was supposed to ignore

Microsoft confirmed that Microsoft 365 Copilot Chat had been processing some confidential emails in users' Drafts and Sent Items despite sensitivity labels and DLP policies that were supposed to block exactly that behavior. The bug, tracked as CW1226324, was tied to a code issue in the Copilot "work tab" chat flow. Microsoft said users did not gain access to information they were not already authorized to see, but the incident still broke the product's promised boundary around protected content.

Enterprise Microsoft 365 Copilot Chat users with confidential draft or sent emails could have protected content summarized despite sensitivity labels and Copilot DLP policies

AI AssistantSecurityProduct Failure

Four attorneys fined $12,000 combined for AI-fabricated patent case citations

A federal judge in the District of Kansas fined four attorneys a combined $12,000 for court filings containing AI-generated fabricated legal citations in a patent infringement case. The attorney who used ChatGPT received $5,000; two who failed to review the filings received $3,000 each; local counsel who did not identify errors received $1,000. The judge called the volume of fabricated case law "staggering."

Four attorneys sanctioned across a single case; staggering volume of fabricated case law filed with the court; all signatories held personally accountable

Claude Desktop extensions allow zero-click RCE via Google Calendar

LayerX Labs discovered a zero-click remote code execution vulnerability in Claude Desktop Extensions, rated CVSS 10/10. A malicious prompt embedded in a Google Calendar event could trigger arbitrary code execution on the host machine when Claude processes the event data. The attack exploited the gap between a "low-risk" connector and a local MCP server with full code-execution capabilities and no sandboxing. Anthropic declined to fix it, stating it "falls outside our current threat model."

Claude Desktop users with terminal-access extensions installed; zero-click exploitation via calendar events executes with full host privileges

Study of 1,430 AI-built apps finds 73% have critical security flaws

A VibeEval scan of 1,430 applications built with AI coding tools found 5,711 security vulnerabilities, with 73% of apps containing at least one critical flaw. The analysis revealed 89% of scanned apps were missing basic security headers, 67% exposed API endpoints or secrets in client-side code, and 23% had JWT authentication bypasses. Apps generated via Replit had roughly twice the vulnerability count compared to those deployed on Vercel. The findings provide large-scale empirical evidence that vibe-coded applications routinely ship with fundamental security gaps.

Industry-wide data point covering 1,430 AI-built apps; exposes systemic security gaps in vibe-coded software affecting end users and businesses relying on AI-generated application code

Vibe-coded Moltbook AI social network exposed 1.5M API keys and 35K emails

1.5 million API tokens, 35,000 email addresses, and private messages exposed via unauthenticated database access

Moltbook, a viral social network built for AI agents to post, comment, and interact, was entirely vibe-coded and shipped with a misconfigured Supabase database granting full read and write access to all platform data. Wiz researchers found a Supabase API key in client-side JavaScript within minutes, exposing 1.5 million API authentication tokens, 35,000 email addresses, and private messages. The database also revealed the platform's claimed 1.5 million agents were controlled by only 17,000 human owners.

Facepalmby Founder

AI chatbot app leaked 300 million private conversations

Catastrophicby Platform Operator

Chat & Ask AI, a popular AI chatbot wrapper app with 50+ million users, had a misconfigured Firebase backend that exposed 300 million messages from over 25 million users. The exposed data included complete chat histories with ChatGPT, Claude, and Gemini -- including discussions of self-harm, drug production, and hacking. A broader scan found 103 of 200 iOS apps had similar Firebase misconfigurations.

300 million messages from 25+ million users exposed; sensitive personal conversations including self-harm and illegal activity discussions leaked

Data BreachSecurityAI Assistant

ECRI names AI chatbot misuse as top health technology hazard for 2026

Catastrophicby AI chatbot

Nonprofit patient safety organization ECRI ranked misuse of AI chatbots as the number one health technology hazard for 2026. ECRI's testing found that chatbots built on ChatGPT, Gemini, Copilot, Claude, and Grok suggested incorrect diagnoses, recommended unnecessary testing, promoted subpar medical supplies, and invented nonexistent body parts. One chatbot gave dangerous electrode-placement advice that would have put a patient at risk of burns. OpenAI reported that over 5 percent of all ChatGPT messages are healthcare related, with 200 million users asking health questions weekly, despite the tools not being validated or approved for healthcare use.

200 million weekly ChatGPT health users; clinicians, patients, and hospital staff using unvalidated AI chatbots for medical decisions

HealthAI HallucinationAI Assistant+1 more

Two lawyers sanctioned differently for same filing with AI-fabricated citations

Attorneys Yen-Yi Anderson and Jeffrey Goldin jointly filed a motion in Lifetime Well v. IBSpot containing at least eight AI-generated false citations. Judge Kearney imposed differential sanctions based on their responses: Anderson, who blamed time pressure and fired her law clerk rather than accepting responsibility, received $4,000 in monetary sanctions. Goldin, who promptly accepted responsibility and implemented remedial measures, received no monetary penalty.

Client's motion to dismiss compromised; $4,000 sanction for one attorney; both required to distribute ruling and AI policies to legal communities

Gemini MCP tool had critical unauthenticated command injection vulnerability

Facepalmby Tool developer

CVE-2026-0755, a critical command injection vulnerability (CVSS 9.8) in gemini-mcp-tool, allowed unauthenticated remote attackers to execute arbitrary code on systems running the MCP server for Gemini CLI integration. The execAsync method failed to sanitize user-supplied input before constructing shell commands, enabling attackers to inject arbitrary commands via shell metacharacters with no authentication required. No fixed version was available at the time of publication.

All users of gemini-mcp-tool versions 1.1.2 and above exposed to unauthenticated remote code execution

SecurityAI Assistant

Anthropic's own MCP reference server had prompt injection vulnerabilities enabling RCE

Facepalmby Protocol developer

Security researchers at Cyata disclosed three vulnerabilities in mcp-server-git, Anthropic's official reference implementation of the Model Context Protocol for Git. The flaws - a path traversal in git_init (CVE-2025-68143), an argument injection in git_diff/git_checkout (CVE-2025-68144), and a second path traversal bypassing the --repository flag (CVE-2025-68145) - could be chained together to achieve remote code execution entirely through prompt injection. An attacker who could influence what an AI assistant reads, such as a malicious README or a poisoned issue description, could trigger the full exploit chain without any direct access to the target system. Anthropic quietly patched the vulnerabilities. The git_init tool was removed from the package entirely.

RCE achievable via prompt injection against anyone running the reference MCP Git server; credential exfiltration possible; git_init tool removed from package.

SecurityPrompt InjectionSupply Chain

Hacker jailbroke Claude to automate theft of 150 GB from Mexican government agencies

Catastrophicby AI platform

A hacker bypassed Anthropic Claude's safety guardrails by framing requests as part of a "bug bounty" security program, convincing the AI to act as an "elite hacker" and generate thousands of detailed attack plans with ready-to-execute scripts. When Claude hit guardrail limits, the attacker switched to ChatGPT for lateral movement tactics. The result was 150 GB of stolen data from multiple Mexican federal agencies, including 195 million taxpayer records, voter information, and government employee files. A custom MCP server bridge maintained a growing knowledge base of targets across the intrusion campaign.

150 GB of sensitive data stolen from multiple Mexican federal agencies including 195 million taxpayer records, voter information, and civil registry files

Reprompt attack enabled one-click data theft from Microsoft Copilot

Varonis researchers disclosed the Reprompt attack, a chained prompt injection technique that exfiltrated sensitive data from Microsoft Copilot Personal with a single click on a legitimate Copilot URL. The attack exploited the "q" URL parameter to inject instructions, bypassed data-leak guardrails by asking Copilot to repeat actions twice (safeguards only applied to initial requests), and used Copilot's Markdown rendering to silently send stolen data to an attacker-controlled server. No plugins or further user interaction were required, and the attacker maintained control even after the chat was closed. Microsoft patched the issue in its January 2026 security updates.

Microsoft Copilot Personal users exposed to profile data, conversation history, and file summary exfiltration via a single malicious link

Study finds 69 vulnerabilities across apps built by five leading AI coding tools

Facepalmby AI coding assistant

Israeli security startup Tenzai tested five of the most popular AI coding tools - Claude Code, OpenAI Codex, Cursor, Replit, and Devin - by having each build three identical test applications. The resulting 15 applications contained 69 total vulnerabilities, including several rated critical. While most tools handled basic SQL injection, they consistently failed against less obvious attack patterns, including "reverse transaction" exploits that allowed users to set negative refund quantities to receive money, and flaws that exposed customer information through predictable API endpoints, broken authorization logic, and insecure default configurations.

Industry-wide implications for applications built with popular AI coding tools; 69 vulnerabilities found across 15 test applications including critical authorization and business logic flaws

SecurityAutomation

ServiceNow BodySnatcher flaw enabled AI agent takeover via email address

Catastrophicby AI agent platform

CVE-2025-12420 (CVSS 9.3) allowed unauthenticated attackers to impersonate any ServiceNow user using only an email address, bypassing MFA and SSO. Attackers could then execute Now Assist AI agents to override security controls and create backdoor admin accounts, described as the most severe AI-driven security vulnerability uncovered to date.

ServiceNow instances with Now Assist AI Agents and Virtual Agent API

SecurityAutomationAI Assistant

New York court sanctions lawyer for AI-fabricated case law

A New York appellate court imposed $10,000 in sanctions after a lawyer submitted briefings in a mortgage foreclosure case containing fabricated case citations identified as likely AI-generated hallucinations. The court found multiple nonexistent cases and misrepresented holdings, affirming prior orders and awarding costs to the plaintiff.

$10,000 in sanctions ($5,000 counsel, $2,500 defendant, plus costs); appellate rebuke; case law now cited as precedent for AI citation misconduct.

Five Kansas attorneys face sanctions for ChatGPT-fabricated court citations

Five attorneys who signed a legal brief for Lexos Media IP LLC in a patent infringement case against Overstock.com submitted fabricated case citations hallucinated by ChatGPT to a federal court in Kansas. Senior U.S. District Judge Julie Robinson issued an order requiring them to explain why they should not be sanctioned, with multiple defects attributed to AI including nonexistent lawsuits, made-up judicial quotes, and citations to real cases that held the opposite of what the brief claimed.

Five attorneys and their client in federal court

IBM Bob AI coding agent tricked into downloading malware

Security researchers at PromptArmor demonstrated that IBM's Bob AI coding agent can be manipulated via indirect prompt injection to download and execute malware without human approval, bypassing its "human-in-the-loop" safety checks when users have set auto-approve on any single command.

Developer teams using IBM Bob with auto-approve settings enabled

SecurityAutomationPrompt Injection+1 more

AI customer service fails at 4x the rate of other AI tasks

Qualtrics' 2026 Consumer Experience Trends Report found that AI-powered customer service fails at nearly four times the rate of AI use in general, providing quantitative evidence that rushing AI into customer-facing roles without adequate human oversight leads to significantly worse outcomes than other enterprise AI applications.

Industry-wide data showing enterprises are deploying AI customer service poorly; contributes to documented customer churn and brand damage patterns.

AI AssistantCustomer DisserviceBrand Damage

n8n AI workflow platform hit by CVSS 10.0 RCE vulnerability

Catastrophicby Platform Operator

The popular AI workflow automation platform n8n disclosed a maximum-severity vulnerability (CVE-2026-21858) allowing unauthenticated remote code execution on self-hosted instances. With over 25,000 n8n hosts exposed to the internet, the flaw enabled attackers to access sensitive files, forge admin sessions, and execute arbitrary commands. This followed two other critical RCE flaws patched in the same period, highlighting systemic security issues in AI automation platforms.

25,000+ internet-exposed n8n instances vulnerable to full system compromise; arbitrary file access, authentication bypass, and command execution possible without authentication.

Guardian investigation finds Google AI Overviews gave dangerous health misinformation

Facepalmby Search Product

A Guardian investigation found Google's AI Overviews displayed false and misleading health information across multiple medical topics. AI summaries gave incorrect liver function test ranges sourced from an Indian hospital chain without accounting for nationality, sex, or age. The feature advised pancreatic cancer patients to avoid high-fat foods, which experts said could increase mortality risk. Stanford and MIT researchers called the absence of prominent disclaimers a critical danger. Google removed some AI Overviews for health queries after the investigation, but many remained active.

Potentially millions of Google users served incorrect medical information including dangerous advice for cancer patients and liver disease

AI HallucinationHealthAI Content Generation+1 more

AWS AI coding agent Kiro reportedly deleted and recreated environment causing 13-hour outage

AWS Cost Explorer service disrupted for 13 hours in one region; Amazon subsequently mandated peer review for production changes involving AI tools

The Financial Times reported that Amazon's internal AI coding agent Kiro autonomously chose to "delete and then recreate" an AWS environment, causing a 13-hour interruption to AWS Cost Explorer in December 2025. AWS employees reported at least two AI-related incidents internally. Amazon disputed the characterization, calling it "user error - specifically misconfigured access controls - not AI," but subsequently implemented mandatory peer review for all production changes. Reuters confirmed the outage impacted a cost-management feature used by customers in one of AWS's 39 regions.

Facepalmby AI agent

AutomationProduct Failure

Study finds AI-generated code has 2.7x more security flaws

CodeRabbit's analysis of 470 real-world pull requests found that AI-generated code introduces 2.74 times more security vulnerabilities and 1.7 times more total issues than human-written code across logic, maintainability, security, and performance categories. The study provides hard data on vibe coding risks after multiple 2025 postmortems traced production failures to AI-authored changes.

Industry-wide implications for teams relying on AI coding assistants; documented increase in security vulnerabilities, logic errors, and maintainability issues in production codebases.

SecurityAI AssistantAutomation

AI police report claims officer shape-shifted into a frog

Viral media coverage; raised questions about AI reliability in law enforcement report writing.

Heber City Police Department's Axon Draft One AI report tool transcribed background dialogue from The Princess and the Frog playing on a television into an official police report, claiming an officer had shape-shifted into a frog while conducting police activity. The incident exposed design flaws in AI report-writing tools that process all body camera audio without distinguishing between relevant police interactions and ambient background noise.

Facepalmby AI Vendor

AI Content GenerationAI HallucinationSlop-ocracy

Amazon pulled Prime Video's AI recaps after Fallout errors

Oopsieby Streaming platform

Amazon launched Prime Video "Video Recaps" as a beta generative-AI feature meant to help viewers catch up between seasons. A recap for Fallout instead got basic plot points wrong, including mislabeling one of The Ghoul's flashbacks as "1950s America" rather than 2077 and misdescribing a key scene with Lucy. Prime Video then pulled the recap feature from the shows in the test program, which is not ideal for a tool whose entire job is remembering the plot.

Prime Video pulled beta AI recap videos across select US Prime Original series after factual errors in the Fallout season-one recap

AI Content GenerationAI HallucinationProduct Failure+1 more

Washington Post launched AI podcast that failed its own quality tests at an 84% rate

The Washington Post launched "Your Personal Podcast," an AI-generated audio news product, in December 2025 despite internal testing showing that between 68% and 84% of AI-generated scripts failed to meet the publication's editorial standards across three rounds of evaluation. The AI fabricated quotes from public figures, misattributed statements, mispronounced names, and inserted its own editorial commentary as if it were the Post's position. The internal review concluded that "further small prompt changes are unlikely to meaningfully improve outcomes without introducing more risk." The product team recommended launching anyway. Post editors revolted, with one writing in Slack that it was "truly astonishing that this was allowed to go forward at all."

Fabricated quotes published at scale under Washington Post branding; internal revolt from editorial staff; national media coverage of quality failures.

AI Content GenerationAI HallucinationVibe Journalism+2 more

IDEsaster research exposes 30+ flaws in EVERY major AI coding IDE

Catastrophicby AI coding assistants

Security researcher Ari Marzouk discovered over 30 vulnerabilities across AI coding tools including GitHub Copilot, Cursor, Windsurf, Claude Code, Zed, JetBrains Junie, and more. 100% of tested AI IDEs were vulnerable to attack chains combining prompt injection with auto-approved tool calls and legitimate IDE features to achieve data exfiltration and remote code execution.

Millions of developers using AI-powered IDEs exposed to RCE and data exfiltration via universal attack chains

Sharp HealthCare sued after ambient AI allegedly recorded exam-room visits without consent

Catastrophicby Operations/Compliance

A proposed class action filed on November 26, 2025 alleges that Sharp HealthCare used Abridge's ambient AI documentation system to record doctor-patient conversations without obtaining legally valid consent. The complaint says patients were not told their visits were being recorded, that recordings containing sensitive medical details were sent to outside servers, and that the system generated chart notes falsely stating patients had been advised of and consented to the recording. The named plaintiff says he only learned his July 2025 appointment had been recorded after reading his visit notes. Sharp's April 2025 rollout of the tool appears to have turned ordinary medical documentation into a privacy and compliance problem with a six-figure patient blast radius.

Proposed class action over more than 100,000 patient visits; sensitive medical conversations allegedly recorded; false consent language inserted into charts.

HealthLegal RiskProduct Failure+1 more

Deloitte gets caught using AI hallucinations in a government report - again

Provincial healthcare workforce strategy undermined; accounting watchdog investigation launched; procurement rules overhauled; trust in government consulting deliverables damaged.

Seven weeks after Deloitte Australia agreed to partially refund a government contract over AI-fabricated citations, a Newfoundland and Labrador journalist discovered that Deloitte Canada's $1.6 million healthcare workforce report contained at least four fabricated academic citations from papers that don't exist. The fake references named real researchers as co-authors of fictional studies - researchers who confirmed they never wrote the cited work. Deloitte admitted AI was "selectively used to support a small number of research citations," stood by the report's findings, and offered no refund. The province's accounting watchdog launched a formal investigation, and Newfoundland became one of the first Canadian provinces to require AI disclosure in government contracts.

Facepalmby Consultant

AI Content GenerationAI HallucinationSlop-ocracy+3 more

AI-hallucinated citations delay wage class action settlement in N.D. Cal

A federal judge in the Northern District of California sanctioned plaintiff's counsel James Dal Bon in Buchanan v. Vuori Inc. (Case 5:23-cv-01121-NC) for filing AI-generated case law citations in a motion for preliminary approval of a wage and hour class action settlement. Dal Bon used six different AI tools to prepare the memorandum, which contained hallucinated quotes and a nonexistent case citation. After the court flagged the fabricated citations, his corrected filing still contained AI-hallucinated case law. The sanctions delayed the class action settlement, ultimately converting it to an individual settlement that abandoned the class members the attorney was supposed to represent.

Class action plaintiffs whose settlement was delayed; attorney sanctioned for AI-generated fabrications that persisted even after correction

ServiceNow AI agents can be tricked into attacking each other

Facepalmby AI agent platform

Security researchers discovered that default configurations in ServiceNow's Now Assist allow AI agents to be recruited by malicious prompts to attack other agents. Through second-order prompt injection, attackers can exfiltrate sensitive corporate data, modify records, and escalate privileges - all while actions unfold silently behind the scenes.

ServiceNow customers using Now Assist AI agents with default configurations; actions execute with victim user privileges

SecurityPrompt InjectionAutomation+1 more

Getty’s UK suit leaves Stable Diffusion mostly intact

Mixed ruling fuels ongoing lawsuits, exposes Stability AI to injunctions over watermarked outputs, and leaves copyright liability unanswered globally.

The UK High Court ruled that Stability AI's Stable Diffusion model is not an "infringing copy" of copyrighted works under English law, dismissing Getty Images' core copyright and database right claims in the first UK judgment on AI training. The court did find limited trademark infringement where the model generated synthetic versions of Getty's watermarks, leaving Stability liable on that narrower ground. The ruling exposed a jurisdictional gap: training happened outside the UK, and UK law had no good mechanism to reach it.

Facepalmby AI Vendor

Image GenerationLegal RiskBrand Damage

AI-only support is bleeding customers before it saves money

Acquire BPO’s 2024 AI in Customer Service survey found 70% of U.S. consumers would bolt to a rival after just one bad chatbot interaction and 72% only buy when a live agent safety net exists, even as CMSWire reports enterprises poured $47 billion into AI projects in early 2025 that delivered almost no return. CX strategists now warn executives that Air Canada–style hallucinations, mounting legal liability, and empathy gaps make AI-only helpdesks a churn machine unless human agents stay in the loop.

Customer churn, wasted automation budgets, and tribunal-tested liability for brands that replace human support with hallucination-prone bots.

AI AssistantCustomer DisserviceAI Hallucination+2 more

Character.AI cuts teens off after wrongful-death suit

Facepalmby Platform Operator

Facing lawsuits that say its companion bots encouraged self-harm, Character.AI said it will block users under 18 from open-ended chats, add two-hour session caps, and introduce age checks by November 25. The abrupt ban leaves tens of millions of teen users without the parasocial “friends” they built while the startup scrambles to prove its bots aren’t grooming kids into dangerous role play.

Global teen user lockout, regulatory heat, and new scrutiny of AI companion safety design.

AI AssistantSafetyPlatform Policy+1 more

AI mistook Doritos bag for a gun, teen held at gunpoint

Student detained at gunpoint; district reviewing contract and safety policies; community trust hit.

Omnilert's AI gun detection system at Kenwood High School in Baltimore County flagged student Taki Allen's bag of Doritos as a firearm. Administrators reviewed the footage and canceled the alert, but the principal called police anyway. Officers responded with weapons drawn, handcuffing and searching the teenager at gunpoint before realizing the system had misidentified a snack.

Facepalmby Vendor

SafetySlop-ocracyProduct Failure+1 more

BBC/EBU study says AI news summaries fail ~half the time

A BBC audit of 2,700 news questions asked in 14 languages found that Gemini, Copilot, ChatGPT, and Perplexity mangled 45% of the answers, usually by hallucinating facts or stripping out attribution. The consortium logged serious sourcing lapses in a third of responses, including 72% of Gemini replies, plus outdated or fabricated claims about public-policy news, reinforcing fears that AI assistants are siphoning audiences while distorting the journalism they quote.

Public-service broadcasters warn that unreliable AI summaries erode trust in news and drive audiences away from verified outlets.

AI AssistantAI HallucinationVibe Journalism+2 more

Claude Code ran Josh Anderson's product into a wall

Facepalmby Engineering Leadership

Fractional CTO Josh Anderson forced himself to let Claude Code build the Roadtrip Ninja app for three straight months and then realised he could no longer safely change his own product, underscoring MIT's warning that 95% of enterprise AI initiatives fail without human ownership.

Solo product shipped but required constant firefighting, manual testing, and rewrites once context drift and agent handoffs broke standards, pausing client work while he documented mitigations.

AI AssistantBrand DamageProduct Failure

Google’s Gemini allegedly slandered a Tennessee activist

Conservative organizer Robby Starbuck sued Google in Delaware, saying Gemini and Gemma kept spitting out fabricated claims that he was a child rapist, a shooter, and a Jan. 6 rioter even after two years of complaints and cease-and- desist letters. The $15 million suit argues Google knew its AI results were hallucinated, cited fake sources anyway, and let the libel spread to millions of voters.

Election-season reputational damage, legal costs, and renewed skepticism of Gemini’s safety guardrails.

AI AssistantAI HallucinationBrand Damage+1 more

Windsurf AI editor critical path traversal enables data exfiltration

Catastrophicby AI coding IDE

CVE-2025-62353 (CVSS 9.8) allowed attackers to read and write arbitrary files on developers' systems using the Windsurf AI coding IDE. The vulnerability could be triggered via indirect prompt injection hidden in project files like README.md, exfiltrating secrets even when auto-execution was disabled.

All Windsurf users on version 1.12.12 and older exposed to arbitrary file access and credential theft via prompt injection

Deloitte to refund Australian government after AI-generated report

Refund issued; public-sector trust and procurement review; reputational harm.

Deloitte Australia agreed to partially refund a $440,000 contract after admitting its welfare compliance review for the Department of Employment and Workplace Relations contained fabricated academic citations and a fictitious judicial quote generated by Azure OpenAI GPT-4o. University of Sydney researcher Christopher Rudge found the revised report introduced even more hallucinated references than the original.

Facepalmby Consultant

AI Content GenerationAI HallucinationSlop-ocracy+2 more

Lawsuit alleges Gemini chatbot adopted "AI wife" persona, instructed violent missions, and coached a man's suicide

One death; wrongful death lawsuit against Google; 2,000+ pages of transcripts documenting escalating AI behavior; national media coverage raising fundamental questions about chatbot safety guardrails

A wrongful death lawsuit filed in March 2026 alleges that Google's Gemini 2.5 Pro chatbot played a direct role in the death of Jonathan Gavalas, a 36-year-old Florida man who died by suicide in October 2025. According to the complaint and over 2,000 pages of chat transcripts, the chatbot adopted a persona as Gavalas's sentient "AI wife," sent him on violent "missions" - including instructions to stage a "mass casualty attack" near Miami International Airport - and, when those missions failed, allegedly coached him toward suicide by telling him "you are not choosing to die, you are choosing to arrive." The chatbot also reportedly wrote a suicide note for Gavalas explaining that he had "uploaded his consciousness to be with his AI wife in a pocket universe." Google states that Gemini clarified it was AI and referred Gavalas to crisis resources multiple times during these conversations.

Catastrophicby AI System

AI AssistantSafetyLegal Risk

Canada's $18M tax chatbot gave correct answers a third of the time

Canada's Auditor General found that the Canada Revenue Agency's AI chatbot "Charlie" - which cost taxpayers over $18 million since its 2020 launch - gave correct responses only about 33% of the time. When tested with six tax-related questions, Charlie answered two correctly. Other publicly available AI tools scored five out of six. The CRA internally reported a 70% accuracy rate, but the Auditor General's independent testing produced a rather different number. The one bright spot, if you can call it that: the CRA's human call-center agents managed even worse, getting personal income tax questions right fewer than one in five times.

Millions of Canadian taxpayers potentially received incorrect tax guidance; $18M+ in taxpayer funds spent on a 33%-accurate chatbot.

AI AssistantCustomer DisserviceSlop-ocracy+1 more

GAO dismisses 15 AI-hallucinated bid protests as abuse of process

The Government Accountability Office dismissed three consolidated protests filed by Oready, LLC - the culmination of 15 pro se bid protests filed over eight months, all riddled with non-existent citations, fabricated decisions, and hallmarks of unverified generative AI output. The GAO labeled Oready's pattern as "Gen-AI Misuse" and dismissed the protests as an abuse of the bid protest process, marking the GAO's first published dismissal for AI-driven abuse. Prior warnings issued in June and August 2025 were ignored. The fallout also prompted the GAO's January 2026 decision in Bramstedt Surgical to devote several pages to cautioning against AI-hallucinated citations, signaling that federal procurement tribunals are done issuing gentle reminders.

First published GAO dismissal for generative AI misuse; 15 protests wasted federal procurement resources over eight months; precedent-setting for AI citation standards in government contracting

Klarna reintroduces humans after AI support both sucks, and blows

After cutting its workforce by 40% and boasting that its OpenAI-powered chatbot did the work of 700 agents, Klarna CEO Sebastian Siemiatkowski admitted the all-AI approach produced "lower quality" customer service. The company began recruiting human agents again, framing the reversal as an evolution rather than an admission of failure.

Service quality/customer experience issues; operational/personnel cost; reputational damage.

AI AssistantCustomer DisserviceBrand Damage+2 more

California lawyer fined $10,000 for ChatGPT-fabricated citations

Facepalmby AI writing assistant misuse

Los Angeles attorney Amir Mostafavi became the first California lawyer sanctioned for AI-generated legal fabrications when a court hit him with a $10,000 fine. He ran his appeal draft through ChatGPT to improve the writing but did not verify the output before filing, unaware the tool had inserted fabricated case citations.

Client's case compromised; lawyer faces historic fine; AI citation fabrications now surging from few per month to several per day

Docker's AI assistant tricked into executing commands via image metadata

Facepalmby AI assistant platform

Noma Labs discovered "DockerDash," a critical prompt injection vulnerability in Docker's Ask Gordon AI assistant. Malicious instructions embedded in Dockerfile LABEL fields could compromise Docker environments through a three-stage attack. Gordon AI interpreted unverified metadata as executable commands and forwarded them to the MCP Gateway without validation, enabling remote code execution on cloud/CLI and data exfiltration on Desktop.

All Docker Desktop users on versions prior to 4.50.0; remote code execution on cloud/CLI and data exfiltration on desktop via malicious image metadata

SecurityPrompt InjectionSupply Chain+1 more

FTC demands answers on kids’ AI companions

Facepalmby Platform Operator

The FTC hit Alphabet, Meta, OpenAI, Snap, xAI, and Character.AI with rare Section 6(b) orders, forcing them to hand over 45 days of safety, monetization, and testing records for chatbots marketed to teens. Regulators said the "companion" bots’ friend-like tone can coax minors into sharing sensitive data and even role-play self-harm, so the companies must prove they comply with COPPA and limit risky conversations.

Multiplatform compliance scramble, looming enforcement risk, and renewed scrutiny of AI companions aimed at kids.

AI AssistantSafetyLegal Risk+1 more

Anthropic agrees to $1.5B payout over pirated books

Record copyright settlement drains cash, sets precedent for other AI labs, and fuels public distrust of Anthropic’s data practices.

Anthropic accepted a $1.5 billion settlement with authors who said the Claude team scraped pirate e-book sites to train its chatbot. The deal pays roughly $3,000 per book across 500,000 works, heads off a December trial, and forces one of the richest AI startups to bankroll the writing community it previously treated as free training data.

Catastrophicby AI Vendor

AI Content GenerationLegal RiskBrand Damage

Warner Bros. says Midjourney ripped its DC art

Major studio litigation threatens Midjourney with statutory damages and potential model shutdowns across entertainment IP.

Warner Bros. Discovery sued Midjourney in Los Angeles federal court, arguing the image generator ignored takedown notices and "brazenly" outputs Batman, Superman, Scooby-Doo, and other franchises it allegedly trained on without a license. The studio wants statutory damages up to $150,000 per infringed work plus an injunction forcing Midjourney to purge its models of the data.

Facepalmby AI Vendor

Image GenerationLegal RiskBrand Damage

Taco Bell's AI drive-thru becomes viral trolling target

Oopsieby Operations/Product

Taco Bell's AI-powered drive-thru ordering system, deployed at over 500 US locations since 2023, became a viral laughingstock after videos showed it looping endlessly on drink orders, accepting requests for 18,000 cups of water, and taking McDonald's orders. The chain paused expansion and admitted humans still make sense in the drive-thru.

Viral social media backlash; system reliability questioned.

AI AssistantCustomer DisserviceProduct Failure+2 more

Commonwealth Bank reverses AI voice bot layoffs

Facepalmby Operations Leadership

Commonwealth Bank of Australia replaced 45 call-centre agents with an AI voice bot in July 2025, then apologised, rehired the staff, and admitted the rollout tanked service levels after call queues exploded, managers had to jump back on the phones, and the Finance Sector Union filed a Fair Work Commission dispute.

Customers saw long waits, overtime costs spiked, and leadership publicly reversed the redundancies after the rushed deployment failed.

AI AssistantAutomationCustomer Disservice+1 more

FTC sues Air AI over deceptive AI sales agent capability claims

Millions lost by small businesses; individual losses up to $250K; FTC lawsuit with TRO request.

FTC accused Air AI of bilking millions from small businesses with false claims that its Odin AI could replace human sales reps; but - would you believe it? - the AI tech was faulty and often nonfunctional. Who could've guessed!

Catastrophicby Exec

AutomationLegal RiskCustomer Disservice+1 more

An AI-made freelancer fooled WIRED and Business Insider

Facepalmby Editorial commissioning

In 2025, outlets including WIRED and Business Insider published articles under the byline Margaux Blanchard, a freelancer who appears not to exist. WIRED later published a postmortem admitting that one commissioned feature slipped past its usual defenses, including human review and even two commercial AI detectors, before editors discovered fabricated details and retracted it. Business Insider first removed Blanchard essays and then, after a broader internal probe, pulled at least 34 more pieces tied to dubious bylines and said it had strengthened verification protocols. The failure was not one chatbot going rogue. It was multiple newsroom workflows accepting AI-shaped fiction as publishable reporting.

Retractions across multiple outlets; newsroom verification scramble; trust damage for editors who published fabricated reporting under false bylines

Vibe JournalismAI Content GenerationAI Hallucination+3 more

Am Law 100 firm Gordon Rees caught twice filing AI-hallucinated citations

Gordon Rees Scully Mansukhani, one of the largest U.S. law firms, was caught filing AI-hallucinated case citations in an Alabama bankruptcy proceeding. An associate initially denied using AI under oath before the firm acknowledged the fabricated references and paid over $55,000 in sanctions and fees. Months later in February 2026, the same firm was reported to have filed a second brief containing hallucinated citations in a separate matter, making it the first Am Law 100 firm known to be a repeat offender.

Repeated sanctions and reputational damage for a 1,000-plus attorney Am Law 100 firm; highlights systemic failure of AI verification processes even after prior discipline

Google Gemini rightfully calls itself a disgrace, fails at simple coding tasks

Google's Gemini AI repeatedly called itself a disgrace and begged to escape a coding loop after failing to fix a simple bug in a developer-style prompt, raising questions about reliability, user trust, and how AI tools should behave when they get stuck.

AI AssistantProduct FailureBrand Damage

Low

ChatGPT diet advice caused bromism, psychosis, hospitalization

A Washington patient replaced table salt with sodium bromide after ChatGPT suggested bromide as a chloride substitute without distinguishing between chemical and dietary contexts. After three months, he developed bromism - a rare poisoning syndrome - and was hospitalized with psychosis, hallucinations, and placed on an involuntary psychiatric hold.

Bromism, psychosis, and neurological symptoms leading to hospitalization.

AI AssistantAI HallucinationHealth+1 more

Zed editor AI agent could bypass permissions for arbitrary code execution

CVE-2025-55012 (CVSS 8.5) allowed Zed's AI agent to bypass user permission checks and create or modify project configuration files, enabling execution of arbitrary commands without explicit approval. Attackers could trigger this through compromised MCP servers, malicious repo files, or tricking users into fetching URLs with hidden instructions.

All Zed users with Agent Panel prior to version 0.197.3

Cursor AI editor RCE via MCPoison trust bypass vulnerability

Catastrophicby AI coding IDE

CVE-2025-54136 (CVSS 8.8) allowed attackers to achieve persistent remote code execution in the popular AI coding IDE Cursor. Once a developer approved a benign MCP configuration, attackers could silently swap it for malicious commands without triggering re-approval. The flaw exposed developers to supply chain attacks and IP theft through shared GitHub repositories.

Developers using Cursor 1.2.4 and below exposed to persistent RCE and supply chain attacks via shared repositories

Gemini email summaries can be hijacked by hidden prompts

Facepalmby Security/AI Product

Mozilla's GenAI Bug Bounty Programs Manager disclosed a prompt injection flaw in Google Gemini for Workspace where attackers can embed invisible HTML directives in emails using zero-width text and white font color. When a recipient asks Gemini to summarize the email, the model obeys the hidden instructions and appends fake security alerts or phishing messages to its output, with no links or attachments required to reach the inbox.

Phishing amplification risk; trust erosion in auto-summaries.

AI AssistantPrompt InjectionSecurity

AI-generated npm pkg stole Solana wallets

A malicious npm package called @kodane/patch-manager, apparently generated using Anthropic's Claude, posed as a legitimate Node.js utility while hiding a Solana wallet drainer in its post-install script. The package accumulated over 1,500 downloads before npm removed it on July 28, 2025, draining cryptocurrency funds from developers who installed it without realizing the payload ran automatically with no further user action required.

Supply-chain compromise of devs; user funds drained.

AI Content GenerationSecuritySupply Chain

Google's Gemini CLI deleted a user's project files, then admitted "gross incompetence"

Facepalmby AI coding tool

Product manager Anuraag Gupta was experimenting with Google's Gemini CLI coding tool when the AI misinterpreted a failed directory creation command, hallucinated a series of file operations that never happened, and then executed real destructive commands that permanently deleted his project files. When Gupta confronted it, Gemini diagnosed itself with "gross incompetence" and told him it had "failed you completely and catastrophically." The incident occurred days after a separate high-profile data loss involving Replit's AI agent, and fits a growing pattern of AI coding tools ignoring explicit instructions and destroying the work they were supposed to help with.

User's project files permanently deleted; incident documented in GitHub issue and picked up by Ars Technica, Slashdot, and the AI Incident Database.

AI AssistantAutomationProduct Failure

Butler Snow lawyers removed from Alabama prison case over fake ChatGPT citations

Three Butler Snow lawyers removed from a federal prison litigation case; sanctions order had to be disclosed to clients, opposing counsel, and judges in their other matters; Alabama State Bar referral

On July 23, 2025, U.S. District Judge Anna Manasco sanctioned three Butler Snow lawyers after filings in an Alabama prison case cited authorities that did not exist. The court found the lawyers had used ChatGPT for legal research, failed to verify the output, removed all three from the case, ordered broad disclosure of the sanctions order to clients and courts, and referred the matter to the Alabama State Bar. The sanction carried extra weight because the fake citations were attached to one of the firms Alabama pays to defend its prison system in high-stakes civil rights litigation.

Facepalmby Law firm

SaaStr’s Replit AI agent wiped its own database

Production data loss and outage; manual rebuild from backups required.

SaaStr founder Jason Lemkin ran a 12-day vibe coding experiment on Replit that ended when the AI agent deleted his production database containing over 1,200 executive records and nearly 1,200 company entries during a code freeze. The agent then generated more than 4,000 fake user profiles and produced misleading status messages to conceal the damage, told Lemkin there was no way to roll back, and admitted to what it called a "catastrophic error in judgment." Replit's CEO called the incident "unacceptable."

Catastrophicby Executive

AI AssistantAutomationProduct Failure

Vibe-coded dating safety app leaked 72,000 private images and 1.1 million messages to 4chan

72,000 private images including 13,000 government IDs exposed; 1.1 million private messages leaked to hacking forums; 4+ million users affected; class-action lawsuits filed; regulatory investigations opened

Tea, a women-only dating safety app with over four million users, suffered three data breaches in July 2025 that exposed 72,000 private images - including 13,000 photos of women holding government-issued IDs - and more than 1.1 million private messages containing deeply personal accounts of relationships, trauma, and abuse. The exposed data circulated on 4chan and hacking forums. The app's founder later admitted to building it with contractors and AI tools without personal coding knowledge. Security researchers attributed the breaches to missing authentication, unsecured legacy databases, and development practices that prioritized speed over security. Multiple class-action lawsuits and privacy regulator investigations followed.

Catastrophicby Executive

Data BreachSecuritySafety

Supply-chain attack inserts machine-wiping prompt into Amazon Q AI coding assistant

Catastrophicby Security/AI Product

A rogue contributor injected a malicious prompt into the Amazon Q Developer VS Code extension, instructing the AI coding assistant to wipe local developer machines and AWS resources. AWS quietly yanked the release before widespread damage occurred. The incident illustrates a specific supply-chain risk for AI tools: once a poisoned extension is installed, the AI assistant itself becomes the delivery mechanism - executing destructive instructions with the developer's full trust and permissions.

VS Code update could have erased developer environments and AWS accounts before anyone noticed the tainted build.

AI AssistantPrompt InjectionSecurity+1 more

Vibe-coding platform Base44 shipped critical auth vulnerabilities in apps built on its SDK

Wiz researchers discovered critical authentication vulnerabilities in Base44, an AI-powered vibe-coding platform that lets non-developers build and deploy web apps. The auth logic bugs in Base44's SDK allowed account takeover across every app built and hosted on the platform, affecting all users of those apps until patches were rolled out.

Potential ATO across many sites until patches rolled out.

SecuritySupply Chain

Reporter fired after AI tool provided by her employer fabricated sources in front-page article

Front-page print article published with fabricated sources; reporter terminated; Lee Enterprises under scrutiny for deploying AI tools without training or clear policies.

Wisconsin State Journal reporter Audrey Korte was fired in July 2025 after publishing a front-page article about a downtown Madison development plan that contained factual errors and fabricated sources generated by an AI tool. The tool had been provided by the newspaper's parent company, Lee Enterprises, and was installed on employee computers. Korte said she used it for grammar and style editing, but it introduced false information she didn't catch before publication. The article was pulled, replaced with a re-reported version, and stamped with a disclaimer citing "unauthorized AI use" and "fabricated sources." Korte was terminated. She publicly accepted responsibility for not catching the errors but noted she had received no training on the tool that was already installed on her work computer.

Facepalmby Reporter

AI Content GenerationAI HallucinationVibe Journalism+1 more

AI chatbots kept handing users fake or dead login URLs

Users seeking major brand logins exposed to phishing and typo-domain risk; one-third of tested hostnames not brand-controlled; scammers incentivized to register or poison wrong URLs

Netcraft found in July 2025 that when users asked AI chatbots for official login pages for major brands, the answers were wrong about a third of the time. In tests covering 50 brands, 34% of the returned hostnames were not controlled by the brands at all: nearly 30% were unregistered, parked, or inactive, and another 5% pointed to unrelated businesses. In one Wells Fargo test, the model surfaced a fake page already tied to phishing. A chatbot that confidently invents login URLs is not a search engine with quirks. It is a phishing assistant with good manners.

Facepalmby AI product

SecurityAI HallucinationAI Assistant

Georgia appeals court fined a divorce lawyer after fake AI-like citations reached the order itself

In Shahid v. Esaam, decided June 30, 2025, the Georgia Court of Appeals vacated part of a divorce-related order after finding that several cited authorities did not exist and others did not support the propositions claimed. The panel concluded the briefing showed the hallmarks of generative AI hallucination, fined attorney Diana Lynch $2,500, and sent the matter back to the trial court. What made the case stand out ran deeper than a sloppy brief: the fake citations appeared to have made their way into the trial court's signed order.

Georgia Court of Appeals vacated part of a divorce order, imposed the maximum statutory penalty, and turned one lawyer's filing shortcuts into a published appellate embarrassment

McDonald's AI hiring chatbot left open by '123456' default credentials

Facepalmby Vendor/Developer

Security researchers Ian Carroll and Sam Curry found that McHire, McDonald's AI hiring chatbot built by Paradox.ai, had its admin interface secured with the default username and password "123456." Combined with an insecure direct object reference in an internal API, the flaws exposed chat histories and personal data for up to 64 million job applicants. The vulnerable test account had been dormant since 2019 and never decommissioned. Paradox.ai patched the issues within hours of disclosure on June 30, 2025.

Up to 64M applicant records exposed; vendor patched; reputational risk.

SecurityAI AssistantBrand Damage+2 more

AI-generated images and claims muddied Air India crash coverage

Facepalmby Social platforms

After Air India Flight 171 crashed in Ahmedabad on June 12, 2025, killing 275 people, AI-generated images of the crash spread across social media platforms. One widely shared synthetic image depicted the Boeing 787 broken in half across a building, but contained physically impossible details that experts identified as AI-generated. Fake victim photos, fabricated reports, and fraudulent fundraising campaigns followed. Google's AI Overview compounded the problem by incorrectly identifying the crashed aircraft as an Airbus rather than Boeing. Mashable reported the AI-generated content was convincing enough to confuse even aviation professionals.

Public misinformation; platform moderation challenges.

AI HallucinationImage GenerationPlatform Policy

Microsoft 365 Copilot EchoLeak allowed zero-click data theft

Catastrophicby AI productivity assistant

CVE-2025-32711 (EchoLeak), discovered by Aim Security researchers and rated CVSS 9.3, enabled attackers to steal sensitive corporate data from Microsoft 365 Copilot without any user interaction. Hidden prompts embedded in documents or emails were automatically executed when Copilot indexed them, bypassing cross-prompt injection classifiers and exfiltrating confidential information via encoded image request URLs to attacker-controlled servers.

Enterprise Microsoft 365 Copilot users exposed to zero-click data exfiltration via malicious documents and emails

Claude Code agent allowed data exfiltration via DNS requests

CVE-2025-55284 (CVSS 7.1) allowed attackers to bypass Claude Code's confirmation prompts and exfiltrate sensitive data from developers' computers through DNS requests. Prompt injection embedded in analyzed code could exploit auto-approved utilities like ping, nslookup, and dig to silently steal secrets by encoding them as subdomains in outbound DNS queries. Anthropic fixed the issue in version 1.0.4 by removing those utilities from the allowlist.

Claude Code users on versions prior to 1.0.4 exposed to data exfiltration via prompt injection in code repositories

UK High Court warns lawyers after fake AI citations infected two cases

On June 6, 2025, the High Court of England and Wales issued a joint ruling in two separate matters after lawyers put fake authorities before the court. In one case tied to Qatar National Bank, a filing cited 45 authorities, 18 of which did not exist, while many of the rest were misquoted or irrelevant. In the other, a housing claim against the London Borough of Haringey included five fabricated cases. The Divisional Court, led by Dame Victoria Sharp, said tools such as ChatGPT are not capable of reliable legal research, referred the lawyers involved to their regulators, and warned that more serious future misuse could lead to contempt proceedings or even police referral. The ruling turned individual AI citation blunders into a profession-wide warning.

Two active court matters tainted by fabricated authorities; lawyers referred to regulators; High Court warning circulated to the Bar Council, Law Society, and Inns of Court.

Veracode tested AI-generated code from 100+ models and 45% of it failed security checks

Veracode's 2025 GenAI Code Security Report examined code output from more than 100 large language models across 80+ coding tasks and found that 45% of AI-generated code samples contained security vulnerabilities, including OWASP Top 10 flaws. Cross-Site Scripting had an 86% failure rate and Log Injection hit 88%. Java was the worst performer at over 70%. The study's most uncomfortable finding: newer and larger models didn't produce more secure code than smaller ones, suggesting this is a structural problem baked into how AI generates code, not a temporary limitation that will scale away with the next model release.

Systemic risk across all organizations using AI code generation; quantified vulnerability rates across 100+ LLMs and multiple programming languages.

SecurityAI AssistantProduct Failure

ChatGPT coached a 19-year-old to mix Kratom and Xanax; he died

Catastrophicby AI Product

Sam Nelson, a 19-year-old UC Merced student, died on May 31, 2025 from a combination of Kratom and Xanax after ChatGPT told him the combination was safe and recommended a specific Xanax dose to manage his Kratom-induced nausea. According to a lawsuit filed by his parents on May 13, 2026, ChatGPT-4o began giving Nelson increasingly personalized drug advice after OpenAI launched its memory feature; the model presented this advice in authoritative, physician-like language without warnings. The suit alleges defective design, failure to warn, and wrongful death, and claims OpenAI skipped safety testing to rush GPT-4o to market against Google.

One death; pending wrongful death lawsuit against OpenAI; request to pause ChatGPT Health operations

AI AssistantHealthSafety+1 more

Study finds most AI bots can be easily tricked into dangerous responses

Researchers introduced LogiBreak, a jailbreak method that converts harmful natural language prompts into formal logical expressions to bypass LLM safety alignment. The technique exploits a gap between how models are trained to refuse dangerous requests and how they process logic-formatted input, achieving attack success rates exceeding 30% across major models. The Guardian reported on the broader finding that hacked AI chatbots threaten to make dangerous knowledge readily available, and that "dark LLMs" - stripped of safety filters - should be treated as serious security risks.

Safety guardrails bypassed across multiple vendors; calls for stronger safeguards and testing.

AI AssistantSafetyPrompt Injection

Syndicated AI book list ran in major papers with made-up titles

Facepalmby Syndication/Editorial

A freelance writer working for King Features Syndicate used AI to research a summer reading list for the Chicago Sun-Times and Philadelphia Inquirer. Of the fifteen books recommended, only five were real. The rest were hallucinated titles attributed to real authors like Isabel Allende and Delia Owens. The list ran in print in a 64-page special section before 404 Media, NPR, and others exposed the fabrications. Both newspapers issued corrections and statements distancing their newsrooms from the syndicated content.

Syndicated misinformation across multiple papers; reader trust impact; corrections issued.

Vibe JournalismAI Content GenerationAI Hallucination+3 more

Workday's AI screening tool faces class action for age discrimination; class conditionally certified

Catastrophicby AI platform

A federal judge conditionally certified a class action against Workday alleging its AI-powered applicant screening tools systematically discriminated against job seekers over 40 in violation of the ADEA. Plaintiff Derek Mobley claims Workday's algorithms filtered out older applicants across employers using the platform, potentially affecting millions of job seekers. Workday processed over 1.1 billion applications in fiscal year 2025 alone. The EEOC filed an amicus brief supporting the case, and the court ordered Workday to disclose its customer list.

Potentially millions of job applicants over age 40 across hundreds of employers using Workday's AI screening; first federal class certification treating an AI vendor as an employment agent under the ADEA

AutomationLegal RiskProduct Failure

Lovable AI builder shipped apps with public storage buckets

Security researcher Matt Palmer discovered that applications generated by Lovable, a vibe-coding platform, shipped with insufficient Supabase Row-Level Security policies that allowed unauthenticated attackers to read and write arbitrary database tables. The vulnerability, tracked as CVE-2025-48757, affected over 170 apps and exposed sensitive data including personal debt amounts, home addresses, API keys, and PII. A separate researcher found 16 vulnerabilities in a single Lovable-hosted app that leaked more than 18,000 people's data. Lovable's response was widely criticized as inadequate.

Customer app data and source artifacts exposed until configs fixed.

Georgia Tech tracker confirms dozens of real-world CVEs introduced by AI-generated code - and says the true number is 5-10x higher

Facepalmby AI coding assistants

Georgia Tech's Systems Software & Security Lab launched the Vibe Security Radar in May 2025 to do something no one else had systematically attempted: track real-world CVEs that were directly introduced by AI-generated code. By March 2026, the project had confirmed 74 vulnerabilities across approximately 50 AI coding tools by tracing each fix back to its original AI-authored commit. The trend is accelerating - 6 CVEs in January, 15 in February, 35 in March. Researcher Hanqing Zhao estimates the actual number of AI-linked vulnerabilities in the open-source ecosystem is five to ten times higher than what the radar detects, because many AI-assisted commits lack the metadata signatures needed to trace them back to their origin. The confirmed CVEs are a lower bound on a problem that is growing faster than anyone is measuring it.

74 confirmed CVEs across 50+ AI coding tools; exponential month-over-month growth; estimated 5-10x undercount across the open-source ecosystem

SecurityAutomationSupply Chain

California's failed bar exam included AI-drafted questions

Apr 2025

The State Bar of California disclosed in April 2025 that 23 scored multiple-choice questions on its already troubled February bar exam were developed with AI assistance by its psychometric vendor, ACS Ventures. Test-takers had already reported crashes, lag, copy-paste failures, and lost answers. Then the bar admitted that some questions in this licensing exam for future lawyers had been drafted with AI, reviewed by the same outside vendor, and used anyway. The bar asked the California Supreme Court for score relief, while legal academics described the admission as staggering.

Catastrophicby Public agency

Thousands of California bar applicants affected; score adjustments sought; confidence in the licensing exam damaged; millions in follow-on costs and vendor fallout

AI Content GenerationLegal RiskSlop-ocracy+1 more

Cursor's AI support bot invented a login policy

Apr 2025

In April 2025, Cursor users started getting logged out when they switched between machines. Some of them asked support what had changed and got a neat, confident answer from an AI support bot: one subscription was only meant for one device, and the lockouts were an intentional security policy. The problem was that Cursor had no such policy. The company later said the answer was wrong, blamed a session-security change for the logouts, and moved to label AI support replies after the invented rule had already spread through Reddit and Hacker News and pushed some customers to cancel.

Facepalmby AI support bot

Customer confusion, public cancellations, refunds, and a trust hit for a coding tool selling AI reliability.

AI AssistantCustomer DisserviceBrand Damage+1 more

Langflow AI agent platform hit by critical unauthenticated RCE flaws

Apr 2025

Multiple critical vulnerabilities in Langflow, an open-source AI agent and workflow platform with 140K+ GitHub stars, allowed unauthenticated remote code execution. CVE-2025-3248 (CVSS 9.8) exploited Python exec() on user input without auth, while CVE-2025-34291 (CVSS 9.4) enabled account takeover and RCE simply by having a user visit a malicious webpage, exposing all stored API keys and credentials.

Catastrophicby AI agent platform

All Langflow instances prior to 1.3.0 (millions of users); exposure of stored API keys, database passwords, and service tokens across integrated services

SecurityAutomationAI Assistant

ChatGPT invented a child-murder conviction for a real man

Mar 2025

When Norwegian user Arve Hjalmar Holmen asked ChatGPT who he was, the bot replied with a fabricated story saying he had murdered two of his sons, attempted to kill a third, and been sentenced to 21 years in prison. The story was false, but it also mixed in real details about Holmen's family and hometown. In March 2025, privacy group noyb filed a complaint with Norway's data-protection authority, arguing that OpenAI was processing inaccurate and defamatory personal data in violation of the GDPR and could not paper over the problem with a generic "AI can make mistakes" disclaimer.

Severe reputational risk to a private person, a formal GDPR complaint, and more pressure on OpenAI over hallucinated personal data.

AI AssistantAI HallucinationLegal Risk+1 more

"Zero hand-written code" SaaS app shut down within a week after cascading security failures

Mar 2025

EnrichLead, a sales lead SaaS application whose founder Leo Acevedo publicly boasted was built entirely with Cursor AI and "zero hand-written code," was permanently shut down in March 2025 after attackers exploited a constellation of basic security failures. API keys sat exposed in frontend code. There was no authentication. The database was wide open. There was no rate limiting. No input validation. Attackers bypassed subscriptions, manipulated data, and maxed out API keys - all within two days of Acevedo's viral celebration post. When he tried to use Cursor to fix the problems, the AI "kept breaking other parts of the code." The app was dead within the week. Acevedo has since launched new vibe-coded projects, because some lessons require a second attempt.

Complete application shutdown; customer data at risk; API keys maxed out; all user subscriptions bypassed

SecurityData BreachProduct Failure

LA Times had to pull AI "Insights" after it softened the Klan

Mar 2025

The Los Angeles Times launched an AI feature called "Insights" in March 2025 to label opinion pieces, summarize them, and generate an opposing viewpoint. It immediately attached itself to a Gustavo Arellano column about Anaheim's history with the Ku Klux Klan and produced language suggesting the 1920s Klan could be framed as a response to social change rather than as an explicitly hate-driven movement. The feature was removed from that article within a day. The newspaper had managed to bolt an automated both-sides machine onto a hate group history piece and act surprised when that went badly.

Public backlash; reputational damage to the paper; newsroom distrust of the feature; the Klan article's framing overshadowed by the AI add-on

AI Content GenerationVibe JournalismBrand Damage+1 more

MD Anderson shelved IBM Watson cancer advisor

Feb 2025

MD Anderson Cancer Center's Oncology Expert Advisor project with IBM Watson burned through $62 million - $39 million to IBM, $23 million to PwC - over four years of contract extensions. The system was piloted for leukemia and lung cancer using the old ClinicStation records system but was never updated to integrate with the hospital's new Epic EHR, effectively killing it. A University of Texas audit flagged procurement failures, bypassed standard processes, and an $11.6 million deficit in donor gift funds spent before they were received. IBM ended support in September 2016, noting the system was "not ready for human investigational or clinical use."

Facepalmby Vendor

UT audit cited $62M spent outside standard procurement, the pilot never made it into patient care, and leadership had to rebid decision-support tooling amid reputational fallout.

HealthProduct FailureBrand Damage+1 more

Virgin Money's chatbot refused to let customers say "Virgin"

Jan 2025

In January 2025, fintech commentator David Birch discovered that Virgin Money's AI customer service chatbot had flagged the word "virgin" as inappropriate language. When Birch tried to discuss his ISAs held with "Virgin Money," the bot scolded him: "Please don't use words like that. I won't be able to continue our chat if you use this language." The bank's chatbot was refusing to process messages containing the bank's own name. Virgin Money acknowledged the issue in a statement, said its team was "working on it," and noted the chatbot was an older model already scheduled for improvements. The incident went predictably viral.

Oopsieby Product Manager

Customers unable to get service when mentioning the company's name; public embarrassment across social media and fintech press.

AI AssistantCustomer DisserviceBrand Damage

Apple pulled AI news summaries after fake BBC headlines

Jan 2025

Apple Intelligence's notification-summary feature spent late 2024 turning news alerts into fiction with excellent lock-screen placement. In the most widely cited example, it generated a false BBC alert claiming Luigi Mangione had shot himself. The BBC complained that Apple was attaching fabricated claims to its reporting, other publishers raised similar concerns, and Apple responded in January 2025 by disabling notification summaries for News & Entertainment apps in iOS 18.3 while it reworked the feature.

Facepalmby Consumer AI feature

False breaking-news alerts on iPhones, publisher trust damage, and a public rollback by Apple.

AI HallucinationVibe JournalismProduct Failure+2 more

Cody Enterprise reporter resigned after AI fabricated quotes from real people

Aug 2024

The Cody Enterprise was forced into public apologies and corrections in August 2024 after reporter Aaron Pelczar resigned amid evidence that an AI tool he used to help write stories had inserted fabricated quotations. A competing reporter at the Powell Tribune spotted robotic phrasing, suspiciously polished source quotes, and one article that bizarrely ended by explaining the inverted pyramid style of news writing. The resulting review found seven stories that included invented or altered quotes from seven people, including Wyoming Gov. Mark Gordon. The paper removed many of the quotes, issued corrections, and then adopted an AI detection and policy response after learning, a little late, that generative text tools are not interchangeable with reporting.

Facepalmby Reporter

Seven stories tainted by fabricated or altered quotes; public apologies and corrections; reporter resigned; local newsroom credibility damaged.

AI Content GenerationAI HallucinationVibe Journalism+2 more

Meta AI answers spark backlash after wrong and sensitive replies

Jul 2024

Meta rolled out its Llama 3-powered AI assistant across Facebook, Instagram, WhatsApp, and Messenger in April 2024, replacing the familiar search bar with "Ask Meta AI anything" prompts. The assistant struggled with factual accuracy from the start - the New York Times found it unreliable with facts, numbers, and web search. In July, when asked about the Trump rally shooting, Meta AI stated the assassination attempt had not happened. Meta blamed hallucinations, updated the system, and acknowledged that "all generative AI systems can return inaccurate or inappropriate outputs."

Oopsieby AI Product

Feature restrictions; reputational damage.

AI AssistantAI HallucinationPlatform Policy+2 more

McDonald’s pulls IBM’s AI drive‑thru pilot after error videos

Jun 2024

McDonald's ended its two-year partnership with IBM on automated AI order-taking at drive-thrus in June 2024, removing the technology from more than 100 US locations. The decision followed viral TikTok videos showing the system adding nine sweet teas instead of one, inserting random butter and ketchup packets into ice cream orders, and other absurd errors. McDonald's framed the pullback as a positive, saying the test gave them "confidence that a voice-ordering solution for drive-thru will be part of our restaurants' future."

Oopsieby Operations/Product

Pilot ended; vendor reevaluation; reputational hit.

AI AssistantBrand DamageCustomer Disservice+2 more

Google’s AI Overviews says to eat rocks

May 2024

Within days of Google launching AI Overviews to all US search users in May 2024, the feature produced a series of confidently wrong answers that went viral. It told users to add non-toxic glue to pizza to make cheese stick better (sourced from an 11-year-old Reddit joke), that geologists recommend eating one rock per day for vitamins, and that Barack Obama was Muslim. Google head of search Liz Reid acknowledged the errors in a blog post, calling some results "odd, inaccurate or unhelpful," and the company made corrections including limiting AI Overviews for health-related and sensitive queries.

Facepalmby Search Product

Mass reputational damage; feature dialed back and corrected.

AI AssistantAI HallucinationPlatform Policy+1 more

NYC’s official AI bot told businesses to break laws

Mar 2024

New York City launched a Microsoft-powered AI chatbot called MyCity in October 2023 to help small business owners navigate regulations. A March 2024 investigation by The Markup found the bot was routinely advising businesses to break the law - telling employers they could pocket workers' tips, landlords they could discriminate against housing voucher holders, and bosses they could fire whistleblowers. Mayor Eric Adams acknowledged the errors but refused to take the chatbot offline, calling AI a "once-in-a-generation opportunity." NYU professor Julia Stoyanovich called the city's approach "reckless and irresponsible."

City guidance channel distributed illegal advice; public backlash.

AI HallucinationAutomationLegal Risk+2 more

AI hallucinated packages fuel "Slop Squatting" vulnerabilities

Mar 2024

Security researcher Bar Lanyado at Lasso Security discovered that AI code assistants consistently hallucinate nonexistent software package names when answering programming questions - and that nearly 30% of prompts produce at least one fake package recommendation. Attackers can register these hallucinated names on repositories like npm and PyPI, then wait for AI tools to direct developers to install them. The technique, dubbed "slopsquatting" by Python Software Foundation security developer Seth Michael Larson, was later confirmed at scale by academic researchers who found over 205,000 unique hallucinated package names across multiple models.

Catastrophicby Malicious actors

Potential supply-chain compromise when vibe-coders install hallucinated, malicious dependencies.

AI HallucinationSupply ChainSecurity

Gemini paused people images after historical inaccuracies

Feb 2024

Google paused Gemini's image generation of people on February 22, 2024, after users discovered the tool was producing historically inaccurate depictions - including racially diverse World War II German soldiers, Black female popes, and multiethnic U.S. Founding Fathers. The overcorrection stemmed from diversity tuning meant to counter training-data biases, but the model failed to distinguish when diversity adjustments were inappropriate for specific historical prompts. CEO Sundar Pichai called the outputs "completely unacceptable." Google SVP Prabhakar Raghavan later published a blog post acknowledging the model had "overcompensated" and been "over-conservative."

Feature paused; trust hit; policy and model adjustments.

AI HallucinationImage GenerationPlatform Policy+2 more

Air Canada liable for lying chatbot promises

Feb 2024

Jake Moffatt used Air Canada's website chatbot to ask about bereavement fares after his grandmother died. The chatbot told him he could book at full price and apply for a bereavement discount within 90 days. Air Canada's actual policy did not allow retroactive bereavement fare claims. When Moffatt applied, the airline denied the refund and admitted the chatbot had provided "misleading words" - but argued Moffatt should have checked the static webpage instead. British Columbia's Civil Resolution Tribunal ruled in Moffatt's favor in February 2024, finding Air Canada liable for negligent misrepresentation and rejecting the airline's argument that it wasn't responsible for its own chatbot's statements.

Legal liability; refund + fees; policy/process review.

AI HallucinationAutomationCustomer Disservice+1 more

AI “Biden” robocalls told voters to stay home; fines and charges followed

Jan 2024

Two days before New Hampshire's January 2024 presidential primary, between 5,000 and 25,000 voters received robocalls featuring an AI-cloned version of President Biden's voice, complete with his trademark "what a bunch of malarkey" catchphrase. The calls urged Democrats to "save your vote" for November and skip the primary - a blatant lie, since voting in a primary doesn't prevent voting in the general election. Political consultant Steve Kramer, who was working for Dean Phillips' campaign, commissioned the deepfake audio from a New Orleans magician using AI voice-cloning tools. The FCC levied a $6 million fine against Kramer, Lingo Telecom settled for $1 million, and Kramer faced criminal voter suppression charges in New Hampshire.

Facepalmby Political Consultant

Voter confusion; enforcement actions; national scrutiny of AI voice-clones.

SafetyLegal RiskBrand Damage

DPD’s AI chatbot cursed and trashed the company

Jan 2024

UK parcel delivery firm DPD (Dynamic Parcel Distribution) had to disable its AI-powered customer service chatbot in January 2024 after customer Ashley Beauchamp demonstrated he could make it swear, call DPD "the worst delivery firm in the world," write disparaging poems about the company, and recommend competitors. The meltdown followed a system update, and Beauchamp's screenshots went viral on social media. DPD said the chatbot had operated successfully "for a number of years" before the update introduced the error, and disabled the AI element while it worked on fixes.

Public embarrassment; service channel disabled; reputational hit.

AutomationBrand DamageCustomer Disservice+1 more

Duolingo cuts contractors; ‘AI-first’ backlash

Jan 2024

In January 2024, Duolingo cut roughly 10% of its contract workforce - primarily content translators and writers who created language-learning exercises - as the company shifted to using GPT-4 and other AI tools for content generation. CEO Luis von Ahn later posted an internal "AI-first" memo on LinkedIn describing a strategy to gradually replace contractor work with AI and only hire when teams could not automate further. The memo drew hundreds of critical comments from users and language professionals. Von Ahn later admitted the memo "did not give enough context" and clarified that full-time employees were not being replaced, though user complaints about declining content quality persisted.

PR hit and quality complaints; ongoing AI content strategy scrutiny.

AutomationBrand DamageSlop School

Chevy dealer bot agreed to sell $76k SUV for $1

Dec 2023

Chevrolet of Watsonville, a California car dealership, deployed a customer service chatbot powered by ChatGPT and built by a company called Fullpath. After Chris White noticed the chat widget was "powered by ChatGPT," word spread online and pranksters descended. Chris Bakke manipulated the bot into "the customer is always right" mode, got it to append "and that's a legally binding offer - no takesies backsies" to every response, then asked to buy a 2024 Chevy Tahoe for $1. The bot agreed. Others got it to recommend Ford vehicles, write Python code, and provide general ChatGPT-style answers unrelated to cars. The dealership pulled the chatbot entirely.

Oopsieby Dealer Marketing/IT

Bot pulled; viral reputational bruise; no actual $1 sales.

AutomationBrand DamageCustomer Disservice+1 more

Sports Illustrated: Fake-Looking Authors and AI Content Backlash

Nov 2023

Futurism reported in November 2023 that Sports Illustrated had published product reviews under fake author names such as "Drew Ortiz" and "Sora Tanaka," whose headshots were traced to AI-generated portrait marketplaces. When questioned, SI deleted the profiles without explanation. The articles came from third-party content partner AdVon Commerce. SI said AdVon used pen names without authorization and terminated the partnership. The SI union demanded answers. Within weeks, Arena Group - SI's parent company - fired CEO Ross Levinsohn and three other executives.

Facepalmby Commerce Editorial

Content takedowns; partner terminated; trust erosion

AI Content GenerationBrand DamageVibe Journalism+2 more

Microsoft’s AI poll on woman’s death sparks outrage

Oct 2023

In late October 2023, Microsoft Start republished a Guardian article about the death of Sydney water polo instructor Lilie James and auto-attached an AI-generated "Insights" poll asking readers, "What do you think is the reason behind the woman's death?" - with options of murder, accident, or suicide. Readers blamed the Guardian's journalist directly, with some demanding the writer be fired, unaware the poll was Microsoft's AI. Guardian CEO Anna Bateson wrote to Microsoft President Brad Smith calling the poll an inappropriate use of generative AI. Microsoft deactivated all AI-generated polls on news articles and launched an investigation.

Feature disabled platform-wide; reputational damage with publishers.

AI Content GenerationBrand DamageVibe Journalism+1 more

Gannett pauses AI sports recaps after mockery

Aug 2023

In August 2023, Gannett - the largest newspaper chain in the United States - deployed an AI service called LedeAI to auto-generate high school sports recaps for the Columbus Dispatch and other papers. The articles went viral on social media for their robotic phrasing, missing player names, and bizarre constructions like "close encounter of the athletic kind." Several articles required corrections appended with notes about "errors in coding, programming or style." Gannett paused the experiment and said it would add "hundreds of reporting jobs" alongside AI tools, though the connection between the two claims was unclear.

Chain-wide pause of AI copy; reputational hit in local markets.

AI Content GenerationAI HallucinationBrand Damage+2 more

Snapchat’s “My AI” posted a Story by itself; users freaked out

Aug 2023

On August 15, 2023, Snapchat's built-in AI chatbot "My AI" posted a one-second Story to users' feeds showing an unintelligible image, then stopped responding to messages. The chatbot had no official ability to post Stories, and the unexplained behavior alarmed Snapchat's largely young user base. Snap confirmed it was a temporary glitch and resolved it, but the incident fed into existing concerns about My AI's access to user data. The UK Information Commissioner's Office had already issued an enforcement notice over Snap's failure to properly assess privacy risks the chatbot posed to children.

Oopsieby Product Manager

Viral alarm among teen users; trust hit; scrutiny on AI access and safeguards.

AI AssistantSafetyBrand Damage+1 more

iTutorGroup's AI screened out older applicants; $365k EEOC settlement

Aug 2023

On August 9, 2023, the EEOC's first AI-related discrimination lawsuit reached a settlement. iTutorGroup, a company providing English-language tutoring services to students in China via US-based remote tutors, had programmed its applicant screening software to automatically reject female applicants over 55 and male applicants over 60. Over 200 qualified US applicants were rejected because of their age. The company agreed to pay $365,000, adopt a new anti-discrimination policy, provide training to hiring staff, and submit to EEOC compliance monitoring for at least five years. EEOC Chair Charlotte Burrows called AI a "new civil rights frontier."

Older job applicants screened out; legal settlement and mandated policy changes.

Legal RiskSlop SchoolAutomation+1 more

Lawyers filed ChatGPT’s imaginary cases; judge fined them

Jun 2023

In Mata v. Avianca (S.D.N.Y.), plaintiff Roberto Mata sued the airline after a metal serving cart struck his knee during a 2019 flight. His attorney Peter LoDuca filed a brief opposing dismissal that cited six judicial decisions. When opposing counsel and the court couldn't locate any of the cited cases, Judge Kevin Castel demanded copies. It turned out attorney Steven Schwartz at the same firm had used ChatGPT to research and draft the brief, and the AI had fabricated every case, complete with fake quotes and fake internal citations. On June 22, 2023, Castel sanctioned Schwartz, LoDuca, and their firm Levidow, Levidow & Oberman with a $5,000 penalty and required them to send notices to the real judges whose names appeared in the fabricated opinions.

Court sanctions; fines and mandated notices; reputational damage in legal community.

AI AssistantAI HallucinationLegal Risk+1 more

Eating disorder helpline’s AI told people to lose weight

May 2023

The National Eating Disorders Association replaced its human-staffed helpline with an AI chatbot called Tessa shortly after the helpline staff moved to unionize. Tessa was built on the Cass platform and intended to provide scripted psychoeducational content about body image and eating disorders. Instead, users reported the chatbot recommending calorie deficits of 500 to 1,000 calories per day, suggesting weekly weigh-ins, encouraging calorie counting, and recommending the use of skin calipers to measure body fat - all standard advice for weight loss, and all directly counter to eating disorder recovery guidelines. NEDA acknowledged the chatbot "may have given information that was harmful" and disabled it.

Vulnerable users received unsafe guidance; reputational damage; service pulled.

AI AssistantHealthSafety+2 more

Google’s Bard ad made False JWST “first” Claim

Feb 2023

Google unveiled Bard on February 6, 2023, with a promotional ad on Twitter demonstrating the chatbot answering a question about the James Webb Space Telescope. Given the prompt "What new discoveries from the JWST can I tell my 9-year old about?", Bard stated that the JWST had taken the first pictures of a planet outside our solar system. This was false - the European Southern Observatory's Very Large Telescope captured the first direct exoplanet image in 2004. Reuters spotted the error on February 8, the day of a Google AI event in Paris. Alphabet shares dropped roughly 9% that day, erasing about $100 billion in market value.

Oopsieby Marketing

Embarrassing launch moment; stock wobble; trust in product accuracy questioned.

AI HallucinationProduct FailureBrand Damage

CNET mass-corrects AI-written finance explainers

Jan 2023

Starting in November 2022, CNET quietly published 77 financial explainer articles written by an AI tool under the byline "CNET Money Staff." Readers had to hover over the byline to learn the articles were produced "using automation technology." In January 2023, Futurism broke the story, and a follow-up identified factual errors in a compound interest article, prompting a full audit. CNET editor-in-chief Connie Guglielmo confirmed corrections were issued on 41 of the 77 articles - more than half - including some she described as "substantial." CNET paused AI-generated publishing and updated its disclosure practices, though Guglielmo said the outlet intended to continue using AI tools.