How AI Phishing Attacks Work in 2026

In June 2026, the FBI and Google dismantled a Chinese cybercrime network called Outsider Enterprise that had generated over one million phishing URLs, stolen roughly 3.87 million credit card records, and caused an estimated $1.9 billion in losses since July 2023. The operation sold subscriptions starting at $88 per week and included more than 290 pre-built website templates. Its operators used Google's own Gemini AI to generate the code behind those fake sites.

That last detail is worth sitting with. A criminal network weaponized a major AI assistant to mass-produce convincing fraud infrastructure, then distributed it to buyers over Telegram like a SaaS product. The FBI's Operation Riptide took it down, but the playbook it exposed is not going away.

This is how AI-powered phishing actually works in 2026, why the filters you relied on in 2022 are failing, and what modern detection tools do differently.

How AI Changed Phishing

Hyper-personalization at scale

Traditional phishing was blunt. A poorly-worded email pretending to be your bank, sent to a million people in the hope that a few thousand would click. AI changed the economics completely. Language models can ingest a target's LinkedIn profile, recent press releases, email thread patterns, and org-chart data and produce a message that reads like it came from someone the recipient trusts, written in that person's actual register and referencing real shared context.

Business email compromise (BEC) attacks now routinely involve AI-drafted messages indistinguishable from the CFO's writing style. The personalization that once required hours of manual reconnaissance now takes seconds.

Deepfake voice and video

Voice cloning has matured to the point where a three-second audio sample, scraped from a public earnings call or YouTube interview, is enough to generate a convincing real-time phone call. Finance teams have wired funds after receiving calls that sounded exactly like their CEO. In 2025 and 2026 this vector moved from proof-of-concept to routine criminal tool. Video deepfakes are still more expensive to produce but have shown up in high-value wire-fraud scenarios.

Polymorphic URLs and infrastructure that rotates itself

The Outsider Enterprise case illustrates one of the most disruptive technical shifts: AI-assisted generation of URL and site variations at industrial scale. Rather than standing up one phishing site and waiting for it to be flagged, operators produce thousands of near-identical URLs that rotate and mutate continuously. By the time a blocklist is updated, the active URL has already changed. This is sometimes called polymorphic phishing infrastructure. Google linked the Outsider network to more than 9,000 fake websites and over a million distinct fraudulent URLs generated over roughly three years.

Phishing-as-a-Service lowers the skill floor

Operations like Outsider turn sophisticated attack techniques into products. A buyer with no technical knowledge pays a weekly subscription, picks a template impersonating a major brand, sends the generated URL via a bulk SMS service, and waits for stolen credentials to arrive in a dashboard. The barrier to entry has collapsed. The supply of would-be phishers has not.

Why Legacy Filters Fail

Most email security products deployed before 2022 rely on reputation-based filtering: check the sender domain against a blocklist, scan for known malicious URLs, look for attachment signatures. These approaches share a fundamental weakness: they require the attack to have been seen before.

Polymorphic URLs defeat reputation lists because each URL is new. AI-written text defeats signature-based content filters because there is no fixed pattern to match. Domains registered hours before an attack have no reputation at all, positive or negative. And because phishing kits now spin up plausible SSL certificates automatically, the padlock icon that users were trained to trust as a safety signal is meaningless.

The volume problem compounds everything. A network sending 2.5 million smishing messages in two weeks, as Outsider Enterprise reportedly did in May 2026, generates analyst alert queues that no human team can process manually.

How Modern AI Detection Works

Behavioral AI and relationship graphs

Rather than asking "have I seen this sender before," behavioral AI asks "does this communication pattern match what I know about how these two people actually interact?" Tools like Abnormal Security build a baseline of normal email behavior for every person in an organization: who they typically email, at what times, in what tone, requesting what kinds of actions. An invoice request that arrives from a domain registered yesterday, formatted slightly differently than usual, requesting a wire transfer to a new account is anomalous even if every individual element looks clean in isolation.

Darktrace applies a similar unsupervised learning approach at the network level, modeling what normal behavior looks like for every device and user and flagging deviations in real time without requiring labeled training data or known-threat signatures.

NLP intent analysis

Modern detection reads what an email is actually trying to accomplish, not just whether its links or attachments appear on lists. Sublime Security uses a message query language approach that lets security teams write detection rules around semantic intent, for example flagging any message that creates urgency around a financial transfer while referencing a relationship the sender cannot actually have. This catches novel social engineering even when the infrastructure is brand new.

Proofpoint applies NLP at scale across billions of messages, correlating linguistic patterns with behavioral and threat-intelligence signals to score intent across an entire organization's communication flow.

Link-time and time-of-click analysis

Because a URL can be clean at delivery and malicious three minutes later (a technique called delayed redirect), scanning links only at the point of arrival is not enough. Mimecast rewrites all URLs so that every click goes through a real-time safety check at the moment the user clicks, not when the email was received. This closes the window that polymorphic infrastructure exploits.

Human-layer training and simulation

Technology catches a lot. It does not catch everything, and the attacks that get through tend to be the most sophisticated ones. Cofense focuses on the human layer: running simulated phishing campaigns against real employees, measuring who clicks, and delivering targeted training immediately after. The behavioral data from those simulations feeds back into threat intelligence shared across Cofense's customer network, so a new attack pattern spotted at one organization is used to train employees at thousands of others.

Collaborative intelligence networks

Material Security takes a different angle by focusing on the mailbox as an archive risk. Most email security tools protect the inbox at delivery time. Material also scans historical email to identify sensitive data sitting in inboxes that a compromised credential could expose, and it enforces step-up authentication for access to email rather than treating inbox access as all-or-nothing. This limits the blast radius when a credential is stolen despite all the other defenses.

What to Look for When Choosing a Tool

Behavioral baseline depth. Ask how long it takes the product to establish a baseline for your organization and what data it uses. Tools that require only email metadata versus those that model full communication graphs have very different detection surfaces.

Time-to-detect for novel attacks. Legacy vendors will show you detection rates against known threats. The more important number is how the product performs against zero-day phishing infrastructure with no prior reputation. Ask for red-team test data or third-party evaluations.

Integration with your existing stack. An email security product that does not integrate with your SIEM, your identity provider, and your incident response workflow creates alert fatigue of its own. Check whether it exposes APIs and whether it plays well with the platforms your team already uses.

Coverage across channels. Email is not the only vector. Smishing (SMS phishing), Microsoft Teams messages, Slack, and collaboration platforms are increasingly used to deliver the same social-engineering payloads. Confirm whether the product covers the channels your employees actually use.

Explainability. Your security team needs to understand why something was flagged to triage it efficiently and to tune false positives. Black-box scoring without reasoning is harder to act on than a system that shows exactly which behavioral signals contributed to a detection.

Response automation. Detection speed matters, but so does remediation speed. Products that can automatically remove a malicious email from every inbox in your organization the moment it is identified, including emails already delivered, limit damage even when an attack gets through initial filters.

The Bigger Picture

The Outsider Enterprise takedown removed one major operation, but it documented a template that will be replicated. AI tools for generating convincing phishing infrastructure are widely available. Phishing-as-a-Service platforms will continue to commoditize the attack. The defenders who keep up are the ones treating this as an AI-versus-AI problem: automated, behavioral, and continuously learning, not a list-matching exercise run once at delivery time.

The FBI's Operation Riptide is a reminder that law enforcement can and does act. It is also a reminder that a single network running for three years can steal nearly four million credit cards before it is stopped. Waiting for takedowns is not a security strategy.

Where to go next

For a ranked breakdown of the leading platforms with pricing, use-case fit, and head-to-head comparisons, see the full guide: Best AI phishing detection tools.

How AI Phishing Attacks Work (And the Tools That Stop Them)