IPThreat - When the Phishing Link Cleared Every Filter and the User Clicked Anyway

By IPThreat Team June 6, 2026

#phishingdetection #cybersecuritydefense #urlanalysis #emailsecurity #threatintelligence #securityoperations #dnssecurity

The Filter Problem Nobody Wants to Talk About

Phishing URL detection sits at the center of enterprise security programs, yet attackers keep getting through. Security teams invest in email gateways, DNS filtering, proxy inspection, and browser isolation, and users still click malicious links that compromise credentials, install malware, or redirect to convincing lookalike pages. The problem rarely comes down to missing a single technology. It comes down to how detection layers are sequenced, tuned, and maintained in a threat environment that changes faster than most policy cycles.

The recent Dutch law enforcement operation that seized 800 servers and arrested two individuals for providing cybercrime infrastructure illustrates how professional phishing operations are run at scale. These operations use clean, freshly registered domains hosted on bulletproof infrastructure, rotate URLs faster than reputation feeds can update, and specifically test their links against major email security gateways before deploying them in campaigns. By the time a URL appears in a blocklist, it has often already served its purpose in the original campaign and is being retired anyway.

Understanding that dynamic changes how you should think about phishing URL detection. The goal shifts from blocking every known bad URL to reducing dwell time, catching novel patterns early, and making the detection stack adaptive rather than purely reactive.

Why Reputation Feeds Alone Are Structurally Insufficient

Most enterprise phishing defenses start and end with URL reputation. A link arrives in email, the gateway queries a reputation service, gets a clean verdict on a domain registered twelve hours ago, and delivers the message. The user clicks the link six hours later when the domain has been flagged, but the email was already in the inbox.

This is a timing problem with a structural cause. Reputation services require observation time. A domain needs to appear in crawls, generate complaints, or show up in honeypot traps before it earns a malicious verdict. Attackers account for this. They register domains weeks in advance, park them on legitimate hosting, let them age, and then weaponize them in tight deployment windows. The 911 S5 botnet, one of the largest residential proxy networks ever dismantled, specifically helped attackers route initial phishing campaign traffic through clean residential IP addresses to avoid triggering reputation-based blocks. That botnet's operational patterns continue to appear in successor networks, meaning the technique is still active.

The practical implication is that reputation checks remain a necessary baseline but a weak primary control. Every additional detection layer you operate increases the probability that a URL with a clean reputation score gets caught by something else before the user acts on it.

Lexical Analysis: What the URL Structure Itself Tells You

Lexical analysis examines the textual structure of a URL without making any network request or relying on reputation data. It operates on the hypothesis that phishing URLs have predictable structural patterns that differ from legitimate URLs, and that those patterns are detectable even when the domain is brand new.

Key features that lexical models examine include domain length, the presence of brand keywords in subdomains or paths (secure-bankofamerica-login.maliciousdomain.com), the use of hyphens to simulate legitimate domain patterns, the entropy of the domain name itself, the TLD in use, the presence of IP addresses in place of hostnames, URL encoding of characters in paths, and the ratio of special characters to alphanumeric characters. These features feed into classifiers that can score a URL's likelihood of being malicious before any reputation data is available.

Implementing lexical analysis in your pipeline requires a feature extraction layer that runs before or alongside your reputation check. Open-source libraries like URLextract and specialized tools like PhishTank's API, URLScan.io, or commercial offerings from vendors like Cisco Umbrella incorporate lexical scoring. Building an internal model using scikit-learn with training data from PhishTank and APWG feeds is achievable for teams with moderate data science capacity and gives you a model tuned to the specific brand terms and URL patterns relevant to your organization.

One practical deployment pattern is to score incoming URLs at the email gateway level and quarantine messages where the URL scores above a threshold but below a confident malicious verdict. This creates a review queue that human analysts or automated sandbox detonation can process rather than delivering the email or blocking it outright based on a borderline score.

Visual and HTML Content Analysis for Lookalike Pages

Some phishing URLs pass every pre-click check because the domain is clean, the URL structure looks normal, and no reputation feed has flagged it. The malicious content only becomes apparent when the page loads. This is where visual similarity detection and HTML analysis become relevant.

Visual similarity detection compares screenshots of rendered pages against a library of legitimate login page templates for common brands. Tools like Phoca, developed by researchers at Georgia Tech, and commercial equivalents use perceptual hashing and computer vision to identify pages that visually impersonate known brands regardless of the domain or URL structure. These tools are most valuable in post-click environments, proxy-based sandboxes, or email link rewriting systems where every URL is detonated before the final redirect reaches the user's browser.

HTML content analysis takes a different approach. It examines the raw source of the landing page for signals like form fields that submit to external domains, mismatched title tags and visible brand names, inline base64-encoded credential harvesting scripts, and the presence of legitimate CDN resources combined with suspicious form actions. This technique is particularly effective against phishing kits, which tend to reuse HTML templates with slight modifications. Extracting consistent structural fingerprints from known phishing kit HTML allows you to flag pages using the same kit even when every other indicator has been rotated.

Both approaches require URL detonation infrastructure. Whether you operate an on-premises sandbox, use a cloud-based detonation service, or integrate with a browser isolation platform, the principle is the same: render the page in a controlled environment, analyze the output, and make a verdict before the user's browser ever connects.

Certificate Transparency Logs as an Early Warning Layer

Every TLS certificate issued for a domain gets logged to public Certificate Transparency (CT) logs. Attackers who register phishing domains and immediately provision HTTPS certificates (which they almost always do, since a padlock icon provides false legitimacy to users) generate a CT log entry that is publicly observable within minutes of provisioning.

Monitoring CT logs for newly issued certificates tied to brand-impersonating domain patterns gives security teams advance warning before a phishing campaign launches. Tools like CertStream provide a real-time feed of CT log entries that you can subscribe to and filter against a list of brand keywords, common impersonation patterns, and your organization's own domain names and product names.

The practical workflow looks like this: a certificate is issued for secure-login-companyname-portal.com, CertStream delivers that event to your monitoring pipeline, your filter matches against a keyword list, an alert fires, an analyst reviews the domain, and your email gateway and DNS filter are updated before any phishing email using that domain is sent. In high-volume deployments, this process can be partially automated with a confidence scoring layer that routes high-confidence matches directly to blocking and lower-confidence matches to analyst review.

This approach does not catch every phishing campaign. Attackers using wildcard certificates on their own domains, or reusing previously provisioned certificates, generate no new CT log entries. But it catches a significant portion of freshly registered campaign infrastructure and has a much lower latency than waiting for the domain to appear in a reputation feed.

DNS Behavioral Signals During Active Campaigns

Phishing infrastructure has a specific DNS lifecycle. A domain is registered, name servers are provisioned, DNS records are created, the domain goes live, a campaign runs, and then the domain is either abandoned or repurposed. That lifecycle produces observable DNS behavioral signals that distinguish phishing infrastructure from legitimate domains.

Signals worth monitoring include domains with very low time-to-live values (attackers rotate DNS records quickly to complicate blocking), domains using newly registered name servers, domains where the A record resolves to hosting known to be associated with previous abuse, domains where WHOIS data is entirely privacy-masked with a registration date in the last seven days, and domains with MX records pointing to free email providers while the domain itself claims to be a financial institution.

Passive DNS data, available through providers like Farsight DNSDB, CIRCL's passive DNS service, and commercial threat intelligence platforms, lets you enrich suspicious domains with their historical DNS records. A domain that appeared to resolve to three different IP addresses in 48 hours, two of which were previously associated with phishing campaigns, carries a very different risk profile than a domain with a stable, multi-year DNS history even if both currently resolve to clean IP addresses.

Internal DNS resolvers that log all query data to a SIEM or threat intelligence platform give you the ability to detect phishing infrastructure at the moment your own users' machines make the first DNS query, which is before the HTTP connection and before any credential submission. Blocking at the DNS layer on a per-query basis is one of the highest-value, lowest-latency interventions available to defenders.

Machine Learning Models in Production: What Actually Works

Machine learning-based phishing URL detection appears in virtually every modern email security product, but understanding what these models actually do and where they fail helps you tune your deployment and identify gaps that need compensating controls.

Production ML models for phishing URL detection typically combine multiple feature sets: lexical features from the URL string itself, WHOIS and registration metadata, DNS resolution characteristics, TLS certificate attributes, HTML content features from detonated pages, and sometimes network-level features like hosting ASN reputation and IP geolocation. Ensemble models that combine these feature sets outperform single-feature models significantly because phishing operators routinely optimize individual indicators while neglecting others.

The failure mode that matters most in production is concept drift. A model trained on phishing data from six months ago may perform significantly worse against current campaigns because attackers adapt. The Iran-linked Handala group, for example, has demonstrated the ability to shift between attack techniques rapidly, recently expanding from digital campaigns to physical threat vectors, which illustrates how threat actors iterate and adapt. Phishing infrastructure operators do the same thing with URL structures and hosting patterns. Models need retraining on a regular cadence with fresh labeled data from recent campaigns.

For teams that operate their own models, a practical approach is to maintain a feedback loop from analyst verdicts back into the training pipeline. Every URL that an analyst marks as phishing or legitimate becomes a labeled training example. Over a quarter, this generates enough new data to meaningfully update the model and maintain accuracy against evolved techniques.

Implementing a Layered Detection Stack

What to Do Today

Start by auditing what your current email gateway actually does with URLs. Determine whether it rewrites links (enabling post-click detonation), whether it checks reputation at time-of-click in addition to time-of-delivery, and whether it performs any structural analysis beyond a simple reputation lookup. Many organizations discover that their gateway performs only delivery-time reputation checks, meaning a URL that goes live after delivery bypasses all automated analysis.

Enable DNS query logging on your internal resolvers if it is not already active. Route that log data to your SIEM. Create a detection rule that alerts on DNS queries to domains registered within the last 14 days. This rule will produce false positives from legitimate new services, but the volume is manageable and the detections include early-stage phishing infrastructure hits that no other layer would catch.

Subscribe to a CT log monitoring service or set up CertStream with a keyword filter covering your organization's brand names, product names, and executive names. Route alerts to a Slack channel or ticketing system for daily analyst review. The operational cost is low and the early warning value is high.

Actions for This Week

Implement URL detonation for all external links in inbound email, either through your existing email security platform's sandboxing feature or through a dedicated detonation service with an API integration. Configure the system to quarantine emails containing URLs that return malicious verdicts after detonation rather than delivering them with a warning banner, which users consistently ignore.

Build or acquire a domain lookalike monitoring capability. Tools like dnstwist generate permutations of your domains using typosquatting, homograph attacks, and common substitution patterns. Run dnstwist against your primary domains weekly and check each generated permutation against registration data. Domains that are registered but not by you are phishing candidates that warrant investigation and potential preemptive blocking.

Review your DNS resolver blocklist update frequency. If your DNS security layer updates its blocklist once daily, you have up to 24 hours of exposure for newly identified phishing domains. Push that update frequency to every hour or implement an API-based blocking mechanism that accepts real-time submissions.

Quarterly Security Posture Work

Evaluate your organization's phishing simulation data to identify which URL patterns and lure types your users consistently click. Many phishing simulation platforms categorize templates by technique. If your users show elevated click rates on finance-themed lures with invoice-related URL paths, feed that information back into your email gateway rules as a higher-sensitivity detection category.

Develop an internal threat intelligence process that converts analyst-confirmed phishing URLs into structured indicators and pushes them to all blocking layers within a defined SLA, targeting under 15 minutes from confirmation to block. This requires a ticketing or SOAR workflow that touches your email gateway, DNS filter, proxy, and endpoint security tools simultaneously rather than requiring individual manual updates to each platform.

Assess whether your current detection stack has visibility into phishing delivered through non-email channels. Phishing via SMS (smishing), messaging platforms like Teams and Slack, and QR codes in physical media all bypass email-centric defenses entirely. Extending mobile device management policies to include SMS content inspection and configuring Teams and Slack with URL reputation integrations closes gaps that attackers are actively exploiting, as observed in campaigns targeting enterprise collaboration tool users throughout 2025 and into 2026.

The Human Layer Cannot Be Engineered Away

Technical controls catch a substantial percentage of phishing URLs, but every detection stack has a false negative rate. Some links get through. When they do, the user is the last line of defense, and framing security awareness training in those terms changes what training should accomplish.

Effective awareness training for phishing URL recognition focuses on a small number of high-signal behaviors: checking the actual domain in a URL rather than the display text, recognizing that a padlock icon indicates only that the connection is encrypted and says nothing about the legitimacy of the site, understanding that urgency in email subject lines is a manipulation tactic, and knowing how to report a suspicious link through a one-click mechanism that feeds directly into the security team's queue.

Organizations that measure their awareness program effectiveness through simulated phishing click rates and report rates rather than quiz completion percentages get much more actionable data. A team with a 4% click rate and a 60% report rate on simulated phishing is in a better operational position than a team with a 2% click rate and a 10% report rate, because the second team's analysts are not seeing the near-misses that would inform detection improvements.

Keeping Detection Current Against a Moving Target

Phishing URL detection is not a problem that gets solved and stays solved. The attackers running these campaigns are professional, well-resourced, and actively test their infrastructure against enterprise defenses before deploying campaigns. The recent pattern of cybercrime networks providing attack-enabling services as a commercial product, visible in the Dutch server seizure operation and in the persistent ecosystem around residential proxy networks, means that individual threat actors have access to evasion testing infrastructure that would have been limited to nation-state operations a decade ago.

The defenders who perform best in this environment treat phishing URL detection as an adaptive process. They measure what their stack catches, analyze what gets through, update their models and rules accordingly, and track campaign evolution over time. That continuous improvement cycle, grounded in real data from real incidents, is what separates organizations that contain phishing incidents quickly from those that discover a compromise weeks after a user clicked a link that cleared every filter.