IPThreat - How Phishing URLs Get Past Your Filters and What Actually Catches Them

By IPThreat Team May 23, 2026

#phishingdetection #cybersecurity #threatintelligence #urlanalysis #emailsecurity #soc #maliciousurls

The Phishing Landscape Has Shifted Under Your Feet

Phishing remains the dominant initial access vector across virtually every industry sector, and the 2025 Verizon Data Breach Investigations Report reinforces what defenders are already experiencing firsthand: social engineering attacks have accelerated in frequency and sophistication. Healthcare organizations in particular are absorbing a disproportionate share of these attacks, with threat actors using increasingly convincing domain spoofing and URL obfuscation to bypass legacy filtering tools.

The challenge for cybersecurity professionals and IT administrators is no longer just recognizing a suspicious email. The real problem is detecting the malicious URL before a user clicks it, before the credential harvest begins, and before the incident becomes a breach notification. Modern phishing infrastructure uses short-lived domains, legitimate hosting services, and layered redirects that make static blocklist approaches functionally obsolete within hours of deployment.

The recent emergence of the Xdr33 implant, a variant of the CIA's HIVE attack kit, is a useful reminder that advanced persistent threat actors treat phishing as a precision delivery mechanism rather than a spray-and-pray nuisance. Watering hole campaigns pushing the ScanBox keylogger follow similar URL obfuscation patterns to traditional phishing kits, borrowing redirect chains and domain aging tricks that make detection harder without behavioral analysis. Understanding how these URLs are constructed and how to dissect them analytically is now a core operational skill.

Why Static Blocklists Fail Against Modern Phishing Infrastructure

Static blocklists operate on a fundamental assumption: that a malicious URL has been observed, reported, and cataloged before it reaches your users. Threat actors have long since engineered around this assumption. A phishing campaign can register a domain, deploy credential harvesting pages on compromised or legitimate cloud infrastructure, conduct a focused attack window of four to six hours, and abandon the infrastructure before any threat intelligence feed updates its blocklists.

The Webworm group's recently documented burrowing techniques illustrate this pattern clearly. Rather than relying on dedicated attacker-controlled infrastructure, groups like Webworm pivot through legitimate services and use domain fronting to make their callback and delivery URLs appear to originate from trusted providers. When a phishing URL routes through a CDN or cloud hosting provider that shares infrastructure with thousands of legitimate sites, IP-based blocking becomes counterproductive.

Beyond infrastructure abuse, attackers also exploit the aging behavior of threat intelligence systems. A domain registered sixty days before a campaign begins will have an established reputation score and minimal red flags in standard feeds. Combining aged domains with URL path randomization, protocol manipulation, and legitimate TLS certificates creates a delivery mechanism that bypasses most commercial email security gateways without requiring any novel technical capability.

Core Technical Indicators That Signal a Phishing URL

Effective phishing URL detection relies on a layered set of technical signals rather than any single indicator. These signals fall into several categories: lexical features of the URL itself, domain registration and infrastructure characteristics, behavioral signals from the landing page, and contextual signals from the surrounding email or message.

Lexical and Structural URL Analysis

The structure of a phishing URL frequently contains patterns that diverge from legitimate URLs in measurable ways. Common indicators include excessive subdomain depth, where legitimate brand URLs rarely exceed two subdomain levels but phishing URLs often construct chains like secure.login.verify.example-bank.attacker.com to push the legitimate-looking token toward the left while burying the actual registrable domain. Character substitution using homoglyphs, such as replacing a lowercase l with a numeral 1 or substituting Unicode characters that render identically to ASCII in most browsers, remains prevalent despite being a known technique.

URL length analysis provides a useful signal when applied statistically. Legitimate service URLs for authentication and account management fall within predictable length ranges for a given domain. Phishing URLs frequently contain long random-looking path components that serve as session tokens or campaign tracking identifiers, pushing total URL length well beyond what legitimate services use.

Keyword injection in the domain or path is another reliable indicator. Domains registered specifically for phishing campaigns frequently incorporate brand terms combined with action words: paypal-secure-verify, microsoft-account-update, chase-security-alert. Natural language processing models trained on labeled URL datasets can assign a suspicion score to these lexical patterns with reasonable accuracy even against domains that have never appeared in any threat feed.

Domain Registration and Infrastructure Signals

Domain age is one of the most operationally useful signals available to defenders. Newly registered domains, particularly those registered within the past thirty days, carry substantially higher base risk when observed in email traffic or proxy logs. Registration data from WHOIS and passive DNS sources can surface domains registered through privacy-masking registrars, which is disproportionately common in phishing infrastructure relative to legitimate commercial domains.

Registrar selection also provides signal. A small number of registrars account for a disproportionate share of abuse-registered domains. Cross-referencing observed domains against historical registrar abuse rates, which abuse.ch and similar projects publish, adds a scoring dimension that operates independently of content-based detection.

Autonomous System Number analysis reveals hosting infrastructure patterns. Phishing pages hosted on residential ISP infrastructure, certain bulletproof hosting ASNs, or on cloud providers' free tiers exhibit different risk profiles than domains hosted on enterprise-grade commercial infrastructure. Combining ASN reputation with domain age and lexical features creates a significantly stronger signal than any individual attribute.

Certificate Transparency logs have become a valuable source of early warning intelligence. Every certificate issued for a domain appears in public CT logs, and monitoring these logs for certificates issued to domains containing brand keywords or homograph variants of protected assets can surface phishing infrastructure before campaigns launch. Tools like certstream provide a real-time feed that security teams can monitor with keyword-matching rules aligned to their organization's brand and partner names.

Behavioral and Content Analysis

When static signals are insufficient, behavioral analysis of the URL's landing page provides the next layer of detection. Sandboxed URL detonation examines what a URL actually serves rather than making inferences from its structure. A URL presenting an HTML form that submits credentials to a different domain than the one displayed in the browser address bar is a definitive phishing indicator regardless of whether any static reputation signal flagged the URL beforehand.

Page content analysis looks for the presence of login form fields, brand imagery loaded from external sources, and discrepancies between the displayed domain and the domains receiving form submissions. Many commercial phishing kits source their brand assets directly from legitimate sites via hotlinking, which means the page will render correctly even when served from throwaway infrastructure. Detecting this pattern requires comparing the registrable domain of the form action attribute against the registrable domain in the URL being analyzed.

Redirect chain analysis is particularly important given how frequently modern phishing URLs use multi-stage redirects. A URL may begin at a legitimate-looking shortened link, redirect through a compromised website, and terminate at the actual credential harvesting page. Following the full redirect chain at analysis time, rather than evaluating only the initial URL, is mandatory for accurate detection. This is where many email security gateways fail when they evaluate URLs at delivery time without following redirects or defer evaluation until click time without adequate sandboxing.

Machine Learning Approaches That Work in Production

Machine learning has matured significantly as a component of phishing URL detection, and several approaches have demonstrated reliable performance in production environments. The key practical consideration is that no single model type works well across all detection scenarios, and production deployments benefit from ensemble approaches that combine multiple feature sets.

Feature-based classification using gradient boosting models such as XGBoost or LightGBM performs well on structured URL feature vectors. Features in these vectors typically include URL length, number of dots in the domain, presence of IP addresses in the domain field, count of special characters, number of subdomains, entropy of path components, and whether HTTPS is used. These models train quickly on labeled datasets, provide interpretable feature importance scores, and achieve high accuracy on known attack patterns. Their primary weakness is susceptibility to evasion by adversaries who understand and deliberately craft around the feature set.

Deep learning approaches using character-level convolutional neural networks or recurrent neural networks operating on raw URL strings capture patterns that engineered features miss, including novel obfuscation patterns that do not map cleanly to predefined feature categories. These models require more training data and computational resources but generalize better to previously unseen evasion techniques.

Graph-based detection is emerging as a high-value technique for phishing infrastructure identification. By mapping the relationships between domains, IP addresses, name servers, registrants, and certificates, graph models can identify clusters of phishing infrastructure that share common registration patterns or hosting configurations. Even when individual domains carry no individual red flags, their graph neighborhood may reveal strong association with known malicious infrastructure. This approach requires investment in passive DNS and certificate intelligence data sources but provides detection capability that is extremely difficult for attackers to evade without completely rebuilding their operational infrastructure.

Phishing URL Detection Checklist for Defenders

The following checklist provides a structured framework for evaluating and improving phishing URL detection capabilities across email security, proxy filtering, and endpoint protection layers.

Domain age verification: Confirm that your email security gateway and web proxy apply elevated risk scoring to domains registered within the past thirty days. Verify this behavior with a test using a newly registered benign domain to confirm the scoring policy is active.
Certificate Transparency monitoring: Deploy CT log monitoring with keyword rules covering your organization's brand names, executive names used in VIP impersonation scenarios, and the names of your major technology partners. Review alerts daily during initial deployment and refine keyword rules to reduce noise.
Full redirect chain evaluation: Audit whether your email security tool follows the complete redirect chain of URLs embedded in messages, including JavaScript-based redirects where supported. If your tool evaluates only the first URL in a chain, identify this gap and implement compensating controls such as click-time proxy evaluation.
Lexical analysis integration: Ensure that URL analysis includes lexical feature scoring. If your current tooling does not include this natively, evaluate whether your SIEM can apply regex-based scoring rules to proxy and email logs for common patterns including brand-keyword domain construction and excessive subdomain depth.
Homoglyph detection: Implement Unicode normalization and homoglyph detection on observed domains in email headers, message bodies, and proxy logs. Many organizations first discover they lack this capability when a homoglyph attack reaches a senior executive.
Sandboxed detonation coverage: Verify what percentage of inbound URLs receive sandboxed detonation. Understand which URL categories are excluded from detonation (common exclusions include internal URLs and certain file types) and assess whether those exclusions introduce exploitable gaps.
ASN and registrar reputation scoring: Integrate ASN reputation data into your web proxy policy so that URLs hosted on high-abuse ASNs receive additional scrutiny or require explicit user confirmation to proceed.
Passive DNS correlation: Subscribe to at least one passive DNS data source and build queries that surface domains sharing name server infrastructure, registrant contacts, or IP history with known phishing domains. Run these queries as part of your threat hunting cycle.
User reporting pipeline: Maintain a functioning user-report-a-phish workflow and measure time from report receipt to threat intelligence integration. User reports frequently surface phishing URLs before automated systems flag them, particularly in targeted spear phishing campaigns against specific business units.
Detection coverage testing: Run regular phishing simulation exercises using URLs that mimic current attacker techniques, including aged domains, legitimate hosting abuse, and redirect chains. Measure detection rates by layer and use gap analysis to prioritize tool improvements.

Real-World Detection Scenarios

Scenario One: Credential Harvesting Through Legitimate Cloud Storage

A threat actor uploads a static HTML credential harvesting page mimicking a Microsoft 365 login to a SharePoint or Google Drive storage bucket. The URL presented in the phishing email is a legitimate cloud storage domain, meaning domain reputation and age signals are entirely clean. Detection in this scenario depends on content analysis of the page served at the URL, specifically the discrepancy between the cloud storage domain in the address bar and the external domain receiving the form submission. Organizations whose email security tools do not perform content analysis of landing pages and analyze only URL reputation will not detect this attack at the gateway layer. The compensating control is endpoint DNS filtering that evaluates the domains contacted when the page loads in the browser, including the form submission target.

Scenario Two: Spear Phishing With Aged Domain and Valid TLS

A threat actor targeting a financial services organization registers a domain sixty days before the campaign, hosts a convincing login portal with a valid TLS certificate from a public CA, and sends targeted messages to finance department staff. The domain has clean reputation, valid HTTPS, and no prior abuse reports. Detection relies on lexical analysis revealing brand keyword injection in the domain name, CT log monitoring that flagged the domain at registration time, and behavioral analysis showing that the login form submits to a data exfiltration endpoint. Organizations that run CT monitoring with brand keyword rules will have had sixty days of advance warning before the campaign launched, giving them time to preemptively block the domain and alert staff.

Scenario Three: Multi-Stage Redirect Through Compromised WordPress

A phishing email contains a URL pointing to a legitimate but compromised WordPress blog. The compromised site redirects visitors to a credential harvesting page on throwaway infrastructure. The initial URL has high reputation because the WordPress site is legitimate and well-established. Detection requires redirect chain following at click time. When the user's browser or proxy follows the redirect and reaches the harvesting page, behavioral analysis flags the login form discrepancy and the terminal domain's age and hosting characteristics. This scenario illustrates why initial URL reputation alone is insufficient and why click-time evaluation with redirect following is operationally necessary.

Implementation Pitfalls That Undermine Detection Programs

Building a phishing URL detection program that works in practice requires avoiding several common implementation failures that appear to provide coverage but collapse under real attack conditions.

The most consequential pitfall is treating URL analysis as a delivery-time-only problem. Many organizations configure email security gateways to evaluate URLs when messages are delivered and treat that evaluation as final. Attackers who understand this behavior register clean domains, host benign content at delivery time, and swap in malicious content after delivery. Time-of-click evaluation that re-analyzes URLs when users actually click them is necessary to catch this technique, which is sometimes called late-stage URL swapping or content cloaking.

A second pitfall is excluding internal tools and trusted partner domains from analysis. Attackers increasingly compromise trusted partner infrastructure specifically to send phishing URLs that bypass trust exemptions. The Webworm group's recently documented burrowing techniques are a clear example: by operating through infrastructure that defenders have pre-exempted from scrutiny, attackers gain reliable delivery. Applying reduced scrutiny to URLs from trusted senders rather than eliminating scrutiny entirely, and never exempting based solely on sending domain, reduces this exposure.

Over-reliance on a single detection signal creates exploitable predictability. When an organization's detection logic is known to rely primarily on domain age, sophisticated attackers register aged domains. When detection logic is primarily lexical, attackers use random dictionary words rather than brand keywords. Combining multiple independent signal types forces attackers to satisfy all of them simultaneously, which increases the cost and complexity of evasion.

Inadequate logging of URL analysis decisions prevents retrospective investigation and detection improvement. When a phishing URL is reported by a user or identified through external threat intelligence, defenders need complete logs of when the URL was first observed in email or proxy traffic, what analysis scores it received, and whether any detection rules fired. Without this telemetry, tuning detection logic and measuring coverage improvement is largely impossible.

Finally, failing to connect detection outputs to measurable business outcomes results in detection programs that stagnate at an acceptable but suboptimal level. Security teams that measure phishing URL detection rates against their own simulations and against user-reported incidents, track mean time from URL first observed to detection, and tie these metrics to audit and compliance requirements build programs that improve continuously rather than remaining static while attacker techniques advance.

The acceleration of AI-assisted phishing construction documented in recent defender playbooks means URL obfuscation techniques are evolving faster than at any prior point. Detection programs built on layered technical signals, behavioral analysis, and continuous measurement are the ones that will continue functioning as the threat evolves rather than requiring complete replacement each time a new evasion pattern emerges.