The 5W Citation Source Audit

Research Report / Q1 2026

Published

May 2026

Updated

June 4, 2026

Research Base

11 datasets

Cadence

Quarterly · Monthly refresh

Read time

14 min

Download PDF →Jump to findings ↓Methodology note This refresh synthesizes published third-party datasets and 5W research properties. Findings should be interpreted as directional indicators rather than platform-specific measurement standards. 5W did not independently run or verify the underlying primary research for the third-party datasets cited. The Q2 2026 edition will layer 5W’s own 1,500-prompt primary research run on top of this baseline.

What’s new in this refresh — June 4, 2026

Three new findings (10–12): Claude’s premium-publisher pattern; passage-level citation behavior; the trade-press rerank.
A Claude Citation Pattern Map — first dedicated Claude view inside this audit — and a Citation Behavior Matrix mapping all five engines against six citation dimensions.
Two new datasets integrated: Lantern AI Citation Content Visibility Report (200M citations, February 2026); 5W Trade Press AI Index 2026 (synthesis of six published citation studies, ~680M citations across nine industries).

Footnote on volatility: every finding holds at time of refresh. Citation patterns shift on multi-week timescales. Expect Q2 2026 to re-baseline.

01 / Executive Summary

The PR Tier Hierarchy No Longer Reflects How Influence Works

For decades, public relations operated on a stable hierarchy: Tier 1 media, then trade media, then blogs. That hierarchy no longer reflects how influence works.

When users ask ChatGPT, Claude, Perplexity, Gemini, or Google AI Overviews about a brand, a category, or an executive, those systems do not rely on the traditional PR tier structure. They pull from a fragmented, dynamic, and structurally different source ecosystem — where Wikipedia and Reddit dominate, LinkedIn and YouTube are rapidly rising, review platforms drive recommendations, and traditional Tier 1 media is underrepresented.

This synthesis report draws on nine independently published research datasets covering hundreds of millions of citations and prompts to offer a unified working model of AI citation behavior.

The Five Core Findings

01 Wikipedia + Reddit = Structural Dominance. Together they account for over 25% of ChatGPT citations in the U.S. (Similarweb, Q1 2026) — more than any traditional media category.

02 The PR Tier System Is Misaligned With AI Reality. Reuters outranks Forbes. Forbes outranks most Tier 1 media. The Wall Street Journal and The New York Times often don't appear at all.

03 AI Citations Are Long-Tail, Not Winner-Take-All. Outside the dominant tier, distribution across many sources consistently outperforms concentration in a few.

04 Platforms Are Volatile. Reddit's share on ChatGPT collapsed from ~60% to ~10% of prompt responses in two weeks during September 2025 (SEMrush, Nov 2025). Static strategies fail.

05 – ach AI Engine Is Different. There is no single "AI SEO" strategy. Each platform requires distinct optimization.

The Core Insight
How This Was Built
Top Cited Domains
The Twelve Findings
Platform Differences
Industry Patterns
Operator Playbook
References & Limitations

02 / The Core Insight

AI Engines Don't Rank Authority. They Assemble Answers.

This is the single biggest shift from traditional PR. AI engines don't rank — they assemble. They favor extractable content, prioritize consensus across sources, and reward repetition over prestige.

For 18 months, the industry has been asking: what is our AI strategy? Most answers have been vague. Create more content. Get on Reddit. Build thought leadership. The advice is directionally correct — but misweighted and incomplete.

Multiple large-scale datasets published in 2025 and 2026 — from Similarweb, SEMrush, Profound, Peec AI, SE Ranking, Goodie, Ahrefs, and Evertune — now allow the industry to move from anecdote to operational framework.

The data converges on a structural insight that traditional PR thinking has not absorbed:

Three Consequences

AI engines favor extractable, structured content over narrative or prestige.
AI engines prioritize consensus across many sources over a single authoritative one.
AI engines reward repetition across the web over editorial endorsement.

Authority is no longer controlled by editors. It is distributed across platforms. The brands that win in AI visibility are not the ones with the most prestigious clip book. They are the ones whose name appears, consistently, in the structured surfaces the models actually pull from.

03 / How This Was Built

Eleven Datasets. Hundreds of Millions of Citations and Prompts.

This is a synthesis report. 5W did not independently run the underlying primary research and did not independently verify the data. The report integrates findings from eleven separately published studies covering January 2025 through May 2026, and surfaces patterns where the studies converge.

The studies use different units of measurement — some count citation events, others source domains, others unique prompts. The table below reports each in its native unit. Full URLs and publication dates appear in References & Limitations.

Source	Dataset	Coverage
Similarweb	~600,000 events	ChatGPT, Google AI Mode (Jan–Feb 2026, U.S.)
Peec AI	30M sources	ChatGPT, AI Mode, Gemini, Perplexity, AI Overviews
SEMrush	325K + 230K prompts	13-week cross-platform tracking
Profound	1.4M citations	Six AI models tracked
SE Ranking	129K domains	Domain-level correlation analysis
Goodie	5.7M citations	Feb–Jun 2025, four engines
Ahrefs	75K brands	December 2025 correlation study
Evertune	200M prompts	Long-tail distribution analysis
Passionfruit	12-month synthesis	March 2026 cross-study review
Lantern	200M citations	Feb 2026, ChatGPT, Perplexity, Gemini, Claude
5W Trade Press AI Index	~680M citations	Six published studies synthesized, 9 industries

The Q2 2026 Primary Research Run

The next edition will layer 5W's own primary research on top of this baseline. We will run 1,500 fixed prompts (600 branded, 600 category, 300 executive) across ChatGPT, Claude, Perplexity, Gemini, and Google AI Mode in a single calendar week, classify every citation against a 12-bucket taxonomy, and publish the dataset and methodology for public replication.

04 / The Leaderboard

The Top 20 Domains ChatGPT Cites

Similarweb's January–February 2026 dataset of approximately 600,000 citation events provides the cleanest single-platform leaderboard available. Three patterns stand out before the table loads:

Structured and community sources dominate. Wikipedia, Reddit, YouTube, LinkedIn, GitHub, and Fandom collectively exceed every traditional news outlet in the top 20.
WSJ, NYT, Bloomberg, and FT do not appear at all. Forbes is the only U.S. business publication on the list.
ChatGPT cites OpenAI itself third — ahead of Reuters and every news outlet measured. Google AI Mode does the same with Google properties.

#	Domain	Share
01	wikipedia.org	13.15%
02	reddit.com	11.97%
03	openai.com	6.21%
04	walmart.com	2.90%
05	youtube.com	2.67%
06	linkedin.com	2.42%
07	reuters.com	2.27%
08	nih.gov	2.22%
09	google.com	2.17%
10	amazon (media-amazon)	1.94%
11	wikimedia.org	1.93%
12	facebook.com	1.76%
13	ebay.com	1.75%
14	amazon.com	1.71%
15	github.com	1.62%
16	apple.com	1.48%
17	yahoo.com	1.44%
18	forbes.com	1.38%
19	fandom.com	1.29%
20	squarespace-cdn.com	1.29%

Source: Similarweb AI Citation Analysis, January–February 2026 (U.S.).

04.1 / Claude Citation Pattern Map

How Claude Sources Differently

The original Q1 audit centered the ChatGPT leaderboard because Similarweb’s January–February 2026 dataset offered the cleanest single-platform data. Claude’s citation behavior runs on a different architecture — and the difference is operationally material for any brand whose buyer mix includes Claude users.

Five patterns define Claude citation behavior:

Pattern	What the cited data shows
Retrieval Backend	Claude is reported to route web retrieval through Brave Search rather than Google. According to Profound’s 2025 analysis (via Tagliaferro), 86.7% of Claude-cited URLs in their sample overlapped with Brave top organic results. Not independently verified by 5W.
Journalism Bias	Across cited datasets, Claude is observed to weight premium long-form publishers heavily — The New York Times, The Atlantic, The New Yorker, The Economist. ChatGPT is observed to favor Forbes, Business Insider, Reuters. Different Tier 1 patterns appear for different engines.
Recency Window	Per the 5W AI Platform Citation Source Index 2026, approximately 36% of Claude’s journalism citations were drawn from the past 12 months, versus approximately 56% for ChatGPT. Claude appears to retain citation value for older authoritative coverage longer.
URL Structure	In the Oltre 2,170-URL Claude analysis: 56% of cited URLs were under a `/blog/` path, 47% used listicle structures, and 24% carried an explicit year token in the URL itself. Reported sample only.
Selectivity	Per analyses by Rankeo and Erlin (2026), Perplexity is reported to account for roughly 47% of all tracked AI citations across platforms; Claude is observed to cite more selectively per query in the same samples.

Sources: Profound (2025) via Tagliaferro · Oltre 2,170-URL Claude analysis (2026) · Erlin 501-site analysis (2026) · 5W AI Platform Citation Source Index 2026 · 5W Trade Press AI Index 2026. 5W did not independently verify the third-party measurements.

What This Means Operationally

Brave Search ranking may serve as a leading indicator of Claude visibility. Profound’s 2025 analysis reported 86.7% URL overlap between Claude citations and Brave top organic — reported by Profound; not independently verified. Most communications programs do not measure Brave.
Premium long-form placements appear to convert in Claude — even when the same outlets are observed as invisible in ChatGPT. The Tier 1 strategy is not dead; the cited data suggests it is reweighted by platform.
URL structure appears to carry retrieval signal. Year tokens in slugs may act as a freshness cue; listicle paths may act as an extraction cue.

05 / The Findings

Twelve Patterns That Define AI Citation Behavior

Each finding below is what the integrated dataset shows. Each is followed by what it means for communications strategy and the source(s) the finding rests on.

Reddit Is Infrastructure, Not a Channel

Reddit ranks #1 across every major AI engine measured.

Reddit ranks first in citation share across most major AI engines. Peec AI's 30-million-source analysis ranks Reddit number one across ChatGPT, Google AI Mode, Gemini, Perplexity, and AI Overviews. On Perplexity specifically, Evertune found Reddit accounts for as many as one in five of all citations.

The mechanism is structural. OpenAI announced a content licensing partnership with Reddit in 2024; Google has its own data agreement. SE Ranking's domain-level analysis found brands with millions of Reddit mentions averaged seven ChatGPT citations versus 1.8 for brands with minimal presence — a 3.9x multiplier.

What most people get wrong: this is not about posting. It is about presence and credibility over time. The platform's culture rewards substance, the LLMs subsequently cite the substance, and promotional behavior is filtered out within hours.

Sources: Peec AI 30M-source analysis; Evertune 200M-prompt analysis; SE Ranking 129K-domain study.

Wikipedia Is the Ground Truth Layer

The single most influential document in any brand's AI visibility profile.

Wikipedia is the most-cited single domain in ChatGPT (13.15% of U.S. citations) and a top source on every other major engine measured. It is widely documented across published research as a major training and citation source for the leading LLMs, and the most consistently retrieved authoritative source at inference time when models ground a factual answer.

If your Wikipedia page is weak, AI answers are weak. If it is missing, AI fills the gap, often incorrectly.

Correction to industry thinking: Wikipedia is not optional. The path to a strong page is not direct editing — Wikipedia's notability and reverter rules punish that. The path is earning citation-eligible secondary coverage that other editors then use to build the page.

Sources: Similarweb (Q1 2026); cross-referenced across Goodie, SEMrush, Spotlight.

LinkedIn Is the Fastest-Growing Signal

From rank #11 to #5 on ChatGPT in three months — the largest shift Profound observed all year.

LinkedIn climbed from approximately #11 on ChatGPT in November 2025 to #5 by February 2026 (Profound). SEMrush's 325,000-prompt study found LinkedIn cited in 14.3% of ChatGPT Search responses, 13.5% of Google AI Mode responses, and 5.3% of Perplexity responses. For B2B and software queries, Profound found LinkedIn is now the #1 most-cited domain across all six major AI platforms.

Critical nuance: ChatGPT and Google AI Mode pull approximately 59% of LinkedIn citations from individual member content. Perplexity inverts this, pulling about 59% from Company Pages. Both the leadership-publishing effort and the company-page operation matter — and they compound.

Leadership visibility is now a ranking factor. Most communications programs underinvest in named-leader publishing because it does not produce traditional earned-media metrics. The AI citation data overrules the traditional metric.

Sources: Profound (Feb 2026); SEMrush 325K-prompt study; ALM Corp synthesis.

YouTube Is the Hidden Power Signal

0.737 correlation with AI visibility — the strongest single predictor in any 2025–2026 study.

Ahrefs' December 2025 study of 75,000 brands found YouTube mentions correlated at 0.737 with appearances in ChatGPT, AI Mode, and AI Overviews — the strongest single correlation in their dataset.

AI engines read transcripts. Mentions persist indefinitely. The video itself is incidental — the transcript is the asset. A single ten-minute video with a substantive brand mention can generate citation lift for months.

The insight: a strong creator-led video mention can outperform a major media hit in AI visibility. Most communications programs do not budget against this. They should.

Source: Ahrefs 75K-brand correlation study, December 2025.

Forbes Is the Editorial Exception

The most-cited U.S. business publication on ChatGPT. WSJ, NYT, and Bloomberg do not appear in the top 20.

Forbes ranks 18th in Similarweb's ChatGPT dataset at 1.38% of all citations. The Wall Street Journal, The New York Times, Bloomberg, and Financial Times — all marquee Tier 1 PR targets — do not appear in the top 20 at all in this dataset.

Three structural reasons: paywalls limit body-text extraction; licensing disputes between LLM platforms and major news publishers have reduced indexing; long-form narrative features produce less clean factual extraction than tighter trade or contributor pieces.

Prestige does not equal extractability. Extractability does not equal citation.

This is not an argument to stop pitching Tier 1. Mainstream coverage retains its value for reputation, financial credibility, and as upstream feedstock to Wikipedia. It is an argument that an earned-media strategy concentrated in Tier 1 only is structurally underweighted on the AI citation layer.

Source: Similarweb AI Citation Analysis, Q1 2026.

Review Platforms Drive Decision Citations

Brands across G2, Capterra, Trustpilot, and Yelp see a 3x citation multiplier.

SE Ranking found brands listed on multiple review platforms averaged 4.6 to 6.3 ChatGPT citations versus 1.8 for absent brands. Peec AI confirmed Yelp and G2 specifically appear frequently in recommendation queries. Passionfruit's March 2026 synthesis found brands with G2, Capterra, Trustpilot, and Yelp profiles have approximately 3x higher citation probability than brands without them.

Review platforms function as third-party validation that AI engines treat as authoritative for vendor-comparison and recommendation queries. They provide structured ratings, comparative data, and clear extraction signals.

Action: claim and complete profiles on the three major platforms for the category. Encourage structured reviews — star ratings combined with specific use cases and pros/cons. Generic five-star reviews carry less signal than detailed mid-range reviews.

Sources: SE Ranking 129K-domain study; Peec AI; Passionfruit synthesis (Mar 2026).

AI Visibility Is Long-Tail, Not Winner-Take-All

Outside Wikipedia and Reddit, no domain exceeds 3% of ChatGPT citations.

Wikipedia and Reddit sit in their own tier on ChatGPT, at 13.15% and 11.97% of citations respectively. Below them, the distribution flattens dramatically: in the Similarweb data, no other domain exceeds 3% of ChatGPT citations except OpenAI's own properties (6.21%). The remaining seventeen domains in the top 20 together account for roughly 32%, and the rest of the citation share spreads across thousands of long-tail sources. Evertune's separate tracking across 200 million prompts confirms the broader pattern — outside the dominant tier, citation share is broadly distributed rather than concentrated.

This is a fundamentally different distribution from traditional SEO, where the top 10 results capture roughly two-thirds of clicks. AI search citations are a long tail with a few outliers — not a winner-take-all market.

Traditional SEO rewards rank concentration. AI visibility rewards distribution across many sources.

The strategic consequence: getting mentioned across many high-citation third-party domains is more valuable than ranking your own .com higher. Distributed mentions produce measurable lift in three to six weeks.

Sources: Similarweb (Q1 2026); Evertune 200M-prompt analysis.

Depth Beats Authority

Fandom outranks Wikipedia in Google AI Mode. Structure plus depth beats brand authority.

Fandom.com leads Google AI Mode's citation list at 7.16% — ahead of Wikipedia (5.21%), YouTube (4.91%), and Reddit (4.19%). The reason is not simply that AI Mode sees lots of entertainment queries. It is that Fandom pages are structurally optimized for what AI engines prefer.

Fandom pages run thousands of words covering one specific subject, organized under precise headings, maintained by communities with encyclopedic precision. Each page exists to answer one question about one thing.

Generalizable lesson: any brand publishing deep, single-topic reference pages on its area of expertise is building the structure AI engines reward. AI rarely cites homepages — most citations come from pages several folders deep. Specific beats broad. Deep beats wide.

Source: Similarweb AI Citation Analysis (Google AI Mode), Q1 2026.

Volatility Is Structural

Reddit's ChatGPT share collapsed from ~60% to ~10% of prompt responses in two weeks (Sept 2025).

The biggest shift of 2025 was the September collapse. Across SEMrush's 230,000-prompt 13-week tracking study, ChatGPT's citation share for Reddit dropped from approximately 60% of prompt responses to roughly 10% in a two-week window. Wikipedia followed a similar pattern, falling from roughly 55% to under 20%. Both partially recovered.

Forbes doubled its ChatGPT citation share in the same period. LinkedIn trended upward. Some weight redistributed; some collapsed entirely.

Annual AI visibility audits are obsolete. Quarterly is the floor. Monthly is competitive advantage.

The platforms tune retrieval behavior aggressively. Rankings shift meaningfully on a multi-week timescale. Brands measuring annually are reporting against citation patterns that no longer exist.

Source: SEMrush 230K-prompt 13-week tracking study, November 2025.

Claude Exhibits Strong Premium-Publisher Bias

Different Tier 1, different mechanics, longer memory.

Claude’s journalism citation profile appears structurally different from ChatGPT’s. Across measured AI citation studies, Claude is observed to lean into The New York Times, The Atlantic, The New Yorker, and The Economist. Per the 5W AI Platform Citation Source Index 2026, approximately 36% of Claude’s journalism citations were drawn from the past 12 months — compared to roughly 56% for ChatGPT. Claude appears to reward older, authoritative long-form coverage more than recent news.

The retrieval mechanism is also reported to differ. Claude is reported to route through Brave Search as its web backend. According to Profound’s 2025 analysis (via Tagliaferro), 86.7% of Claude-cited URLs in that sample overlapped with Brave top organic results — reported by Profound, not independently verified — suggesting Brave Search visibility may serve as a practical leading indicator of Claude citation lift.

Strategic consequence: Tier 1 placements that appear weak in ChatGPT may be central to Claude visibility based on the cited samples. Brands serving enterprise, legal, financial, and policy buyers should consider weighting premium long-form and Brave Search positioning alongside ChatGPT-optimized formats.

Sources: 5W AI Platform Citation Source Index 2026 · Profound (2025) via Tagliaferro · Oltre 2,170-URL Claude analysis (2026) · 5W Trade Press AI Index 2026. Directional indicators, not measurement standards.

Citations Happen at the Passage Level, Not the Page Level

Every paragraph can act as a retrieval unit. Structure beats length.

A single well-structured paragraph can earn a citation; the page around it can be ignored. Across the Oltre 2,170-URL Claude analysis: 56% of cited URLs sat under /blog/ paths, 47% used listicle structures, and 24% carried year tokens in the URL itself. The Pixelmojo synthesis of the Princeton/Georgia Tech GEO study (KDD 2024) reported that fluency plus statistics together outperformed any single tactic by an additional 5.5%.

The unit of optimization appears to be shifting from the article to the extractable claim — a clean sentence with a named entity, a specific number, and a date. Pages built as collections of citable atomic claims appear to outperform pages written as continuous narrative in the cited samples.

Example — citable paragraph (engine-ready): “Wikipedia accounts for 13.15% of ChatGPT citations in the U.S., according to Similarweb’s January–February 2026 dataset of approximately 600,000 citation events. Reddit follows at 11.97%.” Named source, specific number, explicit unit, date window. Every clause is verifiable and extracts cleanly.

One GEO recommendation: Rewrite the lead sentence of every top-traffic page so it contains one named entity, one specific number, and one timeframe. Then audit the next four paragraphs to ensure each contains at least one citable claim of the same form.

Sources: Oltre 2,170-URL Claude analysis (2026) · Pixelmojo synthesis of Princeton/Georgia Tech GEO Study (KDD 2024) · Erlin (2026). Directional indicators, not measurement standards.

The Trade Press Has Been Reranked — And It Has Clear Winners

PCMag appears more consistently than TechCrunch. Skift outpaces prestige titles. Axios outranks most mainstream peers across measured AI citation studies.

The 5W Trade Press AI Index 2026 synthesizes six published citation studies covering approximately 680 million citations across nine industry sectors. The synthesis surfaces a pattern: AI engines appear to have quietly reweighted the trade press in the cited datasets, and the rerank has named winners. PCMag is observed to lead technology. Skift leads travel. STAT leads healthcare. Bloomberg leads financial services. Axios leads public affairs. Prestige titles including TechCrunch are observed to be losing citation ground across measured studies.

Across the synthesized data, the top 15 domains across all platforms appear to capture approximately 68% of all consolidated AI citation share. In technology, PCMag (0.8%–1.6% share), TechRadar (0.3%–1.9%), and CIO.com (0.6%–2.1%) are reported in the top 10 across six to seven engines — out-citing higher-traffic general-news outlets.

Strategic consequence: the PR media list inherited from the 2010s no longer appears to reflect which placements carry AI citation weight. The list should be reaudited by vertical, by engine, and against the rerank winners. Trade press that looks unglamorous on a tearsheet may be the single highest-yield placement in the AI citation layer.

Source: 5W Trade Press AI Index 2026 (published on everything-pr.com), synthesizing data from Lantern, Similarweb, SEMrush, Profound, Peec AI, and Goodie. 5W did not independently run the underlying citation measurements.

06 / Platform Differences

Five Engines. Five Different Citation Patterns.

There is no single “AI SEO.” Each engine sources differently. A strategy that produces results on one platform is not transferable to another.

Platform	Top 5 Cited Domains (observed across cited studies)	Defining Pattern (as reported)
ChatGPT	Wikipedia, Reddit, OpenAI, Walmart, YouTube	Most Wikipedia-heavy. ~56% of journalism citations reported from past 12 months. Recent-news bias observed.
Claude	Wikipedia, Reddit, NYT, The Atlantic, The Economist	Reported to route through Brave Search. Premium long-form bias. Longer memory — ~36% of journalism citations recent. Profound (2025) reported 86.7% URL overlap with Brave top organic; not independently verified.
Google AI Mode	Fandom, Wikipedia, YouTube, Reddit, Google	Observed to favor Google-owned properties. Reported to cite ~9 domains per query.
Gemini	Reddit, YouTube, Wikipedia, Medium, Forbes	Observed to integrate Google search results directly. Strong traditional SEO converts to Gemini visibility more than to other engines.
Perplexity	Reddit, LinkedIn, NIH, Microsoft, G2	Research-credible bias. Per Rankeo (2026), reported to account for ~47% of tracked AI citations across platforms. Most footnote-explicit.
AI Overviews	YouTube, Reddit, Forbes, LinkedIn, Wikipedia	Reported to cite ~7.7 domains per query. YouTube observed in ~29.5% of AI Overviews — highest video weight of any engine measured.

Sources: Similarweb (Jan–Feb 2026) · Peec AI 30M-source analysis · Lantern AI Citation Content Visibility Report (Feb 2026, 200M citations) · 5W AI Platform Citation Source Index 2026 · 5W Trade Press AI Index 2026 · Profound · SEMrush · Rankeo (2026).

Key Patterns

Reddit appears in the top five on every platform. Universal channel.
YouTube is the most-cited domain in AI search by a significant margin once aggregated — Lantern’s 200M-citation analysis reports more than 2× the citation share of the second-ranked domain.
Claude and Perplexity diverge sharpest from ChatGPT. A brand visible in ChatGPT can be invisible in Claude — and vice versa.
AI Mode cites approximately 9 domains per query; AI Overviews cites 7.7. Wider citation pools demand a wider citation strategy.
ChatGPT is the most Wikipedia-heavy. Strong Wikipedia content disproportionately moves ChatGPT visibility.
Gemini integrates Google search results directly. Strong traditional SEO converts to Gemini visibility more than to any other engine.

AI Platform Citation Behavior Matrix

Directional ratings based on the integrated dataset. HIGH / MEDIUM / LOW indicate relative weight of each citation dimension within that engine’s citation mix — not absolute citation volume across engines.

Platform	Recency	Trade Press	Wikipedia	Reddit	Video	Long-form Journalism
ChatGPT	HIGH	MEDIUM	HIGH	HIGH	MEDIUM	LOW
Claude	LOW	MEDIUM	HIGH	HIGH	MEDIUM	HIGH
Gemini	MEDIUM	MEDIUM	HIGH	HIGH	HIGH	MEDIUM
Perplexity	MEDIUM	HIGH	MEDIUM	HIGH	LOW	LOW
AI Overviews	MEDIUM	MEDIUM	MEDIUM	HIGH	VERY HIGH	MEDIUM

Ratings are qualitative synthesis, not normalized platform measurements. Drawn from the integrated dataset of eleven sources cited in this report. Patterns shift on multi-week timescales; rerun quarterly.

07 / Industry Patterns

Citation Behavior Varies Sharply by Vertical

The presence and quality of vertical trade media is one of the strongest predictors of how a category is described in AI. Categories with strong specialized trades see those trades dominate. Categories without them see Reddit, Wikipedia, and review sites fill the vacuum.

Industry	Dominant Citation Sources
B2B SaaS	G2, Capterra, Reddit (r/SaaS, r/sysadmin), GitHub, LinkedIn, vertical trades
Beauty	Reddit (r/SkincareAddiction), Glossy, WWD, Allure, dermatologist sources
Fintech	American Banker, Banking Dive, Reddit (r/personalfinance), .gov, SEC filings
Healthcare	NIH, .gov, .edu, peer-reviewed journals, STAT, Healthcare Dive
Travel	TripAdvisor, Yelp, Reddit (r/travel), Skift, Hotel Management
Cannabis	MJBizDaily, Marijuana Moment, Leafly, Reddit (r/trees, state subreddits)
Legal	Law360, Above the Law, ALM properties, bar associations, case law
CPG	Modern Retail, Retail Dive, Food Dive, AdAge, Reddit, review aggregators

Synthesis based on patterns observed across the eleven source datasets and 5W’s industry experience.

The Overarching Pattern

In high-stakes verticals — healthcare, finance, legal — government, academic, and authoritative sources carry disproportionate weight. The models recognize where source authority matters most.

In consumer-facing verticals — beauty, travel, CPG — community platforms, review aggregators, and influencer-published content lead. Trust signals are distributed across many sources rather than concentrated in editorial brands.

In B2B — SaaS, professional services — review platforms (G2, Capterra) and LinkedIn lead, with vertical trades providing the editorial layer. Wikipedia matters less than in consumer categories.

08 / The Operator Playbook

What This Means for Brands

5W Recommends

Four Moves Now

Drawn from the patterns observed in this audit. Sequenced for execution.

1Audit Brave for Claude. Brave Search ranking is reported by Profound (2025) as a leading indicator of Claude citation visibility. Most communications programs do not measure Brave. Start there for any brand whose buyer mix includes Claude users.
2Build Reddit, YouTube, and LinkedIn citation surfaces. Reddit appears in the top five citations of every major engine measured. YouTube appears in approximately 29.5% of AI Overviews. LinkedIn climbed from approximately rank #11 to #5 on ChatGPT in three months (Profound). All three are distributed-authority assets, not social channels.
3Rewrite top pages into citable claim blocks. Per Finding 11 above: every paragraph can act as a retrieval unit. Lead each section with a named entity, a specific number, and a date. Make the page a collection of extractable claims.
4Re-rank media lists by AI citation value, not prestige. The PR media list inherited from the 2010s no longer reflects which placements carry AI citation weight in the cited samples. PCMag is reported more consistently than TechCrunch. Skift outpaces prestige titles. Reaudit by vertical, engine, and rerank winners.

AI engines reward distribution, not concentration. The brand that appears in many places consistently beats the brand that appears in one place authoritatively.

To win in AI visibility, brands must execute against four mutually reinforcing levers. None is optional. Each one feeds the others.

01 Control Your Ground Truth

Wikipedia page — accurate, complete, well-sourced to citation-eligible publications.
Owned site — About page, leadership bios, product pages, and newsroom written in the language you want repeated by AI.
Schema markup and structured data on every page that matters.
Press releases and corporate communications consistent with your ground-truth language.

02 Build Distributed Authority

Reddit — active brand presence, founder/operator participation, AMAs, expert contribution.
LinkedIn — named-leader publishing on a weekly cadence; active company page.
YouTube — seeded mentions in category creator content, reviews, comparisons, tutorials.
Review platforms — G2, Capterra, Trustpilot, Yelp profiles claimed and structured.

03 Create Extractable Content

Case studies with specific numbers, named clients, and structured outcomes.
FAQs that answer one question per page in clear, extractable language.
Deep vertical reference pages that own a single topic decisively.
Original research and proprietary data — citations compound on findings nobody else has.

04 Increase Repetition Across Sources

Mentions are more important than backlinks.
Distribution across many sources is more important than ranking in any one.
Earned coverage in citation-eligible publications feeds Wikipedia upstream.
Repetition across the surfaces AI engines pull from creates the consensus signal that drives citation.

FAQ

Frequently Asked Questions

Which sources does ChatGPT cite most often?

Per Similarweb’s January–February 2026 dataset of approximately 600,000 citation events in the U.S., the top three domains ChatGPT cites are Wikipedia (13.15%), Reddit (11.97%), and OpenAI.com (6.21%). Wikipedia and Reddit together account for over 25% of all measured ChatGPT citations. Note: AI citation patterns are volatile and shift on multi-week timescales; refer to the latest edition of the 5W Citation Source Audit for current data.

Does Claude use Brave Search?

Claude is reported to use Brave Search as its web retrieval backend. According to Profound’s 2025 analysis via Tagliaferro, 86.7% of Claude-cited URLs in that sample overlapped with Brave top organic search results, which may suggest Brave Search ranking as a leading indicator of Claude citation visibility. This finding is reported by Profound via Tagliaferro and was not independently verified by 5W.

Why does Reddit appear so often in AI citations?

Reddit appears in the top five most-cited domains across all major AI engines measured, including ChatGPT, Google AI Mode, Gemini, Perplexity, and Google AI Overviews. The structural reasons reported across the cited studies: OpenAI announced a content licensing partnership with Reddit in 2024; Google has a separate data agreement; and Reddit’s substantive, threaded content extracts cleanly into AI answers. Per Peec AI’s 30-million-source analysis, Reddit is observed as a universal top-five citation source across all major engines in the cited sample.

What is AI citation share?

AI citation share is the percentage of an AI engine’s cited sources that come from a given domain, measured across a fixed sample of prompts or responses. It is the AI-era equivalent of search market share. A brand’s AI citation share is the percentage of relevant prompts for which an AI engine references the brand, its domain, or third-party content about the brand. 5W publishes quarterly Citation Source Audits to track citation share across ChatGPT, Claude, Perplexity, Gemini, and Google AI Overviews.

How do brands improve AI visibility?

Four mutually reinforcing levers, drawn from the patterns observed across the cited datasets: (1) Control ground truth — accurate, well-sourced Wikipedia page; clear owned-site language; structured data and schema markup. (2) Build distributed authority — active Reddit presence, named-leader LinkedIn publishing, YouTube creator mentions, claimed review-platform profiles. (3) Create extractable content — case studies with specific numbers, FAQs answering one question per page, deep vertical reference pages, original research with proprietary data. (4) Increase repetition across sources — mentions across many citation-eligible sources appear to matter more than ranking on any one. Given the multi-week volatility documented in this audit, programs should consider measuring AI visibility quarterly rather than annually.

Benchmark Your Brand Against the Data.

5W runs custom AI Visibility Audits across all four major LLMs, identifying gaps and quantifying opportunity.

Request an Audit →

09 / References & Limitations

Sources and What 5W Did Not Verify (Updated June 2026)

Primary Source Studies

Similarweb (Apr 2026). The Most Cited Domains by LLMs.
similarweb.com/blog/marketing/geo/most-cited-domains-llms
SEMrush (Nov 2025). The Most-Cited Domains in AI: A 3-Month Study.
semrush.com/blog/most-cited-domains-ai
Goodie (Sep 2025). What Are the Most Cited Domains in LLMs•
higoodie.com/blog/most-cited-domains-in-llms
Contently (Apr 2026). Top 10 Sources LLMs Cite Most in 2026.
contently.com/2026/04/29/top-sources-llms-cite
Passionfruit (Mar 2026). How LLMs Search for Citations.
getpassionfruit.com/blog/how-llms-search-for-citations
Wellows (Nov 2025). Cited by ChatGPT: 7K Queries, 485K Citations.
wellows.com/insights/chatgpt-citations-report
xSeek. AI Source Radar: Track What Sources LLMs Cite.
xseek.io/sources
Profound (via ALM Corp synthesis). LinkedIn rank shift on ChatGPT, Nov 2025–Feb 2026.
almcorp.com/blog/linkedin-ai-search-citations-2026
Ahrefs (Dec 2025). 75K-brand correlation study (via BrandMentions synthesis).
brandmentions.link/ahrefs-brand-mentions
Lantern (Feb 2026). AI Citation Content Visibility Report. 200M+ citations across ChatGPT, Perplexity, Gemini, and Claude.
5W (May 2026). The AI Platform Citation Source Index 2026 — 50 sources ranked across five engines. Published on everything-pr.com and 5wpr.com.
5W (May 2026). The Trade Press AI Index 2026 — across nine industries. Synthesizes six published citation studies, ~680M citations. Published on everything-pr.com.
Oltre (2026). How Claude Picks Sources: Technical Breakdown — 2,170-URL Claude analysis.
Pixelmojo (Apr 2026). GEO playbook synthesis of Princeton/Georgia Tech GEO Study (KDD 2024).
Erlin (2026). 501-site Claude SEO analysis.
Profound (2025) via Tagliaferro — Claude/Brave Search URL overlap analysis.

What 5W Did Not Verify

5W did not run the underlying primary research in this Q1 edition. The Q2 edition will include 5W's own primary research run as described above.
Where studies disagree at the margin, the largest dataset is generally weighted most heavily; specific disagreements are surfaced in the body of each finding.
All percentages, rankings, and correlations are reported as published by the original researchers.
The June 2026 refresh integrates two 5W research properties (the AI Platform Citation Source Index 2026 and the Trade Press AI Index 2026). Both were produced by 5W Research from synthesized third-party data; 5W did not independently run the underlying primary measurements for either. Where these properties are cited in this Q1 audit, readers should consult each property’s published methodology for full sourcing detail.

Limitations

This is a U.S.-focused synthesis. International citation patterns are not covered in this edition.
The September 2025 volatility event documented in Finding 9 is a clear warning that citation patterns shift on multi-week timescales. Findings holding in Q1 2026 may shift by Q2.
Some claims about LLM training data composition are widely discussed in the trade press but not fully documented by the model developers themselves. Where this is the case, the report uses cautious language and stops short of asserting specific training-weight figures.

Related Reports

Coming Q3 2026