EDITION 01 — THE 5W RETRIEVAL INDEX — VOLUME I

AI Media

Q2 2026

The Unvarnished Read

The labs are the press. OpenAI, Anthropic, DeepMind, Google AI Research, Meta FAIR, and Hugging Face publish more cited content than every paywalled prestige publication that covers them. Below the lab tier, two structural dynamics define the sector: open community substrates (Hacker News, r/MachineLearning, r/LocalLLaMA, Stack Overflow) carry the citation load that journalism carries in other sectors, and the highest-authority paywalled outlets — The Information, Stratechery, Bloomberg, The Wall Street Journal, the Financial Times — sit suppressed below their actual influence on the operator class.

The most-read AI journalism is not the most-cited AI journalism. The training-data economy and the paywall economy run in opposite directions, and the gap defines the sector. AI media grades B because the journalism layer is strong but the citation economy is dominated by sources outside the journalism layer entirely.

The System

How AI answers about ai media work.

The labs publish their own canon. OpenAI's research pages, Anthropic's research and Claude model cards, DeepMind's blog, Google AI Research, Meta FAIR, and Hugging Face's documentation are routinely cited as primary sources for the products they ship. No other sector has this dynamic — Pfizer does not out-cite STAT News on Lipitor. In AI, the manufacturers are the press of record on their own products.

Newsletters provide the synthesis layer. Stratechery, Platformer, Big Technology, Pragmatic Engineer, Import AI, AI Snake Oil, One Useful Thing, Interconnects, and Latent Space carry the analysis the engines pull for "what is happening in AI" queries. Substack is more structurally important to AI retrieval than to any other sector.

Forums and community substrates carry the connective tissue. Hacker News, r/MachineLearning, r/LocalLLaMA, and Stack Overflow show up disproportionately for opinion, technical disagreement, and practitioner-experience queries. r/MachineLearning is cited above some mid-tier trade press despite being a forum.

The paywall economy suppresses the prestige tier. The Information, Stratechery, Bloomberg AI, FT Tech, WSJ Tech, and NYT Tech produce the highest-quality AI journalism — and the engines cannot cite what they cannot reach. Paywalls cost the prestige tier 10–25 composite points each.

Geography is U.S.-dominated. Severely. Citation density follows the U.S. AI press to a degree disproportionate even to the U.S. share of the AI industry. UK presence is moderate. Chinese AI press (Caixin, Synced, China Daily Tech) is almost entirely absent from English-language engine retrieval despite the scale of Chinese AI.

55 properties across established tech press, lab and institutional publishers, newsletter and analyst tier, research and reference infrastructure, community and forums, and specialist trade.

The Rankings

Source scores and retrieval tiers.

The Structural Finding

The Lab-as-Publisher Effect

In every sector 5W has modeled, the press is the press and the brands are the brands. Pfizer does not write the cited reference on Lipitor — STAT News or the New England Journal of Medicine does. Tesla does not write the cited reference on its driver-assistance safety record — the IIHS or Reuters does. Procter & Gamble does not write the cited reference on its skincare formulations.

In AI, the manufacturers publish the primary source for the manufactured thing. OpenAI's GPT-4 system card is the cited reference for GPT-4. Anthropic's Claude model card is the cited reference for Claude. DeepMind's technical reports are cited above the journalism covering them. Hugging Face's documentation is the cited reference for nearly every open-source model on the platform. Google AI Research and Meta FAIR routinely sit in the top tier of citations for their own architectures.

This is the Lab-as-Publisher Effect. The labs are not subjects of coverage. They are publishers. They control the retrieval graph for their own products. The journalism layer sits on top of the lab layer, not in place of it. Three secondary patterns reinforce: the Training-Data Paradox (paywalled prestige journalism suppressed below its influence), the (Hacker News and r/MachineLearning carrying load conventional journalism Community Substrate carries elsewhere), and the (definitional queries route to Wikipedia consistently primary). Wikipedia Authority Layer

What Moves It

Operating moves for this sector.

For paywalled prestige publications — open the AI archive. The Information, Stratechery, Bloomberg, FT, and WSJ each forfeit 10–25 composite points to access controls. Opening AI-specific evergreen content recovers retrieval without altering the subscription model on news.
For open journalism — invest in schema and named-entity density. TechCrunch, The Verge, and MIT Tech Review already score high. The next 5 points come from Article schema, Organization schema for every lab and model named, and internal linking to primary lab pages.
For newsletter operators — durability over recency. The engines retrieve from the durable archive, not the inbox. Newsletter platforms that surface the archive (Substack public pages, dedicated index pages) out-cite those that bury old issues.
For labs — protect the publishing surface. Lab pages with strong structure, clear authorship, and stable URLs cement retrieval. Lab pages buried in blog mazes lose to journalism that covers the same material.
For operators trying to be cited AI coverage — the retrieval-effective placements are not the prestige in placements. Open press (TechCrunch, Verge, MIT TR, VentureBeat, Ars), lab partnership surfaces, and community substrate move the needle in engine retrieval. Stratechery and The Information move the needle in human readership. They are not interchangeable channels.

Get in touch

Let's build your next chapter.

Tell us what you're working on. A senior strategist will respond within one business day.

Email: info@5wpr.com
Phone: 212.999.5585
Offices: New York · HQ469 7th Avenue, Floor 8
New York, NY 10018
Miami100 SE 2nd Street, Floor 38
Miami, FL 33131
Tampa110 South 12th Street
Tampa, FL 33602