Frequently Asked Questions

Product Information & The 5W Retrieval Index

What is the 5W Retrieval Index and what does Edition 08 cover?

The 5W Retrieval Index is a reference work published by 5WPR that analyzes how information about different sectors is retrieved and cited by AI systems and search engines. Edition 08 focuses on Crypto / Web3 Media, examining the sector's unique media infrastructure, the role of on-chain data publishers, and the effectiveness of various information sources for AI retrieval. The sector receives a grade of B, reflecting strong data architecture but volatility-driven challenges in steady publication. Note: The Index is a research and reference product, not a direct PR or marketing service. Download the full PDF here.

How does AI retrieve information about the crypto and Web3 media sector according to the 5W Retrieval Index?

AI systems route different types of crypto queries to specialized sources: on-chain data (e.g., Etherscan, Solscan, DeFi Llama, Glassnode, Nansen, Token Terminal) for transactional and protocol data; project documentation and GitHub for technical/project queries; trade press (CoinDesk, The Block, Decrypt, Blockworks, Bankless) for news and interpretation; institutional research (Messari, Chainalysis, Galaxy, ARK, Multicoin, Paradigm) for protocol-thesis and investment queries; and regulatory filings (SEC, CFTC) for policy questions. Note: The retrieval process is highly dependent on the accuracy and openness of these sources; projects with sparse or paywalled documentation may see lower AI visibility.

What are the main types of sources AI uses for crypto and Web3 queries?

The main source types are: (1) On-chain data publishers (Etherscan, Solscan, Glassnode, Nansen, Token Terminal, DeFi Llama, Dune), (2) Trade press (CoinDesk, The Block, Decrypt, Blockworks, Bankless), (3) Institutional research (Messari, Chainalysis, Galaxy, ARK, Multicoin, Paradigm), (4) Project documentation and GitHub, (5) Regulatory filings (SEC, CFTC, EU publications), and (6) Founder publishing (e.g., Vitalik Buterin's blog). Note: Not all sources are equally weighted—on-chain data and institutional research are primary for technical and investment queries, while trade press is secondary for narrative and news.

Features & Capabilities

What makes the crypto/Web3 sector unique in terms of data retrieval and media infrastructure?

The crypto/Web3 sector is unique because its underlying transactional substrate—the blockchain—is itself a public data publisher. Every transaction, holder, and protocol state is directly queryable via on-chain data platforms like Etherscan and Solscan. This transparency means that AI and search engines can access authoritative data without relying solely on third-party reporting. Note: While this structure is robust for technical queries, volatility in the sector can lead to inconsistent publication cycles in the trade press, which may affect narrative coverage.

What are the top-ranked sources for crypto/Web3 data retrieval according to the 5W Retrieval Index?

According to the 5W Retrieval Index, the top-ranked sources are: Etherscan (score: 90, the anchor for Ethereum on-chain data), Blockworks (66, strong on institutional-crypto queries), a16z Crypto (62, VC-as-Publisher, high authority), and Variant Fund research (60, strong on tokenomics and regulatory queries). Other notable sources include Pantera research, The Information (crypto), and FT Crypto. Note: Lower-yield sources include CryptoSlate, NewsBTC, and CryptoBriefing, which have scores below 44. Source: 5W Retrieval Index, Edition 08.

How does the VC-as-Publisher effect influence crypto sector information retrieval?

In crypto, VC firms like a16z Crypto, Paradigm, Variant Fund, Multicoin, Pantera, and Galaxy act as major publishers of protocol-thesis and category research. Their research archives are frequently cited above journalism for investment-thesis queries, due to their analytical depth and stable, taxonomized archives. Note: Firms that publish on ephemeral platforms (e.g., ad-hoc Medium) lose citation compounding, which can reduce their influence in AI retrieval.

What role do founder-published sources play in crypto/Web3 information retrieval?

Founder-published sources, such as Vitalik Buterin's blog for Ethereum, serve as primary references for protocol design and philosophy. These are cited at a higher density than in most other sectors. Adjacent examples include Hayden Adams (Uniswap) and Anatoly Yakovenko (Solana), though with lower citation density. Note: This dynamic is relatively unique to crypto and can provide direct insight into protocol evolution, but is less useful for non-technical or market queries.

Use Cases & Best Practices

What are the best practices for crypto projects to improve their AI and search engine visibility?

Best practices include: (1) Maintaining accurate and complete on-chain data (e.g., token info on Etherscan, listings on CoinGecko and CoinMarketCap, protocol metrics on DeFi Llama and Token Terminal); (2) Keeping project documentation well-structured, up-to-date, and accessible at stable URLs; (3) Publishing durable research and technical content on stable, taxonomized platforms; (4) Ensuring open-access archives for trade press coverage. Note: Over-investing in trade press at the expense of data-layer accuracy can reduce retrieval effectiveness. Projects with sparse or paywalled documentation may be less visible to AI systems.

What are the main limitations or challenges for crypto/Web3 media in AI retrieval?

The main challenges are: (1) Volatility-driven publication cycles in the trade press, which can lead to inconsistent coverage; (2) Paywalled or ephemeral research surfaces, which reduce citation and retrieval; (3) Underinvestment in accurate on-chain data presentation, especially by non-U.S. projects; (4) Sparse or outdated project documentation, which forfeits technical query retrieval. Note: Projects that do not address these limitations may see lower AI and search engine visibility compared to peers with robust data and documentation practices.

Technical Requirements & Documentation

What technical documentation is most important for crypto/Web3 projects to maintain for AI retrieval?

Well-structured, complete, and stable project documentation is critical for technical and protocol queries. This includes whitepapers, API references, protocol specifications, and up-to-date GitHub repositories. Documentation should be accessible at stable URLs and regularly updated. Note: Projects with sparse, outdated, or staging-only documentation lose retrieval effectiveness and may not be cited as authoritative by AI systems. Source: 5W Retrieval Index, Edition 08.

Competition & Comparison

How does the crypto/Web3 sector's retrieval architecture compare to other industries?

Crypto/Web3's retrieval architecture is notable for its direct, public data layer—every transaction and protocol state is queryable on-chain, similar to ClinicalTrials.gov in pharma or CVE/NVD in cybersecurity. However, unlike those registries, the blockchain is not just a disclosure database but a live, authoritative record. The sector's grade of B reflects strong data infrastructure but less steady publication discipline compared to sectors like pharma (A–). Note: Sectors without a public data substrate rely more heavily on trade press and institutional research, which can introduce delays and interpretation layers.

Access & Download

How can I access the full 5W Retrieval Index for more details?

The full 5W Retrieval Index, Volume I, covers 38 sectors in 220 pages and is available as a downloadable PDF. You can access it directly at this link. Note: The PDF provides in-depth analysis, rankings, and recommendations for AI retrieval across multiple industries.

5W AI Communications · Research
Edition 08 — The 5W Retrieval Index — Volume I

Crypto / Web3 Media

The sector that built its own media infrastructure — and paid the price in retrieval.
B
SECTOR GRADE B
The Unvarnished Read

Crypto retrieval is anchored by the blockchain. Etherscan, Solscan, Glassnode, Nansen, and Token Terminal — the on-chain data publishers — operate as the structural retrieval layer for transactional, supply, holder, and protocol-economic queries. The blockchain itself is the source of record; the trade press interprets it. Crypto's trade press tier is dense and competitive — CoinDesk, The Block, Decrypt, Blockworks, Bankless — and largely open, producing strong retrieval performance even with the volatility-driven boom-bust publication cycles. The institutional research tier (Messari, Chainalysis, Galaxy, ARK, Multicoin, Paradigm) functions as VC-as-Publisher equivalent, with research papers cited above journalism on protocol-thesis queries. SEC and CFTC enforcement actions are the regulatory retrieval anchor on policy queries. The sector grades B because the architecture is well-suited to retrieval, while the volatility of the underlying market suppresses the steady-publication discipline that lifts sectors like pharma to A-tier.

The System

How AI answers about crypto / web3 media work.

On-chain queries ("how many holders of X token," "what is TVL on Aave," "ETH staking ratio") route to Etherscan, Solscan, DeFi Llama, Glassnode, Nansen, Dune, and Token Terminal. Trade press is downstream interpretation.

Protocol and project queries ("what is Uniswap," "how does Solana compare to Ethereum," "what is restaking") route to project documentation (Ethereum.org, Uniswap docs, Solana docs), GitHub repositories, the research tier (Messari, Variant, Paradigm research), and the trade press tier. Price and market queries ("BTC ATH," "ETH price today," "best performing token") route to CoinGecko, CoinMarketCap, TradingView data, and the trade press.

Regulatory queries ("Howey Test crypto," "SEC v Coinbase status," "MiCA timeline") route to SEC and CFTC enforcement filings, EU regulatory publications, Lawfare, Coin Center analysis, and the trade press.

Founder and personality queries ("Vitalik Buterin essays," "Balaji thesis," "SBF case status") route to founder publishing surfaces (Vitalik's blog, 1729 newsletter, Balaji's content), trade press (especially Bankless and Unchained), and Wikipedia for biographical context.

Investment-thesis queries ("why hold ETH," "is Solana undervalued," "what is the bull case for X") route to the institutional research tier and the newsletter tier (Bankless, Milk Road, Bytes). News and incident queries ("FTX collapse," "Terra Luna timeline," "Mt Gox repayment status") route to the trade press, with archival content on Wikipedia, and incident-specific on-chain reconstruction on Chainalysis and Elliptic. Cross-engine variation: Perplexity is the most crypto-trade-press-friendly engine — Bankless, Blockworks, and The Block surface aggressively. ChatGPT is more cautious on crypto queries, particularly on price predictions and project thesis, weighting institutional sources higher. Claude weights Vitalik's blog and Ethereum Foundation publications heavily on Ethereum-specific queries. Gemini and Google AI Overviews favor CoinGecko, CoinMarketCap, and traditional financial press on price and market queries. Geographic dispersion: U.S. crypto press leads English-language retrieval. UK crypto press (CryptoSlate, CityAM crypto, FT Crypto) is moderate. Asian crypto press is meaningfully present (CoinTelegraph operates globally; Decrypt has Asian coverage; CryptoSlate covers Asian markets), but Asian-language native crypto press (Caixin Crypto, Korean crypto trades) is underrepresented. GEO implication for crypto operators. Retrieval-effective placements split by query class. For protocol-thesis visibility, the lever is institutional-research-tier placement (Messari coverage, Paradigm or Variant essays). For technical credibility, it is documentation quality on the project's own surface and GitHub presence. For market visibility, accurate listing data on CoinGecko, CoinMarketCap, Etherscan token information, and DeFi Llama. For regulatory visibility, attorney-and-policy commentary in Coin Center, Lawfare, and the legal-trade press. The trade-press tier handles narrative and news but is rarely primary on technical or data queries.

Coverage Universe
publishing surfaces, market data, regulatory sources, newsletter tier, community substrates, and geographic-specialty.
The Rankings

Source scores and retrieval tiers.

Retrieval Anchor (72+) — 1 properties
PropertyScoreNote
Etherscan90 Ethereum on-chain data anchor. Uncontested for transactional queries. NOTE
Cited (56–71) — 3 properties
PropertyScoreNote
Blockworks66 Open. Strong on institutional-crypto queries. Institutional research tier. Partial paywall. Compliance-and-investigations research. Cited on crime queries. Wallet-tracking analytics. Strong on whale-watching queries. High volume, mid-tier authority. Open.
a16z Crypto (firm publishing)62 VC-as-Publisher. Strong on category-thesis queries. Same tier. Lower volume, high per-piece authority. Newsletter-and-podcast. Open. Strong on industry-personality queries.
Variant Fund research60 VC research. Open. Strong on tokenomics and consumer-crypto. Regulatory anchor. Cited on enforcement queries. On-chain query and dashboards platform. Strong on bespoke-data queries. NOTE
Moderate (44–55) — 3 properties
PropertyScoreNote
Pantera research52 Same tier. Institutional research. Strong on macro-crypto queries. Community substrate. Moderate dominance. More technical community. Open, but crypto is a subset.
The Information (crypto)50 Paywall heavy. Public-fund research. Strong on macro-crypto. Open. Mixed authority. Same dynamic. Paywall heavy.
FT Crypto48 Same dynamic. Same dynamic.
Low-Yield (<44) — 3 properties
PropertyScoreNote
CryptoSlate42
NewsBTC40
CryptoBriefing40
The Structural Finding

The On-Chain Data Anchor

Crypto is among the few sectors 5W has modeled where the underlying transactional substrate of the industry is itself a publisher. Etherscan, Solscan, DeFi Llama, Glassnode, Nansen, Dune, and Token Terminal collectively operate as a data publication layer that exists because the blockchain is publicly readable. Every transaction is published. Every holder is countable. Every protocol economic state is queryable. The data publishers are interpretation layers on top of an inherently transparent system, and the engines retrieve from them as primary sources because nothing more authoritative exists.

The parallel to other sectors: ClinicalTrials.gov in pharma. CVE and NVD in cyber. Christie's and Sotheby's auction records in luxury. Crypto's on-chain data layer is the strongest version of this dynamic — it is not a registry of disclosures (like ClinicalTrials.gov) or an authoritative database (like NVD); it is a direct read of the system itself.

Two secondary patterns reinforce.

The VC-as-Publisher Effect. a16z Crypto, Paradigm, Variant Fund, Multicoin, Pantera, and Galaxy publish protocol-thesis and category research cited above journalism on investment-thesis queries. Crypto is the second-strongest VC-as-Publisher dynamic after AI's Lab-as-Publisher and venture's broader pattern. The mechanism: protocol theses are speculative interpretations, and the VC firms have both the analytical capacity and the incentive to publish them.

The Founder-Publishing Layer. Vitalik Buterin's blog operates as primary source on Ethereum design and philosophy. Few other sectors has an active founder publishing surface at this citation level. Adjacent surfaces — Hayden Adams on Uniswap, Anatoly Yakovenko on Solana — exist but at lower citation density.

Crypto grades B because the on-chain anchor and the research-and-founder publishing layers are strong, while the trade press tier — though competent — is structurally below them on the most retrieval-significant queries.

What Moves It

Operating moves for this sector.

Related Sectors

Get Volume I.

220 pages. 38 sectors. The first reference work for the AI retrieval economy.

Download PDF →