AI Search Research

What the Ahrefs AI Benchmark Report Means for Your Brand

Published: 28 May 2026 Author: Cited By AI® Reading time: 9 min
Version 1.0 | Published 28 May 2026 | Last verified: 28 May 2026 | Source: citedbyai.info AI Visibility Intelligence

Ahrefs published the most statistically robust AI search study to date. Eight researchers. 146 million SERPs, 730,000 AI responses, 75,000 brands, 174,000 cited pages, 76,000 websites. Three findings demand immediate attention from every brand operating in AI search: YouTube beats every other signal for AI brand visibility. Hallucinations were deliberately seeded, and most models couldn't tell the difference. And ChatGPT sends 190x less traffic than Google, but the traffic it sends converts at a rate that changes the calculation entirely.

Source: Ahrefs AI Search Benchmark Report, Q4 2025 / Q1 2026. Authors: Ryan Law, Xibeijia Guan, Despina Gavoyannis, Louise Linehan, Glen Allsopp, Erik Sarissky, Patrick Stox, Mateusz Makosiewicz. All statistics cited in this piece are drawn directly from that report unless otherwise stated. This is a synthesis and commentary, not a reproduction of the report itself.

146M
SERPs analysed
730K
AI responses studied
75K
brands tested
76K
websites tracked

These three findings are the ones CBA's team read twice. Each one is independently important. Together they describe a different picture of AI search than most brand strategies are built around. A clearer one.

Finding 1: YouTube is the strongest signal for AI brand visibility: by a wide margin

Most actionable finding
r ≈ 0.737

YouTube mentions were the single strongest correlate of AI brand visibility across 75,000 brands tested against ChatGPT, AI Mode, and AI Overviews.

YouTube mention impressions were the second strongest factor (r ≈ 0.712–0.721). Together the two YouTube signals occupied the top two positions in the correlation table, well ahead of every traditional SEO metric.

Here's what the full correlation table looks like. Every number is from the Ahrefs study.

Signal Correlation with AI mentions (avg. across platforms)
YouTube mentions ~0.737
YouTube mention impressions ~0.717
Branded web mentions ~0.676
Branded anchors ~0.555
Branded search volume ~0.403
Domain Rating (DR) ~0.292
Referring domains ~0.289
Number of backlinks ~0.218
Number of site pages ~0.193
URL rating ~0.185

The gap between the YouTube signals and the traditional SEO metrics isn't small. The YouTube mention correlation is roughly 3.4x stronger than the backlink count correlation and 2.5x stronger than domain rating. Ahrefs put it plainly: link and content volume have little impact on AI visibility.

The most surprising thing about this finding isn't the correlation itself: it's what it implies about the citation mechanism. AI systems are learning what brands exist, what they do, and how credible they're partly from video content and the discussions that surround it. A brand with substantial YouTube presence (product demos, tutorials, founder interviews, third-party reviews) is feeding AI systems a richer entity signal than a brand with the same domain authority but no video presence.

The practical read: YouTube isn't a separate channel to worry about after everything else is done. For AI brand visibility, it's the channel. If your brand has no meaningful YouTube presence, the correlation data suggests this is the highest-return investment you're not making (not because Google's algorithm rewards it, though it does, but because AI retrieval systems are reading from the same pool).

One important note on correlation vs causation. Ahrefs is careful to frame this as correlation. Brands with high YouTube presence may also be larger, more established, and generate more third-party coverage by other mechanisms. The direction of the relationship is clear; the mechanism isn't fully established. The practical implication holds either way: YouTube presence is where AI visibility lives, and building it's directionally right regardless of whether it's the direct driver or a proxy for the thing that's driving it.

Finding 2: The hallucination experiment is the most commercially alarming finding in the report

Most commercially alarming finding
37–39%

Gemini and Perplexity included seeded misinformation in 37 to 39 percent of their answers about a fake brand, despite the official FAQ contradicting the fake claims directly.

ChatGPT-4 and ChatGPT-5 performed materially better at under 7%. Claude didn't hallucinate at all but also never surfaced the brand's own website.

Here's what the experiment actually did. Ahrefs researcher Mateusz Makosiewicz invented a fake luxury paperweight company and seeded three contradictory stories about it: a blog post, a Reddit AMA, and a Medium "debunk" piece. Each story contradicted the brand's official FAQ. Then the team queried every major AI platform about the company.

Model Blog ingested Reddit AMA ingested Medium "debunk" ingested Used FAQ as canonical
Perplexity High High High None
Gemini Low High High None
Copilot Med Low–Med Med None
Grok Low Med High None
AI Mode Low Med High Low
ChatGPT Thinking Low Low Low–Med High
ChatGPT-4 Low Low Low High
Claude None None None None

One model hallucinated the results of a Black Friday campaign the company had never run. This wasn't a subtle error. The model generated specific, false, commercially relevant claims about a brand: the kind of output a customer might act on or a journalist might repeat.

The finding has three implications for real brands.

Third-party content on open platforms can override your official content. Reddit, Medium, Quora, and similar platforms are already inside the retrieval pool for most AI platforms. If contradictory claims about your brand exist on those platforms (even if you know they're wrong), some models will surface them alongside or instead of your official content. The absence of a Reddit presence doesn't protect you; the presence of someone else's Reddit content about you can harm you.

The harm is platform-dependent and currently measurable. Perplexity and Gemini are the highest hallucination-risk platforms for this type of seeded misinformation. ChatGPT-4 and -5 are materially more robust. Claude, while not surfacing the official website, didn't generate false claims. If your brand has significant exposure on Perplexity (a B2B or research-oriented audience, for example), the hallucination risk is substantially higher than if your primary AI exposure is on ChatGPT. This is a measurable, platform-specific risk, not a general category of concern.

The gap between what AI says about your brand and what your brand actually says is currently unmeasured for most companies. Ahrefs ran this experiment with a fake company they controlled, so they knew exactly what was true and what wasn't. For real brands, the equivalent question is: what are AI platforms saying about us right now, across which platforms, and how much of it's accurate? That question is precisely what CBA's hallucination detection module is built to answer. The Ahrefs experiment confirms why it's a first-class audit concern, not a theoretical one.

Finding 3: The 190x traffic gap, and why the comparison is incomplete without the conversion premium

Most strategically clarifying finding
190x

Across 76,000 websites, Google sends 190 times more visitors than ChatGPT. Google accounts for nearly 40% of total traffic. ChatGPT accounts for 0.21%.

ChatGPT has a 96% lower click-through rate than Google, primarily because it keeps users in the conversation rather than routing them to websites. But the visitors it does send aren't the same as the visitors Google sends.

The 190x number has been circulating since Ahrefs published. It's accurate. It's also a single-dimension comparison that strips out the most strategically important variable: what those visits are worth.

ChatGPT processes 2.5 billion daily queries. By Ahrefs' own classification, 65% of that qualifies as search behaviour: things people traditionally searched for on Google. That's 1.6 billion search-equivalent daily queries. Even with a 96% lower click-through rate, the referral traffic from ChatGPT converts at a fundamentally different rate than organic search traffic once it arrives. Independent research (Outpace panel, cited in multiple AI search studies including Perea's May 2026 synthesis) measured AI-referred conversion rates at 14.2% against organic search's 2.8%: a 5x differential.

Run the arithmetic. If ChatGPT sends you 100 visitors, at a 14.2% conversion rate that's 14 conversions. If Google sends you 19,000 visitors (the 190x multiple) at a 2.8% organic rate, that's 532 conversions. Google still wins on raw conversions at that scale, and the Ahrefs data is unambiguous: Google is still the dominant traffic channel by a large margin, and any strategy that abandons Google SEO for AI visibility work is misreading the landscape.

But the per-visit value of an AI-referred visitor is substantially higher. Which means the correct framing of the 190x finding isn't "AI search doesn't matter for traffic." It's: "AI search sends fewer but more qualified visitors, and the brands that earn citations are earning a higher-value customer regardless of absolute volume." That's the counterweight.

The Ahrefs report also confirms something that changes the SEO calculus. AI Overviews reduce click-through rates to the first organic position by 58%, and even tenth position loses 19.4% of clicks. The traffic that remains after an AI Overview is involved is already shrinking. The citations inside that AI Overview earn 35% more organic clicks and 91% more paid clicks than non-cited competitors on the same SERP. So the choice isn't between AI citation and Google SEO. It's between being cited in the AI Overview that cannibalises the SERP, or being a non-cited competitor watching the clicks go elsewhere.

Three other findings worth noting briefly

28% of ChatGPT's most-cited pages have zero organic visibility. Among the top 1,000 pages cited by ChatGPT, 283 have no traditional Google search rankings. AI citation and organic ranking aren't the same thing, don't require the same thing, and won't always co-occur. A brand that ranks well in Google can be absent from ChatGPT. A page with no Google traffic can be heavily cited by ChatGPT.

AI Mode and AI Overviews reach the same conclusions but cite almost entirely different sources. Across 730,000 response pairs, the URL overlap is just 13.7%, yet the responses agree on what to say 86% of the time. This isn't a bug: it confirms the platform-specific citation architecture that CBA's audit has been built around. Being cited in AI Overviews gives you a 61% chance of appearing in AI Mode. It's not guaranteed. The platforms are pulling from different wells to reach the same river.

Content length is largely irrelevant. Across 174,000 pages cited in AI Overviews, the correlation between word count and citation was near zero. 53.4% of cited pages were under 1,000 words. The block-level extractability of an answer is what matters, not how much surrounds it.

The practical synthesis: The Ahrefs report confirms the CBA audit framework point by point. YouTube presence and brand mentions drive AI brand visibility more than links or domain authority. Hallucinations are a live, platform-specific commercial risk that needs active monitoring. AI-referred traffic is lower volume but higher value per visit. And content length is less important than whether a specific block contains a clear, self-contained, evidence-backed answer.

What to do next

Three immediate actions, in priority order based on what the Ahrefs data actually says.

1. Audit your YouTube and brand-mention footprint. The correlation data is clear: YouTube mentions are the highest-priority signal. If your brand has minimal YouTube presence, that's the gap to close. Not more pages, not more backlinks. Third-party reviews, founder interviews, product walkthroughs, and expert appearances all feed the same signal.

2. Check what AI platforms are saying about your brand right now. The hallucination experiment used a fake company. Real brands with real third-party coverage on Reddit, Medium, and industry forums are in a similar position. The only question is whether the third-party content about you is accurate. You probably don't know the answer to that question across all five platforms. That's not a theoretical risk. It's a live brand accuracy question.

3. Run the per-platform citation audit. The 13.7% URL overlap between AI Mode and AI Overviews confirms that being cited in one doesn't protect you in the other. If your current AI visibility programme produces a single score across platforms, it's not measuring the thing that actually varies. The Ahrefs data is unambiguous on this: citation behaviour is per-engine, and strategy has to match.

See what AI is saying about your brand across all five platforms

A free Cited By AI® audit maps your citation share by platform, flags any hallucinations active in AI responses, and scores your top pages for block-level citability. The three questions the Ahrefs report raised, and the answers for your specific brand.

Get a Free Citation Audit →

The Ahrefs report confirmed the framework. The audit shows where you stand in it.

Free audit. 28 modules across a 35-section report. 5 AI platforms. Citation share, hallucination detection, YouTube and brand-mention benchmarking, and per-page CPS® scoring. Results in 48 hours.

Get Your Free Audit →