Does llms.txt help with AI citations?

The evidence says no, not as a citation lever. Ahrefs analysed 137,000 domains in June 2026 and found that 97% of published llms.txt files received zero requests in May 2026. Of the 3% that were read, AI retrieval bots (the bots most associated with AI search citation, such as OAI-SearchBot and PerplexityBot) accounted for just 1.1% of requests. The largest AI consumer was agentic infrastructure like Claude-Code, suggesting llms.txt has utility for AI coding agents parsing developer documentation rather than for citation in AI search. Google's own 15 May 2026 guidance states machine-readable files like llms.txt are not required to appear in generative AI search.

Does schema markup increase AI citations?

A controlled Ahrefs study (11 May 2026) tracked 1,885 pages adding JSON-LD schema against 4,000 matched control pages using difference-in-differences analysis. Adding schema produced no statistically significant citation uplift on Google AI Mode (+2.4%) or ChatGPT (+2.2%), and a small decline on Google AI Overviews (-4.6%) for pages already visible to AI. A separate searchVIU experiment found that five major AI systems extract only visible HTML during retrieval and ignore JSON-LD entirely. Schema retains upstream value for crawlability and entity recognition, but it is not the citation lever it is often sold as for already-visible pages.

Why do AEO and GEO tools recommend schema and llms.txt if they don't drive citations?

Schema and llms.txt are easy to measure, easy to generate, and easy to sell as a productised checklist. They also correlate with citation in observational data, because sites that implement them tend to also invest in content quality, links, and technical maintenance. Controlled studies that isolate the schema or llms.txt variable find the correlation does not hold as causation. The tactics persist in the category because they are convenient to package, not because controlled evidence supports them as citation drivers.

Cornerstone Resource

What Actually Drives AI Citations, and What's Just Sold to You

Q: What actually drives AI citations?

The research points to content-level signals measured at the block level rather than technical markup. The Princeton KDD GEO benchmarks found the largest citation uplifts came from citing sources (+40.6%), adding quotations (+35.1%), and adding statistics (+32.9%). These map to content qualities: clear block structure aligned to retrieval chunk size, high fact density, an answer-first architecture, self-containment so each passage stands alone, and visible freshness signals. The Citation Probability Score framework scores these five pillars at the 134 to 167 word block level, the unit AI retrieval systems actually evaluate.

Published: 17 June 2026 Author: Cited By AI® Reading time: 11 min

Version 1.0 | Published 17 June 2026 | Last verified: 17 June 2026 | Synthesises three independent studies published between May and June 2026

In the space of five weeks, three independent studies landed on the same conclusion. The two tactics sold hardest across the AEO and GEO category, schema markup and llms.txt files, aren't the citation levers they're marketed as. One of those studies analysed 137,000 domains and found that 97% of llms.txt files are never read by anything at all. Here's what the evidence says actually drives AI citations, and what's just being sold to you.

If you have bought, or been pitched, an AI visibility product in 2026, the pitch almost certainly led with one of two things: add structured data so AI can understand your pages, or publish an llms.txt file so AI can find your content. Both sound technical, both are easy to productise, and both can be generated in minutes. Neither, on the controlled evidence now available, measurably moves AI citations on pages that are already visible to AI systems.

This is not an argument that these tactics are worthless. Schema has genuine upstream value. llms.txt has a real, narrow use case. The argument is narrower and more useful than that: the specific claim that adding schema or publishing llms.txt will increase your AI citations does not survive contact with controlled data. Three separate research efforts, using different methods and different datasets, now point the same way.

The three studies, in one place

Study 1

Schema markup: tracked across 1,885 pages, citations barely moved

Ahrefs (Louise Linehan & Xibeijia Guan, reviewed by Ryan Law), 11 May 2026

Ahrefs tracked 1,885 pages that added JSON-LD schema between August 2025 and March 2026, against 4,000 matched control pages, using difference-in-differences analysis across four separate statistical tests. Adding schema produced no statistically significant citation uplift on Google AI Mode (+2.4%) or ChatGPT (+2.2%), and a small statistically significant decline on Google AI Overviews (-4.6%). A separate searchVIU experiment confirmed the mechanism: five major AI systems extract only visible HTML during retrieval and ignore JSON-LD entirely.

Study 2

llms.txt: 137,000 domains analysed, 97% of files never read

Ahrefs (Louise Linehan & Xibeijia Guan, reviewed by Ryan Law), 15 June 2026

Ahrefs analysed the server logs and live traffic of 137,000 domains. 28% publish an llms.txt file. Of those, 97% received zero requests in May 2026: no bots, no humans, nothing. Of the 3% that were read, 96% of requests came from bots, and AI retrieval bots, the category most associated with AI search citation, accounted for just 1.1% of those requests. The single largest AI consumer was agentic infrastructure such as Claude-Code, pointing to a developer-tooling use case rather than a citation one.

Study 3

Google's own guidance: neither file is required for AI search

Google Search Central, "Optimising for generative AI features", 15 May 2026

Google's official AI optimisation guide includes a section literally titled "Mythbusting generative AI search". On schema: structured data is not required for generative AI search, and there is no special schema.org markup needed. On machine-readable files: you do not need to create new AI text files, markup, or Markdown to appear in generative AI search. This is the platform itself, on the record, stating that the two tactics most heavily sold for its AI surfaces are not requirements.

Three studies, three methods, one direction. A controlled experiment on schema. A 137,000-domain traffic analysis on llms.txt. And the largest platform's own published guidance covering both. When independent research using different methodologies converges like this, the conclusion is no longer a matter of opinion or vendor positioning.

The llms.txt finding deserves a closer look

The schema study and Google's guidance have been discussed at length elsewhere, including in our own deep-dives linked below. The newest study, the 137,000-domain llms.txt analysis published on 15 June 2026, contains the single most striking data point of the three, and it has not yet been widely absorbed.

97%

of published llms.txt files received zero requests in May 2026. Not from AI bots, not from search crawlers, not from humans. Nothing fetched them at all.

Ahrefs, "97% of llms.txt Files Never Get Read", 15 June 2026 (137,000 domains analysed)

The 28% adoption figure tells you how effectively the tactic has been sold. More than one in four domains in the study have published an llms.txt file, despite the fact that no major AI platform has ever committed to reading it. Adoption has been driven by speculation that AI platforms might start consuming the file, not by any confirmation that they do.

The 97% figure tells you what that adoption produced. Of roughly 38,000 domains with a valid file, almost none received a single request for it in the measured month. And the small fraction that did get read were not read by the bots site owners are hoping for. When Ahrefs classified every user agent fetching llms.txt into twelve categories, the bot most associated with AI search visibility came in last.

Bot category (of the 3% of files that were read)	Share of requests
SEO audit tools (SiteAuditBot, WebPageTest)	21.7%
Other and unidentified scrapers	14.9%
General web crawlers (Googlebot, Amazonbot)	13.1%
Tech profiling tools (BuiltWith, Dataprovider)	11.6%
AI agents & agentic infrastructure (Claude-Code)	10.5%
GEO/AEO tools studying llms.txt	5.8%
AI training crawlers (GPTBot, ClaudeBot)	5.3%
llms.txt discoverability bots	3.6%
Service & social bots (Slackbot)	2.9%
Research bots (incl. security & prompt-injection)	2.7%
AI assistants (ChatGPT-User, Claude-User)	2.5%
AI retrieval bots (OAI-SearchBot, PerplexityBot)	1.1%

Highlighted rows are AI bot categories. The bottom row, AI retrieval bots, is the category most directly tied to AI search citation. Combined, all four AI categories make up 19.5% of requests, but that is 19.5% of the 3% of files read at all, meaning roughly 0.6% of published llms.txt files ever receive any AI bot request whatsoever. "Fetched" also does not mean "read", so every figure is a ceiling on actual consumption.

Two details make the picture unambiguous. First, Slackbot, a chat app's link-preview bot, fetched llms.txt files more often than PerplexityBot did. Perplexity is one of the AI search engines the file was supposedly designed to help. Second, Ahrefs found zero requests from AI bots for llms.txt files that don't exist. AI bots never go looking for the file opportunistically; they only ever fetch one that happens to be in a path they were already visiting.

The one genuine use case the data supports is the agentic one. AI agents and agentic infrastructure, the category that includes coding agents like Claude-Code, were the largest single AI consumer at 10.5%. This lines up with what Google's John Mueller said when pressed on the contradiction between Search Central's guidance and Chrome's Lighthouse audit: llms.txt is, in his words, a temporary crutch for AI coding tools parsing developer documentation, not something most sites need for search.

If you publish an llms.txt file today, the most likely outcome by far is that nothing ever fetches it. Ahrefs, "97% of llms.txt Files Never Get Read", 15 June 2026

Why these tactics get sold anyway

If the controlled evidence is this clear, why does the category keep leading with schema and llms.txt? The answer is not conspiracy. It is convenience. Both tactics share three properties that make them easy to sell and hard to resist.

They're easy to measure. A tool can check in milliseconds whether your site has schema or an llms.txt file, assign a pass or fail, and put it on a dashboard. A binary checkbox feels like progress. They're easy to generate. Both can be produced automatically, which makes them an ideal productised deliverable: the tool finds the gap and the tool fills it, in one motion. And they correlate with citation in observational data, because sites that bother implementing them tend to also invest in content, links, and maintenance. Strip the schema out and the rest of the quality stack still carries the page. The correlation is real; the causation isn't.

None of that makes the tools acting in bad faith. It makes them optimised for what is packageable rather than for what the controlled evidence says moves citations. The distinction matters because a buyer who spends their AI visibility budget on the packageable layer is spending it on eligibility signals while their actual citation problem, which lives in the content, goes untouched.

Sold as the citation lever

JSON-LD schema markup on existing pages
llms.txt file at the domain root
Markdown copies of your pages
Machine-readability "score" as the headline metric
Checkbox audits of technical presence

What the evidence says drives citation

Block structure aligned to retrieval chunk size
Fact density: named statistics and entities
Answer-first architecture per passage
Self-containment so each block stands alone
Visible freshness signals and recency cues

What the evidence says does drive citations

If schema and llms.txt are not the levers, what is? The same body of research that disconfirms the markup-first tactics points clearly at the alternative. The Princeton and KDD GEO benchmarks, the peer-reviewed foundation under most credible AI citation research, measured which interventions actually moved citation rates in production AI systems. The top three were citing sources (+40.6%), adding quotations (+35.1%), and adding statistics (+32.9%).

Notice what those three have in common. None of them is a technical file or a markup format. All three are properties of the content itself: the density of verifiable, attributed, specific information inside the passage. That is the signal AI retrieval systems are actually responding to, because retrieval-augmented generation, the mechanism Google confirmed in its own May 2026 guidance, works by retrieving and synthesising individual passages, not by reading markup or index files.

This is the basis for the Citation Probability Score framework. CPS scores content at the block level, the 134 to 167 word chunk that production retrieval systems actually embed and evaluate, across five pillars that map directly onto the research: Content Structure, Fact Density, Answer Architecture, Self-Containment, and Freshness. It does not score schema presence or llms.txt presence as citation drivers, because the controlled evidence says they aren't. It scores the content qualities that the GEO benchmarks, the Ahrefs studies, and Google's own mechanism documentation all point to.

The honest version of the schema and llms.txt story. Both have real value in the right place. Schema supports crawlability, rich results, and entity recognition. llms.txt is a legitimate signal for the agentic web, where browser and coding agents orient themselves on a site. Neither is a citation lever on already-visible pages. A consultancy that sells them as citation drivers after this evidence isn't reading the research. A consultancy that dismisses them as worthless is overcorrecting. The accurate position is that they solve a different layer of the problem than the one most buyers think they're paying to solve.

Go deeper on each study

This page is the synthesis. Each of the three studies has its own dedicated analysis where the methodology, the caveats, and the per-platform implications are worked through in full.

Deep-dive · Study 1

Schema vs Content Signals: What Actually Drives AI Citations

The full breakdown of the Ahrefs schema study, the searchVIU mechanistic confirmation, and the two-stack model that separates technical readiness from citation decisions.

Deep-dive · llms.txt & the layers

Machine-Readability Is the Floor, Not the Ceiling

Why machine-readability is eligibility, not selection, and how llms.txt sits on the agentic-readiness layer rather than the citation layer. Covers the Lighthouse vs Search Central contradiction in full.

Deep-dive · Study 3

Google's AI Optimisation Guide: What It Says and What It Doesn't

A close reading of Google's 15 May 2026 guide, including all four mythbusts, the RAG and query fan-out confirmations, and what the "still SEO" framing does and doesn't mean.

The evidence base

CPS Research Foundation

The full evidence stack behind the five-pillar framework, including the disconfirming evidence section that documents what the research says doesn't drive citations.

What to do with this

The practical takeaway is a reallocation, not a teardown. If your AI visibility programme currently leads with schema scoring and llms.txt generation as the headline deliverables, those belong in the technical readiness layer, where they do useful work, not at the top of your citation strategy. Keep them. Stop expecting them to move citations.

Then move your attention, and your budget, to where the controlled evidence points: the content itself, scored at the block level. Read the first paragraph of your most important pages. Does it open with a declarative answer or with brand narrative? Is it dense with verifiable facts, or thin and descriptive? Does it stand on its own, or does it depend on the heading above it that the retrieval system never sees? Those are the questions that determine whether you get cited. No file at your domain root changes the answer to any of them.

See what actually drives your citations, scored at the block level

CBA's free audit scores your content across all five CPS pillars, the signals the research says move citations, across ChatGPT, Perplexity, Claude, Gemini, and Copilot. Not a schema checkbox. The actual lever. Results in 48 hours.

Get a Free Five-Platform Audit →

Three studies say it's not the markup. It's the content.

Free audit. Block-level CPS scoring across all five pillars and all five AI platforms. The signals the research says actually drive citations. Results in 48 hours.

Get Your Free Audit →